Job Description

Description

ABOUT THE ROLE

We are looking for a Senior Engineer to join our AI team at the intersection of evaluation science, post-training, and foundation model development. You will own our end-to-end eval and benchmarking infrastructure — the critical feedback loop that drives every major model improvement — while contributing hands-on to post-training pipelines for industry-specific vertical foundation models.

This role is ideal for someone who has worked directly inside an LLM lab and understands what rigorous evaluation looks like at scale: designing the taxonomy of skills being measured, identifying failure modes, engineering synthetic data to close capability gaps, and translating eval signals into actionable training decisions.

WHAT YOU'LL DO

Evaluation & Benchmarking

  • Design and own task-level evaluation frameworks for LLM agents and base models, covering multi-step reasoning, tool/API use, instructio...
  • Ready to Apply?

    Take the next step in your AI career. Submit your application to Chegg India today.

    Submit Application