Job Description

  • Title:
    Senior Software Engineer – LLM Evaluation (Remote)
  • Engagement:
    Hourly contract (independent contractor)
  • Location:
    Remote

About the Opportunity

One of our global AI research clients is developing advanced evaluation and benchmarking datasets to improve the performance of large language models in real-world software engineering scenarios. This role focuses on assessing AI-generated code and strengthening model reliability across production-grade engineering workflows.

Role Overview

As a Senior Software Engineer supporting AI model evaluation, you will contribute to building high-quality datasets used for training and benchmarking large language models. You will work closely with researchers to curate code examples, provide precise technical solutions, and refine AI-generated outputs across multiple programming languages.

This role blends hands-on software engineering expertise with structured AI evalua...

Ready to Apply?

Take the next step in your AI career. Submit your application to Confidential today.

Submit Application