Job Description
- Title:
Senior Software Engineer – LLM Evaluation (Remote) - Engagement:
Hourly contract (independent contractor) - Location:
Remote
About the Opportunity
One of our global AI research clients is developing advanced evaluation and benchmarking datasets to improve the performance of large language models in real-world software engineering scenarios. This role focuses on assessing AI-generated code and strengthening model reliability across production-grade engineering workflows.
Role Overview
As a Senior Software Engineer supporting AI model evaluation, you will contribute to building high-quality datasets used for training and benchmarking large language models. You will work closely with researchers to curate code examples, provide precise technical solutions, and refine AI-generated outputs across multiple programming languages.
This role blends hands-on software engineering expertise with structured AI evalua...
Ready to Apply?
Take the next step in your AI career. Submit your application to Confidential today.
Submit Application