Job Description
- Title:
Senior Software Engineer – LLM Evaluation (Remote) - Engagement:
Hourly contract (independent contractor) - Location:
Remote
About the Opportunity
One of our global AI research clients is building advanced evaluation and training datasets to improve large language models on realistic software engineering tasks. This project focuses on creating verifiable software engineering challenges derived from public repository histories using a structured, human-in-the-loop approach. The goal is to expand dataset coverage across programming languages, complexity levels, and real-world development scenarios.
Role Overview
We are seeking experienced, tech lead–level software engineers who are comfortable working with high-quality public GitHub repositories (500+ stars). This role combines hands-on engineering work with AI model evaluation, contributing directly to how AI systems interact with real-world codebases.
Wh...
Ready to Apply?
Take the next step in your AI career. Submit your application to Nexus Consulting today.
Submit Application