Job Description
Key Responsibilities
- Systematically analyze, solve, and document benchmark tasks involving Docker, shell scripting, and Linux system administration
- Evaluate agent outputs for correctness, reproducibility, and reliability across complex multi-step CLI workflows
- Provide detailed, evidence-based reasoning grounded in code structure and terminal behavior
- Synthesize information across files and configurations to assess end-to-end architecture
- Contribute high-quality reference solutions and diagnostic insights to improve agent performance metrics
Ideal Qualifications
- 1-3 years of software engineering experience
- Bachelor’s or Master’s in Computer Science or related field from a top 50–100 global university
Based in one of the Five Eyes countries: United States, United Kingdom, Canada, Australia, or New Zealand
- Deep familiarity with terminal workflows, Linux e...
Ready to Apply?
Take the next step in your AI career. Submit your application to Mercor today.
Submit Application