Job Description

Key Responsibilities

  • Systematically analyze, solve, and document benchmark tasks involving Docker, shell scripting, and Linux system administration
  • Evaluate agent outputs for correctness, reproducibility, and reliability across complex multi-step CLI workflows
  • Provide detailed, evidence-based reasoning grounded in code structure and terminal behavior
  • Synthesize information across files and configurations to assess end-to-end architecture
  • Contribute high-quality reference solutions and diagnostic insights to improve agent performance metrics

Ideal Qualifications

  • 1-3 years of software engineering experience
  • Bachelor’s or Master’s in Computer Science or related field from a top 50–100 global university

Based in one of the Five Eyes countries: United States, United Kingdom, Canada, Australia, or New Zealand

  • Deep familiarity with terminal workflows, Linux e...

Ready to Apply?

Take the next step in your AI career. Submit your application to Mercor today.

Submit Application