Agent Quality / Evals Engineer 1754

SOFTGIC

📍 Colombia, Colombia, Colombia

Full-time Engineers Posted June 25, 2026

Apply Now Similar Jobs

Job Description

                       This is a remote position.
 Owns the eval harness and quality gate from the beginning. This role replaces the old late-stage “Evals Specialist” model with a standing owner for measurable agent quality. 
 
 Key Responsibilities
 
 • Build and maintain the MVP eval harness: golden tasks, exception tasks, scorecard metrics, and regression packs.
 
 • Wire evals into CI so quality regressions fail builds and releases.
 
 • Define and maintain release-gate thresholds with Product and the Tech Lead.
 
 • Lay the path for later adversarial and drift-testing expansion without overbuilding MVP scope.
 
Requirements Must-Have Qualifications 
 
 • Experience evaluating ML, LLM, or non-deterministic systems.
 
 • Strong tes...

Ready to Apply?

Take the next step in your AI career. Submit your application to SOFTGIC today.

Submit Application

Job Details

Location

Colombia, Colombia, Colombia

Job Type

Full-time