Job Description

About The Job
Mercor
connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include
Benchmark
,
General Catalyst
,
Peter Thiel
,
Adam D'Angelo
,
Larry Summers
, and
Jack Dorsey
.

Position:
General Chat Behavior Evaluator

Type:
Full-time or Part-time Contract Work
Compensation:
$36/hour
Location:
Geography restricted to Taiwan, Malaysia, USA
Role Responsibilities

  • Evaluate LLM-generated responses for effectiveness in answering user queries.
  • Conduct fact-checking using trusted public sources and external tools.
  • Generate high-quality human evaluation data by annotating response strengths, areas for improvement, and factual inaccuracies.
  • Assess reasoning quality, clarity, tone, and completeness of responses.
  • Ensure model responses align with expected conversational behavior and system guidelines.

Ready to Apply?

Take the next step in your AI career. Submit your application to Mercor today.

Submit Application