Job Description
Senior Research Scientist, Model Evaluation
Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises building AI systems that power content generation, semantic search, RAG, and agents. We believe our work is instrumental to the widespread adoption of AI and that each person on the team contributes to increasing the capabilities of our models and the value they bring to customers.
Why this role?
Evaluation is critical to making progress in scaling intelligence. As models become superhuman in many real-world use cases, we continue to develop new evaluation techniques that accurately reflect current capabilities and set the agenda for future progress. In this role you will create next‑generation evaluation methods and infrastructure to measure LLM progress.
Responsibilities
- Create ambitious new evaluation benchmarks that push the limits ...
Ready to Apply?
Take the next step in your AI career. Submit your application to Cohere today.
Submit Application