Job Description

Member of Technical Staff, Model Efficiency

1 day ago Be among the first 25 applicants

Get AI-powered advice on this job and more exclusive features.

Who are we?

Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.

Why this role?

Our team is a fast-growing group of researchers and engineers focused on building reliable ML systems and pushing the boundaries of LLM inference efficiency. We develop techniques that improve how models execute in production, driving lower latency, higher throughput, and consistent quality across diverse workloads.

As an engineer on this team, you’ll work across the inference stack to improve core performance metric...

Ready to Apply?

Take the next step in your AI career. Submit your application to Cohere today.

Submit Application