Job Description
About The Role
As a Compute / Server Platform Architect on the Cluster Architecture Team, you will own the server-side platform architecture that enables Cerebras CS3-based AI clusters (training and inference) to deliver predictable performance, scalability, and reliability. Our accelerators are network-attached, so the x86 server fleet is a first‑class part of the end-to-end system: it runs critical‑path runtime functions (for example orchestration, prompt caching, and IO/control services) and must be co‑designed with software for token‑level latency, throughput, and cost efficiency. You will translate workload behavior into CPU, memory, IO, PCIe, and host‑networking requirements, drive platform evaluations with vendors, and provide technical leadership through qualification and production adoption in close partnership with other function leaders and TPMs.
Responsibilities
- Own the architecture for all server roles in Cerebras clusters, including...
Ready to Apply?
Take the next step in your AI career. Submit your application to Cerebras Systems today.
Submit Application