Job Description
Job Responsibilities :
We are looking for Site Reliability Engineers (SRE) and Platform Engineers to join our AI Software team. In this role, you will ensure the reliability, performance, scalability, and operational excellence of AI products, model-serving infrastructure, and backend API systems.As a Platform Engineer, you could also design, build, and operate the core platforms that enable scalable AI model serving, data pipelines, and microservices across our organization. This role focuses on Kubernetes-based systems, cloud infrastructure, developer productivity tooling, automation, and the reliability of shared services.
You’ll work closely with software engineers, AI teams and release teams to automate operations, enhance observability, and streamline deployments in a cloud-scale environment. This role is ideal for someone who enjoys building resilient systems, solving complex infrastructure problems, and supporting AI workloads in p...
Ready to Apply?
Take the next step in your AI career. Submit your application to Razer today.
Submit Application