Job Description
CirrusLabs is searching for a Platform Site Reliability Engineer (SRE) in Mexico City. The role focuses on supporting the reliability and observability of AI platform environments. Candidates should have hands-on experience in Linux troubleshooting, operational automation, and incident response, particularly in Kubernetes and GPU operations.
Eligible applicants will also be proficient in using tools like Prometheus and Grafana to monitor platform health. This role demands strong collaboration skills and the ability to automate operational tasks.
#J-18808-LjbffrReady to Apply?
Take the next step in your AI career. Submit your application to CirrusLabs today.
Submit Application