Job Description
Amplify your influence as a Senior Site Reliability Engineer specializing in AWS cloud systems. Ensure secure deployments and strong operational efficiency to empower developers worldwide.
In this senior position, you will leverage software and systems engineering skills to craft scalable, self-healing infrastructures. Your expertise in SLIs and SLOs will optimize performance, and you'll drive reliability through effective incident responses and thorough postmortems to continuously improve systems.
Key Responsibilities:
• Define and manage SLIs, SLOs, and error budgets
• Reduce MTTD, MTTA, and MTTR through effective incident response
• Conduct blameless postmortems for ongoing improvement
• Champion reliability during architectural assessments
• Design actionable alerts and insightful dashboards
Requirements:
• Experience with AWS services like EC2 and EKS
• Proven knowledge of Terraform or CloudFormation
• Strong skills with observability tools like Gr...
In this senior position, you will leverage software and systems engineering skills to craft scalable, self-healing infrastructures. Your expertise in SLIs and SLOs will optimize performance, and you'll drive reliability through effective incident responses and thorough postmortems to continuously improve systems.
Key Responsibilities:
• Define and manage SLIs, SLOs, and error budgets
• Reduce MTTD, MTTA, and MTTR through effective incident response
• Conduct blameless postmortems for ongoing improvement
• Champion reliability during architectural assessments
• Design actionable alerts and insightful dashboards
Requirements:
• Experience with AWS services like EC2 and EKS
• Proven knowledge of Terraform or CloudFormation
• Strong skills with observability tools like Gr...
Ready to Apply?
Take the next step in your AI career. Submit your application to Devopie today.
Submit Application