Job Description

SRE Devops aws - CREQ194972 Description

System Reliability Design, build, and maintain reliable and scalable infrastructure solutions to support our applications and services.
Automation: Develop automation tools and processes to improve efficiency, streamline operations, and reduce manual intervention.
Monitoring and Alerting: Implement robust monitoring and alerting systems to proactively identify and resolve issues before they impact customers.
Incident Management: Participate in incident response, troubleshooting, and resolution to ensure minimal downtime and optimal performance.
Performance Optimization: Continuously optimize system performance, capacity, and resource utilization.
Reliability Engineering: Apply engineering principles to design resilient systems, implement fault-tolerant solutions, and conduct post incident reviews.
Collaboration: Work closely with cross functional teams including software engineers, DevOps, and product managers to...

Ready to Apply?

Take the next step in your AI career. Submit your application to Virtusa today.

Submit Application