Job Description
Role Summary
We are looking for a highly skilled Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of our cloud-native infrastructure. The ideal candidate will bring strong hands-on experience in AWS, Kubernetes, Docker, CI/CD pipelines, monitoring, and automation using Python, and will work closely with development and operations teams to build resilient, highly available systems.
Key Responsibilities
- Design, deploy, and maintain highly available and scalable systems on AWS
- Manage and operate containerized applications using Docker and Kubernetes (EKS)
- Build, maintain, and optimize CI/CD pipelines using Jenkins
- Automate operational workflows and routine tasks using Python scripting
- Implement and manage monitoring, alerting, a...
Ready to Apply?
Take the next step in your AI career. Submit your application to TRDFIN Support Services Pvt Ltd today.
Submit Application