Job Description

Role Summary

We are looking for a highly skilled Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of our cloud-native infrastructure. The ideal candidate will bring strong hands-on experience in AWS, Kubernetes, Docker, CI/CD pipelines, monitoring, and automation using Python, and will work closely with development and operations teams to build resilient, highly available systems.

Key Responsibilities

  • Design, deploy, and maintain highly available and scalable systems on AWS
  • Manage and operate containerized applications using Docker and Kubernetes (EKS)
  • Build, maintain, and optimize CI/CD pipelines using Jenkins
  • Automate operational workflows and routine tasks using Python scripting
  • Implement and manage monitoring, alerting, a...

Ready to Apply?

Take the next step in your AI career. Submit your application to TRDFIN Support Services Pvt Ltd today.

Submit Application