Job Description

Required Experience & Skills

  • Relevant industry experience with 3+ years as a Site Reliability Engineering or as Lead.
  • Strong knowledge of system architecture, networking, and microservice-based distributed systems with cloud platforms (Azure) and related services
  • Expertise in Linux, Docker, Kubernetes/K8, and container orchestration for reliable, scalable systems.
  • Implement GitOps workflows using ArgoCD for continuous delivery and infrastructure automation.
  • Proficiency in monitoring/alerting/logging tools: Prometheus, Grafana, OpenTelemetry Collector.
  • Skilled in Terraform and other infrastructure automation scripting.
  • Knowledge of disaster recovery planning and execution for cloud systems.
  • Commitment to staying updated with latest SRE trends, tools, and technologies.

Ready to Apply?

Take the next step in your AI career. Submit your application to ZEISS India today.

Submit Application