Job Description

We are seeking an experienced **Senior Observability DevOps Engineer** to join our dynamic team.
RESPONSIBILITIES
- Manage AWS infrastructure using Terraform and CloudFormation, including tasks like EKS version upgrades, blue/green deployments, and scaling
- Set up, tune, and modernize various observability services including Cortex/Mimir, Loki, Tempo, OpenTelemetry, Grafana, and Alertmanager
- Automate operations programmatically using Python or Golang and Gitlab CI, plus develop custom self-service solutions based on AWS Service Catalog
- Build Docker images for multiple architectures including arm64 and amd64
- Troubleshoot issues related to microservices in Kubernetes, AWS connectivity, service performance, Lambda functions, and Kafka
- Participate in hypercare events and on-call shifts
**REQUIREMENTS**:
- Proficiency in version control using Git, GitHub, and GitLab alongside CI/CD pipelines
- Strong experience with Infrastructure as Code (IaC) tools l...

Ready to Apply?

Take the next step in your AI career. Submit your application to EPAM Systems today.

Submit Application