Job Description

Key Responsibilities

Infrastructure Management

  • Design, build, and maintain cloud infrastructure on AWS (EKS, Aurora, S3, CloudFront)
  • Manage Kubernetes clusters for containerized application deployments
  • Implement infrastructure as code using Terraform or CloudFormation
  • Ensure high availability, disaster recovery, and system resilience
  • Optimize infrastructure costs while maintaining performance requirements

CI/CD & Automation

  • Own and continuously improve CI/CD pipelines using GitHub Actions and ArgoCD
  • Achieve and maintain 95%+ build success rate
  • Automate repetitive operational tasks
  • Implement automated testing and quality gates in deployment pipelines
  • Enable developers to deploy confidently and frequently

Monitoring & Observability

  • Implement and maintain monitoring using ELK Stack, Prometheus, and Grafana
  • Set up proactive alerting for ...

Ready to Apply?

Take the next step in your AI career. Submit your application to ONE ENGINE today.

Submit Application