Job Description

RESPONSIBILITIES

  • Operate and optimize Kubernetes-based infrastructure using HELM/ kustomize for deployment and configuration management.
  • Build and maintain CI/CD pipelines for infrastructure and application deployments.
  • Manage and monitor cloud infrastructure on AWS (EKS, EC2, S3, IAM, VPC, etc.). and on premise infrastructure
  • Ensure observability through logging, monitoring, and alerting systems (e.g., Prometheus, Grafana, Cloudwatch, DataDog ).
  • Implement and enforce security best practices across infrastructure components.
  • Participate in on-call rotations, incident response, and root cause analysis.
  • Support scaling of systems to meet demand while maintaining reliability.
  • Collaborate with engineering and security teams on architecture and deployment strategies.
  • Ensure the implementation of security standards and compliance requirements across all operational aspects of the cloud platforms. <...

Ready to Apply?

Take the next step in your AI career. Submit your application to Tricog Health today.

Submit Application