Job Description
RESPONSIBILITIES
- Operate and optimize Kubernetes-based infrastructure using HELM/ kustomize for deployment and configuration management.
- Build and maintain CI/CD pipelines for infrastructure and application deployments.
- Manage and monitor cloud infrastructure on AWS (EKS, EC2, S3, IAM, VPC, etc.). and on premise infrastructure
- Ensure observability through logging, monitoring, and alerting systems (e.g., Prometheus, Grafana, Cloudwatch, DataDog ).
- Implement and enforce security best practices across infrastructure components.
- Participate in on-call rotations, incident response, and root cause analysis.
- Support scaling of systems to meet demand while maintaining reliability.
- Collaborate with engineering and security teams on architecture and deployment strategies.
- Ensure the implementation of security standards and compliance requirements across all operational aspects of the cloud platforms. <...
Ready to Apply?
Take the next step in your AI career. Submit your application to Tricog Health today.
Submit Application