Job Description
Key Responsibilities
- Operate, monitor, and maintain cloud infrastructure (AWS/GCP/Azure) and supporting services (compute, networking, storage, IAM).
- Maintain system availability, perform incident response and on-call rotations, and participate in post-mortems with actionable remediation.
- Build, maintain, and run automation for provisioning, configuration management, patching, and repetitive operational tasks (IaC, scripts, configuration management).
- Manage and tune CI/CD pipelines and deployment tooling in coordination with engineering teams.
- Administer core services: DNS, load balancers, VPN, firewalls, identity providers (SSO/Okta), email systems, and backup solutions.
- Implement and operate monitoring, alerting, and logging (Prometheus, Grafana, ELK/Opensearch, Datadog, or similar).
- Perform capacity planning, resource optimization, and cost management for cloud resources.
- Maintain system hardening, securit...
Ready to Apply?
Take the next step in your AI career. Submit your application to MUVR today.
Submit Application