Job Description
ROLE AND RESPONSIBILITIES:
A Site Reliability Engineer (SRE) is expected to own the operational stability and performance of hybrid cloud infrastructure (Nutanix, AWS/GCP). This involves leading automation efforts, architecting for reliability, and acting as the final escalation point for critical incidents to ensure the platform is scalable and efficient.
Nutanix Platform Management
- Design, deploy, and maintain enterprise-scale Nutanix AHV clusters and Prism Central for multi-cluster management
- Expert-level proficiency with Nutanix CLI (nCLI and acli) for advanced operations, troubleshooting, and automation
- Develop automation scripts using Nutanix REST APIs, Python SDK, PowerShell, and Terraform for infrastructure-as-code
- Create and manage VM templates, golden images, and standardized deployment catalogs for consistent provisioning
- Design disaster recovery solutions using Leap, Pr...
Ready to Apply?
Take the next step in your AI career. Submit your application to Proglite today.
Submit Application