Job Description

ROLE AND RESPONSIBILITIES:

A Site Reliability Engineer (SRE) is expected to own the operational stability and performance of hybrid cloud infrastructure (Nutanix, AWS/GCP). This involves leading automation efforts, architecting for reliability, and acting as the final escalation point for critical incidents to ensure the platform is scalable and efficient.

Nutanix Platform Management

  • Design, deploy, and maintain enterprise-scale Nutanix AHV clusters and Prism Central for multi-cluster management
  • Expert-level proficiency with Nutanix CLI (nCLI and acli) for advanced operations, troubleshooting, and automation
  • Develop automation scripts using Nutanix REST APIs, Python SDK, PowerShell, and Terraform for infrastructure-as-code
  • Create and manage VM templates, golden images, and standardized deployment catalogs for consistent provisioning
  • Design disaster recovery solutions using Leap, Pr...

Ready to Apply?

Take the next step in your AI career. Submit your application to Proglite today.

Submit Application