Job Description

Be a critical part of enterprise operations as a L1 SRE Operations Engineer. Focus on monitoring and triaging incidents using cloud technologies, Kubernetes, and various APIs.

In this essential role, you will monitor system health and troubleshoot alerts across both cloud and on-prem infrastructure. With responsibilities including executing standardized runbooks and providing clear communication during incidents, this position is pivotal for operational continuity. Experience with Kubernetes, incident triage, and cloud operations are mandatory. Moreover, your contributions will improve automation and streamline the onboarding of applications into the operations framework.

Key Responsibilities:
• Monitor health and alerts across applications and infrastructure
• Execute runbooks for timely incident resolution
• Perform initial triage of incidents and escalate when needed
• Document new issues and automation opportunities
• Support ...

Ready to Apply?

Take the next step in your AI career. Submit your application to Hitachi today.

Submit Application