Job Description

About the Role

As a Site Reliability Engineer, you'll be at the heart of our production health. You won't just be keeping the lights on; you'll be diagnosing complex issues, leveraging cutting-edge observability tools, and building the automation that makes downtime a thing of the past.

Key Responsibilities

  • Rapid Response:
    Swiftly diagnose and resolve production issues for both web and mobile applications.
  • Root Cause Analysis:
    Identify patterns in recurring issues and implement strategic, long-term stability solutions.
  • Modern Observability:
    Leverage tools like Grafana and AppDynamics to gain deep system insights and enable early issue detection.
  • Drive Automation:
    Reduce manual toil by developing scripts and refining workflows to accelerate incident response.
  • Collaborate:
    Work within Agile/Scrum frameworks to improve performance tuning across the cloud ecosystem.

Must-Have Qualifications <...

Ready to Apply?

Take the next step in your AI career. Submit your application to Nezda Global today.

Submit Application