Job Description

Your Role

:

We are seeking a Staff Site Reliability Engineer (Infrastructure & Site Reliability Engineering) with extensive experience in AWS, AZURE, Kubernetes, and GitOps to lead our Site Reliability Engineering (SRE) team. The successful candidate will deeply understand SRE practices and have a track record of implementing high-quality site reliability engineering practices (SLAs, SLOs, Proactive Alert Management, Incident Response/Review, Postmortems, etc.).


In this role, you will work with our SRE and cross-functional engineering teams to develop and operate our development and production infrastructure and operations

Responsibilities:

  • Work collaboratively with software engineering on infrastructure and deployment requirements;

  • Contribute actively and assist in our automation and observability initiatives

  • Build and maintain operational tools for deployment, monitoring, and analysis of cloud (AWS & AZURE) infrastructu...
  • Ready to Apply?

    Take the next step in your AI career. Submit your application to SolarWinds today.

    Submit Application