Job Description

Job Description:

The Head of Site Reliability Engineering is a hybrid technical‑leadership role. You will:

  • Own reliability of production services running on AWS while steering the roadmap for platform resilience and building out the SRE team.
  • Lead and grow a remote team of SREs—coaching, hiring, performance‑managing, and fostering a blameless culture.
  • Set and enforce Service Level Objectives (SLOs), error budgets, and incident response processes.
  • Drive automation via Infrastructure‑as‑Code (Pulumi / TypeScript), CI/CD, and observability pipelines.
  • Represent the SRE discipline to product, engineering, and senior leadership across our global business.
  • Hands on monitoring and incident response will be critical as the team grows.

Key Responsibilities

  • Leadership & People Management
  • Build an SRE team of initially 3-6 engineers: goal setting, career development, regular 1:1s...

Ready to Apply?

Take the next step in your AI career. Submit your application to Acquire Intelligence today.

Submit Application