Job Description
Job Description:
The Head of Site Reliability Engineering is a hybrid technical‑leadership role. You will:
- Own reliability of production services running on AWS while steering the roadmap for platform resilience and building out the SRE team.
- Lead and grow a remote team of SREs—coaching, hiring, performance‑managing, and fostering a blameless culture.
- Set and enforce Service Level Objectives (SLOs), error budgets, and incident response processes.
- Drive automation via Infrastructure‑as‑Code (Pulumi / TypeScript), CI/CD, and observability pipelines.
- Represent the SRE discipline to product, engineering, and senior leadership across our global business.
- Hands on monitoring and incident response will be critical as the team grows.
Key Responsibilities
- Leadership & People Management
- Build an SRE team of initially 3-6 engineers: goal setting, career development, regular 1:1s...
Ready to Apply?
Take the next step in your AI career. Submit your application to Acquire Intelligence today.
Submit Application