Job Description
The position is hands-on and establishes SRE best practices, automation standards and observability capabilities for Spire. The role contributes to the long-term evolution of reliability engineering at EquiLend, starting with Spire and potentially expanding to the wider product base over time.
Key Responsibilities
Build and maintain the reliability, availability and performance posture of the Spire platform through application of SRE principles and software engineering practices
Establish and embed operational standards, governance models and reliability processes across engineering and operations teams
Design, implement and optimise observability capabilities - including metrics, logs, traces and alerting - to proactively identify performance risk and capacity constraints
Define and maintain SLIs, SLOs and SLAs for critical Spire services and ensure alignment across engineering and product functions
Lead capa...
Ready to Apply?
Take the next step in your AI career. Submit your application to Confidential today.
Submit Application