Job Description
Site Reliability Engineer (SRE)
Position Summary
The Site Reliability Engineer (SRE) will be a hands-on contributor within the Site Reliability Engineering Center of Excellence (CoE), responsible for building monitoring and observability solutions, troubleshooting production issues, and participating in 24x7 on-call operations.
This role focuses on the execution of reliability practices, implementing observability tooling, improving MTTR/MTTD through automation, and ensuring production systems are resilient, observable, and performant. The SRE will collaborate closely with Principal and Senior Staff SREs, adopting best practices and frameworks defined by the CoE while directly contributing to enterprise reliability goals. This role reports to the Sr. Manager, SRE.
Key Responsibilities
Execution & CoE Alignment
Position Summary
The Site Reliability Engineer (SRE) will be a hands-on contributor within the Site Reliability Engineering Center of Excellence (CoE), responsible for building monitoring and observability solutions, troubleshooting production issues, and participating in 24x7 on-call operations.
This role focuses on the execution of reliability practices, implementing observability tooling, improving MTTR/MTTD through automation, and ensuring production systems are resilient, observable, and performant. The SRE will collaborate closely with Principal and Senior Staff SREs, adopting best practices and frameworks defined by the CoE while directly contributing to enterprise reliability goals. This role reports to the Sr. Manager, SRE.
Key Responsibilities
Execution & CoE Alignment
- Implement SRE frameworks, best practices, and playbooks provided...
Ready to Apply?
Take the next step in your AI career. Submit your application to GHX today.
Submit Application