Job Description
We are seeking a highly skilled Site Reliability Engineering (SRE) Subject Matter Expert (SME) to lead and advance our observability, performance engineering, reliability, and AIOps practices. The SME will be responsible for designing, implementing, and evangelizing modern SRE capabilities that improve system reliability, scalability, and efficiency across our IT ecosystem. This role requires deep technical expertise, hands-on problem-solving skills, and the ability to influence cross-functional teams.
Key Responsibilities
Observability & Monitoring
Define and implement observability frameworks across logs, metrics, traces, and events.
- Architect monitoring platforms (e.g., Prometheus, Grafana, ELK, Splunk, Datadog, Dynatrace, New Relic) to deliver actionable insights.
- Establish SLOs, SLIs, and error budgets in collaboration with product and engineering teams.
- Drive proactive incident detection and root cause anal...
Ready to Apply?
Take the next step in your AI career. Submit your application to Quess Philippines Corp. today.
Submit Application