Job Description
Job Summary:
We’re looking for a Site Reliability Engineer (SRE) with 4-6 years of experience with strong technical and analytical skills to ensure the reliability, scalability, and performance of our core applications. This role focuses on improving the stability and efficiency of distributed systems built on Java and microservices architecture, driving operational excellence through monitoring, automation, and incident management.
Key Responsibilities:
Application Reliability & Performance
- Monitor and maintain the health, performance, and reliability of production applications.
- Define, measure, and track SLIs/SLOs for key services, driving improvements proactively.
- Identify performance bottlenecks, memory leaks, and slow transactions in Java-based microservices.
Ready to Apply?
Take the next step in your AI career. Submit your application to Landmark Group today.
Submit Application