Job Description

Job Description
We are seeking a Mid-Level Site Reliability Engineer (SRE) to support a modern, cloud-based application platform. This role will focus on improving reliability, scalability, and observability across core systems while helping transition teams from reactive support to proactive engineering practices.
The ideal candidate will bring solid experience in cloud environments, application monitoring, and automation, along with the ability to collaborate closely with development teams and contribute to the maturation of SRE practices.

Key Responsibilities

Support and enhance monitoring, alerting, and observability frameworks across production environments
Troubleshoot production issues and participate in root cause analysis to reduce recurrence
Support cloud-based applications and contribute to platform reliability improvements
Partner with engineering teams to identify and resolve performance, scalability, and resiliency gaps
Automate operati...

Ready to Apply?

Take the next step in your AI career. Submit your application to Insight Global today.

Submit Application