Job Description
Join our team as a **Lead Site Reliability Engineer** to drive system reliability, observability, and performance monitoring for mission-critical digital trading products.
You will lead monitoring initiatives in a high-availability trading environment, ensuring stable connectivity to external partners while proactively identifying opportunities for continuous improvement. At EPAM, you'll work on cutting-edge technologies, solve complex challenges, and shape the future of digital innovation. With access to continuous learning, mentorship, and global projects, your expertise will drive meaningful change.
Req# 968473077
**Responsibilities**
+ Define and implement a strategic reliability vision for the trading portfolio, covering infrastructure, network connectivity, application performance, and throughput
+ Lead and oversee a team of SRE engineers, providing technical direction, mentorship, and performance guidance
+ Own and evolve the SLA/SLO...
You will lead monitoring initiatives in a high-availability trading environment, ensuring stable connectivity to external partners while proactively identifying opportunities for continuous improvement. At EPAM, you'll work on cutting-edge technologies, solve complex challenges, and shape the future of digital innovation. With access to continuous learning, mentorship, and global projects, your expertise will drive meaningful change.
Req# 968473077
**Responsibilities**
+ Define and implement a strategic reliability vision for the trading portfolio, covering infrastructure, network connectivity, application performance, and throughput
+ Lead and oversee a team of SRE engineers, providing technical direction, mentorship, and performance guidance
+ Own and evolve the SLA/SLO...
Ready to Apply?
Take the next step in your AI career. Submit your application to EPAM Systems today.
Submit Application