Job Description

Join our team as a Lead Site Reliability Engineer to drive system reliability, observability, and performance monitoring for mission-critical digital trading products.

You will lead monitoring initiatives in a high-availability trading environment, ensuring stable connectivity to external partners while proactively identifying opportunities for continuous improvement. At EPAM, you'll work on cutting-edge technologies, solve complex challenges, and shape the future of digital innovation. With access to continuous learning, mentorship, and global projects, your expertise will drive meaningful change.

Req#

Responsibilities

  • Define and implement a strategic reliability vision for the trading portfolio, covering infrastructure, network connectivity, application performance, and throughput
  • Lead and oversee a team of SRE engineers, providing technical direction, mentorship, and performance guidance
  • Own and evolve the SLA/SLO/...

Ready to Apply?

Take the next step in your AI career. Submit your application to EPAM Systems today.

Submit Application