Job Description
Join our team as a Lead Site Reliability Engineer to drive system reliability, observability, and performance monitoring for mission-critical digital trading products.
You will lead monitoring initiatives in a high-availability trading environment, ensuring stable connectivity to external partners while proactively identifying opportunities for continuous improvement. At EPAM, you'll work on cutting-edge technologies, solve complex challenges, and shape the future of digital innovation. With access to continuous learning, mentorship, and global projects, your expertise will drive meaningful change.
Req#
Responsibilities
- Define and implement a strategic reliability vision for the trading portfolio, covering infrastructure, network connectivity, application performance, and throughput
- Lead and oversee a team of SRE engineers, providing technical direction, mentorship, and performance guidance
- Own and evolve the SLA/SLO/...
Ready to Apply?
Take the next step in your AI career. Submit your application to EPAM Systems today.
Submit Application