Job Description

We are looking for a
Site Reliability Engineer
to join a new team at one of our clients, a major American pet care retailer offering supplies, services, and care solutions. This is an opportunity to join a large, well-established organization that combines retail, services, and digital solutions to improve the lives of pets and their owners, in a collaborative environment with the chance to work on impactful, customer-facing products at scale.

Responsibilities

  • Ensure high availability, reliability, and performance of retail systems (e-commerce, checkout, inventory), especially during peak sales events.
  • Monitor systems using SLIs/SLOs, lead incident response, and perform root cause analysis to reduce downtime and customer impact.
  • Design and maintain scalable, fault-tolerant infrastructure using cloud platforms, containers, and Infrastructure as Code.
  • Automate deployments, testing, and operational tasks through CI/CD pipelines and se...

Ready to Apply?

Take the next step in your AI career. Submit your application to Grid Dynamics today.

Submit Application