Job Description

We are looking for a Principal Site Reliability Engineer to join our dynamic Services team. In this role, you will contribute to the reliability and scalability of our cutting-edge platform, ensuring exceptional solutions tailored to our customers’ unique needs. This is a highly technical, hands‑on role that requires deep expertise in system reliability and automation.

Key Responsibilities:

  • Reliability Engineering: Design and build automated systems that ensure the reliability and scalability of our Kubernetes clusters and Hydrolix deployments across multiple cloud platforms, eliminating manual operational tasks.
  • Automation and Efficiency : Identify, quantify, and systematically eliminate repetitive manual work through automation and improved tooling, eliminating toil and freeing the team to focus on high‑value work.
  • Observability Infrastructure : Build and enhance comprehensive observability systems ...

Ready to Apply?

Take the next step in your AI career. Submit your application to Hydrolix today.

Submit Application