Job Description

Site Reliability Engineer

OpenMinds® is looking for a Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of critical services by bridging development and operations.

Job Description

The SRE focuses on scalable infrastructure, SRE practices such as SLOs and SLIs, reducing operational toil, and fostering a continuous learning culture.

Day‑to‑Day Duties

  • Design and implement resilient system architectures for high availability and scalability.
  • Develop automation tools and scripts to improve operational efficiency.
  • Define, track, and analyze SLOs and SLIs for performance and reliability.
  • Conduct post‑mortem analyses and implement improvements based on findings.
  • Collaborate on best practices for system reliability and incident management.
  • Troubleshoot and resolve database, network, and deployment issues.
  • Ensure issue resolution meets Servi...

Ready to Apply?

Take the next step in your AI career. Submit your application to OpenMinds® today.

Submit Application