Job Description

We are looking for a highly experienced

DevOps / Site Reliability Engineer (SRE)

to support and operate

mission-critical production systems

across hybrid environments. The ideal candidate will have strong expertise in

incident management, CI/CD, Kubernetes operations, and cloud infrastructure (AWS/Azure) .
You will play a key role in

ensuring system reliability, deployment stability, and rapid incident resolution , working closely with engineering and support teams.

Key Responsibilities
Production Operations & Incident Response (Primary)
Support

24x7 production systems

for services and integrations
Participate in

on-call rotation (primarily weekdays)
Troubleshoot incidents across:
CI/CD pipelines
Kubernetes clusters
API Gateway
Networking and applications
Perform

incident triage, mitigation, and recovery
Ensure

safe deployments with rollback mecha...

Ready to Apply?

Take the next step in your AI career. Submit your application to Neurealm today.

Submit Application