Job Description

Must‑Have Skills
Incident management, RCA frameworks
Kafka consumer tuning and Temporal worker health monitoring
SLIs/SLOs/SLA compliance
Responsibilities
Operate Import PO flows across multiple markets
Maintain observability dashboards, alerts, service health
Lead reliability improvements and automate recovery patterns
Nice‑to‑Have
Chaos/resiliency testing

Ready to Apply?

Take the next step in your AI career. Submit your application to Insight Global today.

Submit Application