Job Description
Must‑Have Skills
- Incident management, RCA frameworks
- Kafka consumer tuning and Temporal worker health monitoring
- SLIs/SLOs/SLA compliance
Responsibilities
- Operate Import PO flows across multiple markets
- Maintain observability dashboards, alerts, service health
- Lead reliability improvements and automate recovery patterns
Nice‑to‑Have
- Chaos/resiliency testing
Ready to Apply?
Take the next step in your AI career. Submit your application to Insight Global today.
Submit Application