Job Description

Must‑Have Skills

  • Incident management, RCA frameworks
  • Kafka consumer tuning and Temporal worker health monitoring
  • SLIs/SLOs/SLA compliance


Responsibilities

  • Operate Import PO flows across multiple markets
  • Maintain observability dashboards, alerts, service health
  • Lead reliability improvements and automate recovery patterns


Nice‑to‑Have

  • Chaos/resiliency testing

Ready to Apply?

Take the next step in your AI career. Submit your application to Insight Global today.

Submit Application