Job Description

Overview
Will be responsible for Eyes on glass Monitoring, Triage & Incident Ownership, Troubleshooting & Restoration, Cross-Team Collaboration, Platform & Application Stack Awareness and Service Quality & Process Excellence. Responsibilities
Perform rapid intake, triage, and prioritization of alerts, tickets, and incidents. Act as Incident Owner during high-severity events, ensuring clear communication, timely updates, and swift restoration of service. Maintain accurate, real-time incident timelines and post-incident documentation. Execute root-cause isolation across application, middleware, APIs, data, and infrastructure layers. Use observability/monitoring tools (e.g., Kibana, Dynatrace, Cloud Watch, Grafana) to correlate logs, metrics, and traces; identify anomalies, performance bottlenecks, and failure patterns. Perform targeted mitigations, rollbacks, config fixes, and coordinate hotfixes to restore service quickly. Engage with App Dev, Dev Ops, Database, Network, Security,...

Ready to Apply?

Take the next step in your AI career. Submit your application to Luxoft today.

Submit Application