Job Description
The primary mission of this role is to ensure the high availability and stability of enterprise applications running on IBM WebSphere and Red Hat OpenShift. This is a support-centric position focused on proactive monitoring, rapid incident resolution (L3), and the continuous optimization of production environments to meet strict Service Level Agreements (SLAs).
Key Responsibilities
- Incident & Problem Management (L3): Act as the final point of technical escalation for complex outages involving WebSphere Application Server (WAS) and OpenShift clusters.
- Production Stability: Monitor environment health 24/7 using enterprise observability tools and execute immediate recovery actions during critical failures.
- Automation & Scripting: Develop and maintain Bash and Python scripts to automate repetitive support tasks, log collection, and automated health checks across the platform.
- Root Cause Analysis (RCA): Lead deep-dive investigation...
Ready to Apply?
Take the next step in your AI career. Submit your application to BONbLOC today.
Submit Application