Job Description
About the role:
The MLOps Support Engineer is an operations-first role, focused on ensuring AI/ML systems remain stable, observable, and supportable in production environments. This is not a data science or feature development role.
The primary objective is to maintain continuous performance of ML models and associated pipelines with minimal disruption to both internal and client-facing services. You will provide Tier 1 and Tier 2 support, escalating to Tier 3 Engineering as needed.
What you’ll do:
- Provide Tier 1 / Tier 2 operational support for AI/ML solutions.
- Identify failed jobs, degraded pipelines, or performance anomalies.
- Triage incidents, investigate issues, and coordinate escalation to Tier 3 Engineering.
- Participate in on-call rotas once established.
- Validate that pipelines and jobs complete successfully.
- Monitor data pipeline health, model execution, and basic performance metric...
Ready to Apply?
Take the next step in your AI career. Submit your application to CloudFactory Limited today.
Submit Application