Job Description

We are looking for an AI/ML Ops Engineer to support the deployment, monitoring, and operational reliability of AI-powered systems in production environments.

This role combines elements of DevOps, cloud engineering, and AI system support. The ideal candidate should be comfortable working with cloud infrastructure, monitoring tools, and modern AI workflows, while collaborating closely with engineering and AI teams.


Key Responsibilities


  • Support deployment and operational management of AI/ML applications and services
  • Monitor AI systems using logs, metrics, tracing, and observability tools
  • Troubleshoot and debug AI workflows, pipelines, and runtime failures
  • Assist in maintaining scalable, secure, and reliable cloud infrastructure
  • Support prompt experimentation, version tracking, and A/B testing activities
  • Collaborate with engineering teams to improve system reliability, performance, and ...

Ready to Apply?

Take the next step in your AI career. Submit your application to 99x today.

Submit Application