Job Description

Description


Seeking an AI/ML Operations professional for the following role -


Overall Responsibilities

  • Manage operational workflows for model deployments, updates, and versioning across GCP, Azure, and AWS.
  • Monitor model performance metrics: latency, throughput, error rates, token usage, and inference quality
  • Track model drift, accuracy degradation, and performance anomalies - escalating to engineering as needed.
  • Support knowledge base operations including vector embedding pipeline health, chunk quality, and refresh cycles in Vertex AI.
  • Maintain model inventory and documentation across multi-cloud environments.
  • Coordinate model evaluation cycles with Responsible AI and Core Engineering teams


Agent & MCP Server Operations

  • Monitor AI agent health, performance, and reliability (AutoGen-based agents, MCP servers)
  • T...

Ready to Apply?

Take the next step in your AI career. Submit your application to Milestone Technologies today.

Submit Application