Job Description
Description
Seeking an AI/ML Operations professional for the following role -
Overall Responsibilities
- Manage operational workflows for model deployments, updates, and versioning across GCP, Azure, and AWS.
- Monitor model performance metrics: latency, throughput, error rates, token usage, and inference quality
- Track model drift, accuracy degradation, and performance anomalies - escalating to engineering as needed.
- Support knowledge base operations including vector embedding pipeline health, chunk quality, and refresh cycles in Vertex AI.
- Maintain model inventory and documentation across multi-cloud environments.
- Coordinate model evaluation cycles with Responsible AI and Core Engineering teams
Agent & MCP Server Operations
- Monitor AI agent health, performance, and reliability (AutoGen-based agents, MCP servers)
- T...
Ready to Apply?
Take the next step in your AI career. Submit your application to Milestone Technologies today.
Submit Application