Job Description

Capability Requirements
Execute performance tuning activities for model serving infrastructure to maintain optimal latency and throughput.
Conduct post-deployment validation checks to ensure model prediction stability, API responsiveness, and overall service quality.
Support the enhancement of operational pipelines, including CI/CD workflows, configuration templates, and automated monitoring scripts.
Participate in service reliability reviews to improve platform uptime, incident response processes, and operational readiness.
Coordinate closely with DevOps and Platform Engineering to address infrastructure-level concerns related to model hosting and deployment.
Assist in the rollout of platform-level improvements, including model registry enhancements, container optimization, and new monitoring tools.
Minimum Qualifications
Related Work Experience:
Minimum of 2+ years hands-on experience in a production environment covering MLOps, Data Engineering, or Software...

Ready to Apply?

Take the next step in your AI career. Submit your application to Yondu, Inc. today.

Submit Application