Job Description
Apply for the DevOps & ML Ops Engineer role at TransPerfect .
Responsibilities
- Manage resource allocation and workload scheduling for multiple ML services, ensuring efficient utilization of CPU/GPU resources and creating reliable queues based on service priorities.
- Maintain VM environments and manage OS updates, keeping up-to-date VM inventory.
- Work alongside the Dev and QA teams to detect hot spots in our applications and set preventative measures before they become live issues.
- Troubleshoot and provide solutions for system configurations.
- Plan, execute, and test disaster recovery.
- Monitor and examine all application, performance, event, and system logs to assist in troubleshooting.
- File all IT/Colocation tickets, ensuring fulfillment of requests and escalating to the right person if necessary.
- Design, develop, and maintain the infrastructure required for deploying and sc...
Ready to Apply?
Take the next step in your AI career. Submit your application to TransPerfect today.
Submit Application