Job Description

Apply for the DevOps & ML Ops Engineer role at TransPerfect .

Responsibilities

  • Manage resource allocation and workload scheduling for multiple ML services, ensuring efficient utilization of CPU/GPU resources and creating reliable queues based on service priorities.
  • Maintain VM environments and manage OS updates, keeping up-to-date VM inventory.
  • Work alongside the Dev and QA teams to detect hot spots in our applications and set preventative measures before they become live issues.
  • Troubleshoot and provide solutions for system configurations.
  • Plan, execute, and test disaster recovery.
  • Monitor and examine all application, performance, event, and system logs to assist in troubleshooting.
  • File all IT/Colocation tickets, ensuring fulfillment of requests and escalating to the right person if necessary.
  • Design, develop, and maintain the infrastructure required for deploying and sc...

Ready to Apply?

Take the next step in your AI career. Submit your application to TransPerfect today.

Submit Application