Job Description

# Lead Machine Learning EngineerSingapore, Singapore## Job responsibilities* Lead the design and implementation of advanced model optimization pipelines, including quantization, pruning, and distillation.Architect and tune inference runtimes and serving frameworks to achieve optimal performance across deployments.* Guide teams in implementing high-throughput serving strategies (continuous batching, KV caching, speculative decoding, asynchronous scheduling).* Develop benchmarks and performance dashboards to measure and communicate system-level efficiency improvements (throughput, latency, GPU utilization, cost).* Evaluate trade-offs across accuracy, performance, and cost, and design architectures to meet target SLAs across varied hardware environments (cloud, on-prem, edge).* Collaborate with infrastructure, MLOps, and product teams to embed inference optimization into production workflows and platform designs.* Provide technical leadership and mentorship to engineers, fostering a cultu...

Ready to Apply?

Take the next step in your AI career. Submit your application to Thoughtworks Inc. today.

Submit Application