Job Description
Our client is currently seeking a Sr. Machine Learning Engineer
Design, develop, and optimize a wide range of machine learning models, including deep learning architectures, LLMs, and transformer‑based models (., BERT classifiers). Build and manage distributed training workflows using PyTorch and related frameworks. Fine‑tune large language models and enhance inference performance using tools such as the Neuron Compiler, ONNX, and vLLM. Optimize models for diverse hardware platforms including GPUs, TPUs, AWS Inferentia, and Trainium. Design end‑to‑end AI service architectures supporting real‑time streaming and offline batch processing. Lead the development of ML infrastructure encompassing data ingestion, feature engineering, model training, deployment, and monitoring. Build scalable inference systems for both real‑time...
Machine Learning Model Development & Optimization
AI Infrastructure & Services Architecture
Ready to Apply?
Take the next step in your AI career. Submit your application to The Judge Group today.
Submit Application