Job Description

Machine Learning Engineer III – LLM Training (RL + PEFT)


📍 On-site, Bangalore


🏢 LatentForce


About the Role

We are building specialized LLMs that understand and reason over massive enterprise codebases. This is real model training — RL loops, PEFT, verifiable rewards, long-context modeling — not API integration. You’ll own end-to-end experimentation and work directly with founders.


Responsibilities

  • Train LLMs using RL (PPO/GRPO/RLHF/RLVR) and PEFT (LoRA, QLoRA, DoRA, IA3).
  • Build custom training loops with PyTorch, HuggingFace, TRL, Unsloth.
  • Design reward functions and verifiers for code-understanding tasks.
  • Run full-stack ML experiments: data → training → eval → iteration.
  • Develop scalable training infra (FSDP/DeepS...

Ready to Apply?

Take the next step in your AI career. Submit your application to LatentForce today.

Submit Application