Job Description

Machine Learning Engineer III – LLM Training (RL + PEFT)

On-site, Bangalore

LatentForce

About the Role
We are building specialized LLMs that understand and reason over massive enterprise codebases. This is

real model training

— RL loops, PEFT, verifiable rewards, long-context modeling — not API integration. You’ll own end-to-end experimentation and work directly with founders.

Responsibilities
Train LLMs using

RL (PPO/GRPO/RLHF/RLVR)

and

PEFT

(LoRA, QLoRA, DoRA, IA3).
Build custom training loops with

PyTorch, HuggingFace, TRL, Unsloth .
Design reward functions and verifiers for code-understanding tasks.
Run full-stack ML experiments: data → training → eval → iteration.
Develop scalable training infra (FSDP/DeepSpeed, distributed training).
Build evaluation suites for reasoning and code comprehension.

Minimum Qualifications
3+ years

of real deep le...

Ready to Apply?

Take the next step in your AI career. Submit your application to LatentForce today.

Submit Application