Job Description

Join a European applied machine learning team focused on building the next generation of large-scale training infrastructure for foundation models. You will contribute to the design and development of high-performance distributed systems that enable cutting-edge research in machine learning and reinforcement learning at scale.

The team’s mission is to create robust, efficient, and scalable training frameworks that accelerate experimentation and push the boundaries of model performance and system efficiency.

Key Responsibilities

As a senior member of the machine learning infrastructure team, you will work across the full training systems stack:

  • Design, develop, and scale distributed reinforcement learning training systems
  • Build high-performance RL pipelines supporting actor/learner architectures
  • Optimize large-scale training on accelerators (GPU/TPU), with a strong focus on JAX
  • Improve performance, reliability, ...

Ready to Apply?

Take the next step in your AI career. Submit your application to Cleeven group today.

Submit Application