Job Description

NVIDIA has been at the forefront of the deep learning revolution, pioneering innovations that have transformed the entire field. As the leading provider of GPUs and AI computing platforms, NVIDIA has empowered researchers and engineers worldwide to accelerate breakthroughs in artificial intelligence.


We seek a versatile Senior Software Engineer who is passionate about performance optimization and generative AI. Our team builds software solutions that enable efficient inference on the latest and greatest generative AI models. We tackle problems on all levels of the stack—from server-level request batching to GPU kernel fusion—and collaborate with teams across diverse disciplines to push Nvidia's hardware to its full potential.


What you’ll be doing:
+ Cooperate with research teams to onboard new LLMs and VLMs into Nvidia's opensource AI runtimes
+ Optimize inference workloads using sophisticated profiling and simulation tools
+ Build SOLID, extendab...

Ready to Apply?

Take the next step in your AI career. Submit your application to NVIDIA today.

Submit Application