Job Description

NVIDIA has continuously reinvented itself over two decades. NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world.


This is our life’s work — to amplify human imagination and intelligence. AI becomes more and more important in Auto Driving and AI City. NVIDIA is at the forefront of the Auto Driving and AI City revolution and providing powerful solutions for them. All these solutions are based on GPU-accelerated libraries, such as CUDA, TensorRT and V/LLM inference framework etc. Now, we are now looking for an LLM inference framework developer engineer based in Shanghai.


What you’ll be doing:
+ Craft and develop robust inferencing software that can be scale...

Ready to Apply?

Take the next step in your AI career. Submit your application to NVIDIA today.

Submit Application