Job Description
At NVIDIA, we are at the forefront of the constantly evolving field of large language models, and their application in agentic and reasoning use cases. As the scale and complexity of these LLM systems continues to increase, we are seeking outstanding engineers to join our team and help shape the future of LLM inference.
Our team is dedicated to pushing the boundaries of what's possible with LLMs by improving the algorithmic performance and efficiency of systems that represent them. We constantly reflect on how to improve these systems, developing new inference algorithms and protocols, improving existing models, and seamlessly integrating improvements to ensure NVIDIA's solutions can efficiently handle large-scale, sophisticated tasks.
What you'll be doing:
+ Research and Development: Explore and incorporate contemporary research on generative AI, agents, and inference systems into the NVIDIA LLM software stack.
+ Workload Analysis and Optimizatio...
Our team is dedicated to pushing the boundaries of what's possible with LLMs by improving the algorithmic performance and efficiency of systems that represent them. We constantly reflect on how to improve these systems, developing new inference algorithms and protocols, improving existing models, and seamlessly integrating improvements to ensure NVIDIA's solutions can efficiently handle large-scale, sophisticated tasks.
What you'll be doing:
+ Research and Development: Explore and incorporate contemporary research on generative AI, agents, and inference systems into the NVIDIA LLM software stack.
+ Workload Analysis and Optimizatio...
Ready to Apply?
Take the next step in your AI career. Submit your application to NVIDIA today.
Submit Application