Job Description
We are now looking for a HPC Operations Engineer to join our mission and continue improving our HPC infrastructure. A meaningful part of NVIDIA’s strength is our unique and advanced development tools and environments that enable our incredible pace of innovation. We are looking for architects to help us evolve the way our private compute cloud is architected and optimized.
What you’ll be doing:
+ Troubleshoot incoming support requests in a large-scale HPC environment.
+ Contribute enhancements to existing deployment automation, configuration management, observability, and operational monitoring and day to day operation through automation.
+ Ensure compute servers are running correct Operating System and configuration.
+ Troubleshoot Complex Issues: Perform comprehensive troubleshooting from bare metal to application level, ensuring system reliability and efficiency.
+ Collaborate with special...
Ready to Apply?
Take the next step in your AI career. Submit your application to Nvidia today.
Submit Application