Job Description

Description
Shape the Future of AI Accelerators at AWS Neuron

Join the elite team behind AWS Neuron—the software stack powering AWS's next-generation AI accelerators Inferentia and Trainium. As a Senior Software Engineer in our Machine Learning Applications team, you'll be at the forefront of deploying and optimizing some of the world's most sophisticated AI models at unprecedented scale.

What You'll Impact:
• Pioneer distributed inference solutions for industry-leading LLMs such as GPT, Llama, Qwen
• Optimize breakthrough language and vision generative AI models
• Collaborate directly with silicon architects and compiler teams to push the boundaries of AI acceleration
• Drive performance benchmarking and tuning that directly impacts millions of inference calls globally

Key job responsibilities
You will drive the Evolution of Distributed AI at AWS Neuron

As a Technical Leader at the forefront of AWS's AI Accelerator, you'll archite...

Ready to Apply?

Take the next step in your AI career. Submit your application to Amazon today.

Submit Application