Talent.com
Software Development Manager, LLM Inference Model Enablement, Neuron SDK
Software Development Manager, LLM Inference Model Enablement, Neuron SDKAnnapurna Labs (U.S.) Inc. • Cupertino, California, USA
[error_messages.no_longer_accepting]
Software Development Manager, LLM Inference Model Enablement, Neuron SDK

Software Development Manager, LLM Inference Model Enablement, Neuron SDK

Annapurna Labs (U.S.) Inc. • Cupertino, California, USA
[job_card.30_days_ago]
[job_preview.job_type]
  • [job_card.full_time]
[job_card.job_description]
DESCRIPTION
AWS Utility Computing (UC) provides product innovations, from foundational services such as Amazon Elastic Compute Cloud (EC2), to new product innovations that continue to set AWS’s services and features apart in the industry.

We develop AWS Neuron, the complete software stack for Trainium, Amazon's custom cloud-scale
machine learning accelerators. Come optimize LLMs such as Llama and GPT-OSS to run really fast on Trainium.

As the SDM for the LLM Inference Model Enablement team, you will lead a team of expert AI/ML engineers to onboard and optimize state-of-the-art open-source and customer LLMs, both dense and MoE, for inference on Neuron and Trainium and Inferentia accelerators. You will also drive improvements in model enablement speed and experience, while advancing inference usability and quality through inference features, infrastructure optimization, tools, and automation.

The ideal candidate will have a strong background in LLM model architectures, model performance optimizations, and inference techniques, such as delivering high-performance models using distributed inference libraries. You should be capable of managing demanding, fast-changing priorities. You should have a strong technical ability to understand and deliver as part of a vertically integrated system stack consisting of the PyTorch inference library, Neuron compiler, runtime, and collectives.

A day in the life
You will work with your senior management and technical leaders to define the model enablement and performance optimization for the latest SOTA LLMs, build and deliver them to customers.

Meanwhile, lead the team to continue improving the model onboarding experience, as well as enhancing inference usability and quality for Neuron-supported models.

You will manage changing priorities as new models and new technologies emerge, and you adapt your team’s work to manage them. You will dive deep to help your team solve technical challenges.

About the team
About AWS
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.

Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.

Mentorship & Career Growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

BASIC QUALIFICATIONS

- 3+ years of engineering team management experience
- 7+ years of working directly within engineering teams experience
- 3+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
- Experience partnering with product or program management teams

PREFERRED QUALIFICATIONS

- Experience in communicating with users, other technical teams, and senior leadership to collect requirements, describe software product features, technical designs, and product strategy
- Experience in recruiting, hiring, mentoring/coaching and managing teams of Software Engineers to improve their skills, and make them more effective, product software engineers

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Our inclusive culture empowers Amazonians to deliver the best results for our customers.
[job_alerts.create_a_job]

Software Development Manager, LLM Inference Model Enablement, Neuron SDK • Cupertino, California, USA

[internal_linking.similar_jobs]

Staff ML Engineer: LLM Fine-Tuning for RTL/Verilog

Highbrow Technology IncSan Jose, California, United States
[job_card.full_time]

A prominent tech company in California seeks a Staff Machine Learning Engineer to lead the fine-tuning and deployment of LLM-based solutions for code workflows in secure environments.This role requ...[internal_linking.show_more]

 • [job_card.promoted] • [job_card.new]

ML / AI Software Engineer - C++ Metrics Frameworks

General MotorsSunnyvale, CA, United States
[job_card.full_time]

Role:** As an AI/ML Engineer on the Metrics Frameworks team, part of the Simulation, Evaluation, and Data organization, you will be an individual contributor focused on developing and optimizing in...[internal_linking.show_more]

 • [job_card.promoted]

Observability Engineer - Scale, Reliability & Mentorship

Menlo VenturesMountain View, CA, United States
[job_card.full_time]

A leading data and AI company is seeking a Software Engineer for their Observability team in Mountain View, California.The role focuses on developing observability solutions to enhance product moni...[internal_linking.show_more]

 • [job_card.promoted]

Sr. Software Engineer - AI / LLM Applications

SupermicroSan Jose, California, United States
[job_card.full_time]

Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers...[internal_linking.show_more]

 • [job_card.promoted]

Sr. Product Manager - Align Software Development Kit

Align Technology, Inc.San Jose, CA, United States
[job_card.full_time]

Product Manager - Align Software Development Kit.Align Technology is seeking an experienced Senior Product Manager to lead the vision, strategy, and roadmap for our Software Development Kit (SDK) p...[internal_linking.show_more]

 • [job_card.promoted]

AI & ML Architect for Next-Gen Platform

App Orchid IncSan Ramon, CA, United States
[job_card.full_time] +1

A leading tech company in San Ramon seeks a skilled individual for a permanent full-time position focused on developing machine learning applications.Candidates should hold a Bachelor's degree in C...[internal_linking.show_more]

 • [job_card.promoted]

Staff Software Development Engineer (LLM)

FortinetSunnyvale, California, United States
[job_card.full_time]

Architect and implement functions to monitor and filter LLM requests/responses in real time, preventing prompt injection attacks and unauthorized data leakage.Build a highly scalable pipeline capab...[internal_linking.show_more]

 • [job_card.promoted]

Sr. Manager Software Development, AI Models and Applications

AMDSan Jose, CA, United States
[job_card.full_time]

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems.Grounded in a culture of innovatio...[internal_linking.show_more]

 • [job_card.promoted]

Senior Software Engineer - ML/LLM Serving

AlldusSan Jose, CA, United States
[job_card.full_time]

Senior Software Engineer - ML/LLM Serving.This range is provided by Alldus.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.Direct message the jo...[internal_linking.show_more]

 • [job_card.promoted]

Senior Software Engineer - Virtual Hardware Modeling

TALENT Software ServicesSunnyvale, CA, United States
[job_card.full_time]

Senior Software Engineer - Virtual Hardware Modeling.The compute performance and power efficiency requirements of custom AR/VR devices require custom silicon.Our client team is driving the state of...[internal_linking.show_more]

 • [job_card.promoted]

Lead ML Engineer: Scale, Deploy & Validate Models

Intuit Inc.Mountain View, California, United States
[job_card.full_time]

A leading financial software firm is seeking a Staff Machine Learning Engineer to join their vibrant team in Mountain View, CA.The ideal candidate will have over 6 years of experience and strong kn...[internal_linking.show_more]

 • [job_card.promoted] • [job_card.new]

ASIC RTL/SoC Design Engineer

TetraMem - Accelerate The WorldSan Jose, California, United States
[job_card.full_time]

TetraMem - Accelerate The World.Be among the first 25 applicants.Lead RTL design, simulation, and verification efforts for TetraMem ASIC/SoC products, ensuring robust and efficient designs.Integrat...[internal_linking.show_more]

 • [job_card.promoted] • [job_card.new]

Senior AI/ML System Software Engineer — Hybrid

d-MatrixSanta Clara, CA, United States
[job_card.full_time]

A leading AI technology company located in Santa Clara is seeking a Principal AI/ML System Software Engineer to develop and enhance next-generation AI deployment software.The ideal candidate will h...[internal_linking.show_more]

 • [job_card.promoted]

Sr AI/ML Software Engineer

Cisco Systems, Inc.Milpitas, California, United States
[job_card.full_time]

The application window is expected to close on: 05/15/2026.Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received.This is a hybrid posit...[internal_linking.show_more]

 • [job_card.promoted]

Principal AI/ML Software Engineer — Lead Innovation

Analog Devices, Inc.San Jose, CA, United States
[job_card.full_time]

A leading global semiconductor firm located in San Jose, CA is seeking a Principal Engineer specializing in AI/ML software.In this role, you will drive ML technology strategy, lead cross-functional...[internal_linking.show_more]

 • [job_card.promoted]

AIML - Software Engineer, Machine Learning Platform Technologies

AppleCupertino, CA, United States
[job_card.full_time]

Imagine what you could do here.At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly.Bring passion and dedication to your job and there’s no t...[internal_linking.show_more]

 • [job_card.promoted]

Machine Learning Engineering TL, Behavior Planning

Australian Competition and Consumer CommissionMountain View, California, United States
[job_card.full_time]

Software Engineering Mountain View, California.Machine Learning Engineering TL, Behavior Planning.Aurora’s mission is to deliver the benefits of self-driving technology safely, quickly, and broadly...[internal_linking.show_more]

 • [job_card.promoted]

Software Engineer, Machine Learning

lwtsquadSunnyvale, CA, United States
[job_card.full_time]

Software Engineer, Machine Learning Responsibilities.Play a critical role in setting the direction and goals for a sizable team, focusing on project impact, ML system design, and ML excellence.Adap...[internal_linking.show_more]