Software Development Engineer, Annapurna Labs, Trainium Collectives
6 days ago
We are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental operations that enable AI to scale across multiple accelerators & servers. Most of our stack is C/C++ and relatively low level, so solid knowledge of Linux, kernels, and performant code is important. Experience with embedded systems is valued, and experience with high-speed networking or HPC interconnects is valued highly.
If you like solving hard problems, want to work with HPC and ML customers, iterate fast and deliver meaningful solutions at scale, then come join us This truly is a role on the forefront of AI/ML, you'll be working on features for the largest clusters, with the largest customers, for the largest AI models.
The org you would be joining is Annapurna Labs, an integral part of AWS and develops hardware and software components that are critical building blocks for EC2 infrastructure. Every instance in EC2 is running some type of hardware designed in Annapurna Labs. We specialize in designing software, systems and chips that optimize the AWS customer experience.
A day in the life
Annapurna Labs, a crucial part of AWS, is responsible for developing hardware and software components for EC2 infrastructure. Our team focuses on building networking solutions that for Machine Learning (ML) and High-Performance Computing (HPC) workloads on AWS.
We have mixed discipline orgs, you'd be working side by side with infrastructure experts, hardware engineers, RTL engineers, scientists & architects. Our workforce spans the globe and is truly international, you'll find yourself working side by side with individuals from numerous countries. We take mentorship seriously, you can both expect senior mentorship and will be expected to mentor new and junior engineers.
The pace is fast as we work on the latest advancements of AI/ML, but we take the time to bond as a team and enjoy the successes. We offer flexibility in working hours, and respect WLB as a core org tenet. The team enjoys working with numerous principal-level engineers and closely with directors, career growth opportunities are certainly available. This is a role where you will always be encouraged to keep learning, the AI/ML field is fast moving and constantly evolving.
About the team
Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we're building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn't followed a traditional path, or includes alternative experiences, don't let it stop you from applying.
About AWS
Amazon Web Services (AWS) is the world's most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that's why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Inclusive Team Culture
Here at AWS, it's in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there's nothing we can't achieve in the cloud.
Mentorship & Career Growth
We're continuously raising our performance bar as we strive to become Earth's Best Employer. That's why you'll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
- 3+ years of non-internship professional software development experience
- 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience programming with at least one software programming language
- 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Bachelor's degree in computer science or equivalent
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company's reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information. If the country/region you're applying in isn't listed, please contact your Recruiting Partner.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit This position will remain posted until filled. Applicants should apply via our internal or external career site.
-
Cupertino, California, United States Amazon Web Services (AWS) Full timeDescriptionWe are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental operations that enable AI to scale across multiple accelerators & servers. Most of our stack is C/C++ and relatively low level, so solid knowledge of Linux, kernels, and performant code is important....
-
Cupertino, California, United States Amazon Full time $129,300 - $223,600We are seeking an experienced engineer to work on distributed AI/ML systems. This role involves working on collective operations - the fundamental operations that enable AI to scale across multiple accelerators & servers. Most of our stack is C/C++ and relatively low level, so solid knowledge of Linux, kernels, and performant code is important. Experience...
-
Software Development Manager
3 days ago
Cupertino, California, United States Haystack Full timeSoftware Development Manager, LLM Inference Model Enablement, Neuron SDK | Cupertino, California | Remote-Friendly | $166,400 - $287,700We're working with Annapurna Labs (U.S.) Inc. on this exciting opportunity.This role offers a unique opportunity to lead a team of expert AI/ML engineers in optimizing and enabling state-of-the-art open-source and customer...
-
Sr. Product Manager
1 week ago
Cupertino, California, United States Annapurna Labs (U.S.) Inc. Full timeDESCRIPTIONAWS Neuron is looking for an experienced Technical Product Manager to define and drive product strategy for the Neuron Kernel Interface (NKI), a compiler library enabling custom kernel development and optimization. You will be part of the AWS Neuron Product Management team, driving innovation in machine learning acceleration software. AWS Neuron...
-
Cupertino, California, United States Amazon Full time $129,300 - $223,600Annapurna Labs builds custom Machine Learning accelerators that are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Neuron Compiler Engineering team is searching for a Software Development Engineer to support the development infrastructure of a compiler to enable the world's largest ML workloads to...
-
C/C++ Hardware
1 day ago
Cupertino, California, United States Annapurna Labs (U.S.) Inc. Full timeDESCRIPTIONAnnapurna Labs stands at the forefront of hardware/software co-design, leading innovation not just within Amazon Web Services (AWS) but across the entire industry. We design and build every component of our hardware and software to create best-in-class machine learning products that accelerate customer workloads through industry leading hardware...
-
Cupertino, California, United States Amazon Web Services Full timeDESCRIPTIONThe Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The AWS Neuron SDK, developed by the Annapurna Labs team at AWS, is the backbone for accelerating deep learning and...
-
Cupertino, California, United States Amazon Full time $166,400 - $287,700AWS Trainium servers are complex supercomputers, with both hardware and software built entirely in-house from the ground-up. We're looking for someone to lead our SoC (System on a Chip) Hardware Abstraction Layer (HAL) team. You'll be responsible for directing the team both technically and managerially, getting into the details of both. You'll dig in to...
-
Cupertino, California, United States Amazon Full time $133,900 - $231,400We're seeking a Senior Engineering Technical Program Manager for the Annapurna ML networking teams. In this role you will be responsible for scoping and delivering large projects end-to-end. Responsibilities include collection of business and systems requirements from engineering leaders, driving project schedules from design to release, and managing the...
-
Senior GTM Specialist
2 weeks ago
Cupertino, California, United States Amazon Full time $133,200 - $220,200Senior Strategic Go to Market (GTM) Specialist, role on the Amazon Annapurna Labs ML team is responsible for driving growth with Trainium and Neuron partners, ISVs and customers, . This position focuses on understanding influential partners and solutions that will expand the scope and abilities of Trianium and develop strategies to on board them to AWS AI...