AWS Neuron Software Development Lead

3 weeks ago


Cupertino, California, United States Amazon Full time

About the Role

We are seeking a skilled software development engineer to join our AWS Neuron team. As a software development lead, you will be responsible for developing, enabling, and performance tuning of a wide variety of machine learning model families.

Key responsibilities include:

  • Tuning large language models like Llama2, GPT2, and GPT3 for highest performance and efficiency on AWS Trainium and Inferentia silicon.
  • Developing distributed inference solutions with PyTorch, TensorFlow, and XLA.
  • Collaborating with compiler engineers and runtime engineers to create and build these solutions.

Requirements

3+ years of professional software development experience, 2+ years of design or architecture experience, and experience programming with at least one software programming language.

Salary Range: $129,300 - $223,600 per year based on location, depending on job-related knowledge, skills, and experience.

Benefits

We offer a comprehensive benefits package, including medical, financial, and other benefits. For more information, please visit Amazon's employee benefits page.



  • Cupertino, California, United States Annapurna Labs Full time

    Unlock the power of machine learning with AWS Neuron, a cutting-edge software stack designed to optimize performance on the Trainium and Inferentia accelerators. As a skilled developer, you'll join our team at Annapurna Labs to develop and enhance support for PyTorch and JAX frameworks, driving innovation and excellence in ML model development. Key...

  • AWS Neuron Engineer

    6 days ago


    Cupertino, California, United States Annapurna Labs Full time

    About This Role">We are seeking a highly skilled Senior Cloud Accelerator Software Developer to join our team at Annapurna Labs. As a Senior Software Development Engineer, you will be responsible for developing and enhancing support for PyTorch and JAX on our cloud-scale machine learning accelerators.Job Description">">You will work closely with our team to...


  • Cupertino, California, United States Amazon Full time

    Job DescriptionWe are looking for a talented AWS Neuron Runtime Optimization Engineer to join our team. In this role, you will work closely with our team to design, develop, and deploy high-performance software solutions that optimize the performance of complex neural net models executed on AWS Inferentia.The ideal candidate will have 5+ years of experience...


  • Cupertino, California, United States Amazon Full time

    **Job Details:**As a Software Development Engineer for Neuron, you will be part of the Machine Learning Applications team at Amazon. This role involves developing and tuning machine learning models for cloud-scale applications, working closely with compiler engineers and runtime engineers to optimize inference performance and develop distributed inference...


  • Cupertino, California, United States Amazon Full time

    About the RoleThis role involves developing, enabling, and performance tuning of various machine learning model families.Responsibilities:Developing distributed inference support into PyTorch and Tensorflow using XLA and the Neuron compiler and runtime stacks.Tuning machine learning models to ensure highest performance and maximize efficiency on customer AWS...


  • Cupertino, California, United States Amazon Full time

    About the Role: Amazon's Machine Learning Engineering team is seeking a talented Team Lead to join our team. As a key member of our ML Apps team, you will be responsible for leading the development and deployment of large-scale machine learning models on AWS Neuron. This includes designing and implementing distributed training solutions using PyTorch,...


  • Cupertino, California, United States Amazon Full time

    Job DescriptionThis is an exciting opportunity to work on some of the most challenging problems in machine learning and computer science. As a Senior Software Development Lead, you will be responsible for leading a team of software engineers in the development and optimization of large language models for cloud-scale inference solutions. You will design and...


  • Cupertino, California, United States Amazon Full time

    Job Description: As a Software Development Specialist, you will be responsible for developing and optimizing machine learning models for AWS Neuron. This role involves working closely with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions.Responsibilities:Develop high-quality software solutions to meet...


  • Cupertino, California, United States Amazon Full time

    About Amazon">Amazon is a leader in cloud computing, artificial intelligence, and related technologies. We are committed to innovation and excellence, with a focus on developing cutting-edge solutions that improve people's lives.This role is part of our Machine Learning Applications (ML Apps) team, which works on developing and optimizing cloud-scale machine...

  • Software Developer

    3 weeks ago


    Cupertino, California, United States Amazon Full time

    About the JobWe're looking for a skilled Software Developer to join our Machine Learning Applications team. As a key member of the team, you'll contribute to the design, development, and deployment of large-scale machine learning systems. Your expertise in distributed training libraries and frameworks will enable us to optimize and scale ML models for...


  • Cupertino, California, United States Amazon Full time

    About the Role: Amazon is looking for a Senior Distributed Training Specialist to join our team. In this role, you will be responsible for leading the development and deployment of large-scale machine learning models on AWS Neuron. This includes designing and implementing distributed training solutions using PyTorch, TensorFlow, and other libraries.Key...


  • Cupertino, California, United States Amazon Full time

    About the RoleWe are looking for a talented Distributed Inference Solutions Developer to join our Machine Learning Applications team. As a key contributor, you will be responsible for building and maintaining distributed inference solutions using AWS Neuron, PyTorch, and other machine learning frameworks.Salary: $170,200 - $175,500 per...


  • Cupertino, California, United States Amazon Full time

    An exciting opportunity has arisen for a skilled compiler developer to join Annapurna Labs' team at Amazon. As a key member of the Neuron Compiler team, you will be working on developing a deep learning compiler stack that enables state-of-the-art LLM and Vision models to run performantly on custom Machine Learning accelerators.About the Team:The Neuron...


  • Cupertino, California, United States Amazon Full time

    A Day in the LifeAs a member of our team, you'll work closely with experienced engineers and collaborate on projects that impact the development of cutting-edge technologies. You'll be part of a dynamic and supportive environment that encourages knowledge-sharing, mentorship, and career growth. Our team strives to create an environment that values...


  • Cupertino, California, United States Annapurna Labs Full time

    **Job Title:** Software Development Manager, AWS Neuron Machine Learning Distributed Training**Location:** Remote (with occasional travel to Amazon offices)**About Us:Annapurna Labs is a fast-paced and innovative company at the forefront of cloud computing. We're looking for a highly skilled software development manager to lead our team in designing and...


  • Cupertino, California, United States Amazon Full time

    About the JobWe are seeking a Lead Software Development Specialist for AI who can drive the success of Machine Learning technologies at AWS. You will build automation that supports the success of peer teams, developing tools used to guarantee top performance of AWS ML and High Performance Computing (HPC) technologies.About the TeamYou will join a team that...


  • Cupertino, California, United States Amazon Full time

    About the JobWe are seeking a skilled Cloud Scale Machine Learning Engineer to join our team. As a key member of our Machine Learning Applications team, you will be responsible for developing, enabling, and tuning distributed inference solutions using AWS Neuron.Salary: $173,450 - $178,650 per yearKey ResponsibilitiesDesign and implement high-performance...


  • Cupertino, California, United States Amazon Full time

    About the TeamThe AWS Neuron team is a fast-paced and intellectually challenging environment where you will work with thought-leaders in multiple technology areas. We value diversity and inclusion, and we are committed to creating a culture that empowers us to be proud of our differences.We are looking for individuals who are ready for this challenge and...


  • Cupertino, California, United States Amazon Full time

    **Role Overview:**We are seeking an experienced Software Development Engineer to join our Machine Learning Applications team for AWS Neuron. This role involves developing, enabling, and tuning machine learning models for cloud-scale applications.The ideal candidate will have strong software development skills in C++/Python and ML knowledge, with experience...


  • Cupertino, California, United States Amazon Full time

    Amazon is seeking a skilled Machine Learning Compiler Developer to join the AWS Neuron team.">About the RoleYou will be responsible for architecting and implementing business-critical features, publishing cutting-edge research, and contributing to a brilliant team of experienced engineers. As a Machine Learning Compiler Developer, you will leverage your...