AWS Neuron Runtime Optimization Engineer

2 weeks ago


Cupertino, California, United States Amazon Full time
Job Description

We are looking for a talented AWS Neuron Runtime Optimization Engineer to join our team. In this role, you will work closely with our team to design, develop, and deploy high-performance software solutions that optimize the performance of complex neural net models executed on AWS Inferentia.

The ideal candidate will have 5+ years of experience in programming with at least one software programming language, and a strong background in machine learning and AI accelerators. You will have expertise in leading design or architecture of new and existing systems, as well as experience in full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.

Salary: $151,300 - $261,500 per year (based on location)

About Our Team

Our team is dedicated to delivering innovative software solutions that make deep learning pervasive for everyday developers and democratize access to cutting-edge infrastructure.



  • Cupertino, California, United States Amazon Full time

    About the RoleWe are seeking a skilled software development engineer to join our AWS Neuron team. As a software development lead, you will be responsible for developing, enabling, and performance tuning of a wide variety of machine learning model families.Key responsibilities include:Tuning large language models like Llama2, GPT2, and GPT3 for highest...


  • Cupertino, California, United States Amazon Full time

    About the RoleThis role involves developing, enabling, and performance tuning of various machine learning model families.Responsibilities:Developing distributed inference support into PyTorch and Tensorflow using XLA and the Neuron compiler and runtime stacks.Tuning machine learning models to ensure highest performance and maximize efficiency on customer AWS...


  • Cupertino, California, United States Amazon Full time

    **Job Details:**As a Software Development Engineer for Neuron, you will be part of the Machine Learning Applications team at Amazon. This role involves developing and tuning machine learning models for cloud-scale applications, working closely with compiler engineers and runtime engineers to optimize inference performance and develop distributed inference...


  • Cupertino, California, United States Amazon Full time

    About the JobWe are seeking a skilled Cloud Scale Machine Learning Engineer to join our team. As a key member of our Machine Learning Applications team, you will be responsible for developing, enabling, and tuning distributed inference solutions using AWS Neuron.Salary: $173,450 - $178,650 per yearKey ResponsibilitiesDesign and implement high-performance...


  • Cupertino, California, United States Amazon Full time

    **Job Description:**A Software Development Engineer is needed in the Machine Learning Applications team for AWS Neuron. This role is responsible for development, enablement, and performance tuning of various machine learning models.The ideal candidate will have experience with distributed inference libraries such as Deepspeed and optimizing inference...


  • Cupertino, California, United States Amazon Full time

    We are seeking a highly skilled Machine Learning Compiler Engineer to join our Amazon team in the AWS Neuron division. As a key member of our team, you will be responsible for architecting and implementing business-critical features, publishing cutting-edge research, and collaborating with experienced engineers to develop a compiler that handles the world's...


  • Cupertino, California, United States Amazon Full time

    About the RoleThis is an exciting opportunity to join the Annapurna Labs team at Amazon Web Services (AWS) as a Senior Software Engineer. We are seeking a highly skilled engineer with expertise in deep learning and distributed training. As a member of our Machine Learning Applications (ML Apps) team, you will be responsible for developing and maintaining...


  • Cupertino, California, United States Amazon Full time

    Job SummaryThe AWS Neuron team is seeking a skilled Software Development Engineer to join our Machine Learning Applications team. As a key member of this team, you will be responsible for developing, enabling, and performance-tuning a wide variety of machine learning model families, including large language models and vision transformers.This role requires...


  • Cupertino, California, United States Amazon Full time

    About the Role: Amazon's Machine Learning Engineering team is seeking a talented Team Lead to join our team. As a key member of our ML Apps team, you will be responsible for leading the development and deployment of large-scale machine learning models on AWS Neuron. This includes designing and implementing distributed training solutions using PyTorch,...


  • Cupertino, California, United States Amazon Full time

    Job OverviewA high-level software development engineer position is available in the Machine Learning Applications team for AWS Neuron.About the RoleThis role involves developing, enabling, and performance tuning of various machine learning model families, including large language models, stable diffusion, and vision transformers.The successful candidate will...


  • Cupertino, California, United States Amazon Full time

    Team Overview:The AWS Neuron Inference team works side by side with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions with Trn1/Inf2. As a member of this team, you will be responsible for developing, enabling, and optimizing machine learning models for cloud-scale inference accelerators.About the Team:We...


  • Cupertino, California, United States Amazon Full time

    About the Role: We are seeking a highly skilled Machine Learning Engineer to join our team in developing and optimizing machine learning models for AWS Neuron. This role will involve working closely with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions.Key Responsibilities:Design and develop high-impact...


  • Cupertino, California, United States Amazon Full time

    About Amazon">Amazon is a leader in cloud computing, artificial intelligence, and related technologies. We are committed to innovation and excellence, with a focus on developing cutting-edge solutions that improve people's lives.This role is part of our Machine Learning Applications (ML Apps) team, which works on developing and optimizing cloud-scale machine...


  • Cupertino, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Senior Runtime SDE to join our AWS Neuron team. As a key member of our team, you will be responsible for designing and developing innovative software solutions that optimize the performance of complex neural net models executed on AWS Inferentia.Our ideal candidate will have a strong background in machine...


  • Cupertino, California, United States Amazon Full time

    Job DescriptionThis role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. The team works side by side with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions with Trn1.As a cloud-scale machine learning engineer, you will be responsible for optimizing inference...


  • Cupertino, California, United States Amazon Full time

    **Role Overview:**We are seeking an experienced Software Development Engineer to join our Machine Learning Applications team for AWS Neuron. This role involves developing, enabling, and tuning machine learning models for cloud-scale applications.The ideal candidate will have strong software development skills in C++/Python and ML knowledge, with experience...


  • Cupertino, California, United States Amazon Full time

    Cloud Scale Solutions Developer: We are seeking a talented developer to join our team in creating and optimizing cloud-scale machine learning solutions for AWS Neuron. This role involves working closely with compiler engineers and runtime engineers to build and tune distributed inference solutions.Main Responsibilities:Design and develop scalable machine...


  • Cupertino, California, United States Amazon Full time

    Job Description: As a Software Development Specialist, you will be responsible for developing and optimizing machine learning models for AWS Neuron. This role involves working closely with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions.Responsibilities:Develop high-quality software solutions to meet...


  • Cupertino, California, United States Amazon Full time

    About the Role: Amazon is looking for a Senior Distributed Training Specialist to join our team. In this role, you will be responsible for leading the development and deployment of large-scale machine learning models on AWS Neuron. This includes designing and implementing distributed training solutions using PyTorch, TensorFlow, and other libraries.Key...

  • Software Developer

    4 days ago


    Cupertino, California, United States Amazon Full time

    About the JobWe're looking for a skilled Software Developer to join our Machine Learning Applications team. As a key member of the team, you'll contribute to the design, development, and deployment of large-scale machine learning systems. Your expertise in distributed training libraries and frameworks will enable us to optimize and scale ML models for...