AWS Neuron Runtime Optimization Engineer

2 weeks ago

Cupertino, California, United States Amazon Full time

Job Description

We are looking for a talented AWS Neuron Runtime Optimization Engineer to join our team. In this role, you will work closely with our team to design, develop, and deploy high-performance software solutions that optimize the performance of complex neural net models executed on AWS Inferentia.

The ideal candidate will have 5+ years of experience in programming with at least one software programming language, and a strong background in machine learning and AI accelerators. You will have expertise in leading design or architecture of new and existing systems, as well as experience in full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.

Salary: $151,300 - $261,500 per year (based on location)

About Our Team

Our team is dedicated to delivering innovative software solutions that make deep learning pervasive for everyday developers and democratize access to cutting-edge infrastructure.

AWS Neuron Software Development Lead

2 weeks ago

Cupertino, California, United States Amazon Full time

About the RoleWe are seeking a skilled software development engineer to join our AWS Neuron team. As a software development lead, you will be responsible for developing, enabling, and performance tuning of a wide variety of machine learning model families.Key responsibilities include:Tuning large language models like Llama2, GPT2, and GPT3 for highest...
AWS Neuron Distributed Inference Expert

2 weeks ago

Cupertino, California, United States Amazon Full time

About the RoleThis role involves developing, enabling, and performance tuning of various machine learning model families.Responsibilities:Developing distributed inference support into PyTorch and Tensorflow using XLA and the Neuron compiler and runtime stacks.Tuning machine learning models to ensure highest performance and maximize efficiency on customer AWS...
Software Development Engineer for Neuron

2 weeks ago

Cupertino, California, United States Amazon Full time

**Job Details:**As a Software Development Engineer for Neuron, you will be part of the Machine Learning Applications team at Amazon. This role involves developing and tuning machine learning models for cloud-scale applications, working closely with compiler engineers and runtime engineers to optimize inference performance and develop distributed inference...
Cloud Scale Machine Learning Engineer

2 weeks ago

Cupertino, California, United States Amazon Full time

About the JobWe are seeking a skilled Cloud Scale Machine Learning Engineer to join our team. As a key member of our Machine Learning Applications team, you will be responsible for developing, enabling, and tuning distributed inference solutions using AWS Neuron.Salary: $173,450 - $178,650 per yearKey ResponsibilitiesDesign and implement high-performance...
Machine Learning Applications Engineer

2 weeks ago

Cupertino, California, United States Amazon Full time

**Job Description:**A Software Development Engineer is needed in the Machine Learning Applications team for AWS Neuron. This role is responsible for development, enablement, and performance tuning of various machine learning models.The ideal candidate will have experience with distributed inference libraries such as Deepspeed and optimizing inference...
Machine Learning Compiler Engineer: Unlocking Performance with AWS Neuron

1 month ago

Cupertino, California, United States Amazon Full time

We are seeking a highly skilled Machine Learning Compiler Engineer to join our Amazon team in the AWS Neuron division. As a key member of our team, you will be responsible for architecting and implementing business-critical features, publishing cutting-edge research, and collaborating with experienced engineers to develop a compiler that handles the world's...
Machine Learning Engineer, Distributed Training Expert

2 weeks ago

Cupertino, California, United States Amazon Full time

About the RoleThis is an exciting opportunity to join the Annapurna Labs team at Amazon Web Services (AWS) as a Senior Software Engineer. We are seeking a highly skilled engineer with expertise in deep learning and distributed training. As a member of our Machine Learning Applications (ML Apps) team, you will be responsible for developing and maintaining...
Machine Learning Applications Engineer

2 weeks ago

Cupertino, California, United States Amazon Full time

Job SummaryThe AWS Neuron team is seeking a skilled Software Development Engineer to join our Machine Learning Applications team. As a key member of this team, you will be responsible for developing, enabling, and performance-tuning a wide variety of machine learning model families, including large language models and vision transformers.This role requires...
Machine Learning Engineering Team Lead

2 weeks ago

Cupertino, California, United States Amazon Full time

About the Role: Amazon's Machine Learning Engineering team is seeking a talented Team Lead to join our team. As a key member of our ML Apps team, you will be responsible for leading the development and deployment of large-scale machine learning models on AWS Neuron. This includes designing and implementing distributed training solutions using PyTorch,...
Machine Learning Applications Engineer

2 weeks ago

Cupertino, California, United States Amazon Full time

Job OverviewA high-level software development engineer position is available in the Machine Learning Applications team for AWS Neuron.About the RoleThis role involves developing, enabling, and performance tuning of various machine learning model families, including large language models, stable diffusion, and vision transformers.The successful candidate will...
Cloud AI Engineering Specialist

2 weeks ago

Cupertino, California, United States Amazon Full time

Team Overview:The AWS Neuron Inference team works side by side with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions with Trn1/Inf2. As a member of this team, you will be responsible for developing, enabling, and optimizing machine learning models for cloud-scale inference accelerators.About the Team:We...
Machine Learning Engineer

2 weeks ago

Cupertino, California, United States Amazon Full time

About the Role: We are seeking a highly skilled Machine Learning Engineer to join our team in developing and optimizing machine learning models for AWS Neuron. This role will involve working closely with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions.Key Responsibilities:Design and develop high-impact...
Machine Learning Applications Development Lead

4 days ago

Cupertino, California, United States Amazon Full time

About Amazon">Amazon is a leader in cloud computing, artificial intelligence, and related technologies. We are committed to innovation and excellence, with a focus on developing cutting-edge solutions that improve people's lives.This role is part of our Machine Learning Applications (ML Apps) team, which works on developing and optimizing cloud-scale machine...
Cloud Computing Architect

4 days ago

Cupertino, California, United States Amazon Full time

About the RoleWe are seeking a highly skilled Senior Runtime SDE to join our AWS Neuron team. As a key member of our team, you will be responsible for designing and developing innovative software solutions that optimize the performance of complex neural net models executed on AWS Inferentia.Our ideal candidate will have a strong background in machine...
Cloud-Scale Machine Learning Engineer

2 weeks ago

Cupertino, California, United States Amazon Full time

Job DescriptionThis role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. The team works side by side with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions with Trn1.As a cloud-scale machine learning engineer, you will be responsible for optimizing inference...
Cloud Scale Machine Learning Developer

2 weeks ago

Cupertino, California, United States Amazon Full time

**Role Overview:**We are seeking an experienced Software Development Engineer to join our Machine Learning Applications team for AWS Neuron. This role involves developing, enabling, and tuning machine learning models for cloud-scale applications.The ideal candidate will have strong software development skills in C++/Python and ML knowledge, with experience...
Cloud Scale Solutions Developer

2 weeks ago

Cupertino, California, United States Amazon Full time

Cloud Scale Solutions Developer: We are seeking a talented developer to join our team in creating and optimizing cloud-scale machine learning solutions for AWS Neuron. This role involves working closely with compiler engineers and runtime engineers to build and tune distributed inference solutions.Main Responsibilities:Design and develop scalable machine...
Software Development Specialist

2 weeks ago

Cupertino, California, United States Amazon Full time

Job Description: As a Software Development Specialist, you will be responsible for developing and optimizing machine learning models for AWS Neuron. This role involves working closely with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions.Responsibilities:Develop high-quality software solutions to meet...
Senior Distributed Training Specialist

1 week ago

Cupertino, California, United States Amazon Full time

About the Role: Amazon is looking for a Senior Distributed Training Specialist to join our team. In this role, you will be responsible for leading the development and deployment of large-scale machine learning models on AWS Neuron. This includes designing and implementing distributed training solutions using PyTorch, TensorFlow, and other libraries.Key...
Software Developer

4 days ago

Cupertino, California, United States Amazon Full time

About the JobWe're looking for a skilled Software Developer to join our Machine Learning Applications team. As a key member of the team, you'll contribute to the design, development, and deployment of large-scale machine learning systems. Your expertise in distributed training libraries and frameworks will enable us to optimize and scale ML models for...

Americas

Europe

Asia / Oceania

Africa

AWS Neuron Runtime Optimization Engineer