AWS Neuron Runtime Optimization Engineer
2 weeks ago
We are looking for a talented AWS Neuron Runtime Optimization Engineer to join our team. In this role, you will work closely with our team to design, develop, and deploy high-performance software solutions that optimize the performance of complex neural net models executed on AWS Inferentia.
The ideal candidate will have 5+ years of experience in programming with at least one software programming language, and a strong background in machine learning and AI accelerators. You will have expertise in leading design or architecture of new and existing systems, as well as experience in full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.
Salary: $151,300 - $261,500 per year (based on location)
About Our Team
Our team is dedicated to delivering innovative software solutions that make deep learning pervasive for everyday developers and democratize access to cutting-edge infrastructure.
-
AWS Neuron Software Development Lead
2 weeks ago
Cupertino, California, United States Amazon Full timeAbout the RoleWe are seeking a skilled software development engineer to join our AWS Neuron team. As a software development lead, you will be responsible for developing, enabling, and performance tuning of a wide variety of machine learning model families.Key responsibilities include:Tuning large language models like Llama2, GPT2, and GPT3 for highest...
-
AWS Neuron Distributed Inference Expert
2 weeks ago
Cupertino, California, United States Amazon Full timeAbout the RoleThis role involves developing, enabling, and performance tuning of various machine learning model families.Responsibilities:Developing distributed inference support into PyTorch and Tensorflow using XLA and the Neuron compiler and runtime stacks.Tuning machine learning models to ensure highest performance and maximize efficiency on customer AWS...
-
Software Development Engineer for Neuron
2 weeks ago
Cupertino, California, United States Amazon Full time**Job Details:**As a Software Development Engineer for Neuron, you will be part of the Machine Learning Applications team at Amazon. This role involves developing and tuning machine learning models for cloud-scale applications, working closely with compiler engineers and runtime engineers to optimize inference performance and develop distributed inference...
-
Cloud Scale Machine Learning Engineer
2 weeks ago
Cupertino, California, United States Amazon Full timeAbout the JobWe are seeking a skilled Cloud Scale Machine Learning Engineer to join our team. As a key member of our Machine Learning Applications team, you will be responsible for developing, enabling, and tuning distributed inference solutions using AWS Neuron.Salary: $173,450 - $178,650 per yearKey ResponsibilitiesDesign and implement high-performance...
-
Machine Learning Applications Engineer
2 weeks ago
Cupertino, California, United States Amazon Full time**Job Description:**A Software Development Engineer is needed in the Machine Learning Applications team for AWS Neuron. This role is responsible for development, enablement, and performance tuning of various machine learning models.The ideal candidate will have experience with distributed inference libraries such as Deepspeed and optimizing inference...
-
Cupertino, California, United States Amazon Full timeWe are seeking a highly skilled Machine Learning Compiler Engineer to join our Amazon team in the AWS Neuron division. As a key member of our team, you will be responsible for architecting and implementing business-critical features, publishing cutting-edge research, and collaborating with experienced engineers to develop a compiler that handles the world's...
-
Cupertino, California, United States Amazon Full timeAbout the RoleThis is an exciting opportunity to join the Annapurna Labs team at Amazon Web Services (AWS) as a Senior Software Engineer. We are seeking a highly skilled engineer with expertise in deep learning and distributed training. As a member of our Machine Learning Applications (ML Apps) team, you will be responsible for developing and maintaining...
-
Machine Learning Applications Engineer
2 weeks ago
Cupertino, California, United States Amazon Full timeJob SummaryThe AWS Neuron team is seeking a skilled Software Development Engineer to join our Machine Learning Applications team. As a key member of this team, you will be responsible for developing, enabling, and performance-tuning a wide variety of machine learning model families, including large language models and vision transformers.This role requires...
-
Machine Learning Engineering Team Lead
2 weeks ago
Cupertino, California, United States Amazon Full timeAbout the Role: Amazon's Machine Learning Engineering team is seeking a talented Team Lead to join our team. As a key member of our ML Apps team, you will be responsible for leading the development and deployment of large-scale machine learning models on AWS Neuron. This includes designing and implementing distributed training solutions using PyTorch,...
-
Machine Learning Applications Engineer
2 weeks ago
Cupertino, California, United States Amazon Full timeJob OverviewA high-level software development engineer position is available in the Machine Learning Applications team for AWS Neuron.About the RoleThis role involves developing, enabling, and performance tuning of various machine learning model families, including large language models, stable diffusion, and vision transformers.The successful candidate will...
-
Cloud AI Engineering Specialist
2 weeks ago
Cupertino, California, United States Amazon Full timeTeam Overview:The AWS Neuron Inference team works side by side with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions with Trn1/Inf2. As a member of this team, you will be responsible for developing, enabling, and optimizing machine learning models for cloud-scale inference accelerators.About the Team:We...
-
Machine Learning Engineer
2 weeks ago
Cupertino, California, United States Amazon Full timeAbout the Role: We are seeking a highly skilled Machine Learning Engineer to join our team in developing and optimizing machine learning models for AWS Neuron. This role will involve working closely with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions.Key Responsibilities:Design and develop high-impact...
-
Cupertino, California, United States Amazon Full timeAbout Amazon">Amazon is a leader in cloud computing, artificial intelligence, and related technologies. We are committed to innovation and excellence, with a focus on developing cutting-edge solutions that improve people's lives.This role is part of our Machine Learning Applications (ML Apps) team, which works on developing and optimizing cloud-scale machine...
-
Cloud Computing Architect
4 days ago
Cupertino, California, United States Amazon Full timeAbout the RoleWe are seeking a highly skilled Senior Runtime SDE to join our AWS Neuron team. As a key member of our team, you will be responsible for designing and developing innovative software solutions that optimize the performance of complex neural net models executed on AWS Inferentia.Our ideal candidate will have a strong background in machine...
-
Cloud-Scale Machine Learning Engineer
2 weeks ago
Cupertino, California, United States Amazon Full timeJob DescriptionThis role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. The team works side by side with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions with Trn1.As a cloud-scale machine learning engineer, you will be responsible for optimizing inference...
-
Cloud Scale Machine Learning Developer
2 weeks ago
Cupertino, California, United States Amazon Full time**Role Overview:**We are seeking an experienced Software Development Engineer to join our Machine Learning Applications team for AWS Neuron. This role involves developing, enabling, and tuning machine learning models for cloud-scale applications.The ideal candidate will have strong software development skills in C++/Python and ML knowledge, with experience...
-
Cloud Scale Solutions Developer
2 weeks ago
Cupertino, California, United States Amazon Full timeCloud Scale Solutions Developer: We are seeking a talented developer to join our team in creating and optimizing cloud-scale machine learning solutions for AWS Neuron. This role involves working closely with compiler engineers and runtime engineers to build and tune distributed inference solutions.Main Responsibilities:Design and develop scalable machine...
-
Software Development Specialist
2 weeks ago
Cupertino, California, United States Amazon Full timeJob Description: As a Software Development Specialist, you will be responsible for developing and optimizing machine learning models for AWS Neuron. This role involves working closely with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions.Responsibilities:Develop high-quality software solutions to meet...
-
Senior Distributed Training Specialist
1 week ago
Cupertino, California, United States Amazon Full timeAbout the Role: Amazon is looking for a Senior Distributed Training Specialist to join our team. In this role, you will be responsible for leading the development and deployment of large-scale machine learning models on AWS Neuron. This includes designing and implementing distributed training solutions using PyTorch, TensorFlow, and other libraries.Key...
-
Software Developer
4 days ago
Cupertino, California, United States Amazon Full timeAbout the JobWe're looking for a skilled Software Developer to join our Machine Learning Applications team. As a key member of the team, you'll contribute to the design, development, and deployment of large-scale machine learning systems. Your expertise in distributed training libraries and frameworks will enable us to optimize and scale ML models for...