Distributed Inference Solutions Developer
4 weeks ago
We are looking for a talented Distributed Inference Solutions Developer to join our Machine Learning Applications team. As a key contributor, you will be responsible for building and maintaining distributed inference solutions using AWS Neuron, PyTorch, and other machine learning frameworks.
Salary: $170,200 - $175,500 per year
Responsibilities- Develop and deploy high-performance distributed inference solutions using AWS Neuron and other machine learning frameworks.
- Collaborate with cross-functional teams to design and implement scalable and efficient machine learning models.
- Troubleshoot and resolve issues related to distributed inference solutions.
- 3+ years of non-internship professional software development experience.
- 2+ years of experience in designing or architecting new and existing systems.
- Strong programming skills in at least one software programming language.
- 3+ years of full software development life cycle experience, including coding standards, code reviews, source control management, build processes, testing, and operations.
- Bachelor's degree in computer science or equivalent.
-
AWS Neuron Distributed Inference Expert
3 weeks ago
Cupertino, California, United States Amazon Full timeAbout the RoleThis role involves developing, enabling, and performance tuning of various machine learning model families.Responsibilities:Developing distributed inference support into PyTorch and Tensorflow using XLA and the Neuron compiler and runtime stacks.Tuning machine learning models to ensure highest performance and maximize efficiency on customer AWS...
-
Cloud-Scale Inference Solutions Architect
3 weeks ago
Cupertino, California, United States Amazon Full timeAbout the TeamWe are a dynamic team of experts in machine learning and software development, working together to create innovative solutions for cloud-scale inference. Our team is passionate about using technology to make a positive impact on society, and we are committed to fostering a culture of inclusivity, diversity, and respect. If you are a motivated...
-
AI Hardware Engineer
16 hours ago
Cupertino, California, United States Etched Full timeAbout EtchedWe're building AI chips that are hard-coded for individual model architectures. Our first product, Sohu, only supports transformers but has an order of magnitude more throughput and lower latency than a B200. With our ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep &...
-
Cloud Scale Solutions Developer
4 weeks ago
Cupertino, California, United States Amazon Full timeCloud Scale Solutions Developer: We are seeking a talented developer to join our team in creating and optimizing cloud-scale machine learning solutions for AWS Neuron. This role involves working closely with compiler engineers and runtime engineers to build and tune distributed inference solutions.Main Responsibilities:Design and develop scalable machine...
-
Cloud Scale Machine Learning Developer
4 weeks ago
Cupertino, California, United States Amazon Full time**Role Overview:**We are seeking an experienced Software Development Engineer to join our Machine Learning Applications team for AWS Neuron. This role involves developing, enabling, and tuning machine learning models for cloud-scale applications.The ideal candidate will have strong software development skills in C++/Python and ML knowledge, with experience...
-
Cupertino, California, United States Amazon Full timeJob DescriptionThis is an exciting opportunity to work on some of the most challenging problems in machine learning and computer science. As a Senior Software Development Lead, you will be responsible for leading a team of software engineers in the development and optimization of large language models for cloud-scale inference solutions. You will design and...
-
Machine Learning Applications Development Lead
3 weeks ago
Cupertino, California, United States Amazon Full timeAbout Amazon">Amazon is a leader in cloud computing, artificial intelligence, and related technologies. We are committed to innovation and excellence, with a focus on developing cutting-edge solutions that improve people's lives.This role is part of our Machine Learning Applications (ML Apps) team, which works on developing and optimizing cloud-scale machine...
-
Software Development Engineer for Neuron
3 weeks ago
Cupertino, California, United States Amazon Full time**Job Details:**As a Software Development Engineer for Neuron, you will be part of the Machine Learning Applications team at Amazon. This role involves developing and tuning machine learning models for cloud-scale applications, working closely with compiler engineers and runtime engineers to optimize inference performance and develop distributed inference...
-
Cupertino, California, United States Amazon Full timeAbout the RoleThis is a unique opportunity to join Amazon's Machine Learning Applications (ML Apps) team as a software development engineer. You will be responsible for developing, enabling, and performance-tuning a wide range of machine learning models, including large language models like Llama2, GPT2, and GPT3.Your primary focus will be on optimizing...
-
Software Development Specialist
3 weeks ago
Cupertino, California, United States Amazon Full timeJob Description: As a Software Development Specialist, you will be responsible for developing and optimizing machine learning models for AWS Neuron. This role involves working closely with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions.Responsibilities:Develop high-quality software solutions to meet...
-
Machine Learning Applications Engineer
3 weeks ago
Cupertino, California, United States Amazon Full time**Job Description:**A Software Development Engineer is needed in the Machine Learning Applications team for AWS Neuron. This role is responsible for development, enablement, and performance tuning of various machine learning models.The ideal candidate will have experience with distributed inference libraries such as Deepspeed and optimizing inference...
-
Cloud-Scale Software Development Lead
4 weeks ago
Cupertino, California, United States Amazon Full timeAbout AmazonAmazon is committed to a diverse and inclusive workplace.About the TeamThe Machine Learning Applications team works closely with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions.Team Environment:Dedicated team members with a broad mix of experience levels and tenures.Celebrating knowledge-sharing...
-
Machine Learning Applications Engineer
4 weeks ago
Cupertino, California, United States Amazon Full timeJob SummaryThe AWS Neuron team is seeking a skilled Software Development Engineer to join our Machine Learning Applications team. As a key member of this team, you will be responsible for developing, enabling, and performance-tuning a wide variety of machine learning model families, including large language models and vision transformers.This role requires...
-
Cupertino, California, United States Apple Full time**Job Summary**We are seeking a talented Senior Software Development Engineer, Machine Learning Expert to join our team at Apple. As a key member of our applied ML scientists and engineers team, you will be responsible for enhancing the experience and productivity of software developers at Apple and in the Apple developer ecosystem.**About the Role**In this...
-
AWS Neuron Software Development Lead
4 weeks ago
Cupertino, California, United States Amazon Full timeAbout the RoleWe are seeking a skilled software development engineer to join our AWS Neuron team. As a software development lead, you will be responsible for developing, enabling, and performance tuning of a wide variety of machine learning model families.Key responsibilities include:Tuning large language models like Llama2, GPT2, and GPT3 for highest...
-
Cupertino, California, United States Apple Full time**Job Description**We are looking for a highly skilled Machine Learning Engineer, Developer Experience Specialist to join our team at Apple. In this role, you will work closely with our applied ML scientists and engineers to enhance the experience and productivity of software developers at Apple and in the Apple developer ecosystem.**About the Team**Our team...
-
Cloud Scale Machine Learning Engineer
3 weeks ago
Cupertino, California, United States Amazon Full timeAbout the JobWe are seeking a skilled Cloud Scale Machine Learning Engineer to join our team. As a key member of our Machine Learning Applications team, you will be responsible for developing, enabling, and tuning distributed inference solutions using AWS Neuron.Salary: $173,450 - $178,650 per yearKey ResponsibilitiesDesign and implement high-performance...
-
Large Scale Distributed Systems Developer
18 hours ago
Cupertino, California, United States Amazon Full timeJob DescriptionWe're seeking a talented Software Development Engineer to join our team and contribute to the development of the next generation of the AWS Direct Connect service. As a key member of our software engineering team, you will be responsible for designing, developing, and supporting features within the AWS Direct Connect service.Responsibilities*...
-
Cloud AI Engineering Specialist
3 weeks ago
Cupertino, California, United States Amazon Full timeTeam Overview:The AWS Neuron Inference team works side by side with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions with Trn1/Inf2. As a member of this team, you will be responsible for developing, enabling, and optimizing machine learning models for cloud-scale inference accelerators.About the Team:We...
-
Cloud Networking Solutions Architect
4 weeks ago
Cupertino, California, United States Amazon Full time**The Elastic Collectives Team**The Elastic Collectives team builds out the collective operations layer in the Trainium and Nvidia stack for distributed machine learning. In any day, we are designing new algorithms, hunting for performance bottlenecks, and optimizing a customer's heavy ML/AI workloads. You will be working with principal and senior principal...