Distributed Inference Solutions Developer

4 weeks ago


Cupertino, California, United States Amazon Full time
About the Role

We are looking for a talented Distributed Inference Solutions Developer to join our Machine Learning Applications team. As a key contributor, you will be responsible for building and maintaining distributed inference solutions using AWS Neuron, PyTorch, and other machine learning frameworks.

Salary: $170,200 - $175,500 per year

Responsibilities
  • Develop and deploy high-performance distributed inference solutions using AWS Neuron and other machine learning frameworks.
  • Collaborate with cross-functional teams to design and implement scalable and efficient machine learning models.
  • Troubleshoot and resolve issues related to distributed inference solutions.
Requirements
  • 3+ years of non-internship professional software development experience.
  • 2+ years of experience in designing or architecting new and existing systems.
  • Strong programming skills in at least one software programming language.
Preferred Qualifications
  • 3+ years of full software development life cycle experience, including coding standards, code reviews, source control management, build processes, testing, and operations.
  • Bachelor's degree in computer science or equivalent.


  • Cupertino, California, United States Amazon Full time

    About the RoleThis role involves developing, enabling, and performance tuning of various machine learning model families.Responsibilities:Developing distributed inference support into PyTorch and Tensorflow using XLA and the Neuron compiler and runtime stacks.Tuning machine learning models to ensure highest performance and maximize efficiency on customer AWS...


  • Cupertino, California, United States Amazon Full time

    About the TeamWe are a dynamic team of experts in machine learning and software development, working together to create innovative solutions for cloud-scale inference. Our team is passionate about using technology to make a positive impact on society, and we are committed to fostering a culture of inclusivity, diversity, and respect. If you are a motivated...

  • AI Hardware Engineer

    16 hours ago


    Cupertino, California, United States Etched Full time

    About EtchedWe're building AI chips that are hard-coded for individual model architectures. Our first product, Sohu, only supports transformers but has an order of magnitude more throughput and lower latency than a B200. With our ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep &...


  • Cupertino, California, United States Amazon Full time

    Cloud Scale Solutions Developer: We are seeking a talented developer to join our team in creating and optimizing cloud-scale machine learning solutions for AWS Neuron. This role involves working closely with compiler engineers and runtime engineers to build and tune distributed inference solutions.Main Responsibilities:Design and develop scalable machine...


  • Cupertino, California, United States Amazon Full time

    **Role Overview:**We are seeking an experienced Software Development Engineer to join our Machine Learning Applications team for AWS Neuron. This role involves developing, enabling, and tuning machine learning models for cloud-scale applications.The ideal candidate will have strong software development skills in C++/Python and ML knowledge, with experience...


  • Cupertino, California, United States Amazon Full time

    Job DescriptionThis is an exciting opportunity to work on some of the most challenging problems in machine learning and computer science. As a Senior Software Development Lead, you will be responsible for leading a team of software engineers in the development and optimization of large language models for cloud-scale inference solutions. You will design and...


  • Cupertino, California, United States Amazon Full time

    About Amazon">Amazon is a leader in cloud computing, artificial intelligence, and related technologies. We are committed to innovation and excellence, with a focus on developing cutting-edge solutions that improve people's lives.This role is part of our Machine Learning Applications (ML Apps) team, which works on developing and optimizing cloud-scale machine...


  • Cupertino, California, United States Amazon Full time

    **Job Details:**As a Software Development Engineer for Neuron, you will be part of the Machine Learning Applications team at Amazon. This role involves developing and tuning machine learning models for cloud-scale applications, working closely with compiler engineers and runtime engineers to optimize inference performance and develop distributed inference...


  • Cupertino, California, United States Amazon Full time

    About the RoleThis is a unique opportunity to join Amazon's Machine Learning Applications (ML Apps) team as a software development engineer. You will be responsible for developing, enabling, and performance-tuning a wide range of machine learning models, including large language models like Llama2, GPT2, and GPT3.Your primary focus will be on optimizing...


  • Cupertino, California, United States Amazon Full time

    Job Description: As a Software Development Specialist, you will be responsible for developing and optimizing machine learning models for AWS Neuron. This role involves working closely with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions.Responsibilities:Develop high-quality software solutions to meet...


  • Cupertino, California, United States Amazon Full time

    **Job Description:**A Software Development Engineer is needed in the Machine Learning Applications team for AWS Neuron. This role is responsible for development, enablement, and performance tuning of various machine learning models.The ideal candidate will have experience with distributed inference libraries such as Deepspeed and optimizing inference...


  • Cupertino, California, United States Amazon Full time

    About AmazonAmazon is committed to a diverse and inclusive workplace.About the TeamThe Machine Learning Applications team works closely with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions.Team Environment:Dedicated team members with a broad mix of experience levels and tenures.Celebrating knowledge-sharing...


  • Cupertino, California, United States Amazon Full time

    Job SummaryThe AWS Neuron team is seeking a skilled Software Development Engineer to join our Machine Learning Applications team. As a key member of this team, you will be responsible for developing, enabling, and performance-tuning a wide variety of machine learning model families, including large language models and vision transformers.This role requires...


  • Cupertino, California, United States Apple Full time

    **Job Summary**We are seeking a talented Senior Software Development Engineer, Machine Learning Expert to join our team at Apple. As a key member of our applied ML scientists and engineers team, you will be responsible for enhancing the experience and productivity of software developers at Apple and in the Apple developer ecosystem.**About the Role**In this...


  • Cupertino, California, United States Amazon Full time

    About the RoleWe are seeking a skilled software development engineer to join our AWS Neuron team. As a software development lead, you will be responsible for developing, enabling, and performance tuning of a wide variety of machine learning model families.Key responsibilities include:Tuning large language models like Llama2, GPT2, and GPT3 for highest...


  • Cupertino, California, United States Apple Full time

    **Job Description**We are looking for a highly skilled Machine Learning Engineer, Developer Experience Specialist to join our team at Apple. In this role, you will work closely with our applied ML scientists and engineers to enhance the experience and productivity of software developers at Apple and in the Apple developer ecosystem.**About the Team**Our team...


  • Cupertino, California, United States Amazon Full time

    About the JobWe are seeking a skilled Cloud Scale Machine Learning Engineer to join our team. As a key member of our Machine Learning Applications team, you will be responsible for developing, enabling, and tuning distributed inference solutions using AWS Neuron.Salary: $173,450 - $178,650 per yearKey ResponsibilitiesDesign and implement high-performance...


  • Cupertino, California, United States Amazon Full time

    Job DescriptionWe're seeking a talented Software Development Engineer to join our team and contribute to the development of the next generation of the AWS Direct Connect service. As a key member of our software engineering team, you will be responsible for designing, developing, and supporting features within the AWS Direct Connect service.Responsibilities*...


  • Cupertino, California, United States Amazon Full time

    Team Overview:The AWS Neuron Inference team works side by side with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions with Trn1/Inf2. As a member of this team, you will be responsible for developing, enabling, and optimizing machine learning models for cloud-scale inference accelerators.About the Team:We...


  • Cupertino, California, United States Amazon Full time

    **The Elastic Collectives Team**The Elastic Collectives team builds out the collective operations layer in the Trainium and Nvidia stack for distributed machine learning. In any day, we are designing new algorithms, hunting for performance bottlenecks, and optimizing a customer's heavy ML/AI workloads. You will be working with principal and senior principal...