Machine Learning Engineer, Distributed Training Expert

2 weeks ago


Cupertino, California, United States Amazon Full time
About the Role

This is an exciting opportunity to join the Annapurna Labs team at Amazon Web Services (AWS) as a Senior Software Engineer. We are seeking a highly skilled engineer with expertise in deep learning and distributed training. As a member of our Machine Learning Applications (ML Apps) team, you will be responsible for developing and maintaining large-scale machine learning models, including GPT2, GPT3, and other massive models. Your work will focus on optimizing these models for performance and efficiency on the AWS Trainium and Inferentia silicon, as well as the Trn1 and Inf1 servers.

The ideal candidate will have strong software development skills, particularly in Python, as well as experience with distributed training libraries such as PyTorch, Jax, and TensorFlow. You should also be familiar with XLA and the Neuron compiler and runtime stacks. If you're passionate about solving complex problems and working collaboratively with chip architects, compiler engineers, and runtime engineers, we encourage you to apply.

We strive to create an inclusive environment where everyone feels valued and respected. Our team emphasizes work-life balance, mentorship, and career growth opportunities. At AWS, we believe that diversity drives innovation, and we're committed to furthering our culture of inclusion. Join us in shaping the future of cloud computing and artificial intelligence.

  • Cupertino, California, United States Apple Full time

    **Job Summary**We are seeking a talented Senior Software Development Engineer, Machine Learning Expert to join our team at Apple. As a key member of our applied ML scientists and engineers team, you will be responsible for enhancing the experience and productivity of software developers at Apple and in the Apple developer ecosystem.**About the Role**In this...


  • Cupertino, California, United States Apple Full time

    About the OpportunityWe are seeking an experienced Machine Learning Engineer to join our team at Apple. As a key member of our AI/ML - Machine Translation team, you will play a critical role in shaping the future of human-computer interaction.Job Summary:The successful candidate will have a strong background in machine learning and artificial intelligence,...


  • Cupertino, California, United States Apple Full time

    Optimize Distributed Machine Learning SystemsWe are seeking a highly motivated and experienced Machine Learning Engineer to join our team in Cupertino, California. In this role, you will be working on optimizing end-to-end system performance of distributed machine learning workloads.About the RoleThis is a highly collaborative role where you will be working...


  • Cupertino, California, United States Apple Full time

    Company OverviewCupertino, California, United StatesSoftware and ServicesWe're a team of applied ML scientists and engineers who work to enhance the experience and productivity of software developers at Apple and in the Apple developer ecosystem. Our mission is to solve real-world problems using state-of-the-art ML models.Job DescriptionSalary: $175,800 -...


  • Cupertino, California, United States Amazon Full time

    Are you passionate about designing and developing cutting-edge machine learning solutions? Do you thrive in a fast-paced environment where innovation is key?About the RoleWe are seeking an experienced Machine Learning Architect to join our team at Amazon. As a key member of our ML Applications team, you will be responsible for developing and implementing...


  • Cupertino, California, United States Apple Full time

    Job DescriptionWe are seeking a highly skilled Machine Learning Expert to join our team at Apple. As a Machine Learning Expert, you will be responsible for developing and optimizing machine learning models for Spotlight search and other AI-powered applications.**Responsibilities*** Design and implement machine learning algorithms for text indexing and...


  • Cupertino, California, United States Apple Full time

    We're seeking a highly skilled **Machine Learning Engineer** to develop and deploy scalable machine learning models that drive business value at Apple. As a key member of our team, you will work closely with cross-functional teams to design and implement data-driven solutions. Your expertise in distributed computing, data engineering, and analytics will...


  • Cupertino, California, United States Apple Full time

    Job OverviewWe are seeking a highly skilled Machine Learning Engineer to join our team at Apple. As an expert in deep learning, you will be responsible for developing and implementing innovative machine learning solutions for our products.


  • Cupertino, California, United States Amazon Full time

    **Job Description:**A Software Development Engineer is needed in the Machine Learning Applications team for AWS Neuron. This role is responsible for development, enablement, and performance tuning of various machine learning models.The ideal candidate will have experience with distributed inference libraries such as Deepspeed and optimizing inference...


  • Cupertino, California, United States Amazon Full time

    About the Role: We are seeking a highly skilled Machine Learning Engineer to join our team in developing and optimizing machine learning models for AWS Neuron. This role will involve working closely with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions.Key Responsibilities:Design and develop high-impact...


  • Cupertino, California, United States Amazon Full time

    About the Role: Amazon's Machine Learning Engineering team is seeking a talented Team Lead to join our team. As a key member of our ML Apps team, you will be responsible for leading the development and deployment of large-scale machine learning models on AWS Neuron. This includes designing and implementing distributed training solutions using PyTorch,...


  • Cupertino, California, United States Amazon Full time

    About AmazonAmazon is a total compensation company that values its employees' growth and development. We strive to create an inclusive workplace where everyone feels empowered to take on more complex tasks. Our senior members enjoy one-on-one mentoring and thorough code reviews, ensuring that our team members feel confident in their abilities. We are...


  • Cupertino, California, United States Apple Full time

    **Job Description**We are looking for a highly skilled Machine Learning Engineer, Developer Experience Specialist to join our team at Apple. In this role, you will work closely with our applied ML scientists and engineers to enhance the experience and productivity of software developers at Apple and in the Apple developer ecosystem.**About the Team**Our team...


  • Cupertino, California, United States Apple Full time

    Job Overview">We are seeking an experienced Software Engineer to join our team in Cupertino, California. As a key member of the AIML Platform team, you will be responsible for designing and building services and infrastructure to support features that empower billions of Apple users.">About the Role">This is an exciting opportunity to work on large-scale...


  • Cupertino, California, United States Amazon Full time

    Job OverviewA high-level software development engineer position is available in the Machine Learning Applications team for AWS Neuron.About the RoleThis role involves developing, enabling, and performance tuning of various machine learning model families, including large language models, stable diffusion, and vision transformers.The successful candidate will...


  • Cupertino, California, United States Amazon Full time

    About the JobWe are seeking a skilled Cloud Scale Machine Learning Engineer to join our team. As a key member of our Machine Learning Applications team, you will be responsible for developing, enabling, and tuning distributed inference solutions using AWS Neuron.Salary: $173,450 - $178,650 per yearKey ResponsibilitiesDesign and implement high-performance...


  • Cupertino, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Machine Learning Compiler Engineer to join our team at Amazon. As a member of our Neuron Compiler team, you will play a key role in developing and scaling a compiler to handle the world's largest ML workloads.Your primary responsibility will be to architect and implement business-critical features, publish...


  • Cupertino, California, United States Apple Full time

    **Job Description**As a Software Engineer for Machine Learning and AI, you will play a crucial role in building groundbreaking technology for algorithmic search, machine learning, natural language processing, and artificial intelligence. You will work with one of the most exciting high-performance computing environments, with petabytes of data, millions of...


  • Cupertino, California, United States Amazon Full time

    Job SummaryThe AWS Neuron team is seeking a skilled Software Development Engineer to join our Machine Learning Applications team. As a key member of this team, you will be responsible for developing, enabling, and performance-tuning a wide variety of machine learning model families, including large language models and vision transformers.This role requires...


  • Cupertino, California, United States Apple Inc. Full time

    At Apple Inc., we're looking for a talented Machine Learning Engineer to join our team in Developer Productivity. In this role, you'll be responsible for engineering solutions to support model training, such as building data processing pipelines, data generation engines, model evaluation infrastructure, and model inference systems.This is an exciting...