Current jobs related to Sr. Machine Learning Engineer, Annapurna ML - Cupertino - Annapurna Labs (U.S.) Inc.


  • Cupertino, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Senior Machine Learning Engineer to join our Annapurna ML pathfinding team. As a key member of this team, you will play a critical role in helping our most strategic customers accelerate their adoption of Annapurna ML products, including AWS Trainium and AWS Inferentia.Key ResponsibilitiesCollaborate with...

  • Sr. Machine Learning

    3 weeks ago


    Cupertino, United States Amazon Web Services (AWS) Full time

    DescriptionAWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium delivers the best-in-class ML training performance with the most teraflops (TFLOPS) of compute power...

  • Sr. Machine Learning

    3 weeks ago


    Cupertino, United States Amazon Web Services (AWS) Full time

    DescriptionAWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium delivers the best-in-class ML training performance with the most teraflops (TFLOPS) of compute power...


  • Cupertino, United States Amazon Development Center U.S., Inc. - B02 Full time

    We are seeking an experienced software engineer with low-level latency networking or interconnect expertise to optimize customer experience by designing systems that enable scaling network-intensive workloads over thousands of CPUs, GPUs, and TPUs. This role is on the forefront of AI/ML, we spend a good deal of the day optimizing the networking for the...


  • Cupertino, CA, United States Amazon Development Center U.S., Inc. - B02 Full time

    We are seeking an experienced software engineer with low-level latency networking or interconnect expertise to optimize customer experience by designing systems that enable scaling network-intensive workloads over thousands of CPUs, GPUs, and TPUs. This role is on the forefront of AI/ML, we spend a good deal of the day optimizing the networking for the...


  • Cupertino, United States Apple Inc. Full time

    AIML - Sr Machine Learning Engineer, LLM Optimization, Data and ML InnovationAs part of Apple's AI and Machine Learning org, we encourage and create groundbreaking technology for large-scale ML systems, computer vision, natural language processing, and multi-modal understanding. As a Machine Learning Engineer in the LLM Optimization team, you will have the...


  • Cupertino, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled software engineer to join our team at Annapurna Labs, a subsidiary of Amazon Web Services (AWS). As a Device Driver Engineer, you will play a critical role in developing and maintaining the software stack for our custom silicon chips, which power AWS's machine learning servers.Key ResponsibilitiesDesign and...


  • Cupertino, California, United States Annapurna Labs Full time

    About the RoleWe are seeking a highly skilled TPM to lead our AWS AI Chips GTM efforts. As a key member of the Annapurna Labs team, you will be responsible for driving the adoption of our AI Chips across various industries and customer segments.Key ResponsibilitiesLead internal and external cross-team technical projects to accelerate the adoption of AWS AI...


  • Cupertino, California, United States Amazon Development Center U.S., Inc. Full time

    About the RoleWe are seeking a highly skilled Software Engineer to lead the development of machine learning tools at Amazon's Annapurna Labs. As a key member of our team, you will design and implement new toolsets, collaborate with developers, system architects, and hardware engineers to ensure compatibility with existing and next-generation AI...


  • Cupertino, California, United States Apple Full time

    Job SummaryWe are seeking an exceptional software engineer to expand and support Apple's machine learning framework (Core ML). As a key member of our team, you will be responsible for developing, maintaining, and supporting our ML frameworks and tools.Key ResponsibilitiesDevelop and maintain Apple's ML frameworks and toolsWork cross-functionally with...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking an exceptional software engineer to join our team and expand Apple's machine learning framework, Core ML. As a key member of our team, you will be responsible for developing and maintaining highly performant ML frameworks on Apple devices and supporting developer adoption of their capabilities.Key ResponsibilitiesDevelop,...


  • Cupertino, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Device Driver Engineer to join our team at Annapurna Labs, a part of Amazon Web Services (AWS). As a member of our team, you will be responsible for developing and maintaining the software drivers for our custom silicon chips, which power AWS's machine learning servers.Key ResponsibilitiesDesign and develop...


  • Cupertino, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Device Driver Engineer to join our team at Annapurna Labs, a part of Amazon Web Services (AWS). As a member of our team, you will be responsible for developing and maintaining the software drivers for our custom silicon chips, which power AWS's machine learning servers.Key ResponsibilitiesDesign and develop...


  • Cupertino, California, United States Annapurna Labs Full time

    Join Our TeamAre you passionate about shaping the future of AWS AI Chips (AWS Inferentia/Trainium) Go to Market (GTM)? As a key member of the AWS AI Chips Business and GTM team, you will spearhead our most vital customer and industry partnership engagement initiatives.Your RoleIn this position, you will work closely with customers who deploy GenAI...


  • Cupertino, United States Apple, Inc. Full time

    The AIML - On-Device Machine Learning group is responsible for the creation of amazing on-device ML experiences. The team builds foundational machine learning frameworks and tools to optimize large language/vision/multi-modal models that power on-device ML features across Apple products and services. The group is looking for a senior software engineer to...


  • Cupertino, United States Apple Inc. Full time

    AIML - Machine Learning Engineer, Machine Learning Platform & InfrastructureThe AIML - On-Device Machine Learning group is responsible for the creation of amazing on-device ML experiences. The team builds foundational machine learning frameworks and tools to optimize large language/vision/multi-modal models that power on-device ML features across Apple...


  • Cupertino, United States Apple Inc. Full time

    AIML - Machine Learning Engineer, Machine Learning Platform & InfrastructureThe AIML - On-Device Machine Learning group is responsible for the creation of amazing on-device ML experiences. The team builds foundational machine learning frameworks and tools to optimize large language/vision/multi-modal models that power on-device ML features across Apple...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking an exceptional software engineer to join our team at Apple, working on the development and support of our machine learning frameworks and tools. As a Senior Machine Learning Engineer, you will be responsible for building highly performant ML frameworks on Apple devices and supporting developer adoption of their capabilities.Key...


  • Cupertino, United States Apple Inc. Full time

    AIML - Sr Machine Learning Performance Engineer, Siri and Information IntelligenceThe Siri team in the AIML group at Apple is seeking an exceptional Machine Learning Engineer to lead efforts in identifying bottlenecks and optimizing our model inference stack. In this highly collaborative role, you will be at the center of multiple initiatives to accelerate...


  • Cupertino, United States Apple Inc. Full time

    Machine Learning and AIJoin the Siri Team at Apple!Play a part in the next revolution in human-computer interaction. Contribute to the product that is redefining mobile computing through voice interaction. You will help create groundbreaking technology for large scale systems, spoken language, big data, and artificial intelligence. You will be developing and...

Sr. Machine Learning Engineer, Annapurna ML

3 months ago


Cupertino, United States Annapurna Labs (U.S.) Inc. Full time
Annapurna ML pathfinding team is a new function within the Annapurna ML go-to-market org that help customers accelerate their adoption of Annapurna ML products including AWS Trainium and AWS Inferentia. The team offers hands-on data science and coding services to our most strategic customer opportunities to launch their training and inference workloads on AWS purpose built ML silicon offerings.

Key job responsibilities
In this customer-facing role, you will be responsible for helping our most strategic customers port their models to the AWS Trainium & Inferentia platforms by delivering high-quality code and customizations to make the models functional and performant. You will use and provide feedback to the various Neuron SDK libraries and help prototype and develop new features based on the latest research findings and customer requests.

A day in the life
You will be required to assist our most strategic customers in porting their models to AWS Trainium and Inferentia.

You will work directly with customer data scientists and ML engineering teams and write code to have the models be performant on AWS purpose-built silicon solutions. It may require low-level coding in C++ and writing custom kernels to get the best performance possible.

You will also be responsible for porting the latest open-source models to AWS Trainium/Inferentia. You will also contribute to open-source projects to help add support for AWS Trainium/Inferentia in popular projects.

It will require a close collaboration with the Neuron engineering team to help drive the Neuron product roadmap and give feedback on improving product quality.

About the team
Our team's mission is to provide the fastest, cost-effective and user-friendly place to train and deploy Generative AI workloads in the cloud. The team provides white-glove service to our most strategic customers to implement their models for both training and inference using the Neuron SDK associated libraries and APIs.

We are open to hiring candidates to work out of one of the following locations:

Cupertino, CA, USA | New York, NY, USA

BASIC QUALIFICATIONS

- 5+ years of non-internship professional software development experience
- 5+ years of programming using a modern programming language such as Java, C++, or C#, including object-oriented design experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Experience as a mentor, tech lead or leading an engineering team
- 3+ years of experience writing code to train and/or deploy deep learning models in PyTorch

PREFERRED QUALIFICATIONS

- Bachelor's degree in computer science or equivalent
- Experience deploying Generative AI applications with large language or vision models into production.