Sr. Machine Learning Engineer, Annapurna ML

4 weeks ago


Cupertino, United States Annapurna Labs (U.S.) Inc. Full time
Annapurna ML pathfinding team is a new function within the Annapurna ML go-to-market org that help customers accelerate their adoption of Annapurna ML products including AWS Trainium and AWS Inferentia. The team offers hands-on data science and coding services to our most strategic customer opportunities to launch their training and inference workloads on AWS purpose built ML silicon offerings.

Key job responsibilities
In this customer-facing role, you will be responsible for helping our most strategic customers port their models to the AWS Trainium & Inferentia platforms by delivering high-quality code and customizations to make the models functional and performant. You will use and provide feedback to the various Neuron SDK libraries and help prototype and develop new features based on the latest research findings and customer requests.

A day in the life
You will be required to assist our most strategic customers in porting their models to AWS Trainium and Inferentia.

You will work directly with customer data scientists and ML engineering teams and write code to have the models be performant on AWS purpose-built silicon solutions. It may require low-level coding in C++ and writing custom kernels to get the best performance possible.

You will also be responsible for porting the latest open-source models to AWS Trainium/Inferentia. You will also contribute to open-source projects to help add support for AWS Trainium/Inferentia in popular projects.

It will require a close collaboration with the Neuron engineering team to help drive the Neuron product roadmap and give feedback on improving product quality.

About the team
Our team's mission is to provide the fastest, cost-effective and user-friendly place to train and deploy Generative AI workloads in the cloud. The team provides white-glove service to our most strategic customers to implement their models for both training and inference using the Neuron SDK associated libraries and APIs.

We are open to hiring candidates to work out of one of the following locations:

Cupertino, CA, USA

BASIC QUALIFICATIONS

- 5+ years of non-internship professional software development experience
- 5+ years of programming using a modern programming language such as Java, C++, or C#, including object-oriented design experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- 5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience
- Experience as a mentor, tech lead or leading an engineering team
- 3+ years of experience writing code to train and/or deploy deep learning models in PyTorch

PREFERRED QUALIFICATIONS

- Bachelor's degree in computer science or equivalent
- Experience deploying Generative AI applications with large language or vision models into production.



  • Cupertino, California, United States Amazon Full time

    BASIC QUALIFICATIONS 5+ years of noninternship professional software development experience 5+ years of programming using a modern programming language such as Java, C++, or C#, including objectoriented design experience 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience 5+ years of...


  • Cupertino, California, United States tapwage Full time

    Annapurna Labs is an integral part of AWS and develops hardware and software components that are critical building blocks for EC2 infrastructure. We specialize in designing semi-conductors, systems, chips, and software that optimize the AWS customer experience.More and more customers run their HPC and ML workloads on AWS to reap the benefits of elasticity...


  • Cupertino, California, United States tapwage Full time

    Annapurna Labs is an integral part of AWS and develops hardware and software components that are critical building blocks for EC2 infrastructure. We specialize in designing semi-conductors, systems, chips, and software that optimize the AWS customer experience.More and more customers run their HPC and ML workloads on AWS to reap the benefits of elasticity...


  • Cupertino, CA, United States Abs Data Full time

    Job title: 2024 ASIC Physical Design Engineer Intern, Annapurna Labs DESCRIPTION Amazon Web Services (AWS) internships are full-time (40 hours/week) for 12 consecutive weeks during summer. By applying to this position, your application will be considered for all locations we hire for in the United States. In Annapurna Labs we are at the forefront of...


  • Cupertino, United States Synergis Full time

    Title: Machine Learning EngineerDuration: 3 months+Location: Cupertino, CA or Seattle, WA (Tuesday-Thursday onsite hybrid)NOTE: I am not able to work on a C2C basis or provide a visa transfer/sponsorship at this time!We are seeking a highly motivated and experienced machine learning engineer to join our team. The ideal candidate will have a deep...


  • Cupertino, United States Synergis Full time

    Title: Machine Learning EngineerDuration: 3 months+Location: Cupertino, CA or Seattle, WA (Tuesday-Thursday onsite hybrid)NOTE: I am not able to work on a C2C basis or provide a visa transfer/sponsorship at this time!We are seeking a highly motivated and experienced machine learning engineer to join our team. The ideal candidate will have a deep...


  • Cupertino, California, United States Apple Full time

    SummaryPosted: May 30, 2024Weekly Hours: 40Role Number: We're looking for industry-leading machine learning technologists to help shape the future of AI-driven system experiences across the Apple ecosystem. You'll be joining a small, dynamic team that will develop novel ML techniques and applications, build tools and infrastructure, rapidly iterate on...


  • cupertino, United States OSI Engineering Full time

    Software Engineer at Global Device Company in Cupertino, CA or Seattle, WA. We are seeking a highly motivated and experienced software engineers to join our team. The ideal candidate will have a deep understanding of machine learning algorithms and cloud computing infrastructure. Responsibilities • Support ML efficiency metrics analysis and...


  • Cupertino, United States OSI Engineering Full time

    Software Engineer at Global Device Company in Cupertino, CA or Seattle, WA. We are seeking a highly motivated and experienced software engineers to join our team. The ideal candidate will have a deep understanding of machine learning algorithms and cloud computing infrastructure. Responsibilities• Support ML efficiency metrics analysis and improvements on...


  • Cupertino, United States OSI Engineering Full time

    Software Engineer at Global Device Company in Cupertino, CA or Seattle, WA. We are seeking a highly motivated and experienced software engineers to join our team. The ideal candidate will have a deep understanding of machine learning algorithms and cloud computing infrastructure. Responsibilities• Support ML efficiency metrics analysis and improvements on...


  • Cupertino, United States OSI Engineering Full time

    Software Engineer at Global Device Company in Cupertino, CA or Seattle, WA. We are seeking a highly motivated and experienced software engineers to join our team. The ideal candidate will have a deep understanding of machine learning algorithms and cloud computing infrastructure. Responsibilities• Support ML efficiency metrics analysis and improvements on...


  • Cupertino, United States OSI Engineering Full time

    Software Engineer at Global Device Company in Cupertino, CA or Seattle, WA. We are seeking a highly motivated and experienced software engineers to join our team. The ideal candidate will have a deep understanding of machine learning algorithms and cloud computing infrastructure. Responsibilities• Support ML efficiency metrics analysis and improvements on...


  • Cupertino, United States OSI Engineering Full time

    Software Engineer at Global Device Company in Cupertino, CA or Seattle, WA. We are seeking a highly motivated and experienced software engineers to join our team. The ideal candidate will have a deep understanding of machine learning algorithms and cloud computing infrastructure. Responsibilities• Support ML efficiency metrics analysis and improvements on...


  • Cupertino, United States CoolSnail Full time

    Job DescriptionJob DescriptionJob Title: Machine Learning Engineer/AI EngineerDuration- 5 monthsWorking hours: 8 AM -5 PM Work address: Cupertino, CA 95014Summary of the Project: • We will develop an AI/ML Model Inferencing Pipeline that would automate the extraction of all data elements from the Document or from Source Streaming Data, this will leverage...


  • Cupertino, United States CoolSnail Full time

    Job DescriptionJob DescriptionJob Title: Machine Learning Engineer/AI EngineerDuration- 5 monthsWorking hours: 8 AM -5 PM Work address: Cupertino, CA 95014Summary of the Project: • We will develop an AI/ML Model Inferencing Pipeline that would automate the extraction of all data elements from the Document or from Source Streaming Data, this will leverage...


  • Cupertino, CA, United States OSI Engineering Full time

    Software Engineer at Global Device Company in Cupertino, CA or Seattle, WA. We are seeking a highly motivated and experienced software engineers to join our team. The ideal candidate will have a deep understanding of machine learning algorithms and cloud computing infrastructure. Responsibilities • Support ML efficiency metrics analysis and...


  • Cupertino, CA, United States OSI Engineering Full time

    Software Engineer at Global Device Company in Cupertino, CA or Seattle, WA. We are seeking a highly motivated and experienced software engineers to join our team. The ideal candidate will have a deep understanding of machine learning algorithms and cloud computing infrastructure. Responsibilities Support ML efficiency metrics analysis and improvements on...


  • Cupertino, CA, United States CoolSnail Full time

    Job Title: Machine Learning Engineer/AI Engineer Duration- 5 months Working hours: 8 AM -5 PM Work address: Cupertino, CA 95014 Summary of the Project: • We will develop an AI/ML Model Inferencing Pipeline that would automate the extraction of all data elements from the Document or from Source Streaming Data, this will leverage the elastic nature of...

  • ML Engineer with LLMs

    2 weeks ago


    Cupertino, United States Centraprise Full time

    If you're interested, please E-Mail me at chandra.ss@centraprise.comJob Title: ML Engineer with LLMsLocation: Cupertino, CAJob Type: Full time / PermanentJob Description:12+ years of experience in ML Engineering with experience in NLPExperience in deploying ML modelsStrong understanding of machine learning principles, especially in the context of LLMS •...

  • ML Engineer with LLMs

    2 weeks ago


    Cupertino, United States Centraprise Full time

    If you're interested, please E-Mail me at chandra.ss@centraprise.comJob Title: ML Engineer with LLMsLocation: Cupertino, CAJob Type: Full time / PermanentJob Description:12+ years of experience in ML Engineering with experience in NLPExperience in deploying ML modelsStrong understanding of machine learning principles, especially in the context of LLMS •...