Senior Software Development Manager, Machine Learning Acceleration, Neuron Inference Apps

2 weeks ago


Cupertino, California, United States Annapurna Labs Full time
About the Role

We are seeking a highly experienced Senior Software Development Manager to lead our Neuron Inference Customer Enablement Team. As a key member of our organization, you will be responsible for optimizing customer or open-source models for inference performance on various frameworks such as PyTorch, JAX, and TensorFlow.

Key Responsibilities
  • Lead a strong team of managers and engineers to improve inference performance and reliability/scalability features in our internal Neuronx_Distributed and Transformers_Neuronx Inference Libraries.
  • Contribute to other popular open inference libraries and strive towards enabling customers to adopt and make Trainium and Inferentia devices as first-class citizens for ML Acceleration workloads.
  • Ensure support for key ML functionality in a combined chip/software platform.
Requirements
  • 10+ years of engineering experience.
  • 5+ years of engineering team management experience.
  • 10+ years of planning, designing, developing, and delivering consumer software experience.
  • Experience partnering with product or program management teams.
  • Experience managing multiple concurrent programs, projects, and development teams in an Agile environment.
About Us

At Amazon, we are committed to a diverse and inclusive workplace. We are an equal opportunity employer and do not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.



  • Cupertino, California, United States Annapurna Labs (U.S.) Inc. Full time

    About the RoleAs a Software Development Manager for Machine Learning Acceleration, you will be responsible for leading a team of engineers to design and deploy machine learning applications and use cases on various frameworks such as PyTorch, JAX, and TensorFlow. You will be responsible for the full development life cycle of our integrations and extensions...


  • Cupertino, California, United States Annapurna Labs Full time

    About the RoleWe are seeking a highly skilled Software Development Manager to lead our Machine Learning Acceleration team. As a key member of our organization, you will be responsible for designing and deploying ML applications and use cases on various frameworks such as PyTorch, JAX, and TensorFlow.As the Software Development Manager for the ML Applications...


  • Cupertino, California, United States Annapurna Labs (U.S.) Inc. Full time

    Job SummaryWe are seeking a highly skilled Software Development Manager to lead our Machine Learning Applications Framework team. As a key member of our organization, you will be responsible for designing and deploying ML applications and use cases on various frameworks such as PyTorch, JAX, and TensorFlow.Key Responsibilities:Lead a team of engineers to...


  • Cupertino, California, United States Annapurna Labs (U.S.) Inc. Full time

    About the JobWe are seeking a highly skilled Software Development Manager to lead our Machine Learning (ML) Applications Framework team. As a key member of our organization, you will be responsible for designing and deploying ML applications and use cases on various frameworks such as PyTorch, JAX, and TensorFlow.Key ResponsibilitiesLead a team of engineers...


  • Cupertino, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Software Development Engineer to join our AWS Neuron Inference team. As a key member of this team, you will be responsible for developing, enabling, and optimizing a wide range of machine learning models, including large language models, vision transformers, and more.Key ResponsibilitiesDesign and develop...


  • Cupertino, California, United States Amazon Full time

    About the RoleThis is a software engineer position in the Machine Learning Applications (ML Apps) team for AWS Neuron. The team works on development, enablement, and performance tuning of machine learning models, including large language models and vision transformers.The ideal candidate will have experience optimizing inference performance for latency and...


  • Cupertino, California, United States Amazon Full time

    About the RoleThis is a unique opportunity to join the Machine Learning Applications team at Amazon, where you will be responsible for developing, enabling, and performance tuning a wide variety of machine learning model families.As a software engineer in this team, you will work closely with compiler engineers and runtime engineers to create, build, and...


  • Cupertino, California, United States Annapurna Labs (U.S.) Inc. Full time

    Job Title: Software Development Manager for ML AccelerationJoin Amazon Web Services (AWS) as a Software Development Manager for ML Acceleration and lead a team of engineers to design and deploy ML applications on various frameworks such as PyTorch, JAX, and TensorFlow.About the RoleWe are seeking an experienced Software Development Manager to lead our ML...


  • Cupertino, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Senior Software Engineer to join our AWS Neuron Applications team. As a key member of our team, you will be responsible for developing, enabling, and performance tuning of a wide variety of machine learning model families, including large language models, stable diffusion, and vision transformers.Our team works...


  • Cupertino, California, United States Amazon Full time

    About the RoleWe are seeking a highly experienced Senior Software Development Manager to lead our team in developing and extending Neuron support for leading ML frameworks, including PyTorch and JAX. As a key member of our AWS Neuron team, you will be responsible for delivering framework plugins and libraries that enable a great user experience for...


  • Cupertino, California, United States Annapurna Labs (U.S.) Inc. Full time

    About the RoleWe are seeking a highly skilled Software Development Manager to lead our ML Applications - Framework team. As a key member of our organization, you will be responsible for designing and deploying ML applications and use cases on various frameworks such as PyTorch, JAX, and TensorFlow.Key ResponsibilitiesLead a strong team of engineers to design...


  • Cupertino, California, United States Amazon Full time

    About the Role:The AWS Neuron team is seeking a highly skilled Senior Machine Learning Compiler Engineer to join our team. As a key member of our team, you will be responsible for designing and developing a compiler to handle the world's largest ML workloads. You will work closely with our ML services teams to ensure that our compiler meets the needs of our...


  • Cupertino, California, United States Annapurna Labs (U.S.) Inc. Full time

    About the RoleWe are seeking a highly skilled Software Development Manager to lead our ML Applications - Framework team. As a key member of our organization, you will be responsible for designing and deploying ML applications and use cases on various frameworks such as PyTorch, JAX, and TensorFlow.Key ResponsibilitiesLead a strong team of engineers to design...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Senior Engineering Program Manager to join our Machine Learning Platform and Technologies (MLPT) team in AI/ML. As a key member of our team, you will be responsible for simplifying and accelerating the adoption of machine learning in Apple products and ecosystems.As a technical program manager, you will partner...


  • Cupertino, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Machine Learning Engineer II to join our Annapurna ML pathfinding team. As a key member of this team, you will be responsible for helping our most strategic customers port their models to the AWS Trainium & Inferentia platforms.Key ResponsibilitiesDeliver high-quality code and customizations to make models...


  • Cupertino, California, United States Annapurna Labs Full time

    About the RoleAnnapurna Labs, now fully integrated into AWS, is a leader in infrastructure innovation. Our Machine Learning Applications (ML Apps) team is seeking a skilled Software Engineer to join our AWS Neuron team. As a key member of this team, you will be responsible for developing, enabling, and performance-tuning a wide range of machine learning...


  • Cupertino, California, United States ETCHED LLC Full time

    About EtchedEtched is a pioneering company in the field of AI chips, specializing in building model-specific hardware that accelerates machine learning algorithms. Our mission is to empower innovators with cutting-edge technology that enables the creation of groundbreaking AI products.Our first product, Sohu, is a testament to our innovative approach,...


  • Cupertino, California, United States Apple Full time

    About the RoleWe're seeking a highly skilled Machine Learning Software Engineer to join our Creativity Apps team at Apple. As a key member of our team, you'll work on pioneering technologies to create next-generation creative editing tools for professionals and enthusiasts alike.As a Machine Learning Software Engineer, you'll collaborate with our world-class...


  • Cupertino, California, United States Amazon Full time

    About the Role:The AWS Neuron team is seeking a highly skilled Machine Learning Compiler Engineer to join our team. As a key member of our team, you will be responsible for developing and scaling a compiler to handle the world's largest ML workloads.You will work closely with our ML services teams to architect and implement business-critical features,...


  • Cupertino, California, United States Amazon Full time

    Job DescriptionAnnapurna Labs is a leading innovator in custom Machine Learning accelerators, and we're seeking a talented Software Development Engineer to join our team. As a key member of our Neuron Compiler Engineering team, you will play a critical role in developing the infrastructure of a compiler to enable efficient execution of large-scale ML...