Machine Learning Performance Engineer, Annapurna Labs
2 weeks ago
Machine Learning Performance Engineer, Annapurna Labs Our team is responsible for the AWS Neuron software stack, which powers Generative AI and other advanced ML workloads on AWS's custom-built ML accelerators Inferentia and Trainium. These accelerators deliver best-in-class performance and cost-efficiency for ML inference and training in the cloud. We are building a new core group of engineers in Tel Aviv to drive innovation in ML systems performance and software. As a Machine Learning Performance Engineer, you will help shape the direction of the team from the ground up and work on the following: Optimizing system performance across the entire ML software stack Analyzing high-performance ML workloads running on Annapurna hardware Developing high-performance kernels for critical ML operations Enhancing the Neuron SDK to improve developer experience and system capabilities Collaborating across Compiler, Frameworks, and Hardware teams to maximize end-to-end performance As part of the Performance Engineering Team, you will contribute to projects involving instruction scheduling, memory management, parallelism, kernel optimization, and compiler enhancements to maximize end-to-end performance. This is a unique opportunity to be at the intersection of ML and systems within AWS, helping to build the future of AI infrastructure right here in Tel Aviv. Basic Qualifications B.S. or M.S. in computer science or related field Proficiency with 1 or more of the following programming languages: Python (preferred), C++ Experience with TensorFlow, PyTorch, and/or JAX 3+ years of non-internship professional software development experience 3+ years of performance optimization experience in LLM, Vision or other deep-learning models Preferred Qualifications M.S. in computer science or related field Experience with developing algorithms for simulation tools Experience with VLLM or other inference serving infrastructures Experience developing compiler optimization, kernel writing or hardware-software co-design Experience with LLVM and/or MLIR Experience with TensorFlow, PyTorch, and/or JAX Experience in LLM, Vision or other deep-learning models About the team Diverse Experiences Amazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. Why AWS Amazon Web Services (AWS) is the worlds most comprehensive and broadly adopted cloud platform. Work/Life Balance We value work-life harmony. Inclusive Team Culture AWS values curiosity and connection. Mentorship and Career Growth Were continuously raising our performance bar. Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, please visit for more information. Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. #J-18808-Ljbffr
-
Sr. Machine Learning
2 weeks ago
San Francisco, CA, United States Amazon Full timeAWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers bestinclass ML inference performance at the lowest cost in cloud. Trainium will deliver the bestinclass ML training performance with the most teraflops (TFLOPS) of compute power for ML in...
-
San Francisco, CA, United States Altos Labs Full timeMachine Learning Engineer / Machine Learning Scientist, Multi Modality – Altos Labs Join Altos Labs to build computational platforms enabling multi‑modal generative foundation models for biology. Our mission is to restore cell health and resilience through cell rejuvenation to reverse disease, injury, and age‑related disabilities. As part of our...
-
San Francisco, CA, United States Altos Labs Full timeMachine Learning Engineer/Machine Learning Scientist , Multi Modality Our mission is to restore cell health and resilience through cell rejuvenation to reverse disease, injury, and the disabilities that can occur throughout life. Diversity at Altos We believe that diverse perspectives are foundational to scientific innovation and inquiry. As part of...
-
San Francisco, CA, United States Altos Labs Full timeMachine Learning Engineer / Machine Learning Scientist, Multi Modality Altos Labs Join Altos Labs to build computational platforms enabling multimodal generative foundation models for biology. Our mission is to restore cell health and resilience through cell rejuvenation to reverse disease, injury, and agerelated disabilities. As part of our team, you...
-
Sr. Machine Learning
3 weeks ago
San Francisco, United States Amazon Full timeThe Product AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best‑in‑class ML inference performance at the lowest cost in cloud. Trainium will deliver the best‑in‑class ML training performance with the most teraflops (TFLOPS) of...
-
Sr. Machine Learning
3 weeks ago
San Francisco, CA, United States Amazon Full timeThe Product AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers bestinclass ML inference performance at the lowest cost in cloud. Trainium will deliver the bestinclass ML training performance with the most teraflops (TFLOPS) of compute power...
-
San Francisco, United States Amazon Full timeAbout Amazon Annapurna Labs Amazon Annapurna Labs team (our organization within AWS UC) is responsible for building innovation in silicon and software for our AWS customers. We are at the forefront of innovation by combining cloud scale with the world’s most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware...
-
ML Compiler Engineer
3 weeks ago
San Francisco, United States Amazon Full timeML Kernel Performance Engineer, AWS Neuron, Annapurna Labs The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team is at the forefront of maximizing...
-
San Francisco, United States Amazon Full timeAbout Amazon Annapurna Labs:Amazon Annapurna Labs team (our organization within AWS UC) is responsible for building innovation in silicon and software for our AWS customers. We are at the forefront of innovation by combining cloud scale with the world’s most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware...
-
Sr. ML Compiler Engineer, Annapurna Labs
3 weeks ago
San Francisco, United States Amazon Full time"Annapurna Labs builds custom Machine Learning accelerators that are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Neuron Compiler Engineering team is searching for a Senior Software Development Engineer to support the development infrastructure of a compiler to enable the world's largest ML...