Sr. ML Compiler Engineer, Annapurna Labs
3 weeks ago
"Annapurna Labs builds custom Machine Learning accelerators that are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Neuron Compiler Engineering team is searching for a Senior Software Development Engineer to support the development infrastructure of a compiler to enable the world's largest ML workloads to run efficiently in the cloud.Amazon Annapurna Labs organization is responsible for silicon development at AWS. Organization covers multiple disciplines including silicon engineering, hardware design and verification, software and operations. The AWS Neuron team works to optimize the performance of complex neural net models on our custom‑built AWS hardware. More specifically, the AWS Neuron team is developing a deep learning compiler stack that takes neural network descriptions created in frameworks such as TensorFlow, PyTorch, and Jax, and converts them into code suitable for execution.Key job responsibilitiesIdentify and design solutions that enable efficient and reliable build, test, and release mechanisms for the Neuron compiler.Design and implement a solution for distributed execution of the Neuron compiler that will help to run customer workloads more efficiently.Leverage technical communication skills as a hands‑on partner to AWS ML services teams, bringing new products/features to market, and many other exciting projects.Solve challenging technical problems, often ones not solved before, at every layer of the stack.Design, implement, test, deploy and maintain innovative software solutions to transform service performance, durability, cost, and security.Build high‑quality, highly available, always‑on products.Research implementations that deliver the best possible experiences for customers.A day in the lifeBuild high‑impact solutions to deliver to our large customer base.Participate in design discussions, code review, and communicate with internal and external stakeholders.Work cross‑functionally to help drive business decisions with your technical input.Collaborate closely with a cross‑functional team comprised of compiler, hardware, and ML engineers.Work in a startup‑like development environment, where you’re always working on the most important stuff.Basic Qualifications>5+ years of non‑internship professional software development experience.5+ years of programming with at least one software programming language.5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems.5+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience.Experience as a mentor, tech lead or leading an engineering team.Preferred QualificationsMaster’s degree in computer science or equivalent.Experience developing compilers.Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $151,300/year in our lowest geographic market up to $261,500/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job‑related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign‑on payments, and other forms of compensation may be provided part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits . This position will remain posted until filled. Applicants should apply via our internal or external career site." #J-18808-Ljbffr
-
Sr ML Compiler Engineer
3 weeks ago
San Francisco, United States Amazon Full timeProduct AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best‑in‑class ML inference performance at the lowest cost in cloud. Trainium will deliver the best‑in‑class ML training performance with the most teraflops (TFLOPS) of...
-
Sr ML Compiler Engineer
3 weeks ago
San Francisco, United States Amazon Full timeThe Product: AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best‑in‑class ML inference performance at the lowest cost in cloud. Trainium will deliver the best‑in‑class ML training performance with the most teraflops (TFLOPS) of...
-
ML Compiler Engineer
3 weeks ago
San Francisco, United States Amazon Full timeML Kernel Performance Engineer, AWS Neuron, Annapurna Labs The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Acceleration Kernel Library team is at the forefront of maximizing...
-
Sr. Machine Learning
3 weeks ago
San Francisco, United States Amazon Full timeThe Product AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best‑in‑class ML inference performance at the lowest cost in cloud. Trainium will deliver the best‑in‑class ML training performance with the most teraflops (TFLOPS) of...
-
San Francisco, United States Amazon Full timeAbout Amazon Annapurna Labs:Amazon Annapurna Labs team (our organization within AWS UC) is responsible for building innovation in silicon and software for our AWS customers. We are at the forefront of innovation by combining cloud scale with the world’s most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware...
-
San Francisco, United States Amazon Full timeSr. Software Development Engineer, Annapurna Labs In this role you will be responsible for leading a technical team that provides profiling and optimization tools for the Neuron ML accelerators fleet. You will work closely with the hardware and software teams to ensure that the right tools are available for performance profiling of large ML workloads,...
-
San Francisco, United States Amazon Full timeAbout Amazon Annapurna Labs Amazon Annapurna Labs team (our organization within AWS UC) is responsible for building innovation in silicon and software for our AWS customers. We are at the forefront of innovation by combining cloud scale with the world’s most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware...
-
Sr. Machine Learning
3 weeks ago
San Francisco, CA, United States Amazon Full timeThe Product AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers bestinclass ML inference performance at the lowest cost in cloud. Trainium will deliver the bestinclass ML training performance with the most teraflops (TFLOPS) of compute power...
-
Sr. Machine Learning
1 week ago
San Francisco, CA, United States Amazon Full timeAWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers bestinclass ML inference performance at the lowest cost in cloud. Trainium will deliver the bestinclass ML training performance with the most teraflops (TFLOPS) of compute power for ML in...
-
San Francisco, CA, United States Amazon Full timeMachine Learning Performance Engineer, Annapurna Labs Our team is responsible for the AWS Neuron software stack, which powers Generative AI and other advanced ML workloads on AWS's custom-built ML accelerators Inferentia and Trainium. These accelerators deliver best-in-class performance and cost-efficiency for ML inference and training in the cloud. We are...