Sr. Machine Learning

4 weeks ago


Cupertino, United States Amazon Web Services (AWS) Full time

Join to apply for the Sr. Machine Learning - Compiler Engineer III, AWS Neuron, Annapurna Labs role at Amazon Web Services (AWS). The AWS Neuron software stack includes an ML compiler, runtime and integrations into PyTorch, TensorFlow and JAX. AWS Neuron is used at scale with customers such as Snap, Autodesk, Amazon Alexa and Rekognition, among others. The Neuron Compiler team develops a deep learning compiler stack that converts neural network descriptions created in frameworks into code for execution to push performance. The Amazon Annapurna Labs team is responsible for silicon development at AWS, covering silicon engineering, hardware design and verification, software and operations. As a Machine Learning Compiler Engineer II in the AWS Neuron Compiler team, you will be supporting the ground-up development and scaling of a compiler to handle the world\'s largest ML workloads. Architecting and implementing business-critical features, publish cutting-edge research, and contributing to a brilliant team of engineers excites and challenges you. You will work as a hands-on partner to AWS ML services teams and be involved in pre-silicon design, bringing new products/features to market, and other exciting projects. A background in compiler development is strongly preferred. A background in Machine Learning and AI accelerators is preferred, but not required. In order to be considered for this role, candidates must be currently located or willing to relocate to Cupertino (perferred), Seattle, Austin. Basic Qualifications 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience 2+ years of experience in developing compiler features and optimizations Proficiency with 1 or more of the following programming languages: C++ (preferred), C, Python Preferred Qualifications Master or PhD degree in computer science or equivalent Proficiency with resource management, scheduling, code generation, and compute graph optimization Experience optimizing Tensorflow, PyTorch or JAX deep learning models Experience with multiple toolchains and Instruction Set Architectures Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you\'re applying in isn\'t listed, please contact your Recruiting Partner. Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $151,300/year in our lowest geographic market up to $261,500/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site. Company - Annapurna Labs (U.S.) Inc. #J-18808-Ljbffr



  • Cupertino, California, United States Amazon Web Services (AWS) Full time

    DescriptionThe Product: AWS Machine Learning accelerators are at the forefront of AWS innovation and one of several AWS tools used for building Generative AI on AWS. The Inferentia chip delivers best-in-class ML inference performance at the lowest cost in cloud. Trainium will deliver the best-in-class ML training performance with the most teraflops (TFLOPS)...

  • Machine Learning

    4 days ago


    Cupertino, CA, United States Syntricate Technologies Full time

    Position- Machine Learning Duration-Contract Location- Cupertino, C JD 12+ years of experience in Client Engineering with experience in NLP Experience in deploying Client models Strong understanding of machine learning principles, especially in the context of LLMs. Experience building chatbots using LLM's• Managing Data pipeline/Transforms• ...

  • Machine Learning

    1 week ago


    Cupertino, CA, United States Syntricate Technologies Full time

    Position- Machine Learning Duration-Contract Location- Cupertino, C JD 12+ years of experience in Client Engineering with experience in NLP Experience in deploying Client models Strong understanding of machine learning principles, especially in the context of LLMs. Experience building chatbots using LLM's• Managing Data pipeline/Transforms• ...

  • Machine Learning

    1 week ago


    Cupertino, CA, United States Syntricate Technologies Full time

    Position- Machine Learning Duration-Contract Location- Cupertino, C JD 12+ years of experience in Client Engineering with experience in NLP Experience in deploying Client models Strong understanding of machine learning principles, especially in the context of LLMs. Experience building chatbots using LLM's• Managing Data pipeline/Transforms• ...

  • Machine Learning

    3 days ago


    Cupertino, CA, United States Syntricate Technologies Full time

    Position- Machine Learning Duration-Contract Location- Cupertino, C JD 12+ years of experience in Client Engineering with experience in NLP Experience in deploying Client models Strong understanding of machine learning principles, especially in the context of LLMs. Experience building chatbots using LLM's• Managing Data pipeline/Transforms• ...


  • Cupertino, United States Apple Inc. Full time

    Sr. Machine Learning Engineer, Siri Global Cupertino, California, United States Machine Learning and AI Join the Siri team at Apple! Build and contribute to a product and company that that is building products, personal devices, and software designed to enrich people’s lives. Work on building and advancing the world’s most popular intelligent assistant...


  • Cupertino, United States Apple Inc. Full time

    Sr. Machine Learning Engineer, Siri Speech Cupertino, California, United States Machine Learning and AI The Speech Team within the Siri organization drives major speech recognition, synthesis and speech to speech model changes for various features deeply embedded throughout Apple’s ecosystem. Our mission is to build cutting‑edge infrastructure, datasets,...


  • Cupertino, United States Apple Full time

    AIML - Sr. Machine Learning Engineer, World Knowledge Cupertino, California, United States Machine Learning and AI Are you excited about Generative AI and Large Language Models and eager to apply your expertise in a fast-paced, innovative environment? Join a newly created team in AIML focused on building world-class experiences to bring Apple's deep...


  • Cupertino, United States Apple Full time

    Role Number: 200602335-0836 Summary Join the Siri team at Apple! Build and contribute to a product and company that that is building products, personal devices, and software designed to enrich people’s lives. Work on building and advancing the world’s most popular intelligent assistant that helps millions of people get things done — just by asking....


  • Cupertino, CA, United States Apple Full time

    Role Number: 200602335-0836 Summary Join the Siri team at Apple! Build and contribute to a product and company that that is building products, personal devices, and software designed to enrich people’s lives. Work on building and advancing the world’s most popular intelligent assistant that helps millions of people get things done — just by...