Cloud Scale Machine Learning Engineer

3 weeks ago


Cupertino, California, United States Amazon Full time
About the Job

We are seeking a skilled Cloud Scale Machine Learning Engineer to join our team. As a key member of our Machine Learning Applications team, you will be responsible for developing, enabling, and tuning distributed inference solutions using AWS Neuron.

Salary: $173,450 - $178,650 per year

Key Responsibilities
  • Design and implement high-performance distributed inference solutions using AWS Neuron, PyTorch, and other machine learning frameworks.
  • Tune and optimize large-scale machine learning models for latency and throughput on Trn1 and Inf1 servers.
  • Collaborate with compiler engineers and runtime engineers to develop and maintain the Neuron compiler and runtime stacks.
Requirements
  • 3+ years of non-internship professional software development experience.
  • 2+ years of experience in designing or architecting new and existing systems.
  • Strong programming skills in at least one software programming language.
Preferred Qualifications
  • 3+ years of full software development life cycle experience, including coding standards, code reviews, source control management, build processes, testing, and operations.
  • Bachelor's degree in computer science or equivalent.


  • Cupertino, California, United States Amazon Full time

    Job DescriptionThis role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. The team works side by side with compiler engineers and runtime engineers to create, build, and tune distributed inference solutions with Trn1.As a cloud-scale machine learning engineer, you will be responsible for optimizing inference...


  • Cupertino, California, United States Amazon Full time

    Job ResponsibilitiesDevelop, enable, and performance tune various machine learning models, including large language models.Optimize inference performance for latency and throughput using Python, PyTorch, or JAX.Work with our Machine Learning Applications team to create high-impact solutions that deliver exceptional results for our customers.Participate in...


  • Cupertino, California, United States Amazon Full time

    **Role Overview:**We are seeking an experienced Software Development Engineer to join our Machine Learning Applications team for AWS Neuron. This role involves developing, enabling, and tuning machine learning models for cloud-scale applications.The ideal candidate will have strong software development skills in C++/Python and ML knowledge, with experience...


  • Cupertino, California, United States Annapurna Labs Full time

    A comprehensive software stack for AWS's cloud-scale machine learning accelerators, designed to unlock the full potential of PyTorch and JAX frameworks. As a key member of the Annapurna Labs team, you'll be responsible for developing and enhancing support for these leading ML frameworks on the Trainium and Inferentia accelerators.Key...


  • Cupertino, California, United States Apple Full time

    **Job Description:**We are seeking a highly experienced Senior Software Engineer to join our Machine Learning Platform Team. As a key member of this team, you will be responsible for designing and building cloud-native infrastructure platforms at Apple scale to support the deployment and operation of our AI/ML services and applications.Key...


  • Cupertino, California, United States Annapurna Labs Full time

    **Job Title:** Software Development Manager, AWS Neuron Machine Learning Distributed Training**Location:** Remote (with occasional travel to Amazon offices)**About Us:Annapurna Labs is a fast-paced and innovative company at the forefront of cloud computing. We're looking for a highly skilled software development manager to lead our team in designing and...


  • Cupertino, California, United States Amazon Full time

    Job OverviewAmazon Web Services (AWS) is a leading provider of cloud-based services that power the world's most innovative businesses. We are seeking an experienced ASIC Design Engineer to join our Cloud-Scale Machine Learning Acceleration team, responsible for designing and optimizing hardware in our data centers.The successful candidate will have a deep...


  • Cupertino, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Software Engineer to lead the development of machine learning tools for our cloud-scale accelerators. As a member of our team, you will design and implement new tools, pipelines, and automation to optimize system performance and ensure high availability and scalability.The ideal candidate will have experience...


  • Cupertino, California, United States Apple Full time

    We are seeking a highly skilled Senior AI Engineer to join our team at Apple, working on large-scale machine learning infrastructure.As a key member of our Foundation Model Infrastructure team within the Machine Learning Platform Technologies organization, you will play a critical role in building frameworks, services, and tools that power Apple's largest...


  • Cupertino, California, United States Amazon Full time

    We are seeking an experienced ASIC Design Engineer to join our Cloud-Scale Machine Learning Acceleration team at Amazon. This is a challenging and rewarding opportunity for a skilled engineer to design and optimize hardware in our data centers, including AWS Inferentia, our custom designed machine learning inference datacenter server.About the RoleAs an ASIC...


  • Cupertino, California, United States Amazon Full time

    ResponsibilitiesThe Cloud-Scale Machine Learning Acceleration team at AWS designs and optimizes custom chips and software stacks to accelerate innovation in the cloud. As an ASIC Design Engineer, your key responsibilities will include:Designing and optimizing hardware components for our machine learning serversCollaborating with software engineers to ensure...


  • Cupertino, California, United States Amazon Full time

    Cloud Scale Solutions Developer: We are seeking a talented developer to join our team in creating and optimizing cloud-scale machine learning solutions for AWS Neuron. This role involves working closely with compiler engineers and runtime engineers to build and tune distributed inference solutions.Main Responsibilities:Design and develop scalable machine...


  • Cupertino, California, United States Amazon Full time

    About the TeamWe are a dynamic team of experts in machine learning and software development, working together to create innovative solutions for cloud-scale inference. Our team is passionate about using technology to make a positive impact on society, and we are committed to fostering a culture of inclusivity, diversity, and respect. If you are a motivated...


  • Cupertino, California, United States Apple Full time

    **Job Description**We are looking for a highly skilled Machine Learning Engineer, Developer Experience Specialist to join our team at Apple. In this role, you will work closely with our applied ML scientists and engineers to enhance the experience and productivity of software developers at Apple and in the Apple developer ecosystem.**About the Team**Our team...


  • Cupertino, California, United States Apple Full time

    **Job Summary**We are seeking a talented Senior Software Development Engineer, Machine Learning Expert to join our team at Apple. As a key member of our applied ML scientists and engineers team, you will be responsible for enhancing the experience and productivity of software developers at Apple and in the Apple developer ecosystem.**About the Role**In this...


  • Cupertino, California, United States Amazon Full time

    About the Role: Amazon's Machine Learning Engineering team is seeking a talented Team Lead to join our team. As a key member of our ML Apps team, you will be responsible for leading the development and deployment of large-scale machine learning models on AWS Neuron. This includes designing and implementing distributed training solutions using PyTorch,...


  • Cupertino, California, United States Apple Full time

    Company OverviewCupertino, California, United StatesSoftware and ServicesWe're a team of applied ML scientists and engineers who work to enhance the experience and productivity of software developers at Apple and in the Apple developer ecosystem. Our mission is to solve real-world problems using state-of-the-art ML models.Job DescriptionSalary: $175,800 -...


  • Cupertino, California, United States Amazon Full time

    About the RoleThis is an exciting opportunity to join the Annapurna Labs team at Amazon Web Services (AWS) as a Senior Software Engineer. We are seeking a highly skilled engineer with expertise in deep learning and distributed training. As a member of our Machine Learning Applications (ML Apps) team, you will be responsible for developing and maintaining...


  • Cupertino, California, United States Apple Full time

    Optimize Distributed Machine Learning SystemsWe are seeking a highly motivated and experienced Machine Learning Engineer to join our team in Cupertino, California. In this role, you will be working on optimizing end-to-end system performance of distributed machine learning workloads.About the RoleThis is a highly collaborative role where you will be working...


  • Cupertino, California, United States Centraprise Full time

    Job Title: Data Engineer with Machine Learning Expertise at CupertinoWe are seeking a highly skilled Data Engineer with expertise in Machine Learning to join our team at Centraprise in Cupertino, CA. As a key member of our engineering team, you will be responsible for designing, developing, and deploying scalable data pipelines and machine learning...