ML Compute Acceleration Engineer

7 days ago


Cupertino, California, United States Apple Full time $147,400 - $272,100 per year

Apple's Compute Frameworks team in GPU, Graphics and Displays org provides a suite of high-performance data parallel algorithms for developers inside and outside of Apple for iOS, macOS and Apple TV. Our efforts are currently focused in the key areas of linear algebra, image processing, machine learning, along with other projects of key interest to Apple. We are always looking for exceptionally dedicated individuals to grow our outstanding team to lay the foundation of technologies like Apple Intelligence.

Description

Our team is seeking extraordinary machine learning and GPU programming engineers who are passionate about providing robust compute solutions for accelerating machine learning networks on Apple Silicon using GPU and Neural Engine. Role has the opportunity to influence the design of compute and programming models in next generation GPU and Neural Engine architectures. Responsibilities: * Adding optimizations in machine learning computation graph. * Defining and implementing APIs in Metal Performance Shaders Graph, investigating new algorithms. * Developing and maintaining MLIR dialect in Apple and open source with upgrades using latest LLVM. * Performing in-depth analysis, compiler and kernel level optimizations to ensure the best possible performance across hardware families. * Tune GPU and Neural Engine accelerated compute across products. * Tuning the cost model and optimizing runtime dispatch to multiple IPs to get best performance on Apple Silicon. Intended deliverables: * GPU Compute acceleration technology. * Apple Intelligence implementation and acceleration. * Optimized compute graphs across products. If this sounds of interest, we would love to hear from you

Minimum Qualifications

  • Proven programming and problem-solving skills.
  • Good understanding of machine learning fundamentals.
  • GPU compute programming models & optimization techniques.
  • GPU compute framework development, maintenance, and optimization.
  • Experience with system level programming and computer architecture.
  • Experience with high performance parallel programming, GPU programming or LLVM/MLIR compiler infrastructure is a plus.

Preferred Qualifications

  • Background in mathematics, including linear algebra and numerical methods.
  • Strong communication and collaboration skills.
  • Strong background of building high performance, production quality software on schedule.
  • Experience with compiler technologies.
  • Experience with adding computational graph support, runtime or device backend to Machine learning libraries (TensorFlow, PyTorch or JAX) support is a plus.

Pay & Benefits

At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $147,400 and $272,100, and your base pay will depend on your skills, qualifications, experience, and location.

Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses - including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.

Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .

Submit Resume



  • Cupertino, California, United States Amazon Full time $129,300 - $223,600

    The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium.The AWS Neuron SDK, developed by the Annapurna Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads...


  • Cupertino, California, United States Amazon Web Services (AWS) Full time

    DescriptionThe Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium.The AWS Neuron SDK, developed by the Annapurna Labs team at AWS, is the backbone for accelerating deep learning and GenAI...

  • AIML - ML Engineer

    13 hours ago


    Cupertino, California, United States Apple Full time

    As part of Apple's Machine Learning Research organization, we do world-class scientific research and build the technologies that will power future products at Apple. The ML Research Team does world-class research and development across a wide range of domains including understanding and improving ML, addressing bias and fairness in algorithms, privacy and...


  • Cupertino, California, United States Apple Full time $150,000 - $250,000 per year

    We're building the foundation for intelligent, adaptive AI systems from multi-agent platforms and RAG pipelines to advanced evaluation and reasoning frameworks. We're looking for a Senior Applied ML Engineer to design, build, and scale machine learning systems that power next-generation AI applications. In this role, you'll work at the intersection of...


  • Cupertino, California, United States Amazon Full time $151,300 - $261,500

    AWS Utility Computing (UC) provides product Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical...

  • ML Engineer

    18 hours ago


    Cupertino, California, United States Apple Full time

    At Apple, we believe in the power of technology to enrich people's lives. Everything we build is designed to empower people, including our advertising platform. We deliver ads in a way that benefits both customers and advertisers - helping people discover content, supporting creators, and protecting and respecting everyone's privacy. Our technology makes...


  • Cupertino, California, United States Apple Full time $150,000 - $250,000 per year

    Apple's Graphics, Games, and Machine Learning team provides the compute and graphics software foundation across all of our innovative products including iPhone, iPad, Apple TV, Mac, Vision Pro, and Apple Watch. The Pre-silicon Compute Frameworks team is seeking a senior engineer to influence and lead new features in Metal and Metal compute frameworks...


  • Cupertino, California, United States Amazon Full time $129,300 - $223,600

    AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machinelearning accelerators and the Trn1 and Inf1 servers that use them. This role is for a software engineer in the Machine Learning Applications (ML Apps) team for AWS Neuron. This role is responsible for development, enablement and performance tuning of a wide...


  • Cupertino, California, United States myGwork - LGBTQ+ Business Community Full time

    This job is with Amazon, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.DescriptionAWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machinelearning accelerators and the Trn1 and Inf1 servers that use them....


  • Cupertino, California, United States Apple Full time $120,000 - $180,000 per year

    Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other's ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the...