Software Development Manager

3 days ago


Cupertino, California, United States Haystack Full time

Software Development Manager, LLM Inference Model Enablement, Neuron SDK | Cupertino, California | Remote-Friendly | $166,400 - $287,700
We're working with Annapurna Labs (U.S.) Inc. on this exciting opportunity.

This role offers a unique opportunity to lead a team of expert AI/ML engineers in optimizing and enabling state-of-the-art open-source and customer LLMs on custom AWS machine learning accelerators. You'll drive innovation in model enablement speed and inference usability, working across a vertically integrated system stack that includes PyTorch, Neuron compiler, and runtime.

Key Responsibilities

  • Lead a team of expert AI/ML engineers to onboard and optimize open-source and customer LLMs for inference on Neuron, Trainium, and Inferentia accelerators.
  • Drive improvements in model enablement speed and overall experience.
  • Advance inference usability and quality through new features, infrastructure optimization, tools, and automation.
  • Define and deliver model enablement and performance optimization for the latest state-of-the-art LLMs in collaboration with senior management.

What You'll Need

  • 3+ years of engineering team management experience.
  • Strong background in LLM model architectures, performance optimizations, and inference techniques using distributed inference libraries.
  • Ability to manage demanding, fast-changing priorities in a dynamic environment.
  • Strong technical ability to understand and deliver as part of a vertically integrated system stack including PyTorch inference library, Neuron compiler, runtime, and collectives.

What's On Offer

  • Opportunities for mentorship and career growth within AWS.
  • A focus on work-life harmony and flexibility.

Apply via Haystack today



  • Cupertino, California, United States Amazon Full time $166,400 - $287,700

    "AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we're the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on....


  • Cupertino, California, United States Amazon Data Services, Inc. Full time

    DESCRIPTION"AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we're the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation...


  • Cupertino, California, United States Apple Full time

    The Apple Services Engineering (ASE) Commerce group is looking for an extraordinary back-end Java software engineer to join our Account Services software engineering team. The Commerce team provides the transactional engine for App Store, Apple Music, Apple TV+ and more. Our platform is the highest volume digital content store in the world, serving billions...


  • Cupertino, California, United States Apple Full time

    SummaryImagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products very quickly. Bring passion and dedication to your job, and there's no telling what we can accomplish together. We're looking for a hardworking and passionate person to join this amazing team, and if you feel this is you, we'd love to hear from younnThe...


  • Cupertino, California, United States Apple Full time

    Are you a big-picture problem solver who loves setting daring goals? Do you have a passion for understanding how each line of code affects all the others? In the Core Operating Systems group we ensure the OS is inseparable from each device's identity as a whole. That's because this group is committed to building fully integrated operating systems that...

  • Software Developer

    1 week ago


    Cupertino, California, United States Apple Full time

    Do you love creating elegant solutions to complex challenges? Are you committed to user experience? Do you thrive in a fast-paced, collaborative environment? If so, we would love to hear from youAt Apple, we constantly look to improve energy efficiency and are always finding ways to enrich our customer's battery life and charging experience. In this role,...


  • Cupertino, California, United States Amazon Full time $168,100 - $261,500

    Our Machine Learning Acceleration (MLA) team develops the SOCs that are used to power today's AI workloads in datacenters all around the world. As a DevOps Software Engineer, you'll contribute to the project at the ground level by automating workflows and developing large-scale solutions that accelerate silicon development - it's still Day One here at...


  • Cupertino, California, United States Amazon Full time

    Come change the way the world sees the CloudWhat do we do?We build platforms and tools that ensure the health of AWS hardware by testing every new and rebuilt system across all AWS data centers. Our platform enables service owners such as EC2, EBS, S3, and other to deliver healthy servers for their service to the production. Our team leads a large-scale...


  • Cupertino, California, United States Amazon Full time $129,300 - $223,600

    Come change the way the world sees the CloudWhat do we do?We build platforms and tools that ensure the health of AWS hardware by testing every new and rebuilt system across all AWS data centers. Our platform enables service owners such as EC2, EBS, S3, and other to deliver healthy servers for their service to the production. Our team leads a large-scale...


  • Cupertino, California, United States Apple Full time

    Apple is where talented individuals gather together, committing to the values that lead to great work. It's the diversity of our people and their thinking that inspires the innovation that runs through everything we do. In the DevPubs content engineering team, we tell the world about innovative new features and capabilities available to developers. We are...