Senior Software Development Engineer, Machine Learning Infrastructure Team

4 days ago


Cupertino, California, United States Amazon Full time
About the Role

We are seeking a highly skilled Senior Software Development Engineer to join our Machine Learning (ML) Infrastructure team. As a key member of this team, you will be responsible for designing and developing the tools and infrastructure that support the success of our ML and High Performance Computing (HPC) technologies.

As a Senior Software Development Engineer, you will lead the development of our CI/CD pipelines, ensuring that our software is delivered to customers efficiently and effectively. You will also be responsible for developing and maintaining our infrastructure, including our Linux-based systems and public AWS products.

We are looking for a candidate with exceptional knowledge of CI/CD automation, ML and HPC benchmarks and applications. You should have experience with Python, Typescript, and CDK, as well as proficiency in Linux and Containers. Experience with Clustered ML or HPC Applications or Benchmarks, especially using SLURM or in AWS, is also highly desirable.

Key Responsibilities
  • Lead the development of our CI/CD pipelines, ensuring that our software is delivered to customers efficiently and effectively.
  • Develop and maintain our infrastructure, including our Linux-based systems and public AWS products.
  • Design and develop tools and infrastructure that support the success of our ML and HPC technologies.
  • Collaborate with cross-functional teams to ensure that our software meets the needs of our customers.
About the Team

We are part of Annapurna Labs, a subsidiary in AWS that builds software and hardware that make ML on EC2 work. Our organization is a dedicated group of innovators that have invented new networks, new silicon, new software suites, and combined those to entice customers to move immense ML and HPC workloads to the cloud.

The ML Infrastructure team is laser focused on making AWS the best and most cost-effective place for customers to do AI at scale.



  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking an experienced Software Development Engineer to join our iCloud Services team. As a key member of our team, you will be responsible for designing and building scalable machine learning infrastructure to support our customers.ResponsibilitiesDesign and implement scalable machine learning infrastructure to support our...


  • Cupertino, California, United States Apple Full time

    Job DescriptionCupertino, California, United StatesSoftware and ServicesOur team is dedicated to bringing the latest machine learning techniques to the field of software development. We aim to make it easier for our developers to write, build, and test their code.We believe that providing better, more intelligent tools and infrastructure to our developers...


  • Cupertino, California, United States Apple Full time

    Job DescriptionWe're seeking a highly skilled Machine Learning Engineer to join our team at Apple. As a key member of our AI Systems Development team, you will be responsible for designing and developing high-performance machine learning models and infrastructure in concert with designers, software/hardware engineers, and applied R&D teams.Key...


  • Cupertino, California, United States Bayside Solutions Full time

    Job Title: Software Engineer, Machine LearningWe are seeking a highly skilled Software Engineer to join our team in Cupertino, CA. As a Software Engineer, Machine Learning, you will be responsible for migrating customers from our previous service to our new one, working with multiple partner teams and customers to deploy our services in their accounts.Key...


  • Cupertino, California, United States Apple Full time

    Job SummaryWe are seeking a highly skilled Senior Software Engineer to join our dynamic team in Apple's ML Platform & Technology group. This role will focus on designing, building, and maintaining large-scale distributed systems to support the end-to-end machine learning (ML) lifecycle.Key ResponsibilitiesDesign and develop large-scale distributed systems...


  • Cupertino, California, United States Apple Full time

    Job SummaryWe are seeking a highly skilled Senior Software Engineer to join our Apple Maps Data Infrastructure team. As a key contributor, you will be responsible for building capabilities across a spectrum of technologies in a hybrid-cloud environment.Key ResponsibilitiesDesign and develop innovative solutions for large-scale data processing and machine...


  • Cupertino, California, United States Bayside Solutions Full time

    Job Title: Software Engineer, Machine LearningJob Summary:Bayside Solutions, Inc. is seeking a skilled Software Engineer, Machine Learning to join our engineering team. As a key member of our team, you will be responsible for migrating customers from our previous service to our new one, working with multiple partner teams and customers to deploy our services...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Senior Engineering Program Manager to join our Machine Learning Platform & Technology (MLPT) team at Apple. As a key member of our team, you will be responsible for establishing cross-functional partnerships with all of Apple's ML partners to understand their use cases and improve the ease of use of our compute...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Senior Engineering Program Manager to join our Machine Learning Platform & Technology (MLPT) team at Apple. As a key member of our team, you will be responsible for establishing cross-functional partnerships with all of Apple's ML partners to improve the ease of use of our compute services.You will collaborate...


  • Cupertino, California, United States Apple Full time

    Job SummaryWe are seeking an exceptional Senior Machine Learning Engineer to join our OS Intelligence team at Apple. As a key member of our team, you will design and implement innovative Machine Learning solutions that enhance the user experience of our operating systems.Key ResponsibilitiesDesign and develop Deep Learning architectures to improve the...


  • Cupertino, California, United States Annapurna Labs (U.S.) Inc. Full time

    About the RoleWe are seeking a highly skilled Senior Runtime Software Development Engineer to join our team at Annapurna Labs (U.S.) Inc. in the AWS AI organization. As a key member of our team, you will be responsible for designing and developing innovative software solutions for our machine learning accelerators, including the AWS Inferentia and Trainium...


  • Cupertino, California, United States Apple Full time

    Job SummaryWe are seeking a highly skilled Senior Machine Learning Engineer to join our Advertising Platforms team at Apple. As a key member of our team, you will develop and apply advanced techniques and algorithms to improve our ad network. Your expertise in machine learning, statistics, and optimization will enable you to design and implement new...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Senior Engineering Program Manager to join our Machine Learning Platform and Technologies (MLPT) team in AI/ML. As a key member of our team, you will be responsible for simplifying and accelerating the adoption of machine learning in Apple products and ecosystems.As a technical program manager, you will partner...


  • Cupertino, California, United States Apple Full time

    Job SummaryWe are seeking an experienced technical leader to join our Apple Data group as a Senior Manager. The successful candidate will be responsible for leading the planning, execution, and delivery of technical projects that improve the user experience throughout their data and machine learning workflow.About the RoleAs a Senior Manager in the Apple...


  • Cupertino, California, United States Amazon Full time

    About the RoleThis is a unique opportunity to join the Machine Learning Applications team at Amazon, where you will be responsible for developing, enabling, and performance tuning a wide variety of machine learning model families.As a software engineer in this team, you will work closely with compiler engineers and runtime engineers to create, build, and...


  • Cupertino, California, United States Apple Full time

    Job Title: Senior Machine Learning EngineerAt Apple, we're pushing the boundaries of what's possible with technology. As a Senior Machine Learning Engineer, you'll play a key role in developing and applying advanced techniques and algorithms to improve our ad network.Key Responsibilities:Design, develop, and implement new optimization and advertising...


  • Cupertino, California, United States Annapurna Labs (U.S.) Inc. Full time

    About the RoleAs a Software Development Manager for Machine Learning Acceleration, you will be responsible for leading a team of engineers to design and deploy machine learning applications and use cases on various frameworks such as PyTorch, JAX, and TensorFlow. You will be responsible for the full development life cycle of our integrations and extensions...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Senior Machine Learning Engineer to join our team at Apple. As a key member of our team, you will be responsible for designing, implementing, and training machine learning models to advance Siri's capabilities.Our team is responsible for developing and shipping innovative technology to advance Siri and Apple...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Senior Machine Learning Engineer to join our team at Apple. As a key member of our team, you will play a critical role in developing and shipping innovative technology to advance Siri and Apple Intelligence.As a Senior Machine Learning Engineer, you will be responsible for designing, implementing, and training...


  • Cupertino, California, United States Amazon Full time

    About the RoleThis is a software engineer position in the Machine Learning Applications (ML Apps) team for AWS Neuron. The team works on development, enablement, and performance tuning of machine learning models, including large language models and vision transformers.The ideal candidate will have experience optimizing inference performance for latency and...