Distributed Training Solutions Developer

2 weeks ago


Mountain View, California, United States Nuro Full time

**Overview**

">

Nuro is a cutting-edge robotics company that's changing the game with its autonomous driving technology. As a leader in the industry, we're always pushing the boundaries of innovation. Our team is passionate about developing cutting-edge solutions that make a real difference in people's lives.

We're currently looking for a skilled Machine Learning Infrastructure Engineer to join our team. In this role, you'll have the opportunity to work on exciting projects that involve building scalable machine learning infrastructure and distributed training solutions. Your expertise will help drive the success of our business, and you'll be part of a collaborative environment that encourages creativity and growth.

About the Job

Your responsibilities as a Machine Learning Infrastructure Engineer will include:

  • Developing and implementing new distributed training frameworks and strategies to support large-scale deep learning model training.
  • Optimizing model training speed by refining Tensorflow, Keras, Pytorch, and Cuda kernel implementation.
  • Designing and building advanced tools to monitor model training performance and detect/triage training issues.

About You

To excel in this position, you should possess:

  • At least 2 years of relevant work experience or an equivalent background in PhD research.
  • In-depth knowledge of machine learning models and the ML development lifecycle.
  • Hands-on experience with cloud-based distributed training platforms that support data and model parallelism.
  • Strong analytical skills to investigate and optimize training performance bottlenecks for deep learning models.

We offer a competitive salary range of $167,200-$250,800, based on your experience and qualifications. You'll also be eligible for an annual performance bonus, equity, and a comprehensive benefits package. Our inclusive culture values diversity and welcomes employees from all backgrounds.



  • Mountain View, California, United States Waymo Full time

    About the JobThe Waymo ML Infrastructure team is seeking an experienced Senior Machine Learning Engineer, Training to work on developing infrastructure components for distributed training and implementing automation solutions for provisioning, deployment, monitoring, and scaling of distributed training infrastructure.This Hybrid role requires:Developing the...


  • Mountain View, California, United States Waymo Full time

    Job SummaryWaymo is looking for a skilled Senior Machine Learning Engineer, Training to join our Hybrid team. In this role, you will develop the infrastructure components necessary for distributed training, implement automation solutions, and monitor system health. If you have experience building distributed systems and working with Machine Learning...


  • Mountain View, California, United States Waymo Full time

    Job DescriptionThis Hybrid role reports to our TLM of Machine Learning Training and involves:Developing the infrastructure components necessary for distributed trainingImplementing automation solutions for provisioning, deployment, monitoring, and scaling of distributed training infrastructureMonitoring system health and performing routine maintenance tasks...


  • Mountain View, California, United States Waymo Full time

    About the CompanyWaymo is a leader in autonomous driving technology, working to improve access to mobility while saving thousands of lives. Since 2009, we've focused on building the Waymo Driver—the world's most experienced driver—using cutting-edge artificial intelligence and machine learning algorithms.Job SummaryWe're seeking an experienced...


  • Mountain View, California, United States Waymo Full time

    About WaymoWaymo is an innovative autonomous driving technology company with a mission to provide the most trusted driver. Our team has been focused on building the Waymo Driver, the world's most experienced driver, to improve access to mobility and save thousands of lives lost to traffic crashes.The Waymo Driver powers our fully autonomous ride-hailing...


  • Mountain View, California, United States Waymo Full time

    Overview: At Waymo, we're working towards a future where everyone can get where they need to go without needing a car. We're looking for a skilled Machine Learning Engineer, Training to help us achieve this goal.Key Responsibilities: In this hybrid role, you will report to the Technical Lead Manager of Machine Learning Training. Your primary responsibilities...


  • Mountain View, California, United States Waymo Full time

    Company OverviewWaymo is a pioneering autonomous driving technology company dedicated to creating the world's most trusted driver. With its roots in the Google Self-Driving Car Project, Waymo has been working tirelessly since 2009 to build the Waymo Driver, an AI system designed to improve access to mobility while saving countless lives lost to traffic...


  • Mountain View, California, United States Waymo Full time

    Taking Autonomous Driving to the Next LevelAt Waymo, we're pushing the boundaries of what's possible with autonomous driving technology. As a Senior Distributed Systems Developer, you'll have the chance to work on high-impact projects that drive innovation and growth.About the Position:Design and develop scalable distributed training infrastructure...


  • Mountain View, California, United States Waymo Full time

    Job DescriptionWaymo is an autonomous driving technology company with the mission to become the most trusted driver. We are seeking a skilled Machine Learning Distributed Systems Developer to join our Hybrid team.In this role, you will report to our TLM of Machine Learning Training and work closely with Research and Production teams to develop models in...


  • Mountain View, California, United States Intrinsic Full time

    At Intrinsic, we believe that advances in AI, perception, and simulation will transform industrial robotics. Our team of experts is passionate about unlocking the creative and economic potential of industrial robotics.About the Role:We are seeking a highly skilled Distributed Automation Systems Developer to join our team. As a key contributor, you will be...


  • Mountain View, California, United States Intrinsic Full time

    Job Summary: We're seeking an exceptional Distributed Systems Engineer to join our team. As a key contributor, you will play a critical role in designing and implementing a distributed cloud and on-premises system that enables users worldwide to develop and deploy automation solutions. Your expertise in distributed systems, cloud computing, and robotics will...


  • Mountain View, California, United States Turnblock Full time

    At Turnblock, we're at the forefront of crypto's cutting-edge technology, and we're seeking a talented Blockchain Developer to join our team. This is a remote position for any US candidate.We're developing a Blockchain Distribution Network (BDN) that empowers DeFi traders to make better trades by connecting them with everyone in the decentralized world.The...


  • Mountain View, California, United States Waymo Full time

    Company OverviewWaymo is an autonomous driving technology company with the mission to be the most trusted driver. Our team works on developing models in Perception and Planning that are core to our autonomous driving software. We collaborate closely with teams at Google to offer the best solutions for the entire model development lifecycle, ensuring...


  • Mountain View, California, United States LinkedIn Full time

    Job DescriptionWe're looking for a seasoned Distributed Systems Architect to join our world-class software engineering team at LinkedIn. As a critical member of our infrastructure team, you'll play a pivotal role in shaping the next-generation infrastructure and platforms that power our platform. With a focus on building scalable, secure, and reliable...


  • Mountain View, California, United States Waymo Full time

    About the RoleWe're looking for a highly skilled Senior Machine Learning Engineer, Training to join our Waymo ML Infrastructure team. In this role, you'll develop infrastructure components for distributed training and implement automation solutions for provisioning, deployment, monitoring, and scaling of distributed training infrastructure.This Hybrid role...


  • Mountain View, California, United States Waymo Full time

    About the JobWe are seeking a highly skilled Distributed Machine Learning Architect to join our team at Waymo. In this role, you will be responsible for designing and implementing a scalable and reliable distributed training infrastructure that can handle large-scale machine learning workloads.Our ideal candidate will have experience with distributed systems...


  • Mountain View, California, United States MatX Full time

    At MatX, we're revolutionizing AI with vertically integrated solutions that unlock the full potential of silicon and systems. We're driven to create cutting-edge technology for efficient ML workloads.Key ResponsibilitiesWe're seeking a skilled professional to design and implement performance models and tooling to inform scheduling decisions for current and...


  • Mountain View, California, United States Waymo Full time

    About the RoleAt Waymo, we are dedicated to creating the world's most trusted driver. As a member of our Hybrid team, you will play a critical role in developing the infrastructure components necessary for distributed training, implementing automation solutions, and identifying performance bottlenecks and optimization opportunities. If you have experience...


  • Mountain View, California, United States Waymo Full time

    Developing Autonomous Driving TechnologyWaymo is a leading autonomous driving technology company, dedicated to making transportation safer and more accessible. Our mission is to be the most trusted driver, and we're achieving this through cutting-edge innovations in machine learning and software development.We're seeking an experienced Senior Machine...


  • Mountain View, California, United States Waymo Full time

    About UsWaymo is an autonomous driving technology company dedicated to improving access to mobility and saving lives. Our mission is to be the most trusted driver, building on our experience autonomously driving millions of miles on public roads and tens of billions in simulation across 13+ U.S. states.Our TeamThe Waymo ML Infrastructure team collaborates...