Distributed Training Solutions Developer
2 weeks ago
**Overview**
">Nuro is a cutting-edge robotics company that's changing the game with its autonomous driving technology. As a leader in the industry, we're always pushing the boundaries of innovation. Our team is passionate about developing cutting-edge solutions that make a real difference in people's lives.
We're currently looking for a skilled Machine Learning Infrastructure Engineer to join our team. In this role, you'll have the opportunity to work on exciting projects that involve building scalable machine learning infrastructure and distributed training solutions. Your expertise will help drive the success of our business, and you'll be part of a collaborative environment that encourages creativity and growth.
About the Job
Your responsibilities as a Machine Learning Infrastructure Engineer will include:
- Developing and implementing new distributed training frameworks and strategies to support large-scale deep learning model training.
- Optimizing model training speed by refining Tensorflow, Keras, Pytorch, and Cuda kernel implementation.
- Designing and building advanced tools to monitor model training performance and detect/triage training issues.
About You
To excel in this position, you should possess:
- At least 2 years of relevant work experience or an equivalent background in PhD research.
- In-depth knowledge of machine learning models and the ML development lifecycle.
- Hands-on experience with cloud-based distributed training platforms that support data and model parallelism.
- Strong analytical skills to investigate and optimize training performance bottlenecks for deep learning models.
We offer a competitive salary range of $167,200-$250,800, based on your experience and qualifications. You'll also be eligible for an annual performance bonus, equity, and a comprehensive benefits package. Our inclusive culture values diversity and welcomes employees from all backgrounds.
-
Distributed Training Specialist
6 days ago
Mountain View, California, United States Waymo Full timeAbout the JobThe Waymo ML Infrastructure team is seeking an experienced Senior Machine Learning Engineer, Training to work on developing infrastructure components for distributed training and implementing automation solutions for provisioning, deployment, monitoring, and scaling of distributed training infrastructure.This Hybrid role requires:Developing the...
-
Senior Distributed Training Specialist
2 weeks ago
Mountain View, California, United States Waymo Full timeJob SummaryWaymo is looking for a skilled Senior Machine Learning Engineer, Training to join our Hybrid team. In this role, you will develop the infrastructure components necessary for distributed training, implement automation solutions, and monitor system health. If you have experience building distributed systems and working with Machine Learning...
-
Distributed Training Systems Specialist
6 days ago
Mountain View, California, United States Waymo Full timeJob DescriptionThis Hybrid role reports to our TLM of Machine Learning Training and involves:Developing the infrastructure components necessary for distributed trainingImplementing automation solutions for provisioning, deployment, monitoring, and scaling of distributed training infrastructureMonitoring system health and performing routine maintenance tasks...
-
Distributed Training Systems Engineer
3 weeks ago
Mountain View, California, United States Waymo Full timeAbout the CompanyWaymo is a leader in autonomous driving technology, working to improve access to mobility while saving thousands of lives. Since 2009, we've focused on building the Waymo Driver—the world's most experienced driver—using cutting-edge artificial intelligence and machine learning algorithms.Job SummaryWe're seeking an experienced...
-
Distributed Training Infrastructure Engineer
4 weeks ago
Mountain View, California, United States Waymo Full timeAbout WaymoWaymo is an innovative autonomous driving technology company with a mission to provide the most trusted driver. Our team has been focused on building the Waymo Driver, the world's most experienced driver, to improve access to mobility and save thousands of lives lost to traffic crashes.The Waymo Driver powers our fully autonomous ride-hailing...
-
Distributed Training Systems Engineer
5 days ago
Mountain View, California, United States Waymo Full timeOverview: At Waymo, we're working towards a future where everyone can get where they need to go without needing a car. We're looking for a skilled Machine Learning Engineer, Training to help us achieve this goal.Key Responsibilities: In this hybrid role, you will report to the Technical Lead Manager of Machine Learning Training. Your primary responsibilities...
-
Distributed Training Infrastructure Specialist
4 weeks ago
Mountain View, California, United States Waymo Full timeCompany OverviewWaymo is a pioneering autonomous driving technology company dedicated to creating the world's most trusted driver. With its roots in the Google Self-Driving Car Project, Waymo has been working tirelessly since 2009 to build the Waymo Driver, an AI system designed to improve access to mobility while saving countless lives lost to traffic...
-
Senior Distributed Systems Developer
5 days ago
Mountain View, California, United States Waymo Full timeTaking Autonomous Driving to the Next LevelAt Waymo, we're pushing the boundaries of what's possible with autonomous driving technology. As a Senior Distributed Systems Developer, you'll have the chance to work on high-impact projects that drive innovation and growth.About the Position:Design and develop scalable distributed training infrastructure...
-
Mountain View, California, United States Waymo Full timeJob DescriptionWaymo is an autonomous driving technology company with the mission to become the most trusted driver. We are seeking a skilled Machine Learning Distributed Systems Developer to join our Hybrid team.In this role, you will report to our TLM of Machine Learning Training and work closely with Research and Production teams to develop models in...
-
Distributed Automation Systems Developer
6 days ago
Mountain View, California, United States Intrinsic Full timeAt Intrinsic, we believe that advances in AI, perception, and simulation will transform industrial robotics. Our team of experts is passionate about unlocking the creative and economic potential of industrial robotics.About the Role:We are seeking a highly skilled Distributed Automation Systems Developer to join our team. As a key contributor, you will be...
-
Distributed Systems Engineer
5 days ago
Mountain View, California, United States Intrinsic Full timeJob Summary: We're seeking an exceptional Distributed Systems Engineer to join our team. As a key contributor, you will play a critical role in designing and implementing a distributed cloud and on-premises system that enables users worldwide to develop and deploy automation solutions. Your expertise in distributed systems, cloud computing, and robotics will...
-
Mountain View, California, United States Turnblock Full timeAt Turnblock, we're at the forefront of crypto's cutting-edge technology, and we're seeking a talented Blockchain Developer to join our team. This is a remote position for any US candidate.We're developing a Blockchain Distribution Network (BDN) that empowers DeFi traders to make better trades by connecting them with everyone in the decentralized world.The...
-
Autonomous Driving Systems Developer
2 weeks ago
Mountain View, California, United States Waymo Full timeCompany OverviewWaymo is an autonomous driving technology company with the mission to be the most trusted driver. Our team works on developing models in Perception and Planning that are core to our autonomous driving software. We collaborate closely with teams at Google to offer the best solutions for the entire model development lifecycle, ensuring...
-
Distributed Systems Architect
5 days ago
Mountain View, California, United States LinkedIn Full timeJob DescriptionWe're looking for a seasoned Distributed Systems Architect to join our world-class software engineering team at LinkedIn. As a critical member of our infrastructure team, you'll play a pivotal role in shaping the next-generation infrastructure and platforms that power our platform. With a focus on building scalable, secure, and reliable...
-
Machine Learning Infrastructure Developer
6 days ago
Mountain View, California, United States Waymo Full timeAbout the RoleWe're looking for a highly skilled Senior Machine Learning Engineer, Training to join our Waymo ML Infrastructure team. In this role, you'll develop infrastructure components for distributed training and implement automation solutions for provisioning, deployment, monitoring, and scaling of distributed training infrastructure.This Hybrid role...
-
Scalable AI Framework Developer
5 days ago
Mountain View, California, United States Waymo Full timeAbout the JobWe are seeking a highly skilled Distributed Machine Learning Architect to join our team at Waymo. In this role, you will be responsible for designing and implementing a scalable and reliable distributed training infrastructure that can handle large-scale machine learning workloads.Our ideal candidate will have experience with distributed systems...
-
AI Solutions Architect
6 days ago
Mountain View, California, United States MatX Full timeAt MatX, we're revolutionizing AI with vertically integrated solutions that unlock the full potential of silicon and systems. We're driven to create cutting-edge technology for efficient ML workloads.Key ResponsibilitiesWe're seeking a skilled professional to design and implement performance models and tooling to inform scheduling decisions for current and...
-
Autonomous Driving Systems Developer
2 weeks ago
Mountain View, California, United States Waymo Full timeAbout the RoleAt Waymo, we are dedicated to creating the world's most trusted driver. As a member of our Hybrid team, you will play a critical role in developing the infrastructure components necessary for distributed training, implementing automation solutions, and identifying performance bottlenecks and optimization opportunities. If you have experience...
-
Autonomous Driving Software Developer
5 days ago
Mountain View, California, United States Waymo Full timeDeveloping Autonomous Driving TechnologyWaymo is a leading autonomous driving technology company, dedicated to making transportation safer and more accessible. Our mission is to be the most trusted driver, and we're achieving this through cutting-edge innovations in machine learning and software development.We're seeking an experienced Senior Machine...
-
Autonomous Driving Software Developer
5 days ago
Mountain View, California, United States Waymo Full timeAbout UsWaymo is an autonomous driving technology company dedicated to improving access to mobility and saving lives. Our mission is to be the most trusted driver, building on our experience autonomously driving millions of miles on public roads and tens of billions in simulation across 13+ U.S. states.Our TeamThe Waymo ML Infrastructure team collaborates...