Senior Distributed Training Specialist

3 weeks ago


Mountain View, California, United States Waymo Full time
Job Summary

Waymo is looking for a skilled Senior Machine Learning Engineer, Training to join our Hybrid team. In this role, you will develop the infrastructure components necessary for distributed training, implement automation solutions, and monitor system health. If you have experience building distributed systems and working with Machine Learning frameworks, we encourage you to apply.

Key Responsibilities

• Develop scalable ML frameworks to enhance the developer experience and performance
• Collaborate with Research and Production teams to develop models in Perception and Planning
• Design and implement automation solutions for provisioning, deployment, monitoring, and scaling of distributed training infrastructure
• Identify performance bottlenecks and optimization opportunities

Requirements

• Bachelor's degree in Computer Science, Engineering, or related field, or 4+ years equivalent experience
• Experience building distributed systems for production environments
• Solid Python or C++ skills
• Prior experience with Machine Learning frameworks (e.g., TensorFlow, PyTorch) and distributed training algorithms

Salary Range

$192,000-$243,000 USD per year. Eligible for our discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements.

  • Mountain View, California, United States Waymo Full time

    About the JobThe Waymo ML Infrastructure team is seeking an experienced Senior Machine Learning Engineer, Training to work on developing infrastructure components for distributed training and implementing automation solutions for provisioning, deployment, monitoring, and scaling of distributed training infrastructure.This Hybrid role requires:Developing the...


  • Mountain View, California, United States Waymo Full time

    Job DescriptionThis is a unique opportunity to join our team as a Machine Learning Infrastructure Engineer. You will be responsible for developing infrastructure components necessary for distributed training, including job scheduling, resource management, data distribution, and model synchronization.About The RoleYou will be working closely with the ML...


  • Mountain View, California, United States Waymo Full time

    Job DescriptionThis Hybrid role reports to our TLM of Machine Learning Training and involves:Developing the infrastructure components necessary for distributed trainingImplementing automation solutions for provisioning, deployment, monitoring, and scaling of distributed training infrastructureMonitoring system health and performing routine maintenance tasks...


  • Mountain View, California, United States Waymo Full time

    Taking Autonomous Driving to the Next LevelAt Waymo, we're pushing the boundaries of what's possible with autonomous driving technology. As a Senior Distributed Systems Developer, you'll have the chance to work on high-impact projects that drive innovation and growth.About the Position:Design and develop scalable distributed training infrastructure...


  • Mountain View, California, United States Waymo Full time

    Overview: At Waymo, we're working towards a future where everyone can get where they need to go without needing a car. We're looking for a skilled Machine Learning Engineer, Training to help us achieve this goal.Key Responsibilities: In this hybrid role, you will report to the Technical Lead Manager of Machine Learning Training. Your primary responsibilities...


  • Mountain View, California, United States Nuro Full time

    **Overview**">Nuro is a cutting-edge robotics company that's changing the game with its autonomous driving technology. As a leader in the industry, we're always pushing the boundaries of innovation. Our team is passionate about developing cutting-edge solutions that make a real difference in people's lives.We're currently looking for a skilled Machine...


  • Mountain View, California, United States Moveworks Full time

    What You Will DoYou will be responsible for architecting the next generation of Moveworks' AI infrastructure, ensuring reliability, resilience, and scalability. As a senior member of the Core Infrastructure team, you will work closely with machine learning, search, product, data, and frontend teams to understand their infrastructure needs and influence the...


  • Mountain View, California, United States Databricks Full time

    About the JobWe're looking for a talented Distributed Systems Optimization Specialist to join our team at Databricks. In this role, you'll be responsible for optimizing the performance of our data and AI platform, ensuring it meets the needs of our customers.The Impact You'll HaveIdentify performance limitations of our entire stack based on telemetry,...


  • Mountain View, California, United States Waymo Full time

    **Role Summary**We're looking for a highly skilled Senior Machine Learning Engineer, Training to join our team at Waymo. As a senior engineer, you will be responsible for developing the infrastructure necessary for distributed training, implementing automation solutions, and monitoring system health.You will work closely with our ML Infrastructure team to...


  • Mountain View, California, United States Moveworks Full time

    Job DescriptionWe are seeking a highly skilled Senior Machine Learning Infrastructure Specialist to join our team at Moveworks. As a critical member of our AI infrastructure team, you will play a key role in building and optimizing cutting-edge machine learning systems for large language models.In this position, you will work closely with our...


  • Mountain View, California, United States Senior Helpers - Sunbury, PA Full time

    Job Title: Caregiver Professional">About Senior Helpers - Sunbury, PA:We are a trusted provider of home care services dedicated to improving the quality of life for seniors and their families. Our team of experienced caregivers deliver personalized care and support to meet the unique needs of each client.Estimated Salary Range: $13-15 per hour (dependent on...


  • Mountain View, California, United States Intuit Full time

    Job Summary:Intuit is looking for a seasoned IT Support Specialist to join our Executive Support team in Mountain View. This role requires a minimum of 5-7 years of experience in desktop support, with at least 2-3 years of direct senior-level executive support experience. The successful candidate will have advanced knowledge of Exchange Mail, Active...


  • Mountain View, California, United States DataBricks Full time

    Job OverviewThe R&D Operations Organization at DataBricks is seeking a Senior Technical Program Manager (TPM) with expertise in distributed AI, resource management, forecasting, and strategic planning.This role involves supporting platform operations, handling customer escalations, and monitoring cluster health to ensure optimal compute resource allocation...


  • Mountain View, California, United States Waymo Full time

    Job DescriptionWaymo is an autonomous driving technology company with the mission to become the most trusted driver. We are seeking a skilled Machine Learning Distributed Systems Developer to join our Hybrid team.In this role, you will report to our TLM of Machine Learning Training and work closely with Research and Production teams to develop models in...

  • Training Specialist

    1 week ago


    Mountain View, California, United States RODGERS CONSULTING SERVICE INC Full time

    Rodgers Consulting Services is a state-licensed organization offering support to individuals with developmental disabilities, helping them achieve independence and excel in life. We are seeking a Training Specialist to join our team.Job Summary:To provide independent living skills training for adult consumers with developmental disabilities.Implement...


  • Mountain View, California, United States Waymo Full time

    Company OverviewWaymo is a leader in autonomous driving technology with a mission to improve access to mobility while saving thousands of lives. Our innovative Waymo Driver has provided over one million rider-only trips, enabling its experience autonomously driving tens of millions of miles on public roads and simulation.Job DescriptionWe are seeking an...


  • Mountain View, California, United States Waymo Full time

    **About Us**Waymo is a leading autonomous driving technology company dedicated to improving access to mobility while saving lives. Our mission is to be the most trusted driver, and we're committed to developing the world's most experienced driver - The Waymo DriverTM.We're seeking an exceptional Senior Machine Learning Engineer, Training to join our Hybrid...


  • Mountain View, California, United States Databricks Full time

    This role requires an experienced software engineer who can develop and maintain complex software systems. The ideal candidate will have a strong background in computer science, experience with distributed systems, and a passion for innovation. In addition to a competitive salary, Databricks offers a comprehensive benefits package, including health...


  • Mountain View, California, United States Qualified Technical Services Full time

    Job Summary:">We are seeking a highly skilled Software Systems Engineer to join our team at NASA Ames Research Center in Mountain View, CA. This role involves developing the core infrastructure for autonomous coordination between spacecraft and ground applications.">About the Project:">The Distributed Spacecraft Autonomy project is centered out of NASA Ames...


  • Mountain View, California, United States Waymo Full time

    Job OverviewWaymo is a leading autonomous driving technology company with the mission to become the most trusted driver. With its roots as the Google Self-Driving Car Project in 2009, Waymo has focused on developing the Waymo Driver, the world's most experienced driver, to improve access to mobility while saving thousands of lives lost to traffic crashes.The...