Machine Learning Infrastructure Developer

3 days ago


Mountain View, California, United States Waymo Full time
About the Role

We're looking for a highly skilled Senior Machine Learning Engineer, Training to join our Waymo ML Infrastructure team. In this role, you'll develop infrastructure components for distributed training and implement automation solutions for provisioning, deployment, monitoring, and scaling of distributed training infrastructure.

This Hybrid role reports to our TLM of Machine Learning Training and requires:

  • Developing the necessary infrastructure components for distributed training
  • Implementing automation solutions for provisioning, deployment, monitoring, and scaling of distributed training infrastructure
  • Monitoring system health, diagnosing, and performing routine maintenance tasks to ensure the reliability of the distributed training infrastructure
  • Identifying performance bottlenecks and optimization opportunities
  • Improving the developer experience and performance of our scalable ML framework

Your Background

  • Bachelor's degree in Computer Science, Engineering, or related field, or 4+ years equivalent experience
  • Experience building distributed systems for production environments
  • Solid Python or C++ skills
  • Prior experience with Machine Learning frameworks (e.g., TensorFlow, PyTorch) and distributed training algorithms

Why Join Us?

  • Competitive salary range: $192,000 - $243,000 USD
  • Discretionary annual bonus program
  • Equity incentive plan
  • Generous Company benefits program


  • Mountain View, California, United States NewsBreak Full time

    About NewsBreakNewsBreak is revolutionizing the way users interact with local news and their communities by bridging local users, content creators, and businesses.We foster safer, more vibrant, and authentically connected lives through robust collaborations with thousands of local publishers and businesses across the nation.Our MissionWe are redefining the...


  • Mountain View, California, United States NewsBreak Full time

    NewsBreak is redefining the way users interact with local news and their communities. Our mission is to foster safer, more vibrant, and authentically connected lives by bridging local users, content creators, and businesses.We are looking for a talented Machine Learning Infrastructure Developer to join our team. As a key member of our infrastructure team,...


  • Mountain View, California, United States Waymo Full time

    About UsAt Waymo, we're dedicated to improving access to mobility while saving thousands of lives. Our innovative Waymo Driver has provided over one million rider-only trips, driven tens of millions of miles on public roads and simulated tens of billions of miles in simulation.Role SummaryThis Hybrid role involves developing infrastructure components for...


  • Mountain View, California, United States NewsBreak Full time

    Company Overview:NewsBreak is a pioneering local news app that has revolutionized the way users interact with their communities. Founded in 2015, the company has established itself as the nation's premier local news provider, bridging local users, content creators, and businesses across the nation.The company's headquarters is located in the tech hub of...


  • Mountain View, California, United States Waymo Full time

    **About Us**Waymo is a leading autonomous driving technology company dedicated to improving access to mobility while saving lives. Our mission is to be the most trusted driver, and we're committed to developing the world's most experienced driver - The Waymo DriverTM.We're seeking an exceptional Senior Machine Learning Engineer, Training to join our Hybrid...


  • Mountain View, California, United States Waymo Full time

    Job TitleSenior Machine Learning Engineer, TrainingAbout the RoleWe are seeking a Senior Machine Learning Engineer to join our Hybrid team at Waymo. As a key member of our ML Infrastructure team, you will be responsible for developing the infrastructure components necessary for distributed training, implementing automation solutions for provisioning,...


  • Mountain View, California, United States Tik Tok Full time

    Job SummaryThe Machine Learning Infrastructure Specialist will be responsible for designing and implementing the infrastructure for TikTok's machine learning models. This role requires expertise in distributed systems, data engineering, and cloud computing.Key ResponsibilitiesDesign and develop scalable data pipelines for machine learning model training and...


  • Mountain View, California, United States Nuro Full time

    **About Us**">Nuro is a robotics company that aims to improve everyday life through innovative technologies. Founded in 2016, we have spent years developing autonomous driving (AD) technology and commercializing AD applications. Our world-class autonomous driving system, the Nuro DriverTM, combines AD hardware with our AI-first self-driving software.We've...


  • Mountain View, California, United States Tik Tok Full time

    About Our Team We are a dynamic team of engineers working on building a scalable and secure machine learning infrastructure for TikTok. Our team is passionate about pushing the boundaries of AI innovation and is committed to delivering high-quality solutions that meet the needs of our users.Job Description We are seeking a talented Machine Learning Engineer...


  • Mountain View, California, United States NewsBreak Full time

    About UsAt NewsBreak, we are revolutionizing the way users interact with local news and their communities. Our mission is to foster safer, more vibrant, and authentically connected lives through robust collaborations with thousands of local publishers and businesses across the nation.We proudly stand as the nation's premier local news app, with our...


  • Mountain View, California, United States Nuro Full time

    Nuro is a pioneering robotics company dedicated to enhancing everyday life through innovative technology. Founded in 2016, we have spent years developing autonomous driving (AD) solutions and commercializing AD applications. Our world-class Nuro DriverTM combines AD hardware with AI-first self-driving software, built to learn and improve through data.We've...


  • Mountain View, California, United States Waymo Full time

    **Job Description**In this position, you will be responsible for designing, implementing, and optimizing the distributed training infrastructure for our machine learning models. You will work closely with cross-functional teams to develop solutions that improve the scalability, reliability, and performance of our ML frameworks.


  • Mountain View, California, United States Moveworks Full time

    At Moveworks, we're revolutionizing the way businesses interact with AI. As a Senior Machine Learning Infrastructure Specialist, you'll play a critical role in building and scaling our cutting-edge ML infrastructure.">We're looking for an expert in machine learning who can design, build, and optimize scalable ML infrastructure to support training,...


  • Mountain View, California, United States Moveworks Full time

    Job DescriptionWe are seeking a highly skilled Senior Machine Learning Infrastructure Specialist to join our team at Moveworks. As a critical member of our AI infrastructure team, you will play a key role in building and optimizing cutting-edge machine learning systems for large language models.In this position, you will work closely with our...


  • Mountain View, California, United States CV Library Full time

    Job OverviewWe are seeking a highly skilled Cloud Infrastructure Specialist to join our team at CV Library. This is a 12+ month contract opportunity that requires expertise in machine learning infrastructure, cloud platforms, and containerization technologies.Key Responsibilities:Design and implement scalable machine learning infrastructure on Google Cloud...


  • Mountain View, California, United States iSoftTek Solutions Inc Full time

    Job SummaryWe are seeking a highly skilled Senior Cloud Machine Learning Infrastructure Specialist to join our team at iSoftTek Solutions Inc in Mountain View, CA.This long-term W2 opportunity requires strong experience in machine learning infrastructure and cloud platforms such as GCP. Proficiency in programming languages like Python and Java is essential....


  • Mountain View, California, United States Waymo Full time

    About WaymoWaymo is an autonomous driving technology company dedicated to developing the world's most advanced driver.The Waymo Driver, our self-driving system, has been autonomously driving tens of millions of miles on public roads and simulating billions of miles in virtual environments across 13+ U.S. states.Job DescriptionWe are seeking a skilled...


  • Mountain View, California, United States Syntiant Full time

    We are seeking a talented Machine Learning Software Developer to join our team at Syntiant Corp. The ideal candidate will have a strong background in C++ and experience working with machine learning algorithms.The successful candidate will be responsible for developing and maintaining large-scale deployable software projects, including infrastructure code,...


  • Mountain View, California, United States Tik Tok Full time

    About the Role">TikTok is seeking a talented Machine Learning Engineer - Model Training Infrastructure to join our AML team. As a key member of this team, you will be responsible for designing and implementing a global-scale machine learning system for feeds, ads, and search ranking models.">Key Responsibilities">">Design and implement a global-scale machine...


  • Mountain View, California, United States Tik Tok Full time

    Job SummaryTikTok is seeking a skilled Machine Learning Engineer - Machine Learning Infrastructure to join our AML team in the United States. As a key member of our global organization, you will be responsible for designing and implementing a next-generation AI infrastructure and recommendation platform for ads ranking, search ranking, and live & ecom...