Machine Learning Engineer, ML Runtime

2 weeks ago


Fremont, CA, United States Pony.ai Inc. Full time

Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies and services at a rapidly expanding footprint of sites around the world. Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony.ai is an industry leader in the commercialization of autonomous driving and is committed to developing the safest autonomous driving capabilities on a global scale. Pony.ais leading position has been recognized, with CNBC ranking Pony.ai #10 on its CNBC Disruptor list of the 50 most innovative and disruptive tech companies of 2022. In June 2023, Pony.ai was recognized on the XPRIZE and Bessemer Venture Partners inaugural XB100 2023 list of the worlds top 100 private deep tech companies, ranking #12 globally. As of August 2023, Pony.ai has accumulated nearly 21 million miles of autonomous driving globally. Pony.ai went public at NASDAQ in Nov. 2024.

Responsibility

The ML Infrastructure team at Pony.ai provides a set of tools to support and automate the lifecycle of the AI workflow, including model development, evaluation, optimization, deployment, and monitoring.

As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to accelerate the training and inferences of the AI models in autonomous driving systems. This includes:

  • Identifying key applications for current and future autonomous driving problems and performing in-depth analysis and optimization to ensure the best possible performance on current and next-generation compute architectures.
  • Collaborating closely with diverse groups in Pony.ai including both hardware and software to optimize and craft core parallel algorithms as well as to influence the next-generation compute platform architecture design and software infrastructure.
  • Applying model optimization and efficient deep learning techniques to models and optimized ML operator libraries.
  • Working across the entire ML framework/compiler stack (e.g. Torch, CUDA and TensorRT), and system-efficient deep learning models.
Minimum Requirements
  • BS/MS or Ph.D in computer science, electrical engineering or a related discipline.
  • Strong programming skills in C/C++ or Python.
  • Experience in model optimization, quantization or other efficient deep learning techniques.
  • Good understanding of hardware performance, regarding CPU or GPU execution model, threads, registers, cache, cost/performance trade-off, etc.
  • Experience with profiling, benchmarking and validating performance for complex computing architectures.
  • Experience in optimizing the utilization of compute resources, identifying and resolving compute and data flow bottlenecks.
  • Strong communication skills and ability to work cross-functionally between software and hardware teams.
Preferred Qualifications

One or more of the following fields are preferred:

  • Experience with parallel programming, ideally CUDA, OpenCL or OpenACC.
  • Experience in computer vision, machine learning and deep learning.
  • Strong knowledge of software design, programming techniques and algorithms.
  • Good knowledge of common deep learning frameworks and libraries.
  • Deep knowledge on system performance, GPU optimization or ML compiler.
Compensation and Benefits

Base Salary Range: $140,000 - $250,000 Annually. Compensation may vary outside of this range depending on many factors, including the candidates qualifications, skills, competencies, experience, and location. Base pay is one part of the Total Compensation and this role may be eligible for bonuses/incentives and restricted stock units.

Also, we provide the following benefits to the eligible employees:

  • Health Care Plan (Medical, Dental & Vision)
  • Retirement Plan (Traditional and Roth 401k)
  • Life Insurance (Basic, Voluntary & AD&D)
  • Paid Time Off (Vacation & Public Holidays)
  • Family Leave (Maternity, Paternity)
  • Short Term & Long Term Disability
  • Free Food & Snacks

#J-18808-Ljbffr

  • Fremont, CA, United States NR Consulting Full time

    Responsibilities: Develop, optimize, and deploy lightweight machine learning models for edge AI applications, particularly for audio processing. Implement and optimize ML models on embedded platforms, including FPGA and custom ASIC solutions. Work closely with hardware and software teams to integrate ML models into production systems. Research and...


  • Fremont, CA, United States 4 Staffing Corp Full time

    About the job AI/Machine Learning Engineer Job Description:Our client is seeking a highly skilled and motivated AI/Machine Learning Engineer to join their team. As an AI/Machine Learning Engineer, you will play a crucial role in developing and implementing cutting-edge machine learning algorithms and AI models to solve complex problems and drive innovation....


  • Fremont, CA, United States Quantix Search Full time

    Senior Machine Learning Engineer San Francisco | Hybrid, 3 days/week | $200K - $280K + equity We are excited to partner with a rapidly growing healthtech startup that has successfully raised $40M in Series A funding to enhance their engineering team. Their innovative AI-powered platform is transforming the healthcare landscape by automating significant...


  • Fremont, CA, United States Quantix Search Full time

    Senior Machine Learning Engineer San Francisco | Hybrid, 3 days/week | $200K - $280K + equity We are excited to partner with a rapidly growing healthtech startup that has successfully raised $40M in Series A funding to enhance their engineering team. Their innovative AI-powered platform is transforming the healthcare landscape by automating significant...


  • Fremont, CA, United States Quantix Search Full time

    Member of Technical Staff - Machine LearningSan Francisco | Hybrid, 3 days/week | $200K - $280K + equityI'm partnering with a rapidly scaling healthtech startup that has just raised a $40M Series A to expand its engineering team. Their AI-powered platform is already helping clinicians by automating huge amounts of back-office work, and now they're looking...


  • Fremont, CA, United States Quantix Search Full time

    Member of Technical Staff - Machine LearningSan Francisco | Hybrid, 3 days/week | $200K - $280K + equityI'm partnering with a rapidly scaling healthtech startup that has just raised a $40M Series A to expand its engineering team. Their AI-powered platform is already helping clinicians by automating huge amounts of back-office work, and now they're looking...


  • Fremont, CA, United States Neuralink Full time

    About Neuralink: We are creating devices that enable a bi-directional interface with the brain. These devices allow us to restore movement to the paralyzed, restore sight to the blind, and revolutionize how humans interact with their digital world. About the Team: The BCI team develops the software and systems that communicate with the brain. These systems...


  • Fremont, CA, United States Neuralink Full time

    About Neuralink: We are creating devices that enable a bi-directional interface with the brain. These devices allow us to restore movement to the paralyzed, restore sight to the blind, and revolutionize how humans interact with their digital world. About the Team: The BCI team develops the software and systems that communicate with the brain. These systems...


  • Fremont, CA, United States Neuralink Full time

    About Neuralink: We are creating devices that enable a bi-directional interface with the brain. These devices allow us to restore movement to the paralyzed, restore sight to the blind, and revolutionize how humans interact with their digital world. About the Team: The BCI team develops the software and systems that communicate with the brain. These systems...


  • Fremont, CA, United States Ursus Inc Full time

    JOB TITLE: Machine Learning Engineer LOCATION: Onsite in Fremont, CA DURATION: 6 months contract to hire RATE RANGE: Market Rate POSITION SUMMARY: Serve as a member of the factory software machine learning and computer vision team. Design, develop and implement critical machine learning models that operate on our factory and warehouse environments....