Lead AI Systems Engineer

2 weeks ago


Santa Clara, California, United States NVIDIA Full time

NVIDIA is on the lookout for a skilled Lead AI Systems Engineer to become a vital part of our Autonomous Vehicles division. In this position, you will leverage artificial intelligence to enhance Autonomous Vehicle perception, contributing to the development of our cutting-edge autonomous driving technology. We seek an innovative and inquisitive engineer who is both proactive and meticulous, with a passion for uncovering solutions to complex challenges. NVIDIA boasts some of the most visionary and dedicated professionals globally. If this resonates with you, we would love to connect.

Key Responsibilities:

  • Optimizing our deep neural network (DNN) training framework: enhancing speed, scalability, and resource efficiency.
  • Architecting our training infrastructure to support simultaneous usage by multiple engineers across various tasks.
  • Curating datasets for training: constructing a horizontally scalable data preparation pipeline that is user-friendly and minimizes training delays.
  • Developing a high-throughput cloud inference pipeline for evaluation and key performance indicator (KPI) assessment.
  • Streamlining processes to facilitate the creation of verified, deployable artifacts from annotated datasets.
  • Creating tools for analysis and visualization to gain insights into performance and areas for improvement.
  • Aiming for optimal efficiency in training, data preparation, and cloud inference.
  • Collaborating closely with platform and perception DNN engineers, merging expertise in large-scale machine learning systems with in-depth knowledge of perception DNNs.

Qualifications:

  • Master’s or Doctorate in computer science or a related field, or equivalent experience.
  • Minimum of 3 years of relevant industry experience.
  • Proficiency in contemporary machine learning frameworks such as PyTorch.
  • Strong programming skills in C++, Python, and/or CUDA.
  • A commitment to software development excellence and code quality, adhering to the latest standards and practices, writing unit tests and benchmarks, and consistently improving code quality.
  • A passion for optimization: skilled in writing efficient code, from high-level machine learning algorithms to low-level hardware utilization.
  • Strong communication and collaboration skills: ability to work effectively within a large ecosystem, serving both as a client and provider to other teams.

Preferred Qualifications:

  • Publications in the field of efficient machine learning (accelerating training and inference).
  • Experience in developing large-scale machine learning pipelines, particularly for autonomous vehicles.
  • Significant contributions to leading open-source projects in related areas.

The field of perception for autonomous technologies presents one of the most exhilarating and demanding challenges today. Machine Learning plays a crucial role, but to excel in this domain, we must master the fundamentals. Join the Perception ML Foundation team, where we integrate expertise in machine learning, high-performance computing, and cloud computing to establish the perception ML 'factory.' This factory streamlines the generation of perception ML models, making it efficient, scalable, and user-friendly. It encompasses essential ML workflows: data preparation, DNN training, production optimization, and extensive cloud inference and evaluation.

We believe that creating this high-throughput factory necessitates diverse, interdisciplinary thinking and expertise: understanding modern perception architectures, machine learning optimization techniques, large-scale software systems, high-performance computing, and the hardware driving accelerated cloud computing, as well as MLOps and microservices. We refer to all these skills as ML foundation engineering. If this aligns with your expertise and passion, you may be the perfect fit for our team at NVIDIA.



  • Santa Clara, California, United States Celestial AI Full time

    About the RoleCelestial AI is seeking a highly skilled Senior Analog Design Engineer to drive the development of innovative, high-speed analog architectures for low-power, high-performance Analog-Mixed Signal (AMS) solutions customized for AI applications.Key ResponsibilitiesTop-Down Architectural Analysis: Conduct thorough analysis of AMS systems to...


  • Santa Clara, California, United States Celestial AI Full time

    About Celestial AIAt Celestial AI, we are at the forefront of innovation in AI systems. Our ground-breaking Photonic Fabric technology provides a scalable solution to data transfer bottlenecks, revolutionizing AI system performance and delivering unmatched efficiency.Lead Reliability EngineerWe are seeking a dynamic Lead Reliability Engineer to drive...


  • Santa Clara, California, United States NVIDIA Full time

    About the RoleNVIDIA is seeking a highly skilled Principal Engineer to lead the development of AI software resiliency for our most powerful AI supercomputers.Key ResponsibilitiesDevelop and implement critical resiliency features to support frontier model training at scale.Drive down cluster downtime towards zero, ensuring robust and reliable AI...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Principal Software Engineer to lead the development of AI software resiliency for the most powerful AI supercomputers in the world.As a lead focused on AI Software Resiliency, you will play a pivotal role in defining and implementing critical resiliency features for AI supercomputers at a scale of 100,000+ GPUs.Your expertise...


  • Santa Clara, California, United States Tenstorrent Full time

    At Tenstorrent, we are at the forefront of pioneering advancements in artificial intelligence technology, setting new benchmarks for performance, usability, and cost-effectiveness. As AI reshapes the computing landscape, our solutions are evolving to integrate innovations across software models, compilers, platforms, networking, and semiconductor...

  • SoC DV Lead

    3 months ago


    Santa Clara, California, United States Celestial AI Full time

    About Celestial AIAs the industry strives to meet the demands of the AI workloads, bottlenecks in data transfers between processors and memory have hindered progress. The Photonic Fabric based Memory Fabric provides an optically scalable solution to the 'Memory Wall' problem, enabling tens of Terabytes of memory capacity at full HBM bandwidths with low tens...


  • Santa Clara, California, United States NVIDIA Full time

    As a Lead Solutions Architect focusing on AI/ML Storage Systems, you will play a crucial role in our innovative team, contributing to the development, implementation, and management of cutting-edge storage solutions designed specifically for Artificial Intelligence and Machine Learning applications. This position encompasses a variety of areas, including...


  • Santa Clara, California, United States Platform Ldn Full time

    About Platform LdnPlatform Ldn is a pioneering company in the field of robotics, dedicated to advancing the development of AI platforms that support industrial-grade robotics solutions.Job SummaryWe are seeking a highly skilled Senior Software Engineer to lead the design and development of our AI platform, enabling clients to run their AI workflows...


  • Santa Clara, California, United States Dexmate Full time

    Company OverviewDexmate is an innovative robotics firm focused on developing versatile mobile robots capable of executing intricate manipulation tasks. Our team comprises leading experts in artificial intelligence and robotics, dedicated to creating comprehensive robotic systems. We are seeking talented and driven AI engineers to enhance our robotic...

  • Robotics AI Engineer

    2 weeks ago


    Santa Clara, California, United States Dexmate Full time

    Company OverviewDexmate is an innovative robotics company focused on developing versatile mobile robots capable of executing intricate manipulation tasks. Our team comprises leading experts in artificial intelligence and robotics, dedicated to advancing the capabilities of robotic systems. We are seeking talented and driven AI engineers to contribute to the...

  • Robotics AI Engineer

    2 weeks ago


    Santa Clara, California, United States Dexmate Full time

    Company OverviewDexmate is an innovative robotics enterprise focused on creating versatile mobile robots capable of executing intricate manipulation tasks. Our team comprises leading experts in artificial intelligence and robotics who are dedicated to developing comprehensive robotic systems. We are in search of talented and driven AI engineers to enhance...

  • Robotics AI Engineer

    2 weeks ago


    Santa Clara, California, United States Dexmate Full time

    Company OverviewDexmate is an innovative robotics firm focused on creating versatile mobile robots capable of executing intricate manipulation tasks. Our team comprises leading experts in artificial intelligence and robotics who are dedicated to developing comprehensive robotic systems. We are in search of talented and driven AI engineers to enhance our...


  • Santa Clara, California, United States Dexmate Full time

    Company OverviewDexmate is an innovative robotics firm focused on creating versatile mobile robots capable of executing intricate manipulation tasks. Our team comprises leading experts in artificial intelligence and robotics who are dedicated to developing comprehensive robotic systems. We are seeking talented and driven AI engineers to enhance our robotic...


  • Santa Clara, California, United States Dexmate Full time

    Company OverviewDexmate is an innovative robotics enterprise focused on developing versatile mobile robots capable of executing intricate manipulation tasks. Our team comprises leading experts in artificial intelligence and robotics, dedicated to creating comprehensive robotic systems. We are seeking talented and driven AI engineers to enhance our robot...


  • Santa Clara, California, United States NVIDIA Full time

    Job DescriptionWe are seeking a highly skilled Principal Engineer for AI Software Resiliency to join our team at NVIDIA.Key ResponsibilitiesLead the development of AI software resiliency features for our most powerful AI supercomputers.Collaborate with multiple teams and stakeholders to align on mission requirements and ensure successful integration of...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is at the forefront of innovation, having transformed the technology landscape over the past two decades. Our pioneering work in GPU technology has not only propelled the PC gaming industry but has also redefined modern graphics and advanced parallel computing. More recently, our advancements in GPU deep learning have catalyzed the AI revolution,...


  • Santa Clara, California, United States NVIDIA Full time

    Job DescriptionWe are seeking a highly skilled Principal Engineer for AI Software Resiliency to join our team at NVIDIA. As a key member of our organization, you will play a pivotal role in defining and implementing critical resiliency features for AI supercomputers at a scale of 100,000+ GPUs.Key ResponsibilitiesDevelop and lead the execution of software...


  • Santa Clara, California, United States Dexmate Full time

    Company OverviewDexmate is an innovative robotics firm focused on creating versatile mobile robots capable of executing intricate manipulation tasks. Our team comprises leading experts in artificial intelligence and robotics, dedicated to advancing the field of full-stack robotics. We are in search of talented and driven AI engineers to enhance our robot...


  • Santa Clara, California, United States Dexmate Full time

    Company OverviewDexmate is an innovative robotics company focused on developing versatile mobile robots capable of performing intricate manipulation tasks. Our team comprises leading experts in artificial intelligence and robotics who are dedicated to creating comprehensive robotic systems. We are seeking talented and driven AI engineers to enhance our robot...


  • Santa Clara, California, United States Oracle Full time

    About the RoleWe are seeking a highly experienced and skilled Engineering Leader to join our team at Oracle. As a Senior Director of Engineering, AI Workload Orchestration, you will be responsible for leading the software development organization building out and operating AI platforms that operate at unprecedented speed, scale, and reliability.Key...