Senior Software Engineer for Deep Learning Inference Workflows

3 weeks ago


Santa Clara, California, United States NVIDIA Full time

We are seeking a highly skilled Senior Software Engineer to join our Deep Learning software team. As a key member of our team, you will be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference.

Key Responsibilities:

  • Develop graph parsers, optimizers, and tools for effective deployment of trained deep learning models using C++ and Python.
  • Collaborate with teams of deep learning experts, GPU architects, and DevOps engineers across diverse teams.

Requirements:

  • Bachelor's, Master's, PhD, or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering, or related field.
  • 6+ years of software development experience.
  • Strong experience with C++11/C++14/C++17.
  • Strong grasp of Machine Learning concepts, especially Natural Language Processing.
  • Excellent communication skills and an aptitude for collaboration and teamwork.

Preferred Qualifications:

  • Experience developing System Software.
  • Proficiency in Python and background in GPU kernel programming using CUDA or OpenCL.
  • Experience in software performance benchmarking, profiling, and optimizations.
  • Background in compiler development.
  • Experience working with TensorRT, PyTorch, TensorFlow, ONNX Runtime, or other ML frameworks.

NVIDIA is a leader in the technology industry and offers a dynamic work environment with opportunities for growth and development. We are committed to fostering a diverse and inclusive workplace and are proud to be an equal opportunity employer.

The base salary range for this position is $180,000 - $339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits.

NVIDIA accepts applications on an ongoing basis and is committed to fostering a diverse work environment.



  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Software Engineer to join our Deep Learning Inference Workflows team. As a key member of our team, you will be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference.Key Responsibilities: Develop graph parsers, optimizers, and tools for effective deployment of...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Software Engineer to join our Deep Learning software team. As a key member of our team, you will be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference.Key Responsibilities:Develop graph parsers, optimizers, and tools for effective deployment of trained deep...


  • Santa Clara, California, United States NVIDIA Full time

    Job SummaryNVIDIA is seeking a highly skilled Senior Software Engineer to join our TensorRT team in developing industry-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer, you will be responsible for designing and implementing inference optimizations to enable real-time AI applications on personal computing...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a leader in the generative AI revolution, and our Algorithmic Model Optimization Team is at the forefront of optimizing generative AI models for maximal inference efficiency. Our team focuses on techniques ranging from neural architecture search and pruning to sparsity, quantization, and automated deployment strategies.We conduct applied research...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Deep Learning Software Engineer to develop and productize NVIDIA's deep learning solutions in autonomous driving vehicles.The ideal candidate will have a strong background in deep learning and experience with NVIDIA's software libraries, including CUDA and TensorRT.As a member of our Solution Engineering-Automotive Deep...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Deep Learning Software Engineer to design and build our automated inference and deployment solution.As part of the team, you will play a pivotal role in architecting and designing a modular and scalable software platform to provide an excellent user experience with broad model support and optimization...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a leader in the field of deep learning, and we're seeking a talented Senior System Software Engineer to join our team. As a key member of our deep learning software engineering team, you will be responsible for optimizing deep learning algorithms for training and inference, as well as integrating them into existing systems and tools.Key...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Deep Learning Software Engineer to join our team at NVIDIA. As a member of our Solution Engineering-Automotive Deep Learning team, you will apply groundbreaking NVIDIA deep learning model training/inference software libraries for deployment on NVIDIA's hardware architecture.Key Responsibilities:Develop new deep learning...


  • Santa Clara, California, United States NVIDIA Full time

    Senior Performance Software Engineer for Deep Learning LibrariesWe are seeking a highly skilled Senior Performance Software Engineer to join our team in developing optimized code for NVIDIA GPUs. As a key member of our Deep Learning Library team, you will be responsible for writing highly tuned compute kernels in C++ CUDA to perform core deep learning...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Senior Performance Software Engineer to join our Deep Learning Library team at NVIDIA.This role involves developing optimized code to accelerate linear algebra and deep learning operations on NVIDIA GPUs. The team delivers high-performance code to NVIDIA's cuDNN, cuBLAS, and TensorRT libraries to accelerate deep learning models.Key...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Senior Deep Learning Algorithm Engineer to join our team at NVIDIA. As a key member of our team, you will be responsible for optimizing deep learning training workloads on state-of-the-art hardware and software platforms.Key Responsibilities:Understand, analyze, profile, and optimize deep learning training workloads on state-of-the-art...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Deep Learning Software Engineer to join our Solution Engineering-Automotive Deep Learning team. As a key member of our team, you will be responsible for developing and productizing NVIDIA's deep learning solutions in autonomous driving vehicles.Key responsibilities include:Developing new deep learning architectures and...


  • Santa Clara, California, United States NVIDIA Full time

    Join Our Team of InnovatorsWe're seeking a highly skilled Senior Deep Learning Software Engineer to join our cuDNN team at NVIDIA. As a key member of our team, you'll play a crucial role in designing, building, and shipping our GPU-accelerated library of primitives for deep neural networks.About the RoleAs a Senior Deep Learning Software Engineer, you'll be...


  • Santa Clara, California, United States Nvidia Full time

    Job DescriptionNVIDIA's Deep Learning Architecture and Libraries Group is seeking a software engineer to contribute to the development of our performance analysis capabilities. As a member of our team, you will collaborate with GPU architects, CUDA developers, and deep learning performance engineers to design and implement innovative approaches to hardware...


  • Santa Clara, California, United States XPENG Motors Full time

    Job Title: Senior Deep Learning EngineerAbout the Role:We are seeking a highly skilled Senior Deep Learning Engineer to join our team at XPeng Motors. As a key member of our autonomous driving team, you will be responsible for researching, implementing, and evaluating deep-learning-based methods for prediction and planning problems.Responsibilities:*...


  • Santa Clara, California, United States NVIDIA Full time

    About the RoleNVIDIA is the platform upon which every new AI-powered application is built. We are seeking a Senior Software Engineer to develop components that are used by the software factory automation for NVIDIA Inference Microservices (NIMs) and its deployed services.The right person for this role brings technical drive and creativity to change the way...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Senior Software Engineer to develop components that are used by the software factory automation for NVIDIA Inference Microservices (NIMs) and its deployed services.The ideal candidate will bring technical drive and creativity to change the way NVIDIA provides high-performance inferencing for every AI model.NIM offerings are easy to use,...


  • Santa Clara, California, United States NVIDIA Full time

    We're seeking a highly skilled Senior Deep Learning Software Engineer to join our cuDNN team at NVIDIA. As a key member of our team, you will be responsible for designing, building, and shipping cuDNN, our GPU-accelerated library of primitives for deep neural networks.Key Responsibilities:Develop production-quality software that ships as part of NVIDIA's AI...


  • Santa Clara, California, United States NVIDIA Full time

    We're seeking a Senior Deep Learning Software Engineer to join our cuDNN teamDo you enjoy crafting software systems to solve complex problems and writing fast code? We're looking for skilled software engineers to help design, build, and ship cuDNN: our GPU-accelerated library of primitives for deep neural networks. The era of AI has begun, and we're powering...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Performance Software Engineer to join our team responsible for developing optimized code to accelerate linear algebra and deep learning operations on NVIDIA GPUs.The ideal candidate will have a strong background in computer science, computer engineering, or a related field, and experience with performance-oriented...