Senior Software Engineer, Deep Learning Inference Specialist

3 weeks ago


Santa Clara, California, United States NVIDIA Full time

Job Summary

NVIDIA is seeking a highly skilled Senior Software Engineer to join our TensorRT team in developing industry-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer, you will be responsible for designing and implementing inference optimizations to enable real-time AI applications on personal computing devices with NVIDIA AI accelerators (GPU, DLA).

Key Responsibilities

  • Design, implement, and optimize TensorRT components to achieve tightly coordinated and responsive Generative AI inference applications for PCs and workstations.
  • Develop software in C++, Python, CUDA, and DirectML to accelerate systems that enable seamless and efficient deployment of next-gen AI models.
  • Collaborate with deep learning experts and GPU architects throughout the company.

Requirements

  • BS, MS, PhD, or equivalent experience in Computer Science, Computer Engineering, or a related field.
  • 5+ years of software development experience on a large codebase or project.
  • Strong proficiency in C++ and Python programming languages.
  • Experience with development of Deep Learning Frameworks, Compilers, or System Software.
  • Foundational knowledge of Machine Learning techniques or GPU optimizations.
  • Excellent problem-solving skills and the ability to learn and work effectively in a fast-paced, collaborative environment.
  • Strong communication skills and the ability to articulate complex technical concepts.

Preferred Qualifications

  • Experience in developing DirectML backend for GPU or NPU.
  • Windows application and middleware development using DirectX or DirectML API.
  • Knowledge of GPU programming using CUDA or OpenCL.
  • Experience with deploying AI models in production environments.
  • Knowledge of additional performance optimization tools and techniques as well as contributions to open-source projects or publications in relevant areas.

About NVIDIA

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.



  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Software Engineer to join our Deep Learning Inference Workflows team. As a key member of our team, you will be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference.Key Responsibilities: Develop graph parsers, optimizers, and tools for effective deployment of...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Software Engineer to join our Deep Learning software team. As a key member of our team, you will be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference.Key Responsibilities:Develop graph parsers, optimizers, and tools for effective deployment of trained deep...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Software Engineer to join our Deep Learning software team. As a key member of our team, you will be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference.Key Responsibilities:Develop graph parsers, optimizers, and tools for effective deployment of trained deep...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a leader in the generative AI revolution, and our Algorithmic Model Optimization Team is at the forefront of optimizing generative AI models for maximal inference efficiency. Our team focuses on techniques ranging from neural architecture search and pruning to sparsity, quantization, and automated deployment strategies.We conduct applied research...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Deep Learning Software Engineer to develop and productize NVIDIA's deep learning solutions in autonomous driving vehicles.The ideal candidate will have a strong background in deep learning and experience with NVIDIA's software libraries, including CUDA and TensorRT.As a member of our Solution Engineering-Automotive Deep...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Deep Learning Software Engineer to design and build our automated inference and deployment solution.As part of the team, you will play a pivotal role in architecting and designing a modular and scalable software platform to provide an excellent user experience with broad model support and optimization...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a leader in the field of deep learning, and we're seeking a talented Senior System Software Engineer to join our team. As a key member of our deep learning software engineering team, you will be responsible for optimizing deep learning algorithms for training and inference, as well as integrating them into existing systems and tools.Key...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Deep Learning Software Engineer to join our team at NVIDIA. As a member of our Solution Engineering-Automotive Deep Learning team, you will apply groundbreaking NVIDIA deep learning model training/inference software libraries for deployment on NVIDIA's hardware architecture.Key Responsibilities:Develop new deep learning...


  • Santa Clara, California, United States NVIDIA Full time

    Senior Performance Software Engineer for Deep Learning LibrariesWe are seeking a highly skilled Senior Performance Software Engineer to join our team in developing optimized code for NVIDIA GPUs. As a key member of our Deep Learning Library team, you will be responsible for writing highly tuned compute kernels in C++ CUDA to perform core deep learning...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Senior Performance Software Engineer to join our Deep Learning Library team at NVIDIA.This role involves developing optimized code to accelerate linear algebra and deep learning operations on NVIDIA GPUs. The team delivers high-performance code to NVIDIA's cuDNN, cuBLAS, and TensorRT libraries to accelerate deep learning models.Key...


  • Santa Clara, California, United States XPENG Motors Full time

    Job Title: Senior Deep Learning EngineerAbout the Role:We are seeking a highly skilled Senior Deep Learning Engineer to join our team at XPeng Motors. As a key member of our autonomous driving team, you will be responsible for researching, implementing, and evaluating deep-learning-based methods for prediction and planning problems.Responsibilities:*...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Deep Learning Software Engineer to join our Solution Engineering-Automotive Deep Learning team. As a key member of our team, you will be responsible for developing and productizing NVIDIA's deep learning solutions in autonomous driving vehicles.Key responsibilities include:Developing new deep learning architectures and...


  • Santa Clara, California, United States NVIDIA Full time

    Join Our Team of InnovatorsWe're seeking a highly skilled Senior Deep Learning Software Engineer to join our cuDNN team at NVIDIA. As a key member of our team, you'll play a crucial role in designing, building, and shipping our GPU-accelerated library of primitives for deep neural networks.About the RoleAs a Senior Deep Learning Software Engineer, you'll be...


  • Santa Clara, California, United States Nvidia Full time

    Job DescriptionNVIDIA's Deep Learning Architecture and Libraries Group is seeking a software engineer to contribute to the development of our performance analysis capabilities. As a member of our team, you will collaborate with GPU architects, CUDA developers, and deep learning performance engineers to design and implement innovative approaches to hardware...


  • Santa Clara, California, United States NVIDIA Full time

    About the RoleNVIDIA is the platform upon which every new AI-powered application is built. We are seeking a Senior Software Engineer to develop components that are used by the software factory automation for NVIDIA Inference Microservices (NIMs) and its deployed services.The right person for this role brings technical drive and creativity to change the way...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Senior Software Engineer to develop components that are used by the software factory automation for NVIDIA Inference Microservices (NIMs) and its deployed services.The ideal candidate will bring technical drive and creativity to change the way NVIDIA provides high-performance inferencing for every AI model.NIM offerings are easy to use,...


  • Santa Clara, California, United States NVIDIA Full time

    We're seeking a highly skilled Senior Deep Learning Software Engineer to join our cuDNN team at NVIDIA. As a key member of our team, you will be responsible for designing, building, and shipping cuDNN, our GPU-accelerated library of primitives for deep neural networks.Key Responsibilities:Develop production-quality software that ships as part of NVIDIA's AI...


  • Santa Clara, California, United States NVIDIA Full time

    We're seeking a Senior Deep Learning Software Engineer to join our cuDNN teamDo you enjoy crafting software systems to solve complex problems and writing fast code? We're looking for skilled software engineers to help design, build, and ship cuDNN: our GPU-accelerated library of primitives for deep neural networks. The era of AI has begun, and we're powering...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Senior Deep Learning Algorithm Engineer to join our team at NVIDIA. As a key member of our team, you will be responsible for optimizing deep learning training workloads on state-of-the-art hardware and software platforms.Key Responsibilities:Understand, analyze, profile, and optimize deep learning training workloads on state-of-the-art...


  • Santa Clara, California, United States Nvidia Full time

    Job DescriptionWe are seeking a highly skilled Senior System Software Engineer to join our team at NVIDIA. As a key member of our GPU-accelerated deep learning software team, you will be responsible for designing and implementing infrastructure solutions for our Triton Inference Server.Key Responsibilities:Design and implement continuous integration,...