Senior Software Engineer for Deep Learning Inference Workflows

4 weeks ago


Santa Clara, California, United States NVIDIA Full time

We are seeking a highly skilled Senior Software Engineer to join our Deep Learning software team. As a key member of our team, you will be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference.

Key Responsibilities:

  • Develop graph parsers, optimizers, and tools for effective deployment of trained deep learning models using C++ and Python.
  • Collaborate with teams of deep learning experts, GPU architects, and DevOps engineers across diverse teams.

Requirements:

  • Bachelor's, Master's, PhD, or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering, or related field.
  • 6+ years of software development experience.
  • Strong experience with C++11/C++14/C++17.
  • Strong grasp of Machine Learning concepts, especially Natural Language Processing.
  • Excellent communication skills, and an aptitude for collaboration and teamwork.

Preferred Qualifications:

  • Experience developing System Software.
  • Proficiency in Python, as well as background in GPU kernel programming using CUDA or OpenCL.
  • Experience in software performance benchmarking, profiling, and optimizations.
  • Background in compiler development.
  • Experience in working with TensorRT, PyTorch, TensorFlow, ONNX Runtime, or other ML frameworks.

NVIDIA is a leader in the technology industry, and we are committed to fostering a diverse and inclusive work environment. We are an equal opportunity employer and do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

The base salary range for this position is $180,000 - $339,250 USD, and you will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.



  • Santa Clara, California, United States NVIDIA Full time

    We are looking for a Senior Software Engineer to build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA GPUs. The ideal candidate will have strong experience with C++11/C++14/C++17 and a strong grasp of Machine Learning concepts, especially Natural Language Processing.Key...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Software Engineer to join our Deep Learning software team. As a key member of our team, you will be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference.Key Responsibilities:Develop graph parsers, optimizers, and tools for effective deployment of trained deep...


  • Santa Clara, California, United States NVIDIA Full time

    Job SummaryNVIDIA is seeking a highly skilled Senior Software Engineer to join our TensorRT team in developing industry-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer, you will be responsible for designing and implementing inference optimizations to enable real-time AI applications on personal computing...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a leader in the generative AI revolution, and our Algorithmic Model Optimization Team is at the forefront of optimizing generative AI models for maximal inference efficiency. Our team focuses on techniques ranging from neural architecture search and pruning to sparsity, quantization, and automated deployment strategies.We conduct applied research...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Deep Learning Software Engineer to develop and productize NVIDIA's deep learning solutions in autonomous driving vehicles.The ideal candidate will have a strong background in deep learning and experience with NVIDIA's software libraries, including CUDA and TensorRT.As a member of our Solution Engineering-Automotive Deep...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Deep Learning Software Engineer to join our team at NVIDIA. As a member of our Solution Engineering-Automotive Deep Learning team, you will apply groundbreaking NVIDIA deep learning model training/inference software libraries for deployment on NVIDIA's hardware architecture.Key Responsibilities:Develop new deep learning...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Senior Performance Software Engineer to join our Deep Learning Library team at NVIDIA.This role involves developing optimized code to accelerate linear algebra and deep learning operations on NVIDIA GPUs. The team delivers high-performance code to NVIDIA's cuDNN, cuBLAS, and TensorRT libraries to accelerate deep learning models.Key...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Senior Deep Learning Algorithm Engineer to join our team at NVIDIA. As a key member of our team, you will be responsible for optimizing deep learning training workloads on state-of-the-art hardware and software platforms.Key Responsibilities:Understand, analyze, profile, and optimize deep learning training workloads on state-of-the-art...


  • Santa Clara, California, United States Nvidia Full time

    Job DescriptionNVIDIA's Deep Learning Architecture and Libraries Group is seeking a software engineer to contribute to the development of our performance analysis capabilities. As a member of our team, you will collaborate with GPU architects, CUDA developers, and deep learning performance engineers to design and implement innovative approaches to hardware...


  • Santa Clara, California, United States XPENG Motors Full time

    Job Title: Senior Deep Learning EngineerAbout the Role:We are seeking a highly skilled Senior Deep Learning Engineer to join our team at XPeng Motors. As a key member of our autonomous driving team, you will be responsible for researching, implementing, and evaluating deep-learning-based methods for prediction and planning problems.Responsibilities:*...


  • Santa Clara, California, United States NVIDIA Full time

    About the RoleNVIDIA is the platform upon which every new AI-powered application is built. We are seeking a Senior Software Engineer to develop components that are used by the software factory automation for NVIDIA Inference Microservices (NIMs) and its deployed services.The right person for this role brings technical drive and creativity to change the way...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Senior Software Engineer to develop components that are used by the software factory automation for NVIDIA Inference Microservices (NIMs) and its deployed services.The ideal candidate will bring technical drive and creativity to change the way NVIDIA provides high-performance inferencing for every AI model.NIM offerings are easy to use,...


  • Santa Clara, California, United States NVIDIA Full time

    We're seeking a Senior Deep Learning Software Engineer to join our cuDNN teamDo you enjoy crafting software systems to solve complex problems and writing fast code? We're looking for skilled software engineers to help design, build, and ship cuDNN: our GPU-accelerated library of primitives for deep neural networks. The era of AI has begun, and we're powering...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Deep Learning Software Engineer to join our team and contribute to the development of cutting-edge model optimization algorithms.As a Senior Deep Learning Software Engineer, you will be responsible for researching and developing highly efficient search algorithms, defining public APIs, and implementing various software...


  • Santa Clara, California, United States NVIDIA Full time

    Job SummaryWe're seeking a highly skilled Senior Deep Learning Software Engineer to join our cuDNN team. As a key member of our team, you will be responsible for designing, building, and shipping cuDNN, a GPU-accelerated library of primitives for deep neural networks.Key Responsibilities- Develop production-quality software that ships as part of NVIDIA's AI...


  • Santa Clara, California, United States NVIDIA Full time

    Job DescriptionNVIDIA is seeking a highly skilled Senior Systems Software Engineer to join our TAO Toolkit Deep Learning Architectures team. As a key member of our software team, you will be responsible for developing and implementing cutting-edge deep learning algorithms and solutions.Key Responsibilities:Architect, analyze, develop, and prototype key deep...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is seeking a senior engineer to design and build a factory automation pipeline for NVIDIA Inference Microservices (NIMs). The right person for this role brings technical drive and creativity to change the way NVIDIA optimizes and serves performant inferencing for every AI model.The NIM offerings are easy to use, highly performant, and tested in all...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Software Architect to lead the development and deployment of NVIDIA Inference Microservices (NIM) blueprints. NIM Agent Blueprints are reference workflows for canonical generative AI use cases. Enterprises can build and operationalize custom AI applications using NIM Agent Blueprints along with NVIDIA NIM microservices...


  • Santa Clara, California, United States NVIDIA Full time

    Job SummaryWe are seeking a skilled engineer to join our team and help shape the future of agentic inference systems. As a Senior LLM Research Engineer, you will play a critical role in improving the algorithmic performance and efficiency of large language models.Responsibilities:Research and development of contemporary research on generative AI, agents, and...


  • Santa Clara, California, United States NVIDIA Full time

    At NVIDIA, we're building software that will be used by the entire world. As a Senior MLOps Engineer, Deep Learning Algorithms, you'll work with high-class software engineers to implement a large-scale toolset that tests deep learning models and frameworks on the most powerful computers.The ability to work in a multifaceted, fast-paced environment is...