Senior Software Engineer, Deep Learning Inference

3 weeks ago


Santa Clara, United States NVIDIA Full time

Are you passionate about driving innovation in deep learning and eager to work on cutting-edge AI technology? Join NVIDIA’s TensorRT team as a Senior Software Engineer, and be at the forefront of technology, enabling support in TensorRT for an evolving landscape of ground-breaking hardware capabilities. Your expertise will help shape the performance and functionality of our products, ensuring NVIDIA remains synonymous with innovation. If you're ready to tackle challenging projects, push the boundaries of AI, and make a significant impact in a company that values creativity, excellence, and teamwork, we want to hear from you

What you'll be doing:

  1. Orchestrate the integration of new hardware functionalities into TensorRT's compiler and runtime.
  2. Work closely with teams and stakeholders across the whole hardware and software stack to understand and leverage new features to improve TensorRT’s functionality and performance.
  3. Guide the design and implementation of robust, high-quality C++ code in alignment with Modern C++ standards.
  4. Contribute to the continuous improvement of software practices and processes within the team.

What we need to see:

  1. Masters, PhD, or equivalent experience in relevant fields (Computer Engineering, Computer Science, Electrical Engineering, AI).
  2. At least 12 years of relevant software development experience.
  3. Strong C++ skills, including knowledge of and application of best practices with C++11 and C++14.
  4. Familiarity with deep learning concepts and frameworks.
  5. A track record of taking initiative and driving projects to completion.
  6. Excellent interpersonal skills and a collaborative, pragmatic approach to solving problems.

Ways to stand out from the crowd:

  1. Proficiency with Python and/or CUDA, ideally with experience in a professional environment.
  2. Background with systems programming, embedded systems, and/or compiler development.
  3. Experience in software performance benchmarking, profiling, and optimizations.
  4. Experience with state-of-the-art deep learning models (such as Large Language Models) & frameworks for inference.
  5. Background with C++17.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative, autonomous, and love a challenge, come join our team

#LI-Hybrid

The base salary range is 220,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#J-18808-Ljbffr

  • Santa Clara, United States NVIDIA Full time

    We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA GPUs? We are now welcoming exceptional software engineers to apply to Senior Engineering...


  • Santa Clara, California, United States NVIDIA Full time

    We are looking for a Senior Software Engineer to build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA GPUs. The ideal candidate will have strong experience with C++11/C++14/C++17 and a strong grasp of Machine Learning concepts, especially Natural Language Processing.Key...


  • Santa Clara, United States NVIDIA Full time

    At NVIDIA, we're at the forefront of innovation, driving advancements in AI and machine learning to solve some of the world’s most challenging problems. We're seeking talented and motivated engineers to join our TensorRT team in developing the industry-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer in the...


  • Santa Clara, United States NVIDIA Full time

    We are now looking for a Senior Performance Software Engineer for Deep Learning Libraries! Do you enjoy tuning parallel algorithms and analyzing their performance? If so, we want to hear from you! As a deep learning library performance software engineer, you will be developing optimized code to accelerate linear algebra and deep learning operations on NVIDIA...


  • Santa Clara, United States NVIDIA Full time

    We are now looking for a Senior Deep Learning Software Development Engineer! Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in problems from image classification to speech recognition and natural language processing. By tapping into the unlimited potential of AI to define the next...


  • Santa Clara, United States NVIDIA Corporation Full time

    Senior Software Test Development Engineer - Deep Learning page is loaded Senior Software Test Development Engineer - Deep Learning Apply locations US, CA, Santa Clara time type Full time posted on Posted 30+ Days Ago job requisition id JR1987150 We are looking for a Software Test development engineer in NVIDIA’s Deep...


  • Santa Clara, United States NVIDIA Full time

    We're now looking for a Senior Deep Learning Software Engineer for our cuDNN team!Do you love writing fast code and crafting software systems to solve complex problems? We are looking for hardworking software engineers to help design, build, and ship cuDNN: our GPU-accelerated library of primitives for deep neural networks. Intelligent machines powered by AI...


  • Santa Clara, CA, United States NVIDIA Full time

    We are now looking for a Senior Performance Software Engineer for Deep Learning Libraries! Do you enjoy tuning parallel algorithms and analyzing their performance? If so, we want to hear from you! As a deep learning library performance software engineer, you will be developing optimized code to accelerate linear algebra and deep learning operations on NVIDIA...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA is hiring senior software engineers to build and optimize the tools Deep Learning engineers use across the world to design, develop, and deploy AI applications. We are an ambitious, forward-thinking and diverse team that influences all areas of NVIDIA's AI platform and directly contributes to premiere Deep Learning frameworks - PyTorch, JAX and...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA is hiring senior software engineers to build and optimize the tools Deep Learning engineers use across the world to design, develop, and deploy AI applications. We are an ambitious, forward-thinking and diverse team that influences all areas of NVIDIA's AI platform and directly contributes to premiere Deep Learning frameworks - PyTorch, JAX and...


  • US, CA, Santa Clara NVIDIA Full time

    We're now looking for a Senior Deep Learning Software Engineer for our cuDNN team!Do you love writing fast code and crafting software systems to solve complex problems? We are looking for hardworking software engineers to help design, build, and ship cuDNN: our GPU-accelerated library of primitives for deep neural networks. Intelligent machines powered by AI...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. At NVIDIA, our solutions architects work across different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our company...


  • Santa Clara, CA, United States NVIDIA Full time

    We're now looking for a Senior Deep Learning Software Engineer for our cuDNN team!Do you love writing fast code and crafting software systems to solve complex problems? We are looking for hardworking software engineers to help design, build, and ship cuDNN: our GPU-accelerated library of primitives for deep neural networks. Intelligent machines powered by AI...

  • Performance Engineer

    3 weeks ago


    Santa Clara, United States NVIDIA Full time

    NVIDIA is hiring software engineers at all experience levels to build and optimize the tools Deep Learning engineers use across the world to design, develop, and deploy AI applications. This position will embed you in an ambitious and diverse team that influences all areas of NVIDIA's AI platform and directly contributes to premiere Deep Learning frameworks...

  • Performance Engineer

    3 weeks ago


    Santa Clara, United States NVIDIA Full time

    NVIDIA is hiring software engineers at all experience levels to build and optimize the tools Deep Learning engineers use across the world to design, develop, and deploy AI applications. This position will embed you in an ambitious and diverse team that influences all areas of NVIDIA's AI platform and directly contributes to premiere Deep Learning frameworks...

  • Performance Engineer

    3 weeks ago


    Santa Clara, United States NVIDIA Full time

    NVIDIA is hiring software engineers at all experience levels to build and optimize the tools Deep Learning engineers use across the world to design, develop, and deploy AI applications. This position will embed you in an ambitious and diverse team that influences all areas of NVIDIA's AI platform and directly contributes to premiere Deep Learning frameworks...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is seeking a senior engineer to design and build a factory automation pipeline for NVIDIA Inference Microservices (NIMs). The right person for this role brings technical drive and creativity to change the way NVIDIA optimizes and serves performant inferencing for every AI model.The NIM offerings are easy to use, highly performant, and tested in all...


  • Santa Clara, CA, United States NVIDIA Full time

    NVIDIA is hiring senior software engineers to build and optimize the tools Deep Learning engineers use across the world to design, develop, and deploy AI applications. We are an ambitious, forward-thinking and diverse team that influences all areas of NVIDIA's AI platform and directly contributes to premiere Deep Learning frameworks - PyTorch, JAX and...


  • Santa Clara, California, United States NVIDIA Full time

    Job SummaryWe are seeking a skilled engineer to join our team and help shape the future of agentic inference systems. As a Senior LLM Research Engineer, you will play a critical role in improving the algorithmic performance and efficiency of large language models.Responsibilities:Research and development of contemporary research on generative AI, agents, and...


  • Santa Clara, United States NVIDIA Full time

    Are you ready to usher in the new world of Artificial Intelligence? Do you want to build the rockets launching the AI revolution? We are seeking a Director of Software Engineering for building a GPU accelerated software platform for inference applications. The right candidate for this role brings a mix of humanity and technical talent to provide the drive...