Senior Software Engineer, Deep Learning Inference
3 weeks ago
At NVIDIA, we're at the forefront of innovation, driving advancements in AI and machine learning to solve some of the world’s most challenging problems. We're seeking talented and motivated engineers to join our TensorRT team in developing the industry-leading deep learning inference software for NVIDIA AI accelerators.
As a Senior Software Engineer in the TensorRT team, you will be responsible for designing and implementing inference optimizations to enable real-time AI applications on personal computing devices with NVIDIA AI accelerators (GPU, DLA). You will work closely with cross-functional teams to integrate and deploy AI solutions in production environments, and your expertise will help shape the performance, functionality, and efficiency of our AI models and systems. If you're ready to take on challenging projects and make a significant impact in a company that values creativity, excellence, and collaboration, we want to hear from you
What you’ll be doing:
- Design, implement and optimize TensorRT components to achieve tightly coordinated and responsive Generative AI inference applications for PCs and workstations.
- Develop software in C++, Python, CUDA, and DirectML to accelerate systems that enable seamless and efficient deployment of next-gen AI models.
- Collaborate with deep learning experts and GPU architects throughout the company.
What we need to see:
- BS, MS, PhD or equivalent experience in Computer Science, Computer Engineering or a related field.
- 5+ years of software development experience on a large codebase or project.
- Strong proficiency in C++ and Python programming languages.
- Experience with development of: Deep Learning Frameworks, Compilers, or System Software.
- Foundational knowledge of Machine Learning techniques or GPU optimizations.
- Excellent problem-solving skills and the ability to learn and work effectively in a fast-paced, collaborative environment.
- Strong communication skills and the ability to articulate complex technical concepts.
Ways to stand out from the crowd:
- Experience in developing DirectML backend for GPU or NPU.
- Windows application and middleware development using DirectX or DirectML API.
- Knowledge of GPU programming using CUDA or OpenCL.
- Experience with deploying AI models in production environments.
- Knowledge of additional performance optimization tools and techniques as well as contributions to open-source projects or publications in relevant areas.
The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
#J-18808-Ljbffr-
Santa Clara, United States NVIDIA Full timeWe are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA GPUs? We are now welcoming exceptional software engineers to apply to Senior Engineering...
-
Santa Clara, California, United States NVIDIA Full timeWe are looking for a Senior Software Engineer to build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA GPUs. The ideal candidate will have strong experience with C++11/C++14/C++17 and a strong grasp of Machine Learning concepts, especially Natural Language Processing.Key...
-
Santa Clara, United States NVIDIA Full timeAre you passionate about driving innovation in deep learning and eager to work on cutting-edge AI technology? Join NVIDIA’s TensorRT team as a Senior Software Engineer, and be at the forefront of technology, enabling support in TensorRT for an evolving landscape of ground-breaking hardware capabilities. Your expertise will help shape the performance and...
-
Santa Clara, United States NVIDIA Full timeWe are now looking for a Senior Performance Software Engineer for Deep Learning Libraries! Do you enjoy tuning parallel algorithms and analyzing their performance? If so, we want to hear from you! As a deep learning library performance software engineer, you will be developing optimized code to accelerate linear algebra and deep learning operations on NVIDIA...
-
Senior Deep Learning Software Engineer
5 days ago
Santa Clara, United States NVIDIA Full timeWe are now looking for a Senior Deep Learning Software Development Engineer! Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in problems from image classification to speech recognition and natural language processing. By tapping into the unlimited potential of AI to define the next...
-
Senior Software Test Development Engineer
3 weeks ago
Santa Clara, United States NVIDIA Corporation Full timeSenior Software Test Development Engineer - Deep Learning page is loaded Senior Software Test Development Engineer - Deep Learning Apply locations US, CA, Santa Clara time type Full time posted on Posted 30+ Days Ago job requisition id JR1987150 We are looking for a Software Test development engineer in NVIDIA’s Deep...
-
Senior Deep Learning Software Engineer, cuDNN
4 weeks ago
Santa Clara, United States NVIDIA Full timeWe're now looking for a Senior Deep Learning Software Engineer for our cuDNN team!Do you love writing fast code and crafting software systems to solve complex problems? We are looking for hardworking software engineers to help design, build, and ship cuDNN: our GPU-accelerated library of primitives for deep neural networks. Intelligent machines powered by AI...
-
Santa Clara, CA, United States NVIDIA Full timeWe are now looking for a Senior Performance Software Engineer for Deep Learning Libraries! Do you enjoy tuning parallel algorithms and analyzing their performance? If so, we want to hear from you! As a deep learning library performance software engineer, you will be developing optimized code to accelerate linear algebra and deep learning operations on NVIDIA...
-
Senior Performance Engineer
4 weeks ago
Santa Clara, United States NVIDIA Full timeNVIDIA is hiring senior software engineers to build and optimize the tools Deep Learning engineers use across the world to design, develop, and deploy AI applications. We are an ambitious, forward-thinking and diverse team that influences all areas of NVIDIA's AI platform and directly contributes to premiere Deep Learning frameworks - PyTorch, JAX and...
-
Senior Performance Engineer
3 weeks ago
Santa Clara, United States NVIDIA Full timeNVIDIA is hiring senior software engineers to build and optimize the tools Deep Learning engineers use across the world to design, develop, and deploy AI applications. We are an ambitious, forward-thinking and diverse team that influences all areas of NVIDIA's AI platform and directly contributes to premiere Deep Learning frameworks - PyTorch, JAX and...
-
Senior Deep Learning Software Engineer, cuDNN
2 months ago
US, CA, Santa Clara NVIDIA Full timeWe're now looking for a Senior Deep Learning Software Engineer for our cuDNN team!Do you love writing fast code and crafting software systems to solve complex problems? We are looking for hardworking software engineers to help design, build, and ship cuDNN: our GPU-accelerated library of primitives for deep neural networks. Intelligent machines powered by AI...
-
Solutions Architect, Generative AI
1 week ago
Santa Clara, United States NVIDIA Full timeNVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. At NVIDIA, our solutions architects work across different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our company...
-
Senior Deep Learning Software Engineer, cuDNN
3 weeks ago
Santa Clara, CA, United States NVIDIA Full timeWe're now looking for a Senior Deep Learning Software Engineer for our cuDNN team!Do you love writing fast code and crafting software systems to solve complex problems? We are looking for hardworking software engineers to help design, build, and ship cuDNN: our GPU-accelerated library of primitives for deep neural networks. Intelligent machines powered by AI...
-
Performance Engineer
3 weeks ago
Santa Clara, United States NVIDIA Full timeNVIDIA is hiring software engineers at all experience levels to build and optimize the tools Deep Learning engineers use across the world to design, develop, and deploy AI applications. This position will embed you in an ambitious and diverse team that influences all areas of NVIDIA's AI platform and directly contributes to premiere Deep Learning frameworks...
-
Performance Engineer
3 weeks ago
Santa Clara, United States NVIDIA Full timeNVIDIA is hiring software engineers at all experience levels to build and optimize the tools Deep Learning engineers use across the world to design, develop, and deploy AI applications. This position will embed you in an ambitious and diverse team that influences all areas of NVIDIA's AI platform and directly contributes to premiere Deep Learning frameworks...
-
Performance Engineer
3 weeks ago
Santa Clara, United States NVIDIA Full timeNVIDIA is hiring software engineers at all experience levels to build and optimize the tools Deep Learning engineers use across the world to design, develop, and deploy AI applications. This position will embed you in an ambitious and diverse team that influences all areas of NVIDIA's AI platform and directly contributes to premiere Deep Learning frameworks...
-
Senior Systems Software Engineer
1 month ago
Santa Clara, California, United States NVIDIA Full timeNVIDIA is seeking a senior engineer to design and build a factory automation pipeline for NVIDIA Inference Microservices (NIMs). The right person for this role brings technical drive and creativity to change the way NVIDIA optimizes and serves performant inferencing for every AI model.The NIM offerings are easy to use, highly performant, and tested in all...
-
Senior Performance Engineer
3 weeks ago
Santa Clara, CA, United States NVIDIA Full timeNVIDIA is hiring senior software engineers to build and optimize the tools Deep Learning engineers use across the world to design, develop, and deploy AI applications. We are an ambitious, forward-thinking and diverse team that influences all areas of NVIDIA's AI platform and directly contributes to premiere Deep Learning frameworks - PyTorch, JAX and...
-
Senior LLM Research Engineer
4 weeks ago
Santa Clara, California, United States NVIDIA Full timeJob SummaryWe are seeking a skilled engineer to join our team and help shape the future of agentic inference systems. As a Senior LLM Research Engineer, you will play a critical role in improving the algorithmic performance and efficiency of large language models.Responsibilities:Research and development of contemporary research on generative AI, agents, and...
-
Director, Software Engineering
4 weeks ago
Santa Clara, United States NVIDIA Full timeAre you ready to usher in the new world of Artificial Intelligence? Do you want to build the rockets launching the AI revolution? We are seeking a Director of Software Engineering for building a GPU accelerated software platform for inference applications. The right candidate for this role brings a mix of humanity and technical talent to provide the drive...