Senior Software Engineer, NVIDIA Inference Microservices
3 weeks ago
About the Role
NVIDIA is the platform upon which every new AI-powered application is built. We are seeking a Senior Software Engineer to develop components that are used by the software factory automation for NVIDIA Inference Microservices (NIMs) and its deployed services.
The right person for this role brings technical drive and creativity to change the way NVIDIA provides high-performance inferencing for every AI model.
Our NIM offerings are easy to use, optimized for performance, and developed using a highly automated software factory. We create containers available for download and hosted services.
Key Responsibilities
- Design, build, and optimize containerized inference execution for a variety of applications, ensuring efficiency and scalability.
- Drive the adoption of a large number of optimized models through microservices developed in Python or Go.
- Ensure the performance and scalability of NIMs through comprehensive performance measurement and optimization.
- Collaborate, brainstorm, and improve the designs of inference solutions with a broad team of software engineers, researchers, SREs, and product management.
Requirements
- A history of using advanced programming skills to build distributed compute systems, backend services, microservices, and cloud technologies.
- Effective experience working with multi-functional teams, principals, and architects across organizational boundaries.
- Mentorship and the ability to grow teams and team members.
- Deep technical expertise in distributed containerized applications using Docker, Kubernetes, Cloud Endpoints, Helm, and Prometheus.
- Passion for building scalable and performant microservice applications.
- Excellent interpersonal skills and the flexibility to lead multi-functional efforts.
- Proven experience debugging and analyzing the performance of distributed microservices or cloud systems.
About NVIDIA
We are widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and creative people in the world working for us. If you're creative and autonomous with a real passion for technology, we want to hear from you.
The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.
You will also be eligible for equity and benefits.
NVIDIA accepts applications on an ongoing basis.
We are committed to fostering a diverse work environment and proud to be an equal opportunity employer.
-
Santa Clara, California, United States NVIDIA Full timeWe are seeking a Senior Software Engineer to develop components that are used by the software factory automation for NVIDIA Inference Microservices (NIMs) and its deployed services.The ideal candidate will bring technical drive and creativity to change the way NVIDIA provides high-performance inferencing for every AI model.NIM offerings are easy to use,...
-
Santa Clara, California, United States NVIDIA Full timeWe are seeking a highly skilled Senior Software Architect to lead the development and deployment of NVIDIA Inference Microservices (NIM) blueprints. NIM Agent Blueprints are reference workflows for canonical generative AI use cases. Enterprises can build and operationalize custom AI applications using NIM Agent Blueprints along with NVIDIA NIM microservices...
-
Santa Clara, California, United States NVIDIA Full timeWe are seeking a highly skilled Senior Software Engineer to join our Deep Learning Inference Workflows team. As a key member of our team, you will be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference.Key Responsibilities: Develop graph parsers, optimizers, and tools for effective deployment of...
-
Santa Clara, California, United States NVIDIA Full timeJob SummaryNVIDIA is seeking a highly skilled Senior Software Engineer to join our TensorRT team in developing industry-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer, you will be responsible for designing and implementing inference optimizations to enable real-time AI applications on personal computing...
-
Santa Clara, California, United States NVIDIA Full timeNVIDIA is a leader in the generative AI revolution, and our Algorithmic Model Optimization Team is at the forefront of optimizing generative AI models for maximal inference efficiency. Our team focuses on techniques ranging from neural architecture search and pruning to sparsity, quantization, and automated deployment strategies.We conduct applied research...
-
Senior Software Engineer
3 weeks ago
Santa Clara, California, United States NVIDIA Full timeJob SummaryNVIDIA is seeking a senior software engineer to design and build factory automation for NVIDIA Inference Microservices (NIMs). The ideal candidate will have a strong background in system software and platform layers, including kernel, device driver, memory, storage, networking, and PCIe devices. They will apply their technical expertise to design...
-
Senior Systems Software Engineer
3 weeks ago
Santa Clara, California, United States NVIDIA Full timeNVIDIA is seeking a senior engineer to design and build a factory automation pipeline for NVIDIA Inference Microservices (NIMs). The right person for this role brings technical drive and creativity to change the way NVIDIA optimizes and serves performant inferencing for every AI model.The NIM offerings are easy to use, highly performant, and tested in all...
-
Santa Clara, California, United States NVIDIA Full timeWe are seeking a highly skilled Senior Software Engineer to join our Deep Learning software team. As a key member of our team, you will be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference.Key Responsibilities:Develop graph parsers, optimizers, and tools for effective deployment of trained deep...
-
Santa Clara, California, United States NVIDIA Full timeWe are seeking a highly skilled Senior Software Engineer to join our Deep Learning software team. As a key member of our team, you will be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference.Key Responsibilities:Develop graph parsers, optimizers, and tools for effective deployment of trained deep...
-
Santa Clara, California, United States NVIDIA Full timeNVIDIA is a world-leader in high-speed computer vision, artificial intelligence, and deep learning. Our team builds the accelerated software ecosystem that enables visual AI developers to innovate swiftly and efficiently at scale.We are seeking an outstanding individual to help us build highly optimized microservice products and NVIDIA NIMs that bring visual...
-
Senior Deep Learning Software Engineer
3 weeks ago
Santa Clara, California, United States NVIDIA Full timeWe are seeking a highly skilled Senior Deep Learning Software Engineer to develop and productize NVIDIA's deep learning solutions in autonomous driving vehicles.The ideal candidate will have a strong background in deep learning and experience with NVIDIA's software libraries, including CUDA and TensorRT.As a member of our Solution Engineering-Automotive Deep...
-
Senior Deep Learning Software Engineer
4 weeks ago
Santa Clara, California, United States NVIDIA Full timeWe are seeking a highly skilled Senior Deep Learning Software Engineer to design and build our automated inference and deployment solution.As part of the team, you will play a pivotal role in architecting and designing a modular and scalable software platform to provide an excellent user experience with broad model support and optimization...
-
Santa Clara, California, United States Nvidia Full timeJob DescriptionWe are seeking a highly skilled Senior System Software Engineer to join our team at NVIDIA. As a key member of our GPU-accelerated deep learning software team, you will be responsible for designing and implementing infrastructure solutions for our Triton Inference Server.Key Responsibilities:Design and implement continuous integration,...
-
Santa Clara, California, United States NVIDIA Full timeWe are seeking a Senior Performance Software Engineer to join our Deep Learning Library team at NVIDIA.This role involves developing optimized code to accelerate linear algebra and deep learning operations on NVIDIA GPUs. The team delivers high-performance code to NVIDIA's cuDNN, cuBLAS, and TensorRT libraries to accelerate deep learning models.Key...
-
Santa Clara, California, United States NVIDIA Full timeSenior Performance Software Engineer for Deep Learning LibrariesWe are seeking a highly skilled Senior Performance Software Engineer to join our team in developing optimized code for NVIDIA GPUs. As a key member of our Deep Learning Library team, you will be responsible for writing highly tuned compute kernels in C++ CUDA to perform core deep learning...
-
Santa Clara, California, United States NVIDIA Full timeNVIDIA is a leader in the field of deep learning, and we're seeking a talented Senior System Software Engineer to join our team. As a key member of our deep learning software engineering team, you will be responsible for optimizing deep learning algorithms for training and inference, as well as integrating them into existing systems and tools.Key...
-
Santa Clara, California, United States NVIDIA Full timeWe are seeking a highly skilled Senior Software Engineer to join our Generalist Embodied Agent Research (GEAR) team at NVIDIA. As a key member of our team, you will be responsible for developing robust AI solutions for general-purpose humanoid robots and embodied agents.Key Responsibilities:Work with world-class researchers to develop large-scale AI training...
-
Santa Clara, California, United States NVIDIA Full timeJob SummaryAs a Senior Software Product Manager for Nemo Microservices at NVIDIA, you will lead key pieces of our portfolio of software that enable state-of-the-art accuracy and performance for neural networks across data curation, model customization, and guard rails. You will engage with customers, define products and roadmaps, develop business strategy,...
-
Deep Learning Software Engineer
3 weeks ago
Santa Clara, California, United States NVIDIA Full timeWe are seeking a highly skilled Deep Learning Software Engineer to join our team at NVIDIA. As a member of our Solution Engineering-Automotive Deep Learning team, you will apply groundbreaking NVIDIA deep learning model training/inference software libraries for deployment on NVIDIA's hardware architecture.Key Responsibilities:Develop new deep learning...
-
Senior Full-Stack Software Engineer
3 weeks ago
Santa Clara, California, United States NVIDIA Full timeNVIDIA is a leader in High-Performance Computing, Artificial Intelligence, and Visualization.The company is looking for a highly motivated Full-Stack Web Applications Engineer to join its dynamic and innovative Hardware Infrastructure team.The team is building new infrastructure to deliver highly available services to run scalable web services, development...