Senior Software Architect, NVIDIA Inference Microservices
3 weeks ago
We are seeking a highly skilled Senior Software Architect to lead the development and deployment of NVIDIA Inference Microservices (NIM) blueprints. NIM Agent Blueprints are reference workflows for canonical generative AI use cases. Enterprises can build and operationalize custom AI applications using NIM Agent Blueprints along with NVIDIA NIM microservices and NVIDIA NeMo framework, all part of the NVIDIA AI Enterprise Platform.
This role offers an outstanding opportunity to craft the future of AI at a fast-growing company at the forefront of the AI revolution. You'll work on the most powerful, enterprise-grade GPU clusters capable of hundreds of PetaFLOPS and gain early access to unreleased hardware, making a direct impact on NVIDIA's roadmap and the broader AI landscape.
Key Responsibilities:
- Design, build, and deploy NIM blueprints using NVIDIA Nemo and NVIDIA NIMs in a cloud native environment.
- Drive adoption and scale blueprint development by building reusable foundation blocks.
- Apply cloud native development and deployment expertise to create optimized patterns for NIM blueprints.
- Collaborate, brainstorm, and improve the designs of NIM blueprints with stakeholders from across the organization.
- Mentor and collaborate with team members and other teams to foster growth and development.
Requirements:
- AI applications and services experience.
- Cloud native software development and deployment experience.
- A degree in Computer Science, Computer Engineering, or a related field (BS or MS) or equivalent experience.
- 15+ years of relevant proven experience.
- Strong background in design and implementation.
- Passion for building scalable and performant inference applications.
- Hands-on development and deployment of high-quality, highly distributed cloud-based RESTful web services.
- Passion for extending your technical knowledge into new areas.
- Strong analytical skills and proven success in problem-solving and achieving performance objectives.
- Mentorship and the ability to grow teams and team members.
Preferred Qualifications:
- MS or PhD in Computer Science or an equivalent technical field.
- 10+ years of experience building end-to-end AI services and deploying them into production.
- 5+ years of experience with cloud native technologies such as Kubernetes, etc.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We highly value diversity in our current and future employees and do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
-
Santa Clara, California, United States NVIDIA Full timeAbout the RoleNVIDIA is the platform upon which every new AI-powered application is built. We are seeking a Senior Software Engineer to develop components that are used by the software factory automation for NVIDIA Inference Microservices (NIMs) and its deployed services.The right person for this role brings technical drive and creativity to change the way...
-
Santa Clara, California, United States NVIDIA Full timeWe are seeking a Senior Software Engineer to develop components that are used by the software factory automation for NVIDIA Inference Microservices (NIMs) and its deployed services.The ideal candidate will bring technical drive and creativity to change the way NVIDIA provides high-performance inferencing for every AI model.NIM offerings are easy to use,...
-
SRE Manager, NVIDIA Inference Microservices
4 weeks ago
Santa Clara, California, United States NVIDIA Full timeAbout NVIDIANVIDIA is the driving force behind the innovation revolution in AI, computing, and graphics. We are a leader in the development of technologies that power the world's most advanced computing systems.Job Title: SRE Manager, NIM FactoryWe are seeking a highly skilled SRE Manager to join our NIM Factory team. As an SRE Manager, you will be...
-
Santa Clara, California, United States NVIDIA Full timeWe are seeking a highly skilled Senior Software Engineer to join our Deep Learning Inference Workflows team. As a key member of our team, you will be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference.Key Responsibilities: Develop graph parsers, optimizers, and tools for effective deployment of...
-
Santa Clara, California, United States NVIDIA Full timeJob SummaryNVIDIA is seeking a highly skilled Senior Software Engineer to join our TensorRT team in developing industry-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer, you will be responsible for designing and implementing inference optimizations to enable real-time AI applications on personal computing...
-
Santa Clara, California, United States NVIDIA Full timeNVIDIA is a leader in the generative AI revolution, and our Algorithmic Model Optimization Team is at the forefront of optimizing generative AI models for maximal inference efficiency. Our team focuses on techniques ranging from neural architecture search and pruning to sparsity, quantization, and automated deployment strategies.We conduct applied research...
-
Senior Solutions Architect, NVIDIA
4 weeks ago
Santa Clara, California, United States NVIDIA Full timeAre you a seasoned expert in designing, building, and maintaining large-scale HPC and AI hybrid computing solutions? We are seeking a highly skilled Senior Solutions Architect to join our team at NVIDIA.As a key member of our team, you will work closely with customers and partners to address unsolved problems in the industry and help deploy and...
-
Santa Clara, California, United States NVIDIA Full timeWe are seeking a highly skilled Senior Software Engineer to join our Deep Learning software team. As a key member of our team, you will be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference.Key Responsibilities:Develop graph parsers, optimizers, and tools for effective deployment of trained deep...
-
Santa Clara, California, United States NVIDIA Full timeWe are seeking a highly skilled Senior Software Engineer to join our Deep Learning software team. As a key member of our team, you will be responsible for developing components of TensorRT, NVIDIA's SDK for high-performance deep learning inference.Key Responsibilities:Develop graph parsers, optimizers, and tools for effective deployment of trained deep...
-
Senior Software Engineer
3 weeks ago
Santa Clara, California, United States NVIDIA Full timeJob SummaryNVIDIA is seeking a senior software engineer to design and build factory automation for NVIDIA Inference Microservices (NIMs). The ideal candidate will have a strong background in system software and platform layers, including kernel, device driver, memory, storage, networking, and PCIe devices. They will apply their technical expertise to design...
-
Santa Clara, California, United States NVIDIA Full timeAbout NVIDIANVIDIA is a leader in the technology industry, renowned for its innovative products and services. With a legacy of 30 years, we've been redefining computer graphics, PC gaming, and accelerated computing. Our mission is to harness the power of AI to drive the next era of computing, where our GPUs serve as the brains of computers, robots, and...
-
Senior Deep Learning Software Engineer
4 weeks ago
Santa Clara, California, United States NVIDIA Full timeWe are seeking a highly skilled Senior Deep Learning Software Engineer to design and build our automated inference and deployment solution.As part of the team, you will play a pivotal role in architecting and designing a modular and scalable software platform to provide an excellent user experience with broad model support and optimization...
-
Senior Systems Software Engineer
3 weeks ago
Santa Clara, California, United States NVIDIA Full timeNVIDIA is seeking a senior engineer to design and build a factory automation pipeline for NVIDIA Inference Microservices (NIMs). The right person for this role brings technical drive and creativity to change the way NVIDIA optimizes and serves performant inferencing for every AI model.The NIM offerings are easy to use, highly performant, and tested in all...
-
Senior Solutions Architect
4 weeks ago
Santa Clara, California, United States Nvidia Full timeNVIDIA Job DescriptionWe are seeking a highly skilled Solutions Architect to join our team at NVIDIA. As a key member of our AI Solutions team, you will play a critical role in helping our customers build innovative solutions using our latest AI technology.Key Responsibilities:Partner with cross-functional teams to understand customer needs and develop...
-
Santa Clara, California, United States Nvidia Full timeJob DescriptionWe are seeking a highly skilled Senior System Software Engineer to join our team at NVIDIA. As a key member of our GPU-accelerated deep learning software team, you will be responsible for designing and implementing infrastructure solutions for our Triton Inference Server.Key Responsibilities:Design and implement continuous integration,...
-
Solutions Architect for AI Enterprise
3 weeks ago
Santa Clara, California, United States NVIDIA Full timeAre you passionate about emerging technologies and AI? We are seeking a skilled Solutions Architect to join our NVIDIA AI Enterprise team.The mission of our team is to guide and enable the successful adoption of DGX Cloud and NVIDIA AI Enterprise Software in production environments.DGX Cloud is an AI platform for enterprise developers, optimized for the...
-
AI Solutions Architect
3 weeks ago
Santa Clara, California, United States NVIDIA Full timeNVIDIA is seeking highly skilled AI Solutions Architects to collaborate with customers on cutting-edge Generative AI projects.As a Senior AI Solutions Architect, you will work closely with customers to understand their technical needs and develop high-value solutions using NVIDIA's latest AI technology.You will partner with cross-functional teams to define...
-
Santa Clara, California, United States NVIDIA Full timeNVIDIA is a world-leader in high-speed computer vision, artificial intelligence, and deep learning. Our team builds the accelerated software ecosystem that enables visual AI developers to innovate swiftly and efficiently at scale.We are seeking an outstanding individual to help us build highly optimized microservice products and NVIDIA NIMs that bring visual...
-
Senior Software Architect
4 weeks ago
Santa Clara, California, United States NVIDIA Full timeWe are seeking a highly skilled Senior Software Architect to join our system software engineering team at NVIDIA. As a key member of our team, you will be responsible for architecting, evaluating, and integrating proximity sensing and positioning solutions to our automotive platforms and products.You will collaborate with our global engineering teams to...
-
AI Solutions Architect
4 weeks ago
Santa Clara, California, United States NVIDIA Full timeAI Solutions ArchitectWe are seeking a highly skilled AI Solutions Architect to join our team at NVIDIA. As a key member of our Solution Architect organization, you will work closely with our customers to develop and deploy innovative AI solutions using NVIDIA's cutting-edge technologies.Key Responsibilities:Lead software customer technical engagements with...