Senior Solution Architect, HPC and AI

2 weeks ago


Santa Clara, United States NVIDIA Full time

Do you want to be part of the team that brings Artificial Intelligence (AI) emerging technology to the field? We are looking for a hardworking Solution Architect (SA) to join the NVIDIA AI Enterprise (NVAIE) SA Segment Team. The mission of the NVAIE Segment team is to guide and enable the successful adoption at scale of NVIDIA AI Enterprise Software in production.

In our Solutions Architecture team, we work with NVIDIA's pioneering hardware and software, driving the latest breakthroughs in artificial intelligence. We need people who enable customer adoption of NVIDIA technology and develop lasting relationships with our technology partners, making NVIDIA a key design choice for end-user solutions. On this team, you will support full stack deployment including architectural designs, workload orchestration and application optimization. At NVIDIA, you will be immersed in a diverse, encouraging environment where everyone is inspired to do their life's work. Come join the team and see how you can make a lasting impact on the world

What You’ll Be Doing:

  • Primary responsibilities will include building and enabling robust AI/HPC infrastructure for customers.
  • Support operational and reliability aspects of large-scale AI clusters, focusing on performance at scale, training stability, real-time monitoring, logging, and alerting.
  • Engage in and improve services from inception and design through deployment, operation, and optimization.
  • Co-design telemetry of AI workloads to help engineering build solutions for more robust workloads at scale.
  • Communicate across internal teams to support the continuous improvement of NVIDIA's offerings and software designs.

What We Need to See:

  • Strong foundational expertise, from a BS, MS, or Ph.D. degree in Engineering, Mathematics, Physics, Computer Science, Data Science, or similar (or equivalent experience).
  • 8+ years of experience and knowledge of neural networks including good understanding of transformer architectures. Experience designing large scale AI workloads with SLURM and/or Kubernetes.
  • Proficiency with Python / C++ / Rust or other popular software languages.
  • Excellent verbal, written communication, and technical presentation skills in English.
  • You are motivated to work with multiple levels and teams across organizations.
  • Strong analytical and problem-solving skills.
  • Strong time-management and organization skills for coordinating multiple initiatives, priorities and implementations of new technology and products into very sophisticated projects.
  • You are a curious self-starter with a desire for continuous learning and sharing knowledge across the team.

Ways to Stand Out from The Crowd:

  • Experience orchestrating distributed Deep Learning training with SLURM.
  • Proficiency in DevOps, including hands-on experience with Ansible, Terraform or similar tools. Equivalent experience will be accepted as well.
  • 8+ years designing solutions with one or more Tier-1 Clouds (AWS, Azure, GCP or OCI) and cloud-native architectures and software.
  • Technical leadership with a strong understanding of NVIDIA technologies, and success in working with customers.
  • Expertise with parallel file systems (e.g. Lustre, GPFS, BeeGFS, WekaIO) and high-speed interconnects (InfiniBand, Omni Path, and Gig-E).
  • Experience with integration and deployment of software products in production enterprise environments, and microservices software architecture.

The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#J-18808-Ljbffr

  • Santa Clara, California, United States NVIDIA Full time

    As a Solutions Architect at NVIDIA, you will be part of a team that brings innovative AI solutions to our largest customers. We are looking for an experienced professional to assist customers in building AI/ML and HPC software solutions at scale.You will be driving end-to-end technology solutions with some of NVIDIA's most strategic customers, leveraging our...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers,...


  • Santa Clara, United States Advanced Micro Devices Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA is leading groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU -- our invention -- serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables groundbreaking creativity and discovery, and powers inventions...


  • Santa Clara, United States NVIDIA Full time

    Want to be part of a team that's revolutionizing the field of AI with data center scale solutions? We are looking for a hardworking Solution Architect with experience in designing, building, and maintaining large scale HPC and AI hybrid computing solutions to join our team at NVIDIA. As Solution Architects on the NVIDIA Partner Network team, we are actively...


  • Santa Clara, United States NVIDIA Corporation Full time

    Solutions Architect, Cloud Providers and Hyperscale Apply locations US, CA, Santa Clara US, WA, Remote US, CA, Remote Time type: Full time Posted on: Posted 30+ Days Ago Job requisition id: JR1980504 NVIDIA is looking for an experienced Solutions Architect to assist customers building infrastructure for AI and HPC. Do you want to be part of a team that...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. At NVIDIA, our solutions architects work across different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our company,...


  • Santa Clara, United States Advanced Micro Devices, Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • Santa Clara, United States E-Solutions Full time

    Title: Principal AI ArchitectSeniority: Director/HeadLocation: Santa, Clara, CA, and New Jersey-Onsitecandidates need to work onsite in the Client's AI Lab AWS or GCP or Azure background(Preferred)Employment:- Full-Time(Long Term) Position SummaryThe Principal AI Architect is responsible for leading the design and implementation of advanced AI solutions and...


  • santa clara, United States E-Solutions Full time

    Title: Principal AI ArchitectSeniority: Director/HeadLocation: Santa, Clara, CA, and New Jersey-Onsitecandidates need to work onsite in the Client's AI Lab AWS or GCP or Azure background(Preferred)Employment:- Full-Time(Long Term) Position SummaryThe Principal AI Architect is responsible for leading the design and implementation of advanced AI solutions and...


  • santa clara, United States E-Solutions Full time

    Title: Principal AI ArchitectSeniority: Director/HeadLocation: Santa, Clara, CA, and New Jersey-Onsitecandidates need to work onsite in the Client's AI Lab AWS or GCP or Azure background(Preferred)Employment:- Full-Time(Long Term) Position SummaryThe Principal AI Architect is responsible for leading the design and implementation of advanced AI solutions and...


  • Santa Clara, United States NVIDIA Full time

    We are looking for a AI Solution Architect Engineer with experience in Generative AI software development and deployment. As part of the Solution Architect organization, we work with the most exciting computing hardware and software, driving the latest breakthroughs in deep learning and AI with NVIDIA’s key customers. This role offers an excellent...


  • Santa Clara, United States Promote Project Full time

    We are looking for an AI Solution Architect Engineer with experience in Generative AI software development and deployment. As part of the Solution Architect organization, we work with the most exciting computing hardware and software, driving the latest breakthroughs in deep learning and AI with NVIDIA’s key customers. This role offers an excellent...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI and enabled the next era of computing. NVIDIA is a “learning machine” that constantly evolves...


  • Santa Clara, United States NVIDIA Corporation Full time

    Senior Software Engineer - HPC Locations: US, CA, Santa Clara; US, MA, Westford; US, TX, Austin; US, NC, Durham Time Type: Full time Posted on: Posted 17 Days Ago Job Requisition ID: JR1979406 NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer...


  • Santa Clara, United States Promote Project Full time

    NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI and enabled the next era of computing. NVIDIA is a “learning machine” that constantly evolves...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. At NVIDIA, our solutions architects work across different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our company...

  • Principal AI Architect

    3 months ago


    Santa Clara, United States HCLTech Full time

    Position Summary The Principal AI Architect is responsible for leading the design and implementation of advanced AI solutions and strategic architecture. Working closely with technology leaders from across our global client community, you will be their senior Trusted Advisor for their AI-enabled transformation journey This role demands deep understanding of...


  • Santa Clara, United States Advanced Micro Devices Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled AI Solution Architect Engineer to join our team at NVIDIA. As a key member of our Solution Architect organization, you will work with the latest breakthroughs in deep learning and AI, driving innovation and excellence in AI software development and deployment.This role offers an excellent opportunity to build your career in...