Senior AI-HPC Storage Solutions Architect

4 weeks ago


Santa Clara, California, United States NVIDIA Full time
Job Title: Senior AI-HPC Storage Solutions Architect

NVIDIA is a leader in the field of artificial intelligence and high-performance computing, and we are seeking a highly skilled Senior AI-HPC Storage Solutions Architect to join our team.

About the Role:

We are looking for an expert in designing and implementing high-performance storage solutions for our AI and HPC workloads. The ideal candidate will have a strong background in computer science, electrical engineering, or a related field, and will have experience in designing and operating large-scale storage infrastructure.

Key Responsibilities:
  • Design and implement high-performance storage solutions for AI and HPC workloads
  • Research and implement distributed storage services
  • Design and implement scalable and efficient next-gen storage solutions tailored for data-intensive applications
  • Develop tooling to automate management of large-scale infrastructure environments
  • Collaborate across teams to better understand developers' workflows and gather their infrastructure requirements
  • Influence and guide methodologies for building, testing, and deploying applications to ensure optimal performance and resource utilization
Requirements:
  • Bachelor's degree in Computer Science, Electrical Engineering, or related field
  • 8+ years of experience designing and operating large-scale storage infrastructure
  • Experience analyzing and tuning performance for a variety of AI/HPC workloads
  • Experience with one or more parallel or distributed filesystems such as Lustre, GPFS
  • Proficient in Centos/RHEL and/or Ubuntu Linux distros, including Python programming and bash scripting
  • Strong experience operating services in leading cloud environments (AWS, Azure, GCP)
  • Experience with AI/HPC cluster job schedulers such as SLURM, LSF
  • In-depth understanding of container technologies like Docker, Enroot
  • Experience with AI/HPC workflows that use MPI
Preferred Qualifications:
  • Experience with NVIDIA GPUs, Cuda Programming, NCCL, and MLPerf benchmarking
  • Experience with Machine Learning and Deep Learning concepts, algorithms, and models
  • Familiarity with InfiniBand with IBOP and RDMA
  • Background with Software Defined Networking and AI/HPC cluster networking
  • Familiarity with deep learning frameworks like PyTorch and TensorFlow
What We Offer:

NVIDIA offers highly competitive salaries and a comprehensive benefits package. We have some of the most resourceful and talented people in the world working for us, and our extraordinary engineering teams are growing fast.

If you're a creative and autonomous engineer with real passion for technology, we want to hear from you.

The base salary range is $180,000 - $339,250. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer.



  • Santa Clara, California, United States NVIDIA Full time

    Job DescriptionNVIDIA is a leader in the field of artificial intelligence and high-performance computing, and we are seeking a highly skilled AI-HPC Storage Engineer to join our team.The successful candidate will be responsible for designing and implementing cutting-edge storage solutions for our AI and HPC infrastructure, ensuring optimal performance and...


  • Santa Clara, California, United States NVIDIA Full time

    Job Title: Senior Product Architect, HPC and AIJob Summary: We are seeking a visionary Product Architect to join our team at NVIDIA. As a key member of our team, you will harness your infrastructure expertise to create reference designs for the world's most powerful AI clusters.Responsibilities:* Design the next-gen datacenter-scale AI infrastructure,...


  • Santa Clara, California, United States NVIDIA Full time

    Job Description:NVIDIA is the world leader in computer graphics, artificial intelligence, and accelerated computing. For over 25 years, we have been at the forefront of research and engineering around the greatest advances in technology. Our history of innovation drives us to solve the world's hardest problems.We are looking for a Senior HPC and AI Solutions...


  • Santa Clara, California, United States Amazon Full time

    Job DescriptionWe are seeking a highly skilled Sr. Worldwide Specialist Solutions Architect to join our team at Amazon Web Services (AWS). As a key member of our sales organization, you will work with customers to design and implement cloud-based solutions for High Performance Computing (HPC) workloads.Key Responsibilities:Design and architect HPC solutions...


  • Santa Clara, California, United States Intel Full time

    Job SummaryWe are seeking an experienced AI and HPC Scale-out Systems architect to join our team at Intel. As a key member of our Data Center and Artificial Intelligence group, you will be responsible for architecting large-scale systems that support breakthrough performance on HPC and AI workloads.Key ResponsibilitiesArchitecting large-scale systems that...


  • Santa Clara, California, United States NVIDIA Full time

    Are you a seasoned expert in designing, building, and maintaining large-scale HPC and AI hybrid computing solutions? We are seeking a highly skilled Senior Solutions Architect to join our team at NVIDIA.As a key member of our team, you will work closely with customers and partners to address unsolved problems in the industry and help deploy and...


  • Santa Clara, California, United States NVIDIA Full time

    As a Solutions Architect at NVIDIA, you will be part of a team that brings innovative AI solutions to our largest customers. We are looking for an experienced professional to assist customers in building AI/ML and HPC software solutions at scale.You will be driving end-to-end technology solutions with some of NVIDIA's most strategic customers, leveraging our...


  • Santa Clara, California, United States Intel Full time

    Job DescriptionThe rapid growth of Artificial Intelligence across various industries presents a significant opportunity for Intel, with data center AI systems and technologies being a key area for expansion.To support a wide range of AI, HPC, enterprise, and cloud workloads, Intel's systems and technologies must be optimized for best-in-class performance,...


  • Santa Clara, California, United States NVIDIA Full time

    Are you passionate about bringing Artificial Intelligence (AI) solutions to NVIDIA's largest customers? As a Solutions Architect, you will play a key role in assisting customers in building AI/ML and HPC software solutions at scale.As part of the NVIDIA Solutions Architecture team, you will drive end-to-end technology solutions with some of NVIDIA's most...


  • Santa Clara, California, United States Advanced Micro Devices , Inc. Full time

    Transforming Lives with AMD TechnologyWe are committed to enriching our industry, communities, and the world with AMD technology. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming, and embedded systems.The Team:Our Data Center GPU...


  • Santa Clara, California, United States HPE Full time

    Job Description:Hewlett Packard Enterprise is seeking a highly skilled Software Engineer to join our HPC and AI organization. As a key member of the Slingshot Ethernet Fabric team, you will play a critical role in expanding HPE's High Performance Ethernet Fabric product growth through Commercial HPC use cases, AI use cases networking, systems, and...


  • Santa Clara, California, United States AMD Full time

    Job SummaryWe are seeking a highly skilled AI - GPU System Architect to join our Data Center GPU organization at AMD. As a technical leader, you will be responsible for creating AMD's future accelerated computing platforms, interacting with key engineering teams, and assessing strategic technologies to integrate into our accelerated processing platforms.Key...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is seeking a highly skilled Storage Systems Architect to join our team. As a key member of our Enterprise Platform team, you will be responsible for designing and implementing cutting-edge storage solutions that enable the development of AI and Generative AI technologies.As a Storage Systems Architect, you will work closely with our engineering teams...


  • Santa Clara, California, United States NVIDIA Full time

    AI Solutions ArchitectWe are seeking a highly skilled AI Solutions Architect to join our team at NVIDIA. As a key member of our Solution Architect organization, you will work closely with our customers to develop and deploy innovative AI solutions using NVIDIA's cutting-edge technologies.Key Responsibilities:Lead software customer technical engagements with...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is seeking highly skilled AI Solutions Architects to collaborate with customers on cutting-edge Generative AI projects.As a Senior AI Solutions Architect, you will work closely with customers to understand their technical needs and develop high-value solutions using NVIDIA's latest AI technology.You will partner with cross-functional teams to define...


  • Santa Clara, California, United States NVIDIA Full time

    Job Title: Senior Site Reliability Engineer - HPC StorageNVIDIA is a leader in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. Our work opens up new universes to explore, enables unique creativity and discovery, and powers what were once science fiction inventions, from artificial intelligence to...

  • Solutions Architect

    4 weeks ago


    Santa Clara, California, United States NVIDIA Full time

    Job DescriptionNVIDIA is seeking a highly skilled Solutions Architect to join our Data Center Solutions Architecture team. As a key member of our team, you will be responsible for driving end-to-end technology solutions with our most strategic customers, including web2.0, cloud, HPC AI, and enterprise datacenter customers.You will work closely with our...


  • Santa Clara, California, United States Nvidia Full time

    AI Solution Architect EngineerWe are seeking an experienced AI Solution Architect Engineer to join our team at NVIDIA. As a key member of our Solution Architect organization, you will work with our customers to develop and deploy innovative AI solutions using NVIDIA's cutting-edge technologies.Key Responsibilities:Develop and demonstrate software solutions...


  • Santa Clara, California, United States HPE Full time

    About the Role:Hewlett Packard Enterprise (HPE) is seeking an experienced Software Engineer to join the Slingshot Ecosystem Development Team. This role will focus on expanding HPE's High Performance Ethernet Fabric product growth through Commercial HPC use cases, AI use cases networking, systems, and application and open-source...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a leader in computer graphics, PC gaming, and accelerated computing, with a legacy of innovation driven by technology and people. We are seeking a senior architect to join our Advanced Development team and shape the future of our company.As a senior architect, you will be responsible for crafting architectural solutions and participating in...