Senior HPC Systems Engineer

2 weeks ago


Santa Clara, California, United States NVIDIA Full time
About NVIDIA

NVIDIA is a leader in the technology world, renowned for its innovative products and services. As a pioneer in the field of accelerated computing, NVIDIA has been transforming computer graphics, PC gaming, and AI for over 25 years.

Job Summary

We are seeking an exceptional Senior HPC Systems Engineer to join our team. As a key player in our datacenter performance team, you will be responsible for designing and implementing high-performance computing systems that drive innovation and excellence in AI and deep learning.

Key Responsibilities
  • Lead the implementation of performance practices in large-scale infrastructure
  • Develop powerful tools, methodologies, and flows to validate and improve datacenter products
  • Accelerate strategic customer deployments and ensure speed-of-light bringup and deployment of AI infrastructure
  • Provide engineering solutions to enable large-scale performance strategies for datacenter GPU computing products and software stacks
Requirements
  • 5+ years of experience in using accelerated computing for datacenter container computing solutions
  • BS in Engineering, Mathematics, Physics, or Computer Science, MS or PhD desirable (or equivalent experience)
  • Solid understanding of accelerated parallel computing models (MPI, NCCL)
  • Experience using and handling modern Cloud and container-based Enterprise computing architectures
What We Offer

NVIDIA offers highly competitive salaries and a comprehensive benefits package. As a member of our team, you will have the opportunity to work with cutting-edge technology and collaborate with talented individuals who share your passion for innovation.



  • Santa Clara, California, United States NVIDIA Full time

    About NVIDIANVIDIA is a leader in the field of computer graphics, PC gaming, and accelerated computing. With a legacy of innovation spanning over 25 years, we're committed to pushing the boundaries of what's possible with AI and GPU computing.Job SummaryWe're seeking an exceptional Senior HPC Systems Engineer to join our team. As a key player in our AI...


  • Santa Clara, California, United States NVIDIA Full time

    About NVIDIANVIDIA has been a pioneer in computer graphics, PC gaming, and accelerated computing for over 25 years. Our legacy of innovation is fueled by great technology and amazing people. Today, we're pushing the boundaries of AI to define the next era of computing.Job SummaryWe're seeking an exceptional Senior HPC Systems Engineer to join our team. As a...


  • Santa Clara, California, United States NVIDIA Full time

    Job Title: Senior AI-HPC Storage EngineerNVIDIA is seeking a highly skilled Senior AI-HPC Storage Engineer to join our GPU AI/HPC Infrastructure team. As a member of this team, you will provide leadership in the design and implementation of groundbreaking fast storage solutions to enable runs of demanding deep learning, high performance computing, and...


  • Santa Clara, California, United States NVIDIA Full time

    About NVIDIANVIDIA has been a pioneer in computer graphics, PC gaming, and accelerated computing for over 25 years. Our legacy of innovation is fueled by great technology and amazing people. Today, we're pushing the boundaries of AI to define the next era of computing.Job SummaryWe're seeking an exceptional Senior HPC Systems Engineer to join our team. As a...


  • Santa Clara, California, United States NVIDIA Full time

    Senior Software Engineer - HPC Infrastructure SpecialistNVIDIA is a pioneer in the field of high-performance computing, and we're seeking a talented Senior Software Engineer to join our team. As a key member of our HPC infrastructure team, you will be responsible for designing and implementing scalable systems to meet the demands of our high-performance...


  • Santa Clara, California, United States Nvidia Full time

    Job Title: Senior Site Reliability Engineer - HPC StorageNVIDIA is a leader in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. We are seeking a phenomenal Senior Site Reliability Engineer to join our team and play a crucial role in designing, implementing, and optimizing on-prem High-Performance...

  • HPC Cluster Engineer

    3 weeks ago


    Santa Clara, California, United States Sustainable Talent Full time

    Unlock the Power of HPCSustainable Talent is seeking a seasoned HPC Cluster Engineer to join our team in shaping the future of AI, deep learning, and machine learning initiatives. As a key player in our Nvidia-powered HPC environment, you'll leverage cutting-edge GPU technology to drive groundbreaking discoveries and revolutionize industries.With over 25...


  • Santa Clara, California, United States NVIDIA Full time

    Job Title: Senior AI-HPC Storage Solutions ArchitectNVIDIA is a leader in the field of artificial intelligence and high-performance computing, and we are seeking a highly skilled Senior AI-HPC Storage Solutions Architect to join our team.About the Role:We are looking for an expert in designing and implementing high-performance storage solutions for our AI...


  • Santa Clara, California, United States NVIDIA Full time

    About NVIDIANVIDIA is a leader in the field of artificial intelligence, machine learning, and datacenter acceleration. With a rich history of innovation, we have continuously pushed the boundaries of what is possible in the world of computing.Job SummaryWe are seeking an experienced Site Reliability Engineer to join our GPU AI/HPC Infrastructure team. As a...


  • Santa Clara, California, United States Nvidia Full time

    NVIDIA Job DescriptionWe are seeking a highly skilled Senior HPC Cluster Administrator to lead our GPU-accelerated systems and provide architectural mentorship to product teams in the deep learning and scientific computing domains.Key Responsibilities:Administer Linux systems, including powerful DGX servers and embedded systems, and bring up hardware to...


  • Santa Clara, California, United States Intel Full time

    Job SummaryWe are seeking an experienced AI and HPC Scale-out Systems architect to join our team at Intel. As a key member of our Data Center and Artificial Intelligence group, you will be responsible for architecting large-scale systems that support breakthrough performance on HPC and AI workloads.Key ResponsibilitiesArchitecting large-scale systems that...

  • HPC Cluster Engineer

    3 weeks ago


    Santa Clara, California, United States Sustainable Talent Full time

    Unlock the Power of HPCSustainable Talent is seeking a seasoned HPC Cluster Engineer to join our team in shaping the future of AI, deep learning, and machine learning initiatives. As a key player in our Nvidia-powered HPC environment, you'll leverage cutting-edge GPU technology to drive groundbreaking discoveries and revolutionize industries.As a trusted...


  • Santa Clara, California, United States NVIDIA Full time

    About NVIDIANVIDIA is a leader in AI, machine learning, and datacenter acceleration. Our company has continuously reinvented itself over two decades, with a strong focus on innovation and growth.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our GPU AI/HPC Infrastructure team. As a key member of our team, you will be responsible...


  • Santa Clara, California, United States NVIDIA Full time

    About NVIDIANVIDIA is a leader in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. Our products and services rely heavily on NVIDIA GPUs, which serve as the visual cortex of modern computers. Our work enables new universes to explore, facilitates amazing creativity and discovery, and powers innovations...


  • Santa Clara, California, United States NVIDIA Full time

    Unlock the Power of High-Performance ComputingNVIDIA is a pioneer in the field of high-performance computing, and we're seeking a talented Senior Software Engineer to join our team. As a leader in the industry, we've continuously pushed the boundaries of what's possible with our innovative solutions.As a Senior Software Engineer at NVIDIA, you'll be...


  • Santa Clara, California, United States NVIDIA Full time

    Job Title: Senior Site Reliability EngineerNVIDIA is a leader in AI, machine learning, and datacenter acceleration. Our company is expanding its leadership into datacenter networking with ethernet switches, NICs, and DPUs. We have continuously reinvented ourselves over two decades.Our invention of the GPU in 1999 sparked the growth of the PC gaming market,...


  • Santa Clara, California, United States Skilltorch Full time

    Job OverviewSkilltorch is seeking a Senior Director of Solutions Engineering to lead our team of technical experts in delivering innovative solutions for AI and high-performance computing (HPC) applications. As a key member of our leadership team, you will be responsible for shaping the development and deployment of enterprise solutions that meet complex...


  • Santa Clara, California, United States NVIDIA Full time

    Job DescriptionNVIDIA is a leader in the field of artificial intelligence and high-performance computing, and we are seeking a highly skilled AI-HPC Storage Engineer to join our team.The successful candidate will be responsible for designing and implementing cutting-edge storage solutions for our AI and HPC infrastructure, ensuring optimal performance and...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a leader in groundbreaking developments in Artificial Intelligence, High Performance Computing, and Visualization. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars.We are the GPU Communications Libraries and...


  • Santa Clara, California, United States NVIDIA Full time

    About NVIDIANVIDIA is a leader in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. Our GPUs serve as the visual cortex of modern computers and are at the heart of our products and services.We are pushing the boundaries of innovation, enabling amazing creativity and discovery, and powering innovations such...