Senior Software Engineer HPC

3 weeks ago


Santa Clara, United States Promote Project Full time

NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI and enabled the next era of computing. NVIDIA is a “learning machine” that constantly evolves by adapting to new opportunities that are hard to address, that matters to the world, and that only we can address. This is our life’s work, to amplify human imagination and intelligence, and expand what is possible.

We’re seeking strategic, bold, hard-working, and creative individuals who are passionate about helping us tackle challenges no one else can solve. We are looking for a Senior Software Engineer to join our mission to continue improving our HPC infrastructure. Our team builds and operates sophisticated infrastructure to enable business critical services and AI applications. You will be working with a team of passionate and skilled engineers that are continuously working to provide better tools to build and manage this infrastructure.

What you’ll be doing:
  1. Design highly available and scalable systems to meet the demands of our HPC clusters
  2. Evaluate new and innovative technologies as the landscape evolves
  3. Continuously improve infrastructure provisioning and management using automation
  4. Support a globally distributed, multi-cloud hybrid environment - AWS, GCP and On-prem
  5. Build strong cross functional relationships and align with partners across various business units
  6. Ensure the highest level of up-time and Quality of Service (QoS) to our users through operational excellence
  7. Participate in team's on-call rotation and be a contact for service incidents
What we need to see:
  1. 10+ years of experience in design, implementation, and delivery of large engineering projects
  2. Comfortable with at least two of the following programming languages: Golang, Java, C/C++, Scala, Python, Elixir.
  3. Understands scalability challenges and performance of server-side code.
  4. Able to craft and develop horizontally-scalable, resilient and performing-under-load systems.
  5. Versatile technologist with experience in full software development lifecycle – from inception and design to deployment, operation, and iterative development.
  6. Proficient in cloud computing and are hands-on in at least one cloud platform: GCP, AWS, or Azure.
  7. Proficient in modern CI/CD techniques, GitOps and Infrastructure as Code(IaC)
  8. Strong work ethic and a passion for problem solving
  9. B.S. degree in Computer Science or related technical field (or equivalent experience)
  10. Detail oriented with great communication and collaboration skills
Ways to stand out from the crowd:
  1. Prior experience building solutions for HPC clusters based on Slurm or Kubernetes
  2. Strong understanding of Linux operating system and TCP/IP fundamentals

The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits.

NVIDIA accepts applications on an ongoing basis. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

NVIDIA is the world leader in accelerated computing. NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and digital twins is transforming the world's largest industries and profoundly impacting society. Learn more about NVIDIA.

#J-18808-Ljbffr

  • Santa Clara, United States NVIDIA Corporation Full time

    Senior Software Engineer - HPC Locations: US, CA, Santa Clara; US, MA, Westford; US, TX, Austin; US, NC, Durham Time Type: Full time Posted on: Posted 17 Days Ago Job Requisition ID: JR1979406 NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI and enabled the next era of computing. NVIDIA is a “learning machine” that constantly evolves...


  • US, CA, Santa Clara NVIDIA Full time

    NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI and enabled the next era of computing. NVIDIA is a “learning machine” that constantly evolves...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers,...


  • Santa Clara, United States Promote Project Full time

    NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI and enabled the next era of computing. NVIDIA is a “learning machine” that constantly evolves...


  • Santa Clara, United States Promote Project Full time

    NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI and enabled the next era of computing. NVIDIA is a “learning machine” that constantly evolves...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA is leading groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU -- our invention -- serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables groundbreaking creativity and discovery, and powers inventions...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were...


  • Santa Clara, California, United States NVIDIA Full time

    About the RoleWe are seeking a talented Senior System Software Engineer to join our dynamic team at NVIDIA, focusing on Data Center Server Platform Diagnostics. You will work closely with our crew to build and maintain software for complex heterogeneous computing systems that power sophisticated server products used in groundbreaking AI, HPC, and accelerated...


  • Santa Clara, California, United States NVIDIA Full time

    As a Solutions Architect at NVIDIA, you will be part of a team that brings innovative AI solutions to our largest customers. We are looking for an experienced professional to assist customers in building AI/ML and HPC software solutions at scale.You will be driving end-to-end technology solutions with some of NVIDIA's most strategic customers, leveraging our...


  • Santa Clara, United States NVIDIA Full time

    We are the GPU Communications Libraries and Networking team at NVIDIA. We deliver communication libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC. DL and HPC applications have a huge compute demand already and run on scales which go up to tens of thousands of GPUs. The GPUs are connected with high-speed interconnects (eg. NVLink, PCIe) within a...


  • Santa Clara, United States Advanced Micro Devices , Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • Santa Clara, United States Advanced Micro Devices , Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • Santa Clara, United States Advanced Micro Devices , Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • Santa Clara, United States Advanced Micro Devices , Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • Santa Clara, United States NVIDIA Full time

    We are the GPU Communications Libraries and Networking team at NVIDIA. We deliver communication libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC. DL and HPC applications have a huge compute demand already and run on scales which go up to tens of thousands of GPUs. The GPUs are connected with high-speed interconnects (eg. NVLink, PCIe) within a...


  • Santa Clara, United States NVIDIA Full time

    Do you want to be part of the team that brings Artificial Intelligence (AI) emerging technology to the field? We are looking for a hardworking Solution Architect (SA) to join the NVIDIA AI Enterprise (NVAIE) SA Segment Team. The mission of the NVAIE Segment team is to guide and enable the successful adoption at scale of NVIDIA AI Enterprise Software in...


  • Santa Clara, United States NVIDIA Corporation Full time

    Senior Software Engineer - DOCAApplylocations: US, CA, Santa Clara; US, CA, Remote; US, WA, Seattletime type: Full timeposted on: Posted 22 Days Agojob requisition id: JR1977626NVIDIA is looking for an outstanding Senior Software Engineer to expand the US-based networking software group. As a technical leader, you will have a unique opportunity to shape the...


  • Santa Clara, United States Microsoft Full time

    OverviewMicrosoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft's expanding Cloud Infrastructure and responsible for powering Microsoft's "Intelligent Cloud" mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365,...


  • Santa Clara, United States NVIDIA Corporation Full time

    Senior Software Engineer, Fabric Networking - GPUNVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables...