Software Engineering Manager

2 weeks ago


Santa Clara, United States NVIDIA Full time

We are the GPU Communications Libraries and Networking team at NVIDIA. We deliver communication libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC. DL and HPC applications have a huge compute demand already and run on scales which go up to tens of thousands of GPUs. The GPUs are connected with high-speed interconnects (eg. NVLink, PCIe) within a node and with high-speed networking (eg. Infiniband, Ethernet) across the nodes.

Communication performance between the GPUs has a direct impact on the end-to-end application performance; and the stakes are even higher at huge scales We are looking for a technical leader to manage our NVSHMEM and UCX libraries. This is an outstanding opportunity to push the limits on the state-of-the-art and deliver platforms the world has never seen before. Are you ready to contribute to the development of innovative technologies and help realize NVIDIA's vision?

What you will be doing:

  • Lead, mentor, and grow your library engineering team and be responsible for the planning and execution of projects as well as the quality and performance of your libraries.
  • This is a technical leadership role so you will participate in feature design and implementation.
  • Interact with internal and external partners and researchers to understand their use cases and requirements. Collaborate with engineering teams, program and product management, and partners to define the product roadmap.
  • Continuously review and identify improvement opportunities in established processes, infrastructure, and practices to ensure the teams are executing in the most efficient and transparent manner.

What we need to see:

  • 10+ overall years of experience in the software industry with specialization in HPC networking or system software.
  • 4+ years of management experience.
  • BS, MS, or Ph.D. in CS, CE, EE (related technical field) or equivalent experience.
  • Prior systems software or communication runtime or high performance networking software development experience with a successful track record of taking several complex software features or products through the full product life cycle.
  • Strong understanding of computer system architecture, operating systems principles (aka systems software fundamentals), HW-SW interactions and performance analysis/optimizations.
  • Excellent C/C++ programming and debugging skills in Linux.
  • Experience balancing multiple projects with competing priorities.
  • Flexibility to work and communicate effectively across different teams and timezones.

Ways to stand out from the crowd:

  • Experience with parallel programming models (MPI, SHMEM) and at least one communication runtime (MPI, NCCL, NVSHMEM, OpenSHMEM, UCX, UCC). Experience with programming using CUDA, MPI, OpenMP, OpenACC, pthreads.
  • Background with RDMA, high-performance networking technologies (InfiniBand, RoCE, Ethernet, EFA), network architecture and network topologies. Knowledge of HPC and ML/DL fundamentals.
  • Experience with Deep Learning Frameworks such as PyTorch, TensorFlow, etc.

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world. NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, we want to hear from you

The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#J-18808-Ljbffr

  • santa clara, United States SP Software Solutions Full time

    Role: Embedded Software EngineerLocation: Santa Clara, CADuration: 12 monthsJob description• Design and develop Mendix applications, following internal best practices and guidelines.• Collaborate with business analysts, designers, and other developers to gather requirements and translate them into functional applications.• Customize and extend Mendix...

  • Software Engineer III

    1 month ago


    santa clara, United States Intelliswift Software Full time

    Must Have skillsJava or PythonGoRESTNoSQLKafkaAWSDocker / KubernetesDescription:We are seeking a highly skilled Server Software Engineer to join our dynamic team. This role will focus on designing, building, and maintaining large-scale distributed systems to support the end-to-end machine learning (ML) lifecycle. The server engineer is responsible for design...

  • Software Engineer III

    1 month ago


    Santa Clara, United States Intelliswift Software Full time

    Must Have skillsJava or PythonGoRESTNoSQLKafkaAWSDocker / KubernetesDescription:We are seeking a highly skilled Server Software Engineer to join our dynamic team. This role will focus on designing, building, and maintaining large-scale distributed systems to support the end-to-end machine learning (ML) lifecycle. The server engineer is responsible for design...


  • Santa Clara, California, United States Apple Full time

    Job DescriptionCompany: AppleJob Title: Software Engineering ManagerDepartment: Software DeliveryLocation: Santa Clara, California, United StatesWe are seeking a highly skilled Software Engineering Manager to lead the Device Services Engineering team within Software Delivery at Apple. The successful candidate will be responsible for supervising and mentoring...


  • Santa Clara, California, United States United Software Group Full time

    We are seeking a skilled Software Development Engineer in Test to join our team at United Software Group. As a key member of our software development team, you will be responsible for designing, developing, and maintaining high-quality software products.The ideal candidate will have a strong background in software development, with experience in languages...


  • Santa Clara, California, United States NVIDIA Full time

    We are the GPU Communications Libraries and Networking team at NVIDIA, and we're looking for a talented Software Engineering Manager to join our team.As a Software Engineering Manager, you will be responsible for leading and mentoring a team of library engineers, planning and executing projects, and ensuring the quality and performance of our libraries.You...


  • Santa Clara, California, United States Apple Full time

    Job SummaryApple is seeking a skilled Software Engineering Manager to lead the Device Services Engineering team within Software Delivery. The ideal candidate will have a strong background in software engineering, experience in managing high-performance teams, and excellent communication skills.Key ResponsibilitiesSupervise and maintain thousands of iOS/macOS...

  • Software Engineer III

    3 weeks ago


    Santa Clara, United States Intelliswift Software Inc Full time

    Must Have skillsJava or PythonGoRESTNoSQLKafkaAWSDocker / KubernetesDescription:We are seeking a highly skilled Server Software Engineer to join our dynamic team. This role will focus on designing, building, and maintaining large-scale distributed systems to support the end-to-end machine learning (ML) lifecycle. The server engineer is responsible for design...


  • Santa Clara, California, United States Applied Materials Full time

    We are seeking a skilled Senior Software Engineering Manager to lead our team in designing, developing, and maintaining scalable and robust software solutions for semiconductor equipment.The ideal candidate will have hands-on experience in developing software that interfaces with hardware, as well as demonstrated experience in hiring, managing, growing,...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RoleWe are seeking a highly skilled Software Engineering Manager to lead our DLP team. As a Manager, you will be responsible for overseeing the development of our DLP software, ensuring it meets the highest standards of quality and security.Key ResponsibilitiesLead a team of software engineers to design, develop, and deliver high-quality DLP...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA is looking for a Software Engineering Manager to lead the DVS SRE team within the IPP Group (Infrastructure, Planning and Process). IPP is a global organization within NVIDIA. Play a pivotal role in shaping the future of technology. In this position, you will have the chance to work on innovative projects, collaborate with highly skilled individuals,...


  • Santa Clara, United States NVIDIA Full time

    We are the GPU Communications Libraries and Networking team at NVIDIA. We deliver communication libraries like NCCL, NVSHMEM, UCX for Deep Learning and HPC. DL and HPC applications have a huge compute demand already and run on scales which go up to tens of thousands of GPUs. The GPUs are connected with high-speed interconnects (eg. NVLink, PCIe) within a...

  • Software Engineer

    3 weeks ago


    santa clara, United States netPolarity, Inc. (Saicon Consultants, Inc.) Full time

    I would like to set up a time to discuss the potential opportunity below: Role: Software EngineerLocation: Santa Clara, CA (Hybrid)Duration: 12 monthsResponsibilities:Design, build, test, monitor, manage and maintain tool chains for critical functions.Automate technologies to improve the velocity and quality of engineering efforts.Requirements:BS/MS Computer...

  • Software Engineer

    3 weeks ago


    Santa Clara, United States netPolarity, Inc. (Saicon Consultants, Inc.) Full time

    I would like to set up a time to discuss the potential opportunity below: Role: Software EngineerLocation: Santa Clara, CA (Hybrid)Duration: 12 monthsResponsibilities:Design, build, test, monitor, manage and maintain tool chains for critical functions.Automate technologies to improve the velocity and quality of engineering efforts.Requirements:BS/MS Computer...

  • Software Engineer

    3 weeks ago


    santa clara, United States netPolarity, Inc. (Saicon Consultants, Inc.) Full time

    I would like to set up a time to discuss the potential opportunity below: Role: Software EngineerLocation: Santa Clara, CA (Hybrid)Duration: 12 monthsResponsibilities:Design, build, test, monitor, manage and maintain tool chains for critical functions.Automate technologies to improve the velocity and quality of engineering efforts.Requirements:BS/MS Computer...

  • Software Engineer

    3 weeks ago


    Santa Clara, United States netPolarity Full time

    I would like to set up a time to discuss the potential opportunity below:Role: Software EngineerLocation: Santa Clara, CA (Hybrid)Duration: 12 monthsResponsibilities:Design, build, test, monitor, manage and maintain tool chains for critical functions.Automate technologies to improve the velocity and quality of engineering efforts.Requirements:BS/MS Computer...


  • Santa Clara, California, United States Sustainable Talent Full time

    Job Title: Software Test EngineerJob Description:We are seeking a highly skilled Software Test Engineer to join our team at Sustainable Talent. As a Software Test Engineer, you will play a key member of the data collection and operation teams, serving as a liaison for the engineering team.You will work with our data engineering team to design engineering...


  • Santa Clara, United States Arista Networks, Inc. Full time

    Job Description As a core member of the Platform software team, the candidate will be part of a fast-paced, high caliber team building diagnostic software to help validate high-speed digital designs and achieve high yields in manufacturing Arista Network products used in the computer networking industry's largest data centers. Who You’ll Work With The...


  • Santa Clara, United States ZipRecruiter Full time

    Job DescriptionCompany DescriptionIt all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today — ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500. Our intelligent...

  • Software Engineer

    4 weeks ago


    Santa Clara, California, United States Oracle Full time

    Job Title: Software EngineerOracle is seeking a skilled Software Engineer to join our team. As a Software Engineer, you will be responsible for designing, developing, and testing software applications and operating systems.Key Responsibilities:Design and develop software applications and operating systemsTest and debug software applications and operating...