Senior Deep Learning Algorithm Engineer

6 days ago


Santa Clara, California, United States NVIDIA Full time $148,000 - $287,500 per year

NVIDIA is looking for engineers for our core AI Frameworks (Megatron Core and NeMo Framework) team to design, develop and optimize diverse real world workloads. Megatron Core and NeMo Framework are open-source, scalable and cloud-native frameworks built for researchers and developers working on Large Language Models (LLM) and Multimodal (MM) foundation model pretraining and post-training. Our GenAI Frameworks provide end-to-end model training, including pretraining, reasoning, alignment, customization, evaluation, deployment and tooling to optimize performance and user experience.

In this critical role, you will expand Megatron Core and NeMo Framework's capabilities, enabling users to develop, train, and optimize models by designing and implementing the latest in distributed training algorithms, model parallel paradigms, model optimizations, defining robust APIs, meticulously analyzing and tuning performance, and expanding our toolkits and libraries to be more comprehensive and coherent. You will collaborate with internal partners, users, and members of the open source community to analyze, design, and implement highly optimized solutions.

What you'll be doing:

  • Develop algorithms for AI/DL, data analytics, machine learning, or scientific computing

  • Contribute and advance open source Megatron Core and NeMo Framework

  • Solve large-scale, end-to-end AI training and inference challenges, spanning the full model lifecycle from initial orchestration, data pre-processing, running of model training and tuning, to model deployment.

  • Work at the intersection of compter-architecture, libraries, frameworks, AI applications and the entire software stack.

  • Innovate and improve model architectures, distributed training algorithms, and model parallel paradigms.

  • Performance tuning and optimizations, model training and finetuning with mixed precision recipes on next-gen NVIDIA GPU architectures.

  • Research, prototype, and develop robust and scalable AI tools and pipelines.

What we need to see:

  • MS or equivalent experience in Computer Science, AI, Applied Math, or related fields and 3+ years of industry experience.

  • Experience with AI Frameworks (e.g. PyTorch, JAX), and/or inference and deployment environments (e.g. TRTLLM, vLLM, SGLang).

  • Proficient in Python programming, software design, debugging, performance analysis, test design and documentation.

  • Consistent record of working effectively across multiple engineering initiatives and improving AI libraries with new innovations.

  • Strong understanding of AI/Deep-Learning fundamentals and their practical applications.

Ways to stand out from the crowd:

  • Hands-on experience in large-scale AI training, with a deep understanding of core compute system concepts (such as latency/throughput bottlenecks, pipelining, and multiprocessing) and demonstrated excellence in related performance analysis and tuning.

  • Expertise in distributed computing, model parallelism, and mixed precision training

  • Prior experience with Generative AI techniques applied to LLM and Multi-Modal learning (Text, Image, and Video).

  • Knowledge of GPU/CPU architecture and related numerical software.

  • Contributions to open source deep learning frameworks.

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working with us. If you're creative and autonomous, we want to hear from you

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 148,000 USD - 235,750 USD for Level 3, and 184,000 USD - 287,500 USD for Level

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until October 31, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.#deeplearning

  • Santa Clara, California, United States NVIDIA Full time $148,000 - $287,500 per year

    We are looking for outstanding Deep Learning Software Engineers to develop and productize NVIDIA's deep learning solutions in autonomous driving vehicles. As a member of our Solution Engineering-Automotive Machine Learning team, you will apply ground breaking NVIDIA deep learning model training/inference software libraries for deployment on NVIDIA's hardware...


  • Santa Clara, California, United States NVIDIA Full time $148,000 - $287,500 per year

    We are now looking for a Senior DL Algorithms Engineer NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of Deep Learning workloads. If you are unafraid to work across all layers of the hardware/software stack from GPU architecture to Deep Learning Framework to achieve...


  • Santa Clara, California, United States Plus Full time $120,000 - $180,000 per year

    We are seeking a highly skilled Machine Learning Engineer with deep expertise in developing Bird's Eye View (BEV) fusion models using multimodal sensor inputs, particularly LiDAR. You will play a central role in designing scalable perception algorithms that integrate data from camera, LiDAR, and radar sensors to support autonomous driving and 3D scene...


  • Santa Clara, California, United States Apple Full time

    We are looking for a Senior Machine Learning engineering with a passion for using machine learning to create intelligent and personalized search applications. Our team researches and implements novel retrieval and ranking techniques, machine learning algorithms and large language models that power amazing Search experiences across Apple products.We are...


  • Santa Clara, California, United States Plus Full time

    Plus, also known as PlusAI, is a Physical AI company pioneering AI-based virtual driver software for factory-built autonomous trucks. Headquartered in Silicon Valley with operations in the United States and Europe, Plus was named by Fast Company as one of the World's Most Innovative Companies. Partners including TRATON GROUP's Scania, MAN, and International...


  • Santa Clara, California, United States Plus Full time

    Plus, also known as PlusAI, is a Physical AI company pioneering AI-based virtual driver software for factory-built autonomous trucks. Headquartered in Silicon Valley with operations in the United States and Europe, Plus was named by Fast Company as one of the World's Most Innovative Companies. Partners including TRATON GROUP's Scania, MAN, and International...


  • Santa Clara, California, United States Autonomous Healthcare Full time $120,000 - $180,000 per year

    About Autonomous HealthcareAt Autonomous Healthcare, we are at the forefront of medical innovation, developing the next generation of devices that will revolutionize patient care. Our mission is to commercialize breakthrough medical technologies by leveraging cutting-edge AI and autonomous systems. We believe that the best solutions are built together, and...


  • Santa Clara, California, United States NVIDIA Full time $184,000 - $287,500 per year

    We are recruiting top research engineers in the Autonomous Vehicles Research team at NVIDIA with strong expertise in software engineering and in artificial intelligence topics, such as deep learning, reinforcement learning, and generative modeling. You must have strong programming skills, a solid track record of training deep learning models at scale, and a...


  • Santa Clara, California, United States NVIDIA Full time $224,000 - $356,500 per year

    We are seeking an energetic, hardworking Senior Software Engineer to join our Prediction team and develop key autonomous driving features for our DRIVE solution. In this role, you will spearhead work on various planning problems to build robust software solutions. Through effective collaboration with established teams, you will help design and mature new...


  • Santa Clara, California, United States Overlord Labs, Inc. Full time $120,000 - $200,000 per year

    Senior Applications EngineerLocation: San Jose, CA (In-Office)Employment Type: Full-TimeAbout Overlord LabsOverlord Labs is a Silicon Valley semiconductor startup redefining Battery Management Systems (BMS) from the silicon up. Our proprietary GENESIS platform fuses precision analog design with advanced digital intelligence to deliver smarter, safer, and...