Senior AI Software Engineer, GenAI Framework

4 weeks ago


Santa Clara, California, United States NVIDIA Full time

We are seeking a highly skilled AI Software Engineer to join our team at NVIDIA. As a key member of our team, you will be responsible for crafting and implementing new model development features, optimizations, defining APIs, analyzing and tuning performance, expanding functionality coverage to build larger, coherent toolsets and libraries.

Key Responsibilities:

  • Develop the GenAI open source NeMo framework and Megatron Core.
  • Solve large-scale, end-to-end AI training and inference-deployment challenges (data curation, pre-processing, orchestrate and run model training and tuning, model serving).
  • Work at the intersection of deep learning applications, libraries, frameworks, and the entire software stack.
  • Performance tuning and optimizations of deep learning framework & software components.
  • Research, prototype and develop effective AI tools and pipelines.

Requirements:

  • MS, PhD or equivalent experience in Computer Science, AI, Applied Math, or related field and 5+ years of industry experience.
  • Experience with AI Frameworks (e.g. PyTorch, JAX), and/or inference and deployment environments (e.g. TRT, ONNX, Triton).
  • Proficient in Python programming, software design, debugging, performance analysis, test design and documentation.
  • Consistent record of working effectively across multiple engineering initiatives and improving AI libraries with new innovations.
  • Solid understanding of deep learning fundamentals and techniques.

Preferred Qualifications:

  • Experience with large scale AI training and understanding of the compute system concepts (latency/throughput bottlenecks, pipelining, multiprocessing), related performance analysis and tuning.
  • Prior experience with Generative AI techniques applied to LLM and MM learning (Image, Video, Speech).
  • Knowledge of GPU/CPU architecture and related numerical software.
  • Experience with cloud computing (e.g. end-to-end pipelines for AI training and inference on CSP (AWS/Azure/GCP)).
  • Contributions to open source deep learning frameworks.

NVIDIA is a leader in the technology industry and we are committed to fostering a diverse and inclusive work environment. We are an equal opportunity employer and welcome applications from qualified candidates from all backgrounds.

The base salary range for this position is $180,000 - $339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits.



  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is seeking a senior build and continuous integration (CI/CD) engineer for its GenAI Frameworks (NeMo, Megatron Core) team.NVIDIA NeMo is an open-source, scalable, and cloud-native framework built for researchers and developers working on Large Language Models (LLM), Multimodal (MM), and Speech AI.NeMo provides end-to-end model training, including data...

  • AI Systems Engineer

    4 weeks ago


    Santa Clara, California, United States Meshy Full time

    About MeshyWe are a leading 3D generative AI company headquartered in the Silicon Valley, on a mission to unleash 3D creativity.We simplify the creation of distinctive 3D assets for both professional artists and hobbyists by transforming text and images into stunning 3D models in minutes.Our global team of experts in computer graphics, AI, and art includes...

  • Principal Scientist

    4 weeks ago


    Santa Clara, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Principal Scientist to join our team at Amazon. As a key member of our organization, you will be responsible for leading advanced research in Large Language Models (LLMs), Generative AI, and Deep Learning.Key ResponsibilitiesConduct research and develop novel algorithms, architectures, and methodologies for...


  • Santa Clara, California, United States ServiceNow Full time

    At ServiceNow, we're transforming the way organizations work by harnessing the power of AI-enhanced technology. As a Senior Staff Software Engineer for Conversational AI Experiences, you'll play a key role in building the frameworks that power our line of Generative AI products.**Key Responsibilities:**Design and develop high-quality, scalable, and reusable...

  • Senior AI/ML Engineer

    1 month ago


    Santa Clara, California, United States Eightfold LLC Full time

    About Eightfold.aiWe're at the forefront of innovation in the AI-driven HR tech space, shaping the future of how organizations find, manage, and empower their talent. Our groundbreaking AI platform is revolutionizing the industry, and we're looking for exceptional engineers to join our team and drive the next wave of advancements.About the AI/ML TeamOur...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled AI Solution Architect Engineer to join our team at NVIDIA. As a key member of our Solution Architect organization, you will work with the latest breakthroughs in deep learning and AI, driving innovation and excellence in AI software development and deployment.This role offers an excellent opportunity to build your career in...

  • AI 3D Model Engineer

    4 weeks ago


    Santa Clara, California, United States Meshy Full time

    About MeshyWe are a leading 3D generative AI company headquartered in the Silicon Valley, on a mission to Unleash 3D Creativity. Our platform simplifies the creation of distinctive 3D assets for both professional artists and hobbyists by transforming text and images into stunning 3D models in minutes.Our global team of 30 experts in computer graphics, AI,...


  • Santa Clara, California, United States Nvidia Full time

    AI Solution Architect EngineerWe are seeking an experienced AI Solution Architect Engineer to join our team at NVIDIA. As a key member of our Solution Architect organization, you will work with our customers to develop and deploy innovative AI solutions using NVIDIA's cutting-edge technologies.Key Responsibilities:Develop and demonstrate software solutions...


  • Santa Clara, California, United States Amazon Full time

    Job DescriptionWe are seeking a highly skilled GenAI Solutions Architect to join our team at Amazon. As a GenAI Solutions Architect, you will be responsible for designing and implementing scalable GenAI solutions for our customers. You will work closely with our engineering teams to develop and deploy GenAI workloads on AWS, and will facilitate the...


  • Santa Clara, California, United States Couchbase Full time

    Empower Modern ApplicationsEvery day, we tackle new and exciting challenges to empower developers to build modern cloud, mobile, and edge applications that deliver a premium user experience. Couchbase's fast, flexible, and affordable cloud database platform, Capella, enables organizations to quickly build applications that deliver premium experiences to...


  • Santa Clara, California, United States Couchbase, Inc. Full time

    Empower the Future of Database TechnologyCouchbase is seeking a highly skilled Senior Software Engineer to join our AI team. As a key member of our engineering team, you will design and implement cutting-edge database and AI features and tools using the latest techniques to evolve Couchbase products and Capella service.Key Responsibilities:Design and...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Software Engineer, AI to join our team at NVIDIA. Our high-performance computing platforms are powering the AI revolution, and our GPUs deliver industry-leading performance on many applications, including generative AI through our impressive suite of software products like TensorRT and cuDNN.As a member of our team, you...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly experienced AI Senior Software Quality Assurance Engineer to join NVIDIA's Deep Learning SWQA team.The position is in NVIDIA's Deep Learning and AI Software Quality Assurance team that defines, develops, and performs tests to validate robustness and measure the performance of NVIDIA's Deep Learning software and GPU Infrastructure for...

  • Software Engineer

    4 weeks ago


    Santa Barbara, California, United States HPE Full time

    Software Engineer - Generative AI - Early Career at Hewlett Packard LabsHewlett Packard Enterprise is at the forefront of High Performance Computing and AI innovations to solve the most difficult and complex problems that we are facing today. In our Physics-Based GenAI team at Hewlett Packard Labs, we are dedicated to pushing the boundaries of what's...


  • Santa Clara, California, United States NVIDIA Full time

    AI Solutions ArchitectWe are seeking a highly skilled AI Solutions Architect to join our team at NVIDIA. As a key member of our Solution Architect organization, you will work closely with our customers to develop and deploy innovative AI solutions using NVIDIA's cutting-edge technologies.Key Responsibilities:Lead software customer technical engagements with...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Senior High-Performance AI Training Engineer to join our team at NVIDIA. As a key member of our team, you will be responsible for optimizing AI training workloads on innovative hardware and software platforms.This role offers the opportunity to directly impact the hardware and software roadmap in a fast-growing technology company that leads...


  • Santa Clara, California, United States Advanced Micro Devices , Inc. Full time

    Unlock the Power of AI with AMDWe're seeking a highly skilled AI Optimization Engineer to join our team at Advanced Micro Devices, Inc. Our mission is to push the boundaries of innovation and solve the world's most complex challenges. As an AI Optimization Engineer, you'll be responsible for designing and optimizing machine learning models for our...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RoleWe are seeking a highly skilled Senior Manager of Software Engineering AIOps to join our team at Palo Alto Networks. As a key member of our engineering team, you will be responsible for leading the development of advanced AI and Machine Learning enabled solutions to detect and remediate network issues, discover problems, provide insights, and...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Senior High-Performance AI Training Engineer to join our team at NVIDIA. As a key member of our team, you will be responsible for optimizing AI training workloads on innovative hardware and software platforms.This role offers the opportunity to directly impact the hardware and software roadmap in a fast-growing technology company that leads...


  • Santa Clara, California, United States NVIDIA Full time

    Job Description:NVIDIA is the world leader in computer graphics, artificial intelligence, and accelerated computing. For over 25 years, we have been at the forefront of research and engineering around the greatest advances in technology. Our history of innovation drives us to solve the world's hardest problems.We are looking for a Senior HPC and AI Solutions...