Senior LLM Research Engineer

2 weeks ago


Santa Clara, California, United States NVIDIA Full time

Job Summary

We are seeking a skilled engineer to join our team and help shape the future of agentic inference systems. As a Senior LLM Research Engineer, you will play a critical role in improving the algorithmic performance and efficiency of large language models.

Responsibilities:

  • Research and development of contemporary research on generative AI, agents, and inference systems.
  • Workload analysis and optimization of agentic LLM workloads to reduce request latency and increase request throughput.
  • Design and implementation of scalable systems to accelerate agentic workflows and handle sophisticated datacenter-scale use cases.
  • Collaboration and communication with diverse teams at NVIDIA and external partners.

Requirements:
  • BS, MS, PhD in Computer Science, Electrical Engineering, Computer Engineering, or a related field (or equivalent experience).
  • 3+ years of experience in deep learning and deep learning systems design.
  • Proficiency in Python and C++ programming.
  • Strong understanding of computer architecture, and GPU/parallel datacenter computing fundamentals.
  • Proven interest in analyzing, modeling, and tuning application performance.

NVIDIA benefits:
  • Competitive base salary range: $180,000 - $339,250.
  • Eligibility for equity and benefits.


  • Santa Clara, California, United States ServiceNow Full time

    At ServiceNow, we're pushing the boundaries of AI innovation to create a better world for everyone. As a Senior Research Engineer, you'll be part of our cutting-edge AI research team, dedicated to delivering secure, private, and reliable AI solutions for enterprise settings.The team's mission is to innovate and deliver practical solutions that advance the...


  • Santa Clara, California, United States ServiceNow Full time

    About ServiceNowServiceNow is a global market leader in innovative AI-enhanced technology, serving over 8,100 customers, including 85% of the Fortune 500. Our intelligent cloud-based platform connects people, systems, and processes to empower organizations to work smarter, faster, and better.Job SummaryWe are seeking a highly skilled Senior Research Engineer...

  • Senior AI/ML Engineer

    4 weeks ago


    Santa Clara, California, United States Eightfold LLC Full time

    About Eightfold.aiWe're at the forefront of innovation in the AI-driven HR tech space, shaping the future of how organizations find, manage, and empower their talent. Our groundbreaking AI platform is revolutionizing the industry, and we're looking for exceptional engineers to join our team and drive the next wave of advancements.About the AI/ML TeamOur...


  • Santa Clara, California, United States Couchbase Full time

    Empower Modern ApplicationsCouchbase is seeking a talented Senior Software Engineer to join our AI team. As a key member of our engineering team, you will design and implement cutting-edge database and AI features and tools using the latest techniques to evolve Couchbase products and Capella service.Key ResponsibilitiesCreate the world's best distributed...


  • Santa Clara, California, United States ServiceNow Full time

    AI Security Researcher RoleAt ServiceNow, we're pushing the boundaries of AI-enhanced technology to empower organizations to find smarter, faster, and better ways to work.We're seeking a highly skilled AI Security Researcher to join our team and contribute to the development of secure, private, and reliable AI for enterprise settings.The ideal candidate will...


  • Santa Clara, California, United States Couchbase Full time

    Empower Modern ApplicationsEvery day, we tackle new and exciting challenges to empower developers to build modern cloud, mobile, and edge applications that deliver a premium user experience. Couchbase's fast, flexible, and affordable cloud database platform, Capella, enables organizations to quickly build applications that deliver premium experiences to...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Senior Systems Software Engineer to join our TAO Toolkit Team, where you will be responsible for developing novel, scalable, and automated pipelines to make sense of petabytes of unstructured data. You will collaborate with multiple deep-learning architects and engineers to enable the development of pioneering AI models.Key Responsibilities:...


  • Santa Clara, California, United States Nvidia Full time

    NVIDIA is seeking a highly skilled and experienced engineer to join our growing team. The successful candidate will work at the intersection of GPU chip design and AI, responsible for the design, development, and maintenance of the infrastructure around Nvidia's internal large language model aimed at facilitating chip design.Key Responsibilities:Develop and...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Senior Systems Software Engineer to join our TAO Toolkit Team at NVIDIA. Our team builds frameworks, services, algorithms, and tools that power the largest NVIDIA Multi-Modal Foundation Models and their customization.Key Responsibilities:Design, develop, and support a platform to access large datasets, integrating data from various...


  • Santa Clara, California, United States Amazon Full time

    **Job Overview**We are seeking a talented and motivated Senior AI Researcher to join our team at Amazon AGI, advancing the state of the art in AGI models and directly impacting the experience and engagement of customers interacting with Amazon's products and services.**Salary Range**$136,000 - $222,200 per year, based on geographic location and market...


  • Santa Clara, California, United States XPENG Motors Full time

    At XPeng Motors, we're pushing the boundaries of innovation in the electric vehicle industry. We're seeking a highly skilled Senior Machine Learning Engineer to join our team and contribute to the development of cutting-edge AI software systems for autonomous driving.The ideal candidate will have a strong background in deep learning algorithms and experience...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a leader in the generative AI revolution, and our Algorithmic Model Optimization Team is at the forefront of optimizing generative AI models for maximal inference efficiency. Our team focuses on techniques ranging from neural architecture search and pruning to sparsity, quantization, and automated deployment strategies.We conduct applied research...


  • Santa Clara, California, United States Polaris Wireless Full time

    At Polaris Wireless, we are seeking a highly skilled Senior Research Engineer to spearhead projects related to our core geo-location and analytics business.The ideal candidate will have a strong background in data analysis, estimation theory, and RF principles, with experience in developing algorithms for use in practical systems.The Senior Research Engineer...


  • Santa Clara, California, United States NVIDIA Full time

    Job DescriptionNVIDIA is seeking a highly skilled Senior Systems Software Engineer to join our TAO Toolkit Deep Learning Architectures team. As a key member of our software team, you will be responsible for developing and implementing cutting-edge deep learning algorithms and solutions.Key Responsibilities:Architect, analyze, develop, and prototype key deep...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Software Quality Assurance Test Development Engineer to join our team at NVIDIA. As a key member of our QA team, you will be responsible for developing and executing test plans, automating testing, and designing tools to improve productivity and optimize test plans.The ideal candidate will have a strong background in...


  • Santa Clara, California, United States NVIDIA Full time

    Job SummaryNVIDIA is seeking a highly skilled Senior Systems Software Engineer to join the ML Data Platform team. The ideal candidate will have a strong background in software engineering, data science, and machine learning.Key ResponsibilitiesDesign, develop, and support a platform to access large datasets, integrating data from various sources.Build...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior Software Engineer to join our Generalist Embodied Agent Research (GEAR) team at NVIDIA. As a key member of our team, you will be responsible for developing robust AI solutions for general-purpose humanoid robots and embodied agents.Key Responsibilities:Work with world-class researchers to develop large-scale AI training...

  • Product Manager

    4 weeks ago


    Santa Clara, California, United States Global AI Platform Corporation Full time

    About Global AI Platform CorporationGlobal AI Platform Corporation is a pioneering company in the AI industry, founded in June 2023. Headquartered in Santa Clara, California, with additional operations in Pangyo, South Korea, we are dedicated to developing cutting-edge AI technologies. Our flagship project is the Personal AI Assistant (PAA), designed to...


  • Santa Clara, California, United States Nvidia Full time

    NVIDIA Job DescriptionWe are seeking a highly skilled Senior Software QA Test Development Engineer to join our team at NVIDIA. As a key member of our QA team, you will be responsible for ensuring the highest quality of our products.Key Responsibilities:Develop and execute test plans, cases, and scripts to validate product functionalityCollaborate with...

  • Senior AI Researcher

    3 weeks ago


    Santa Clara, California, United States Softworld, a Kelly Company Full time

    Drive Innovation in MobilityAs a Senior Research Engineer at Softworld, a Kelly Company, you will lead the development of AI solutions that transform the automotive industry. With a focus on Mobility, Operational Excellence, and Value to our Customers, you will be part of a team that exemplifies ingenuity in everything we do.Key ResponsibilitiesTake charge...