Current jobs related to Senior LLM Performance Optimization Specialist - Santa Clara, California - NVIDIA


  • Santa Clara, California, United States NVIDIA Full time

    Job SummaryWe are seeking a skilled engineer to join our team and help shape the future of agentic inference systems. As a Senior LLM Research Engineer, you will play a critical role in improving the algorithmic performance and efficiency of large language models.Responsibilities:Research and development of contemporary research on generative AI, agents, and...


  • Santa Clara, California, United States NVIDIA Full time

    Job Title: Senior Performance Optimization EngineerWe are seeking a highly skilled Senior Performance Optimization Engineer to join our AI Applications organization at NVIDIA. As a key member of our team, you will be responsible for optimizing the performance of our distributed cloud native accelerated video analytics applications.Our team is building...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is seeking a highly skilled Senior Performance Engineer to join our team of experts in building and optimizing the tools Deep Learning engineers use worldwide to design, develop, and deploy AI applications.We are a diverse and ambitious team that influences all areas of NVIDIA's AI platform and directly contributes to premier Deep Learning frameworks...


  • Santa Clara, California, United States Apple Full time

    At Apple, we're pushing the boundaries of innovation and looking for a talented individual to join our team as a Thermal Performance Optimization Specialist.This role involves working closely with multi-functional teams to optimize mobile devices' thermal performance and participate in advanced IC packaging research and development.Key...


  • Santa Clara, California, United States Nvidia Full time

    Job SummaryNVIDIA is seeking a highly skilled Cloud AI Performance Architect to drive the performance analysis, optimization, and modeling of our AI infrastructure. As a key member of our team, you will work closely with cross-functional teams to define the architecture and design of our cloud-based AI systems.Key ResponsibilitiesDevelop benchmarks and...


  • Santa Clara, California, United States NVIDIA Full time

    Performance Engineer Job DescriptionWe are seeking a highly skilled performance engineer to join our AI Applications organization at NVIDIA. As a performance engineer, you will work with our Application teams to understand the architecture, profile, identify bottlenecks, and optimize our distributed cloud native accelerated video analytics applications.Our...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a leader in the generative AI revolution, and our Algorithmic Model Optimization Team is at the forefront of optimizing generative AI models for maximal inference efficiency. Our team focuses on techniques ranging from neural architecture search and pruning to sparsity, quantization, and automated deployment strategies.We conduct applied research...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior System Software Engineer to join our team and contribute to the development of the CUDA driver and runtime. As a key member of our team, you will work on optimizing the performance of our platform for accelerating general purpose computation on the GPU.Our team is responsible for analyzing performance issues,...


  • Santa Clara, California, United States NVIDIA Full time

    Job DescriptionWe are seeking a highly skilled Senior System Software Engineer to join our team and contribute to the development of the CUDA driver and runtime. As a key member of our team, you will be responsible for analyzing performance issues, investigating bottlenecks, and delivering features and improvements to enhance the performance of NVIDIA...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is seeking a highly skilled Cloud AI Infrastructure Engineer to drive the performance analysis, optimization, and modeling of NVIDIA DGXTM Cloud clusters.The ideal candidate will have a deep understanding of the methodology to conduct end-to-end performance analysis of critical AI applications running on large-scale parallel and distributed...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a senior software engineer to join our team working on the CUDA driver and runtime, core components of our platform for accelerating general purpose computation on the GPU.Our team analyzes performance of applications, investigates bottlenecks in software or hardware, and delivers features and improvements to better realize the potential of...


  • Santa Clara, California, United States Hitachi Energy Full time

    Job Summary:Hitachi Energy is seeking a Senior Optimization Engineer to join its team in San Jose, CA. This role involves delivering innovative solutions in mathematical modeling, optimization, and numerical methods.Key Responsibilities:Analyze customer engineering, business, and software requirements to propose feasible solutions.Design, develop, test,...


  • Santa Clara, California, United States Apple Full time

    We are seeking a highly motivated and experienced Thermal Performance Optimization Engineer to join our team at Apple. As a key member of our engineering team, you will be responsible for optimizing the thermal performance of our mobile devices.The ideal candidate will have a strong background in thermal analysis and modeling, as well as experience with IC...

  • Senior AI/ML Engineer

    3 weeks ago


    Santa Clara, California, United States Eightfold LLC Full time

    About Eightfold.aiWe're at the forefront of innovation in the AI-driven HR tech space, shaping the future of how organizations find, manage, and empower their talent. Our groundbreaking AI platform is revolutionizing the industry, and we're looking for exceptional engineers to join our team and drive the next wave of advancements.About the AI/ML TeamOur...


  • Santa Clara, California, United States NVIDIA Full time

    About the RoleWe are seeking a highly skilled Senior System Software Engineer to join our team and contribute to the development of the CUDA driver and runtime. As a key member of our team, you will be responsible for analyzing performance issues, investigating bottlenecks, and delivering features and improvements to enhance the performance of NVIDIA...


  • Santa Clara, California, United States Apple Full time

    At Apple, we are seeking a highly skilled Thermal Performance Optimization Engineer to join our team. As a key member of our engineering team, you will be responsible for architecting thermal solutions to address chip energy efficiency and performance scaling.**Key Responsibilities:**- Providing thermal modeling solutions to mobile devices at silicon die and...


  • Santa Clara, California, United States NVIDIA Full time

    Cloud Infrastructure Optimization SpecialistNVIDIA is a leader in computer graphics, PC gaming, and accelerated computing. We're now leveraging AI to define the next era of computing. As a Cloud Infrastructure Optimization Specialist, you'll work with internal teams to optimize cloud resources, reducing costs and improving performance.Key Responsibilities:...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Senior Systems Software Engineer to join our TAO Toolkit Team, where you will be responsible for developing novel, scalable, and automated pipelines to make sense of petabytes of unstructured data. You will collaborate with multiple deep-learning architects and engineers to enable the development of pioneering AI models.Key Responsibilities:...


  • Santa Clara, California, United States Apple Full time

    Job SummaryWe are seeking a highly motivated and ambitious individual to join our team as a Thermal Performance Optimization Engineer. As a key member of our multi-functional team, you will be responsible for architecting thermal solutions to address chip energy efficiency and performance scaling. Your expertise in thermal modeling and analysis will be...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a Senior Systems Software Engineer to join our TAO Toolkit Team at NVIDIA. Our team builds frameworks, services, algorithms, and tools that power the largest NVIDIA Multi-Modal Foundation Models and their customization.Key Responsibilities:Design, develop, and support a platform to access large datasets, integrating data from various...

Senior LLM Performance Optimization Specialist

2 months ago


Santa Clara, California, United States NVIDIA Full time
About the Role

We are seeking a highly skilled Senior LLM Performance Engineer to join our team at NVIDIA. As a key member of our Deep Learning Architecture team, you will play a critical role in optimizing the performance of Large Language Models (LLMs) on state-of-the-art hardware and software platforms.

Key Responsibilities
  • Understand and analyze the performance of LLMs on various hardware and software platforms.
  • Develop and implement production-quality software to optimize LLM performance.
  • Collaborate with cross-functional teams to identify and prioritize performance optimization opportunities.
  • Design and implement tools to automate workload analysis and optimization.
Requirements
  • PhD (or equivalent experience) in Computer Science, Electrical Engineering, or related field, and 5+ years of relevant work experience, or MS and 8+ years of relevant work experience.
  • Strong background in deep learning and neural networks, with a focus on training and large language models.
  • Deep understanding of computer architecture and familiarity with GPU architecture.
  • Proven experience analyzing and tuning application performance, preferably on GPUs.
  • Familiarity with common deep learning software packages like PyTorch and JAX.
  • Proven experience with processor and system-level performance modeling.
  • Programming skills in C++, Python, and CUDA.
About NVIDIA

NVIDIA is a leader in the field of artificial intelligence and deep learning. We are committed to fostering a diverse and inclusive work environment and are proud to be an equal opportunity employer. If you are passionate about performance and interested in working on industry-leading Deep Learning products, we encourage you to apply for this exciting opportunity.