Current jobs related to Senior LLM Performance Optimization Specialist - Santa Clara, California - NVIDIA
-
Senior LLM Research Engineer
1 week ago
Santa Clara, California, United States NVIDIA Full timeJob SummaryWe are seeking a skilled engineer to join our team and help shape the future of agentic inference systems. As a Senior LLM Research Engineer, you will play a critical role in improving the algorithmic performance and efficiency of large language models.Responsibilities:Research and development of contemporary research on generative AI, agents, and...
-
Senior Performance Optimization Engineer
2 months ago
Santa Clara, California, United States NVIDIA Full timeJob Title: Senior Performance Optimization EngineerWe are seeking a highly skilled Senior Performance Optimization Engineer to join our AI Applications organization at NVIDIA. As a key member of our team, you will be responsible for optimizing the performance of our distributed cloud native accelerated video analytics applications.Our team is building...
-
Senior Performance Optimization Engineer
3 weeks ago
Santa Clara, California, United States NVIDIA Full timeNVIDIA is seeking a highly skilled Senior Performance Engineer to join our team of experts in building and optimizing the tools Deep Learning engineers use worldwide to design, develop, and deploy AI applications.We are a diverse and ambitious team that influences all areas of NVIDIA's AI platform and directly contributes to premier Deep Learning frameworks...
-
Thermal Performance Optimization Specialist
2 weeks ago
Santa Clara, California, United States Apple Full timeAt Apple, we're pushing the boundaries of innovation and looking for a talented individual to join our team as a Thermal Performance Optimization Specialist.This role involves working closely with multi-functional teams to optimize mobile devices' thermal performance and participate in advanced IC packaging research and development.Key...
-
Senior Cloud Performance Architect
3 weeks ago
Santa Clara, California, United States Nvidia Full timeJob SummaryNVIDIA is seeking a highly skilled Cloud AI Performance Architect to drive the performance analysis, optimization, and modeling of our AI infrastructure. As a key member of our team, you will work closely with cross-functional teams to define the architecture and design of our cloud-based AI systems.Key ResponsibilitiesDevelop benchmarks and...
-
Senior Performance Optimization Engineer
3 weeks ago
Santa Clara, California, United States NVIDIA Full timePerformance Engineer Job DescriptionWe are seeking a highly skilled performance engineer to join our AI Applications organization at NVIDIA. As a performance engineer, you will work with our Application teams to understand the architecture, profile, identify bottlenecks, and optimize our distributed cloud native accelerated video analytics applications.Our...
-
Santa Clara, California, United States NVIDIA Full timeNVIDIA is a leader in the generative AI revolution, and our Algorithmic Model Optimization Team is at the forefront of optimizing generative AI models for maximal inference efficiency. Our team focuses on techniques ranging from neural architecture search and pruning to sparsity, quantization, and automated deployment strategies.We conduct applied research...
-
Santa Clara, California, United States NVIDIA Full timeWe are seeking a highly skilled Senior System Software Engineer to join our team and contribute to the development of the CUDA driver and runtime. As a key member of our team, you will work on optimizing the performance of our platform for accelerating general purpose computation on the GPU.Our team is responsible for analyzing performance issues,...
-
Santa Clara, California, United States NVIDIA Full timeJob DescriptionWe are seeking a highly skilled Senior System Software Engineer to join our team and contribute to the development of the CUDA driver and runtime. As a key member of our team, you will be responsible for analyzing performance issues, investigating bottlenecks, and delivering features and improvements to enhance the performance of NVIDIA...
-
Senior Cloud Performance Architect
2 weeks ago
Santa Clara, California, United States NVIDIA Full timeNVIDIA is seeking a highly skilled Cloud AI Infrastructure Engineer to drive the performance analysis, optimization, and modeling of NVIDIA DGXTM Cloud clusters.The ideal candidate will have a deep understanding of the methodology to conduct end-to-end performance analysis of critical AI applications running on large-scale parallel and distributed...
-
Santa Clara, California, United States NVIDIA Full timeWe are seeking a senior software engineer to join our team working on the CUDA driver and runtime, core components of our platform for accelerating general purpose computation on the GPU.Our team analyzes performance of applications, investigates bottlenecks in software or hardware, and delivers features and improvements to better realize the potential of...
-
Senior Optimization Engineer
2 weeks ago
Santa Clara, California, United States Hitachi Energy Full timeJob Summary:Hitachi Energy is seeking a Senior Optimization Engineer to join its team in San Jose, CA. This role involves delivering innovative solutions in mathematical modeling, optimization, and numerical methods.Key Responsibilities:Analyze customer engineering, business, and software requirements to propose feasible solutions.Design, develop, test,...
-
Thermal Performance Optimization Engineer
3 weeks ago
Santa Clara, California, United States Apple Full timeWe are seeking a highly motivated and experienced Thermal Performance Optimization Engineer to join our team at Apple. As a key member of our engineering team, you will be responsible for optimizing the thermal performance of our mobile devices.The ideal candidate will have a strong background in thermal analysis and modeling, as well as experience with IC...
-
Senior AI/ML Engineer
3 weeks ago
Santa Clara, California, United States Eightfold LLC Full timeAbout Eightfold.aiWe're at the forefront of innovation in the AI-driven HR tech space, shaping the future of how organizations find, manage, and empower their talent. Our groundbreaking AI platform is revolutionizing the industry, and we're looking for exceptional engineers to join our team and drive the next wave of advancements.About the AI/ML TeamOur...
-
Santa Clara, California, United States NVIDIA Full timeAbout the RoleWe are seeking a highly skilled Senior System Software Engineer to join our team and contribute to the development of the CUDA driver and runtime. As a key member of our team, you will be responsible for analyzing performance issues, investigating bottlenecks, and delivering features and improvements to enhance the performance of NVIDIA...
-
Thermal Performance Optimization Engineer
2 weeks ago
Santa Clara, California, United States Apple Full timeAt Apple, we are seeking a highly skilled Thermal Performance Optimization Engineer to join our team. As a key member of our engineering team, you will be responsible for architecting thermal solutions to address chip energy efficiency and performance scaling.**Key Responsibilities:**- Providing thermal modeling solutions to mobile devices at silicon die and...
-
Cloud Infrastructure Optimization Specialist
3 weeks ago
Santa Clara, California, United States NVIDIA Full timeCloud Infrastructure Optimization SpecialistNVIDIA is a leader in computer graphics, PC gaming, and accelerated computing. We're now leveraging AI to define the next era of computing. As a Cloud Infrastructure Optimization Specialist, you'll work with internal teams to optimize cloud resources, reducing costs and improving performance.Key Responsibilities:...
-
Senior Systems Software Engineer
3 weeks ago
Santa Clara, California, United States NVIDIA Full timeWe are seeking a Senior Systems Software Engineer to join our TAO Toolkit Team, where you will be responsible for developing novel, scalable, and automated pipelines to make sense of petabytes of unstructured data. You will collaborate with multiple deep-learning architects and engineers to enable the development of pioneering AI models.Key Responsibilities:...
-
Thermal Performance Optimization Engineer
4 weeks ago
Santa Clara, California, United States Apple Full timeJob SummaryWe are seeking a highly motivated and ambitious individual to join our team as a Thermal Performance Optimization Engineer. As a key member of our multi-functional team, you will be responsible for architecting thermal solutions to address chip energy efficiency and performance scaling. Your expertise in thermal modeling and analysis will be...
-
Senior Systems Software Engineer
2 weeks ago
Santa Clara, California, United States NVIDIA Full timeWe are seeking a Senior Systems Software Engineer to join our TAO Toolkit Team at NVIDIA. Our team builds frameworks, services, algorithms, and tools that power the largest NVIDIA Multi-Modal Foundation Models and their customization.Key Responsibilities:Design, develop, and support a platform to access large datasets, integrating data from various...
Senior LLM Performance Optimization Specialist
2 months ago
We are seeking a highly skilled Senior LLM Performance Engineer to join our team at NVIDIA. As a key member of our Deep Learning Architecture team, you will play a critical role in optimizing the performance of Large Language Models (LLMs) on state-of-the-art hardware and software platforms.
Key Responsibilities- Understand and analyze the performance of LLMs on various hardware and software platforms.
- Develop and implement production-quality software to optimize LLM performance.
- Collaborate with cross-functional teams to identify and prioritize performance optimization opportunities.
- Design and implement tools to automate workload analysis and optimization.
- PhD (or equivalent experience) in Computer Science, Electrical Engineering, or related field, and 5+ years of relevant work experience, or MS and 8+ years of relevant work experience.
- Strong background in deep learning and neural networks, with a focus on training and large language models.
- Deep understanding of computer architecture and familiarity with GPU architecture.
- Proven experience analyzing and tuning application performance, preferably on GPUs.
- Familiarity with common deep learning software packages like PyTorch and JAX.
- Proven experience with processor and system-level performance modeling.
- Programming skills in C++, Python, and CUDA.
NVIDIA is a leader in the field of artificial intelligence and deep learning. We are committed to fostering a diverse and inclusive work environment and are proud to be an equal opportunity employer. If you are passionate about performance and interested in working on industry-leading Deep Learning products, we encourage you to apply for this exciting opportunity.