No more applications are being accepted for this job

GPU programming Expert

3 weeks ago

San Francisco, United States Mistral AI Full time

Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco.

The role will involve

-Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity

-Rethinking various part of the generative model architecture to make them more suitable for efficient inference-Integrating low-level efficient code in a high-level MLOps framework

The successful candidate will have

-High technical competence for writing custom CUDA kernels and pushing GPUs to their limits. High expertise on the distributed computation infrastructure of current generation GPU clusters

-Overall understanding of the field of generative AI, knowledge or interest in fine-tuning and using language models for applications

GPU programming expert

3 days ago

San Francisco, United States Mistral Full time

Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role will involve - writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity - rethinking various part of the generative model architecture to make them more suitable for efficient inference - integrating...
GPU programming Expert

1 month ago

San Francisco, United States Mistral Full time

Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable for...
GPU programming Expert

22 hours ago

San Francisco, CA, United States Mistral AI Full time

Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable for...
GPU programming Expert

23 hours ago

San Francisco, CA, United States mistral.ai Full time

Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable...
Junior GPU Engineer

4 days ago

San Jose, United States Ampstek Full time

Junior GPU Engineer Location: San Jose, CA (Onsite) Min exp : 7+ Years Job Description. We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...
Junior GPU Engineer

2 weeks ago

San Jose, United States Ampstek Full time

Junior GPU EngineerLocation: San Jose, CA (Onsite)Min exp : 7+ YearsJob Description.We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...
Junior GPU Engineer

1 week ago

San Diego, United States Ampstek Full time

Junior GPU Engineer Location: San Jose, CA (Onsite) Min exp : 7+ Years Scroll down to find the complete details of the job offer, including experience required and associated duties and tasks. Job Description. We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of...
Junior GPU Engineer

2 weeks ago

San Jose, United States Ampstek Full time

Junior GPU EngineerLocation: San Jose, CA (Onsite)Min exp : 7+ YearsJob Description.We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...
Junior GPU Engineer

2 weeks ago

San Jose, United States Ampstek Full time

Junior GPU EngineerLocation: San Jose, CA (Onsite)Min exp : 7+ YearsJob Description.We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...
Junior GPU Engineer

2 weeks ago

San Jose, United States Ampstek Full time

Junior GPU Engineer Location: San Jose, CA (Onsite) Min exp : 7+ Years Job Description. We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...
Expert Applications

22 hours ago

San Francisco, CA, United States Mistral AI Full time

Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -High technical competence for writing custom CUDA kernels and pushing GPUs to their limits. Overall understanding of the field of generative AI,...
Expert Applications

23 hours ago

San Francisco, CA, United States mistral.ai Full time

Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -High technical competence for writing custom CUDA kernels and pushing GPUs to their limits. Overall understanding of the field of generative AI,...
GPU Modeling Engineer

2 hours ago

San Jose, United States SAMSUNG Full time

Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...
GPU Kernel Engineer

1 month ago

San Francisco, United States CareerBuilder Full time

Join us to build and safely deploy aligned, superhuman AI. We are building an AI pair programmer that feels like a full colleague inside your computer - capable, conversational, and reliable across domains. As a GPU Kernel Engineer, you will design efficient implementations of novel model architectures and optimize kernels to ensure high throughput and low...
Senior GPU Performance Engineer

1 month ago

San Mateo, United States Zoox Full time

Zoox is building the world's most advanced self-driving hardware and software solution. The efficiency demands of such a system require an expert fine-tuning of both the compute hardware architecture as well as the algorithms and middleware that runs on it to achieve maximum throughput at the most optimal power levels. The Software Core Performance team's...
GPU Top RTL Integration Lead

4 days ago

San Jose, United States Samsung Electronics Co., Ltd. Full time

Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy - the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...
GPU Top RTL Integration Lead

5 days ago

San Jose, United States SAMSUNG Full time

Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...
Sr. Sourcing Commodity Manager, GPU

23 hours ago

San Jose, United States Cisco Full time

What You'll Do Cisco Global Supplier Management (GSM) team is seeking a motivated Sr. Sourcing Commodity Manager for GPUs. You will be part of a highly impactful and dynamic organization collaborating with cross-functional teams and suppliers. Responsibilities include: * Primary contact and focal point to the Business Units to address all sourcing and...
Senior Software Engineer, Data Platform

22 hours ago

San Francisco, CA, United States CentML Full time

Our founding team is made up of experts in AI, compilers, and ML hardware and has led efforts at companies like Amazon, Google, Microsoft Research, Nvidia, Intel, Qualcomm, and IBM. Our co-founder and CEO, Gennady Pekhimenko, is a world-renowned expert in ML systems who holds multiple academic and industry research awards from Google, Amazon, Facebook, and...
Performance Engineer

1 month ago

San Francisco, United States Anthropic Limited Full time

Running machine learning (ML) algorithms at our scale often requires solving novel systems problems. As a Performance Engineer, you'll be responsible for identifying these problems, and then developing systems that optimize the throughput and robustness of our largest distributed systems. Strong candidates here will have a track record of solving large-scale...

Americas

Europe

Asia / Oceania

Africa

GPU programming Expert