GPU programming Expert
3 weeks ago
Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco.
The role will involve
-Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity
-Rethinking various part of the generative model architecture to make them more suitable for efficient inference-Integrating low-level efficient code in a high-level MLOps framework
The successful candidate will have
-High technical competence for writing custom CUDA kernels and pushing GPUs to their limits. High expertise on the distributed computation infrastructure of current generation GPU clusters
-Overall understanding of the field of generative AI, knowledge or interest in fine-tuning and using language models for applications
-
GPU programming expert
3 days ago
San Francisco, United States Mistral Full timeMistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role will involve - writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity - rethinking various part of the generative model architecture to make them more suitable for efficient inference - integrating...
-
GPU programming Expert
1 month ago
San Francisco, United States Mistral Full timeMistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable for...
-
GPU programming Expert
22 hours ago
San Francisco, CA, United States Mistral AI Full timeMistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable for...
-
GPU programming Expert
23 hours ago
San Francisco, CA, United States mistral.ai Full timeMistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable...
-
Junior GPU Engineer
4 days ago
San Jose, United States Ampstek Full timeJunior GPU Engineer Location: San Jose, CA (Onsite) Min exp : 7+ Years Job Description. We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...
-
Junior GPU Engineer
2 weeks ago
San Jose, United States Ampstek Full timeJunior GPU EngineerLocation: San Jose, CA (Onsite)Min exp : 7+ YearsJob Description.We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...
-
Junior GPU Engineer
1 week ago
San Diego, United States Ampstek Full timeJunior GPU Engineer Location: San Jose, CA (Onsite) Min exp : 7+ Years Scroll down to find the complete details of the job offer, including experience required and associated duties and tasks. Job Description. We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of...
-
Junior GPU Engineer
2 weeks ago
San Jose, United States Ampstek Full timeJunior GPU EngineerLocation: San Jose, CA (Onsite)Min exp : 7+ YearsJob Description.We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...
-
Junior GPU Engineer
2 weeks ago
San Jose, United States Ampstek Full timeJunior GPU EngineerLocation: San Jose, CA (Onsite)Min exp : 7+ YearsJob Description.We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...
-
Junior GPU Engineer
2 weeks ago
San Jose, United States Ampstek Full timeJunior GPU Engineer Location: San Jose, CA (Onsite) Min exp : 7+ Years Job Description. We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...
-
Expert Applications
22 hours ago
San Francisco, CA, United States Mistral AI Full timeMistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -High technical competence for writing custom CUDA kernels and pushing GPUs to their limits. Overall understanding of the field of generative AI,...
-
Expert Applications
23 hours ago
San Francisco, CA, United States mistral.ai Full timeMistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -High technical competence for writing custom CUDA kernels and pushing GPUs to their limits. Overall understanding of the field of generative AI,...
-
GPU Modeling Engineer
2 hours ago
San Jose, United States SAMSUNG Full timePosition Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...
-
GPU Kernel Engineer
1 month ago
San Francisco, United States CareerBuilder Full timeJoin us to build and safely deploy aligned, superhuman AI. We are building an AI pair programmer that feels like a full colleague inside your computer - capable, conversational, and reliable across domains. As a GPU Kernel Engineer, you will design efficient implementations of novel model architectures and optimize kernels to ensure high throughput and low...
-
Senior GPU Performance Engineer
1 month ago
San Mateo, United States Zoox Full timeZoox is building the world's most advanced self-driving hardware and software solution. The efficiency demands of such a system require an expert fine-tuning of both the compute hardware architecture as well as the algorithms and middleware that runs on it to achieve maximum throughput at the most optimal power levels. The Software Core Performance team's...
-
GPU Top RTL Integration Lead
4 days ago
San Jose, United States Samsung Electronics Co., Ltd. Full timePosition Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy - the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...
-
GPU Top RTL Integration Lead
5 days ago
San Jose, United States SAMSUNG Full timePosition Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...
-
Sr. Sourcing Commodity Manager, GPU
23 hours ago
San Jose, United States Cisco Full timeWhat You'll Do Cisco Global Supplier Management (GSM) team is seeking a motivated Sr. Sourcing Commodity Manager for GPUs. You will be part of a highly impactful and dynamic organization collaborating with cross-functional teams and suppliers. Responsibilities include: * Primary contact and focal point to the Business Units to address all sourcing and...
-
Senior Software Engineer, Data Platform
22 hours ago
San Francisco, CA, United States CentML Full timeOur founding team is made up of experts in AI, compilers, and ML hardware and has led efforts at companies like Amazon, Google, Microsoft Research, Nvidia, Intel, Qualcomm, and IBM. Our co-founder and CEO, Gennady Pekhimenko, is a world-renowned expert in ML systems who holds multiple academic and industry research awards from Google, Amazon, Facebook, and...
-
Performance Engineer
1 month ago
San Francisco, United States Anthropic Limited Full timeRunning machine learning (ML) algorithms at our scale often requires solving novel systems problems. As a Performance Engineer, you'll be responsible for identifying these problems, and then developing systems that optimize the throughput and robustness of our largest distributed systems. Strong candidates here will have a track record of solving large-scale...