GPU programming Expert

3 weeks ago


San Francisco, United States Mistral AI Full time

Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco.

The role will involve

-Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity

-Rethinking various part of the generative model architecture to make them more suitable for efficient inference-Integrating low-level efficient code in a high-level MLOps framework

The successful candidate will have

-High technical competence for writing custom CUDA kernels and pushing GPUs to their limits. High expertise on the distributed computation infrastructure of current generation GPU clusters

-Overall understanding of the field of generative AI, knowledge or interest in fine-tuning and using language models for applications



  • San Francisco, United States Mistral Full time

    Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role will involve - writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity - rethinking various part of the generative model architecture to make them more suitable for efficient inference - integrating...


  • San Francisco, United States Mistral Full time

    Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable for...

  • GPU programming Expert

    22 hours ago


    San Francisco, CA, United States Mistral AI Full time

    Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable for...

  • GPU programming Expert

    23 hours ago


    San Francisco, CA, United States mistral.ai Full time

    Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable...

  • Junior GPU Engineer

    4 days ago


    San Jose, United States Ampstek Full time

    Junior GPU Engineer Location: San Jose, CA (Onsite) Min exp : 7+ Years Job Description. We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...

  • Junior GPU Engineer

    2 weeks ago


    San Jose, United States Ampstek Full time

    Junior GPU EngineerLocation: San Jose, CA (Onsite)Min exp : 7+ YearsJob Description.We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...

  • Junior GPU Engineer

    1 week ago


    San Diego, United States Ampstek Full time

    Junior GPU Engineer Location: San Jose, CA (Onsite) Min exp : 7+ Years Scroll down to find the complete details of the job offer, including experience required and associated duties and tasks. Job Description. We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of...

  • Junior GPU Engineer

    2 weeks ago


    San Jose, United States Ampstek Full time

    Junior GPU EngineerLocation: San Jose, CA (Onsite)Min exp : 7+ YearsJob Description.We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...

  • Junior GPU Engineer

    2 weeks ago


    San Jose, United States Ampstek Full time

    Junior GPU EngineerLocation: San Jose, CA (Onsite)Min exp : 7+ YearsJob Description.We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...

  • Junior GPU Engineer

    2 weeks ago


    San Jose, United States Ampstek Full time

    Junior GPU Engineer Location: San Jose, CA (Onsite) Min exp : 7+ Years Job Description. We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...

  • Expert Applications

    22 hours ago


    San Francisco, CA, United States Mistral AI Full time

    Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -High technical competence for writing custom CUDA kernels and pushing GPUs to their limits. Overall understanding of the field of generative AI,...

  • Expert Applications

    23 hours ago


    San Francisco, CA, United States mistral.ai Full time

    Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -High technical competence for writing custom CUDA kernels and pushing GPUs to their limits. Overall understanding of the field of generative AI,...

  • GPU Modeling Engineer

    2 hours ago


    San Jose, United States SAMSUNG Full time

    Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...

  • GPU Kernel Engineer

    1 month ago


    San Francisco, United States CareerBuilder Full time

    Join us to build and safely deploy aligned, superhuman AI. We are building an AI pair programmer that feels like a full colleague inside your computer - capable, conversational, and reliable across domains. As a GPU Kernel Engineer, you will design efficient implementations of novel model architectures and optimize kernels to ensure high throughput and low...


  • San Mateo, United States Zoox Full time

    Zoox is building the world's most advanced self-driving hardware and software solution. The efficiency demands of such a system require an expert fine-tuning of both the compute hardware architecture as well as the algorithms and middleware that runs on it to achieve maximum throughput at the most optimal power levels. The Software Core Performance team's...


  • San Jose, United States Samsung Electronics Co., Ltd. Full time

    Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy - the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...


  • San Jose, United States SAMSUNG Full time

    Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...


  • San Jose, United States Cisco Full time

    What You'll Do Cisco Global Supplier Management (GSM) team is seeking a motivated Sr. Sourcing Commodity Manager for GPUs. You will be part of a highly impactful and dynamic organization collaborating with cross-functional teams and suppliers. Responsibilities include: * Primary contact and focal point to the Business Units to address all sourcing and...


  • San Francisco, CA, United States CentML Full time

    Our founding team is made up of experts in AI, compilers, and ML hardware and has led efforts at companies like Amazon, Google, Microsoft Research, Nvidia, Intel, Qualcomm, and IBM. Our co-founder and CEO, Gennady Pekhimenko, is a world-renowned expert in ML systems who holds multiple academic and industry research awards from Google, Amazon, Facebook, and...

  • Performance Engineer

    1 month ago


    San Francisco, United States Anthropic Limited Full time

    Running machine learning (ML) algorithms at our scale often requires solving novel systems problems. As a Performance Engineer, you'll be responsible for identifying these problems, and then developing systems that optimize the throughput and robustness of our largest distributed systems. Strong candidates here will have a track record of solving large-scale...