GPU Kernel Engineer

1 month ago


San Francisco, United States CareerBuilder Full time

Join us to build and safely deploy aligned, superhuman AI. We are building an AI pair programmer that feels like a full colleague inside your computer - capable, conversational, and reliable across domains.
As a GPU Kernel Engineer, you will design efficient implementations of novel model architectures and optimize kernels to ensure high throughput and low latency during training and inference.
Responsibilities
Write efficient custom kernels for training and inference in CUDA/CuTe/Cutlass

Optimize inference for our novel architectures, both by writing more efficient code and thinking about how we can sacrifice accuracy for speed

Understand and optimize for H100 GPUs

Think beyond the kernel level to the broader scheme of how we train these models and suggest improvements

Requirements
Understands and has worked on GPU programming, ideally matmul-heavy workloads

Magic's culture
Integrity.

Words and actions should be aligned.

Hands-on.

Most of us have previously led engineering teams. At Magic, there are no managers. We all spend the vast majority of our time on engineering. If you want to solve hard problems, Magic is the right place for you.

Teamwork.

We move as one team, not N individuals.

Focus.

Ethically deploy AGI. Everything else is noise.

Quality.

We have high standards for ourselves and our products. Magic should feel like magic.

Benefits and perks
Benchmark-based compensation in the 75th or 90th percentile, including base salary, generous equity, and benefits

401K with 6% match

Flexible working hours

In-person (SF or Vienna) or remote

A small, fast-paced, highly focused team

FAQ:
What's your motivation?
Automation has led humanity from subsistence farming to becoming a globally connected society. AGI is the ultimate chapter of the story of human tool-building, presenting the potential to decouple productivity and ingenuity from human labor. What if the last 50 years of technological progress happened in 2 days? We want to make this a possibility.
Funding?
We've recently raised $28M.
How do we balance deploying the technology today with ambitions for AGI?
We think deploying AI within the right interfaces is just as important as the technology itself. Building an AI pair programmer helps us do both at the same time. We aim to launch gradually improving AI assistants while pursuing work on what will ultimately become AGI.
Do you train your own models?
Yes
Do you care about the product?
It's funny that this is a question, but many AI companies neglect UX and focus only on their model. Yes, we care.
Can I work from anywhere?
We welcome applications from anyone around the world. We'll look at visa requirements case by case.
I don't meet all the criteria, should I still apply?
If you feel you have something to contribute to the mission and you're a high-energy person, absolutely. We make exceptions for exceptional people. In all hires, we are looking for either 1) difference makers on world class teams or 2) individuals who would become this very quickly if placed on such a team tomorrow.

#J-18808-Ljbffr



  • San Francisco, United States Mistral Full time

    Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role will involve - writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity - rethinking various part of the generative model architecture to make them more suitable for efficient inference - integrating...


  • San Mateo, United States Zoox Full time

    Zoox is building the world's most advanced self-driving hardware and software solution. The efficiency demands of such a system require an expert fine-tuning of both the compute hardware architecture as well as the algorithms and middleware that runs on it to achieve maximum throughput at the most optimal power levels. The Software Core Performance team's...


  • San Francisco, United States Mistral Full time

    Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable for...


  • San Francisco, United States Mistral AI Full time

    Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable for...


  • San Francisco, CA, United States Mistral AI Full time

    Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable for...


  • San Francisco, CA, United States mistral.ai Full time

    Mistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable...


  • San Francisco, United States Loft Orbital Full time

    **Embedded Software Engineer - Driver/Kernel/Linux** San Francisco / Engineering / Full time *About Loft Orbital* Loft Orbital was founded in 2017 and is headquartered in San Francisco, California with offices in Boulder, Colorado and Toulouse, France. Our mission is to make space simple for our customers: we operate microsatellites and fly customer...


  • San Francisco, United States META Full time

    Summary: In this role, you will be a member of the MTIA (Meta Training & Inference Accelerator) Software team and part of the bigger industry-leading PyTorch AI framework organization. MTIA Software Team has been developing a comprehensive AI Compiler strategy that delivers a highly flexible platform to train & serve new DL/ML model architectures, combined...


  • San Francisco, CA, United States CentML Full time

    Overview: Do you want to help drive the development of high-performance, power-efficient datacenter solutions for Deep Learning? Do you have an interest in how system architecture across GPU, networking, CPU and IO relate to brand new generative AI capabilities? Come join our team, and bring your experience and interests to help us optimize our next...

  • Kernel Manager

    7 days ago


    San Francisco, United States Agoric Full time

    Job DescriptionJob DescriptionAgoric is an open-source software development company bringing better security and composability to the decentralized financial infrastructure of today. Agoric is built on a JavaScript library of reusable, composable components coded by experienced community members. Our secure JavaScript smart contract platform allows...

  • Junior GPU Engineer

    6 days ago


    San Jose, United States Ampstek Full time

    Junior GPU Engineer Location: San Jose, CA (Onsite) Min exp : 7+ Years Job Description. We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...

  • Performance Engineer

    1 month ago


    San Francisco, United States Anthropic Limited Full time

    Running machine learning (ML) algorithms at our scale often requires solving novel systems problems. As a Performance Engineer, you'll be responsible for identifying these problems, and then developing systems that optimize the throughput and robustness of our largest distributed systems. Strong candidates here will have a track record of solving large-scale...

  • Junior GPU Engineer

    2 weeks ago


    San Jose, United States Ampstek Full time

    Junior GPU EngineerLocation: San Jose, CA (Onsite)Min exp : 7+ YearsJob Description.We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...

  • Junior GPU Engineer

    2 weeks ago


    San Diego, United States Ampstek Full time

    Junior GPU Engineer Location: San Jose, CA (Onsite) Min exp : 7+ Years Scroll down to find the complete details of the job offer, including experience required and associated duties and tasks. Job Description. We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of...

  • Junior GPU Engineer

    2 weeks ago


    San Jose, United States Ampstek Full time

    Junior GPU EngineerLocation: San Jose, CA (Onsite)Min exp : 7+ YearsJob Description.We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...

  • Junior GPU Engineer

    2 weeks ago


    San Jose, United States Ampstek Full time

    Junior GPU Engineer Location: San Jose, CA (Onsite) Min exp : 7+ Years Job Description. We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...

  • Junior GPU Engineer

    2 weeks ago


    San Jose, United States Ampstek Full time

    Junior GPU EngineerLocation: San Jose, CA (Onsite)Min exp : 7+ YearsJob Description.We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...


  • San Francisco, CA, United States Gensyn Full time

    Machine learning models are driving our cars , testing our eyesight , detecting our cancer , giving sight to the blind , giving speech to the mute , and dictating what we consume, enjoy, and think . Soon, we'll conjure unlimited content: from never-ending TV series (where we’re the main character) to personalised tutors that are infinitely...


  • San Francisco, United States Targeted Talent Full time

    Job DescriptionJob DescriptionSenior Neural Network Kernel Software Development EngineerOur client is making substantial investments in software to enhance the seamless deployment of neural networks on their hardware, streamlining the experience for researchers and developers. The focus involves the optimization of various common neural networks for optimal...


  • San Jose, United States SAMSUNG Full time

    Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...