GPU Kernel Engineer
1 month ago
Join us to build and safely deploy aligned, superhuman AI. We are building an AI pair programmer that feels like a full colleague inside your computer - capable, conversational, and reliable across domains.
As a GPU Kernel Engineer, you will design efficient implementations of novel model architectures and optimize kernels to ensure high throughput and low latency during training and inference.
Responsibilities
Write efficient custom kernels for training and inference in CUDA/CuTe/Cutlass
Optimize inference for our novel architectures, both by writing more efficient code and thinking about how we can sacrifice accuracy for speed
Understand and optimize for H100 GPUs
Think beyond the kernel level to the broader scheme of how we train these models and suggest improvements
Requirements
Understands and has worked on GPU programming, ideally matmul-heavy workloads
Magic's culture
Integrity.
Words and actions should be aligned.
Hands-on.
Most of us have previously led engineering teams. At Magic, there are no managers. We all spend the vast majority of our time on engineering. If you want to solve hard problems, Magic is the right place for you.
Teamwork.
We move as one team, not N individuals.
Focus.
Ethically deploy AGI. Everything else is noise.
Quality.
We have high standards for ourselves and our products. Magic should feel like magic.
Benefits and perks
Benchmark-based compensation in the 75th or 90th percentile, including base salary, generous equity, and benefits
401K with 6% match
Flexible working hours
In-person (SF or Vienna) or remote
A small, fast-paced, highly focused team
FAQ:
What's your motivation?
Automation has led humanity from subsistence farming to becoming a globally connected society. AGI is the ultimate chapter of the story of human tool-building, presenting the potential to decouple productivity and ingenuity from human labor. What if the last 50 years of technological progress happened in 2 days? We want to make this a possibility.
Funding?
We've recently raised $28M.
How do we balance deploying the technology today with ambitions for AGI?
We think deploying AI within the right interfaces is just as important as the technology itself. Building an AI pair programmer helps us do both at the same time. We aim to launch gradually improving AI assistants while pursuing work on what will ultimately become AGI.
Do you train your own models?
Yes
Do you care about the product?
It's funny that this is a question, but many AI companies neglect UX and focus only on their model. Yes, we care.
Can I work from anywhere?
We welcome applications from anyone around the world. We'll look at visa requirements case by case.
I don't meet all the criteria, should I still apply?
If you feel you have something to contribute to the mission and you're a high-energy person, absolutely. We make exceptions for exceptional people. In all hires, we are looking for either 1) difference makers on world class teams or 2) individuals who would become this very quickly if placed on such a team tomorrow.
#J-18808-Ljbffr
-
GPU programming expert
5 days ago
San Francisco, United States Mistral Full timeMistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role will involve - writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity - rethinking various part of the generative model architecture to make them more suitable for efficient inference - integrating...
-
Senior GPU Performance Engineer
1 month ago
San Mateo, United States Zoox Full timeZoox is building the world's most advanced self-driving hardware and software solution. The efficiency demands of such a system require an expert fine-tuning of both the compute hardware architecture as well as the algorithms and middleware that runs on it to achieve maximum throughput at the most optimal power levels. The Software Core Performance team's...
-
GPU programming Expert
1 month ago
San Francisco, United States Mistral Full timeMistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable for...
-
GPU programming Expert
1 month ago
San Francisco, United States Mistral AI Full timeMistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable for...
-
GPU programming Expert
3 days ago
San Francisco, CA, United States Mistral AI Full timeMistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable for...
-
GPU programming Expert
3 days ago
San Francisco, CA, United States mistral.ai Full timeMistral AI is hiring an expert in the role of serving and training large language models at high speed on GPUs. The role is based in San Francisco. The role will involve -Writing low-level code to take all advantage of high-end GPUs (H100) and max out their capacity -Rethinking various part of the generative model architecture to make them more suitable...
-
Embedded Software Engineer
3 weeks ago
San Francisco, United States Loft Orbital Full time**Embedded Software Engineer - Driver/Kernel/Linux** San Francisco / Engineering / Full time *About Loft Orbital* Loft Orbital was founded in 2017 and is headquartered in San Francisco, California with offices in Boulder, Colorado and Toulouse, France. Our mission is to make space simple for our customers: we operate microsatellites and fly customer...
-
Software Engineer, Systems ML
6 days ago
San Francisco, United States META Full timeSummary: In this role, you will be a member of the MTIA (Meta Training & Inference Accelerator) Software team and part of the bigger industry-leading PyTorch AI framework organization. MTIA Software Team has been developing a comprehensive AI Compiler strategy that delivers a highly flexible platform to train & serve new DL/ML model architectures, combined...
-
Senior Software Engineer
3 days ago
San Francisco, CA, United States CentML Full timeOverview: Do you want to help drive the development of high-performance, power-efficient datacenter solutions for Deep Learning? Do you have an interest in how system architecture across GPU, networking, CPU and IO relate to brand new generative AI capabilities? Come join our team, and bring your experience and interests to help us optimize our next...
-
Kernel Manager
7 days ago
San Francisco, United States Agoric Full timeJob DescriptionJob DescriptionAgoric is an open-source software development company bringing better security and composability to the decentralized financial infrastructure of today. Agoric is built on a JavaScript library of reusable, composable components coded by experienced community members. Our secure JavaScript smart contract platform allows...
-
Junior GPU Engineer
6 days ago
San Jose, United States Ampstek Full timeJunior GPU Engineer Location: San Jose, CA (Onsite) Min exp : 7+ Years Job Description. We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...
-
Performance Engineer
1 month ago
San Francisco, United States Anthropic Limited Full timeRunning machine learning (ML) algorithms at our scale often requires solving novel systems problems. As a Performance Engineer, you'll be responsible for identifying these problems, and then developing systems that optimize the throughput and robustness of our largest distributed systems. Strong candidates here will have a track record of solving large-scale...
-
Junior GPU Engineer
2 weeks ago
San Jose, United States Ampstek Full timeJunior GPU EngineerLocation: San Jose, CA (Onsite)Min exp : 7+ YearsJob Description.We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...
-
Junior GPU Engineer
2 weeks ago
San Diego, United States Ampstek Full timeJunior GPU Engineer Location: San Jose, CA (Onsite) Min exp : 7+ Years Scroll down to find the complete details of the job offer, including experience required and associated duties and tasks. Job Description. We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of...
-
Junior GPU Engineer
2 weeks ago
San Jose, United States Ampstek Full timeJunior GPU EngineerLocation: San Jose, CA (Onsite)Min exp : 7+ YearsJob Description.We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...
-
Junior GPU Engineer
2 weeks ago
San Jose, United States Ampstek Full timeJunior GPU Engineer Location: San Jose, CA (Onsite) Min exp : 7+ Years Job Description. We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...
-
Junior GPU Engineer
2 weeks ago
San Jose, United States Ampstek Full timeJunior GPU EngineerLocation: San Jose, CA (Onsite)Min exp : 7+ YearsJob Description.We are seeking a highly motivated and skilled GPU Resource Specialist to join our team. The ideal candidate will have a strong understanding of graphics processing units (GPUs), their architecture, and their applications across various industries. The Junior GPU Resource...
-
San Francisco, CA, United States Gensyn Full timeMachine learning models are driving our cars , testing our eyesight , detecting our cancer , giving sight to the blind , giving speech to the mute , and dictating what we consume, enjoy, and think . Soon, we'll conjure unlimited content: from never-ending TV series (where we’re the main character) to personalised tutors that are infinitely...
-
San Francisco, United States Targeted Talent Full timeJob DescriptionJob DescriptionSenior Neural Network Kernel Software Development EngineerOur client is making substantial investments in software to enhance the seamless deployment of neural networks on their hardware, streamlining the experience for researchers and developers. The focus involves the optimization of various common neural networks for optimal...
-
GPU Modeling Engineer
2 days ago
San Jose, United States SAMSUNG Full timePosition Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...