Systems/GPU Research Engineer

4 days ago

San Francisco, CA, United States Vast Full time

About Us

Vast.ai's cloud powers AI projects and businesses all over the world. We are democratizing and decentralizing AI computing-reshaping our future for the benefit of humanity.

We are a small, growing, and highly motivated team dedicated to an ambitious technical plan. We operate with a flat mobile organizational structure where all contribute directly to the company's mission. Leadership is earned by those who show initiative and deliver excellence.

We seek engineers/researchers with strong intrinsic drive, a true passion for advancing the state of the art, and a mix of excellent research, coding, and communication skills.

LOCATION: On-site at our office in San Francisco or Westwood, Los Angeles.
About the Role

As a systems/GPU engineer, you will play a crucial role in developing new kernels and algorithms that can improve inference for AI models. You will help develop new high-performance tensor libraries and auto-optimization tools. Collaborating directly with our technical founder and diverse team, you will enhance the performance and efficiency of our AI systems. Your ability to research and stay on top of cutting-edge papers will be vital in staying up-to-date with the latest advancements in AI model inference and GPU programming techniques.

Full-Time
On-site at either our SF or LA offices

Tech Stack

CUDA/C++, GPGPU, Python, Linux
Ideal Experience

Expertise in systems engineering across the tech stack
Deep understanding of GPU architectures
Strong holistic background in neural network performance and tooling
Published research at top AI conferences

Key Responsibilities

Develop or extend parallel generic GPU libraries and kernels
Help design and deploy market-based resource management systems
Quickly investigate and summarize options for new system architectures
Prototype and evaluate novel state-of-the-art methods/models
Investigate and learn new frameworks and tools

Interview Process

After submitting your application, our technical team reviews your credentials. If selected, you'll proceed through the following stages:

Initial screening (virtual, 15 minutes)
Quick dive into Vast, systems and architectures (virtual, 30 minutes)
LLM-assisted coding assessment (virtual, 1 hour)
Meet and greet with coding assessment (on-site, 2 hours)

Our goal is to complete the interview process in two weeks.
Annual Salary Range

$160,000 - $320,000 + equity + benefits

Vast.ai is hiring across all experience levels with compensation commensurate with background, experience and potential.
Benefits

Comprehensive health, dental, vision, and life insurance
401(k) with company match
Meaningful early-stage equity
Onsite meals, snacks, and close collaboration with founders/tech leaders
Ambitious, fast-paced startup culture where initiative is rewarded

Systems Research Engineer, GPU Programming

19 hours ago

San Francisco, CA, United States Together AI Full time

About the Role As a Systems Research Engineer specialized in GPU Programming, you will play a crucial role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications. Working closely with the modeling and algorithm team, you will co-design GPU kernels and model architecture to enhance the performance and efficiency of our AI...
Performance Engineer

1 week ago

San Francisco, CA, United States Anthropic Full time

About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role:...
Performance Engineer

1 week ago

San Francisco, CA, United States Anthropic Full time

About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role:...
Performance Engineer, GPU

2 days ago

San Francisco, CA, United States Anthropic Full time

About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role:...
GPU Research Engineer

2 weeks ago

San Diego, CA, United States Qualcomm Full time

Company: Qualcomm Technologies, Inc. Job Area: Engineering Group, Engineering Group > GPU ASICS Engineering General Summary: As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. As a Qualcomm GPU...
GPU Performance Engineer

4 days ago

San Francisco, CA, United States Genmo Full time

We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what's possible in video generation. We're seeking a GPU Performance Engineer to squeeze every last FLOP from our H100 infrastructure and optimize our...
GPU Performance Engineer

3 days ago

San Francisco, CA, United States Genmo Full time

We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what's possible in video generation. We're seeking a GPU Performance Engineer to squeeze every last FLOP from our H100 infrastructure and optimize our...
Research Engineer, Infrastructure, Training Systems

2 weeks ago

San Francisco, CA, United States Thinking Machines Lab Full time

Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building a future where everyone has access to the knowledge and tools to make AI work for their unique needs and goals. We are scientists, engineers, and builders who've created some of the most widely used AI products, including ChatGPT and...
Research Engineer, Infrastructure, Training Systems

1 week ago

San Francisco, CA, United States Thinking Machines Lab Full time

Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building a future where everyone has access to the knowledge and tools to make AI work for their unique needs and goals. We are scientists, engineers, and builders who've created some of the most widely used AI products, including ChatGPT and...
Research Engineer, Infrastructure, Training Systems

1 week ago

San Francisco, CA, United States Thinking Machines Lab Full time

Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building a future where everyone has access to the knowledge and tools to make AI work for their unique needs and goals. We are scientists, engineers, and builders who've created some of the most widely used AI products, including ChatGPT and...

Americas

Europe

Asia / Oceania

Africa

Systems/GPU Research Engineer