Optimization Expert for AI Model Performance

2 days ago

San Jose, California, United States Adobe Inc. Full time

About the Opportunity

We are seeking a highly skilled Optimization Expert to join our team at Adobe Inc. in a strategic and visible role that applies GPU optimization skills towards improving the training efficiency and performance of our commercially safe AI models.

The Firefly family of creative generative AI models is revolutionizing the way we conceptualize, build, and scale content. As an Optimization Expert, you will be working on optimizing model efficiency for Hopper/Blackwell GPU architectures, leveraging FP8 to accelerate training and inference, and writing high-quality, product-level code that is easy to maintain and test.

Key Responsibilities

Collaborate with model architecture teams to co-design hardware-aware models
Develop efficient kernels for forward and backward passes in CUDA, Cutlass / CuTe, Triton
Create optimized custom layers using PyTorch
Leverage profiling tools - Nsight, Kineto, etc.
Work on other related tasks as needed

Requirements

Bachelor's, Master's, or Ph.D. in Computer Science, Computer Engineering, or a related field, and 5+ years of relevant experience
Proficiency in Linux, Docker
Strong understanding of modern transformer-based model architectures
Expertise in Python, PyTorch, CUDA, Triton, Cutlass/CuTe, and C++
Familiarity with distributed training fundamentals

What We Offer

We offer an estimated annual salary range of $170,900-$325,200, depending on location and job-related knowledge, skills, and experience. Our compensation reflects the cost of labor across several U.S. geographic markets. We are proud to be an Equal Employment Opportunity and affirmative action employer, committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity.

Deep Learning and AI Model Optimization Expert

4 days ago

San Jose, California, United States Syntricate Technologies Full time

Role Summary:We're seeking a Deep Learning and AI Model Optimization Expert to join our team at Syntricate Technologies. As a key member of our R&D department, you will be responsible for exploring and documenting how to port cutting-edge AI models to AMD's devices. Your expertise in AI frameworks like ONNX, Pytorch, or TensorFlow will be crucial in...
Lead AI Model Optimization Expert

3 weeks ago

San Jose, California, United States Adobe Full time

About UsAt Adobe, we're passionate about empowering people to create and deliver exceptional digital experiences. Our company is committed to creating an inclusive environment where everyone has access to equal opportunity.The RoleWe're seeking a highly skilled Lead AI Model Optimization Expert to join our team. This role involves applying GPU optimization...
AI Performance Optimization Expert

4 days ago

San Francisco, California, United States ZipRecruiter Full time

Job Title: AI Performance Optimization ExpertCompany OverviewWe are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI.
AI Model Training and Deployment Expert

3 days ago

San Jose, California, United States Tik Tok Full time

About the RoleThe AI Model Training and Deployment Expert will design, architect, and implement backend systems to deploy generative AI models for image and video generation use cases.Responsibilities:Design and implement highly efficient engineering systems for generative AI tasks.Optimize the performance of generative AI model training and serving.Build...
High-Performance GPU Optimization Expert

4 weeks ago

San Jose, California, United States Adobe Inc. Full time

About Adobe Inc.At Adobe, we're passionate about empowering creatives to push the boundaries of what's possible. With a legacy spanning over 40 years, we've been at the forefront of innovation in digital experiences. Our commitment to creativity and inclusivity drives us to create exceptional products that transform how companies interact with customers...
AI Optimization Solutions Expert

3 days ago

San Jose, California, United States NextDeavor Full time

Job Title: AI Optimization Solutions ExpertAbout the Role:The GenStudio Optimization Strategist will play a crucial role in helping our customers successfully integrate and leverage our cutting-edge generative AI tool, GenStudio for Performance Marketing. This includes guiding customers through the setup and implementation of our generative AI tool,...
AI Model Compression Expert

1 week ago

San Diego, California, United States Kneron Full time

We are looking for a talented AI Model Compression Expert to join our team at Kneron. As a key member of our team, you will be responsible for developing and implementing model compression techniques, including QAT, model distillation, pruning, quantization, and others for deep learning models.Key Responsibilities:Develop and implement novel deep neural...
AI Optimization Strategist for Performance Marketing

3 weeks ago

San Jose, California, United States Cypress HCM Full time

Job Title: AI Optimization Strategist for Performance MarketingAbout Us:Cypress HCM is a leading multimedia and creative software company, pioneering the use of generative AI tools to enhance performance marketing. We are seeking an exceptional Ai Optimization Strategist to join our team and help our customers unlock the full potential of our cutting-edge...
High Performance Optimization Engineer

3 weeks ago

San Francisco, California, United States Liquid AI Full time

We are seeking a highly skilled engineer at Liquid AI to optimize inference stacks tailored to diverse hardware platforms. This role is ideal for an expert with extensive experience in CUDA, C++, and Triton, as well as a deep understanding of GPU, CPU, and NPU architectures.Key ResponsibilitiesDesign and optimize inference stacks for GPUs, CPUs, and...
AI Model Efficiency Expert

3 days ago

San Francisco, California, United States Genmo Full time

About the Role: We are seeking an AI Model Efficiency Expert to join our team at Genmo. In this role, you will analyze and optimize the performance of our massive parallel and distributed systems. You will also implement and fine-tune distributed training strategies for multi-GPU and multi-node environments and develop and maintain benchmarking suites for...
AI Model Deployment Specialist

4 days ago

San Jose, California, United States Tik Tok Full time

About the RoleWe are looking for an experienced Ai Model Deployment Specialist to join our team at TikTok. As a key member of our Intelligent Creation - AI Platform team, you will be responsible for designing and implementing efficient engineering systems for generative AI tasks, including model training, optimization, deployment, and applications such as...
Senior AI Performance Optimization Lead

2 weeks ago

San Jose, California, United States Advanced Micro Devices, Inc. Full time

Job OverviewWe are seeking a highly skilled Senior AI Performance Optimization Lead to join our team at Advanced Micro Devices, Inc. (AMD). As a key member of our engineering team, you will play a critical role in driving performance improvements and shaping the future of artificial intelligence (AI) on our GPU hardware.
Senior AI Model Optimization Engineer

3 days ago

San Francisco, California, United States Lumicity Full time

About LumicityWe are a pioneering company in generative video models, pushing the boundaries of AI innovation. With a strong presence in San Francisco and over $10M in funding, we're expanding our team to tackle cutting-edge challenges.Salary: $180,000 - $220,000 per annumThe RoleWe're seeking a highly skilled Senior AI Model Optimization Engineer to join...
Data Engineering Expert for AI Model Development

3 weeks ago

San Francisco, California, United States Scale AI Full time

About Scale AIAt Scale AI, our mission is to accelerate the development of AI applications. With 8 years of experience as the leading AI data foundry, we've helped fuel exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. Our recent Series F round has enabled us to accelerate the abundance of frontier data,...
Large Language Model Engineer

4 days ago

San Francisco, California, United States Perplexity AI Full time

Leveraging Expertise in Large Language ModelsAre you an expert in large language models and conversational AI? Do you thrive in fast-paced environments where no two days are alike? We're Perplexity AI, a cutting-edge company dedicated to revolutionizing the conversational AI landscape. As a seasoned Large Language Model Engineer, you will play a pivotal role...
Cloud Infrastructure Optimization Expert

3 weeks ago

San Jose, California, United States EVONA Full time

Job Summary">We are seeking a highly skilled Cloud Infrastructure Optimization Expert to join our team at EVONA. As an expert in this role, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure.">The ideal candidate will have proven experience in managing and optimizing cloud infrastructure, preferably...
GPU Performance Architect

4 days ago

San Francisco, California, United States Liquid AI Full time

Unlocking AI Performance with Liquid AILiquid AI is seeking a highly skilled AI Inference Expert to join our team. As a key member of our engineering team, you will be responsible for optimizing inference stacks tailored to various hardware platforms, including GPUs, CPUs, and NPUs. If you have a passion for delivering exceptional performance and low...
Senior AI Model Optimization Specialist

3 weeks ago

San Francisco, California, United States Perplexity AI Full time

OverviewPerplexity AI is at the forefront of conversational search technology, having achieved tremendous growth and adoption since launching the world's first fully functional conversational answer engine. Our AI-powered search assistant has amassed 10 million monthly active users, with our mobile apps installed over 1 million times across iOS and Android...
High-Performance AI Solutions Engineer

2 weeks ago

San Jose, California, United States AMD Full time

We are transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded.Underpinning our mission is the AMD culture. We push the limits of...
Optimization Engineer

4 days ago

San Francisco, California, United States Liquid AI Full time

Company Overview: Liquid AI is a cutting-edge technology company at the forefront of artificial intelligence innovation. We're dedicated to harnessing the power of machine learning to drive exceptional outcomes in various industries.Salary: $140,000 - $160,000 per annum, depending on experience and qualifications.Job Description: As we prepare to deploy our...

Americas

Europe

Asia / Oceania

Africa

Optimization Expert for AI Model Performance