Optimization Expert for AI Model Performance
2 days ago
About the Opportunity
">We are seeking a highly skilled Optimization Expert to join our team at Adobe Inc. in a strategic and visible role that applies GPU optimization skills towards improving the training efficiency and performance of our commercially safe AI models.
">The Firefly family of creative generative AI models is revolutionizing the way we conceptualize, build, and scale content. As an Optimization Expert, you will be working on optimizing model efficiency for Hopper/Blackwell GPU architectures, leveraging FP8 to accelerate training and inference, and writing high-quality, product-level code that is easy to maintain and test.
">Key Responsibilities
">- Collaborate with model architecture teams to co-design hardware-aware models
- Develop efficient kernels for forward and backward passes in CUDA, Cutlass / CuTe, Triton
- Create optimized custom layers using PyTorch
- Leverage profiling tools - Nsight, Kineto, etc.
- Work on other related tasks as needed
Requirements
">- Bachelor's, Master's, or Ph.D. in Computer Science, Computer Engineering, or a related field, and 5+ years of relevant experience
- Proficiency in Linux, Docker
- Strong understanding of modern transformer-based model architectures
- Expertise in Python, PyTorch, CUDA, Triton, Cutlass/CuTe, and C++
- Familiarity with distributed training fundamentals
What We Offer
">We offer an estimated annual salary range of $170,900-$325,200, depending on location and job-related knowledge, skills, and experience. Our compensation reflects the cost of labor across several U.S. geographic markets. We are proud to be an Equal Employment Opportunity and affirmative action employer, committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity.
-
San Jose, California, United States Syntricate Technologies Full timeRole Summary:We're seeking a Deep Learning and AI Model Optimization Expert to join our team at Syntricate Technologies. As a key member of our R&D department, you will be responsible for exploring and documenting how to port cutting-edge AI models to AMD's devices. Your expertise in AI frameworks like ONNX, Pytorch, or TensorFlow will be crucial in...
-
Lead AI Model Optimization Expert
3 weeks ago
San Jose, California, United States Adobe Full timeAbout UsAt Adobe, we're passionate about empowering people to create and deliver exceptional digital experiences. Our company is committed to creating an inclusive environment where everyone has access to equal opportunity.The RoleWe're seeking a highly skilled Lead AI Model Optimization Expert to join our team. This role involves applying GPU optimization...
-
AI Performance Optimization Expert
4 days ago
San Francisco, California, United States ZipRecruiter Full timeJob Title: AI Performance Optimization ExpertCompany OverviewWe are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI.
-
AI Model Training and Deployment Expert
3 days ago
San Jose, California, United States Tik Tok Full timeAbout the RoleThe AI Model Training and Deployment Expert will design, architect, and implement backend systems to deploy generative AI models for image and video generation use cases.Responsibilities:Design and implement highly efficient engineering systems for generative AI tasks.Optimize the performance of generative AI model training and serving.Build...
-
High-Performance GPU Optimization Expert
4 weeks ago
San Jose, California, United States Adobe Inc. Full timeAbout Adobe Inc.At Adobe, we're passionate about empowering creatives to push the boundaries of what's possible. With a legacy spanning over 40 years, we've been at the forefront of innovation in digital experiences. Our commitment to creativity and inclusivity drives us to create exceptional products that transform how companies interact with customers...
-
AI Optimization Solutions Expert
3 days ago
San Jose, California, United States NextDeavor Full timeJob Title: AI Optimization Solutions ExpertAbout the Role:The GenStudio Optimization Strategist will play a crucial role in helping our customers successfully integrate and leverage our cutting-edge generative AI tool, GenStudio for Performance Marketing. This includes guiding customers through the setup and implementation of our generative AI tool,...
-
AI Model Compression Expert
1 week ago
San Diego, California, United States Kneron Full timeWe are looking for a talented AI Model Compression Expert to join our team at Kneron. As a key member of our team, you will be responsible for developing and implementing model compression techniques, including QAT, model distillation, pruning, quantization, and others for deep learning models.Key Responsibilities:Develop and implement novel deep neural...
-
San Jose, California, United States Cypress HCM Full timeJob Title: AI Optimization Strategist for Performance MarketingAbout Us:Cypress HCM is a leading multimedia and creative software company, pioneering the use of generative AI tools to enhance performance marketing. We are seeking an exceptional Ai Optimization Strategist to join our team and help our customers unlock the full potential of our cutting-edge...
-
High Performance Optimization Engineer
3 weeks ago
San Francisco, California, United States Liquid AI Full timeWe are seeking a highly skilled engineer at Liquid AI to optimize inference stacks tailored to diverse hardware platforms. This role is ideal for an expert with extensive experience in CUDA, C++, and Triton, as well as a deep understanding of GPU, CPU, and NPU architectures.Key ResponsibilitiesDesign and optimize inference stacks for GPUs, CPUs, and...
-
AI Model Efficiency Expert
3 days ago
San Francisco, California, United States Genmo Full timeAbout the Role: We are seeking an AI Model Efficiency Expert to join our team at Genmo. In this role, you will analyze and optimize the performance of our massive parallel and distributed systems. You will also implement and fine-tune distributed training strategies for multi-GPU and multi-node environments and develop and maintain benchmarking suites for...
-
AI Model Deployment Specialist
4 days ago
San Jose, California, United States Tik Tok Full timeAbout the RoleWe are looking for an experienced Ai Model Deployment Specialist to join our team at TikTok. As a key member of our Intelligent Creation - AI Platform team, you will be responsible for designing and implementing efficient engineering systems for generative AI tasks, including model training, optimization, deployment, and applications such as...
-
Senior AI Performance Optimization Lead
2 weeks ago
San Jose, California, United States Advanced Micro Devices, Inc. Full timeJob OverviewWe are seeking a highly skilled Senior AI Performance Optimization Lead to join our team at Advanced Micro Devices, Inc. (AMD). As a key member of our engineering team, you will play a critical role in driving performance improvements and shaping the future of artificial intelligence (AI) on our GPU hardware.
-
Senior AI Model Optimization Engineer
3 days ago
San Francisco, California, United States Lumicity Full timeAbout LumicityWe are a pioneering company in generative video models, pushing the boundaries of AI innovation. With a strong presence in San Francisco and over $10M in funding, we're expanding our team to tackle cutting-edge challenges.Salary: $180,000 - $220,000 per annumThe RoleWe're seeking a highly skilled Senior AI Model Optimization Engineer to join...
-
Data Engineering Expert for AI Model Development
3 weeks ago
San Francisco, California, United States Scale AI Full timeAbout Scale AIAt Scale AI, our mission is to accelerate the development of AI applications. With 8 years of experience as the leading AI data foundry, we've helped fuel exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. Our recent Series F round has enabled us to accelerate the abundance of frontier data,...
-
Large Language Model Engineer
4 days ago
San Francisco, California, United States Perplexity AI Full timeLeveraging Expertise in Large Language ModelsAre you an expert in large language models and conversational AI? Do you thrive in fast-paced environments where no two days are alike? We're Perplexity AI, a cutting-edge company dedicated to revolutionizing the conversational AI landscape. As a seasoned Large Language Model Engineer, you will play a pivotal role...
-
Cloud Infrastructure Optimization Expert
3 weeks ago
San Jose, California, United States EVONA Full timeJob Summary">We are seeking a highly skilled Cloud Infrastructure Optimization Expert to join our team at EVONA. As an expert in this role, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure.">The ideal candidate will have proven experience in managing and optimizing cloud infrastructure, preferably...
-
GPU Performance Architect
4 days ago
San Francisco, California, United States Liquid AI Full timeUnlocking AI Performance with Liquid AILiquid AI is seeking a highly skilled AI Inference Expert to join our team. As a key member of our engineering team, you will be responsible for optimizing inference stacks tailored to various hardware platforms, including GPUs, CPUs, and NPUs. If you have a passion for delivering exceptional performance and low...
-
Senior AI Model Optimization Specialist
3 weeks ago
San Francisco, California, United States Perplexity AI Full timeOverviewPerplexity AI is at the forefront of conversational search technology, having achieved tremendous growth and adoption since launching the world's first fully functional conversational answer engine. Our AI-powered search assistant has amassed 10 million monthly active users, with our mobile apps installed over 1 million times across iOS and Android...
-
High-Performance AI Solutions Engineer
2 weeks ago
San Jose, California, United States AMD Full timeWe are transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded.Underpinning our mission is the AMD culture. We push the limits of...
-
Optimization Engineer
4 days ago
San Francisco, California, United States Liquid AI Full timeCompany Overview: Liquid AI is a cutting-edge technology company at the forefront of artificial intelligence innovation. We're dedicated to harnessing the power of machine learning to drive exceptional outcomes in various industries.Salary: $140,000 - $160,000 per annum, depending on experience and qualifications.Job Description: As we prepare to deploy our...