Optimization Expert for AI Model Performance

2 days ago


San Jose, California, United States Adobe Inc. Full time

About the Opportunity

">

We are seeking a highly skilled Optimization Expert to join our team at Adobe Inc. in a strategic and visible role that applies GPU optimization skills towards improving the training efficiency and performance of our commercially safe AI models.

">

The Firefly family of creative generative AI models is revolutionizing the way we conceptualize, build, and scale content. As an Optimization Expert, you will be working on optimizing model efficiency for Hopper/Blackwell GPU architectures, leveraging FP8 to accelerate training and inference, and writing high-quality, product-level code that is easy to maintain and test.

">

Key Responsibilities

">
  • Collaborate with model architecture teams to co-design hardware-aware models
  • Develop efficient kernels for forward and backward passes in CUDA, Cutlass / CuTe, Triton
  • Create optimized custom layers using PyTorch
  • Leverage profiling tools - Nsight, Kineto, etc.
  • Work on other related tasks as needed
">

Requirements

">
  • Bachelor's, Master's, or Ph.D. in Computer Science, Computer Engineering, or a related field, and 5+ years of relevant experience
  • Proficiency in Linux, Docker
  • Strong understanding of modern transformer-based model architectures
  • Expertise in Python, PyTorch, CUDA, Triton, Cutlass/CuTe, and C++
  • Familiarity with distributed training fundamentals
">

What We Offer

">

We offer an estimated annual salary range of $170,900-$325,200, depending on location and job-related knowledge, skills, and experience. Our compensation reflects the cost of labor across several U.S. geographic markets. We are proud to be an Equal Employment Opportunity and affirmative action employer, committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity.



  • San Jose, California, United States Syntricate Technologies Full time

    Role Summary:We're seeking a Deep Learning and AI Model Optimization Expert to join our team at Syntricate Technologies. As a key member of our R&D department, you will be responsible for exploring and documenting how to port cutting-edge AI models to AMD's devices. Your expertise in AI frameworks like ONNX, Pytorch, or TensorFlow will be crucial in...


  • San Jose, California, United States Adobe Full time

    About UsAt Adobe, we're passionate about empowering people to create and deliver exceptional digital experiences. Our company is committed to creating an inclusive environment where everyone has access to equal opportunity.The RoleWe're seeking a highly skilled Lead AI Model Optimization Expert to join our team. This role involves applying GPU optimization...


  • San Francisco, California, United States ZipRecruiter Full time

    Job Title: AI Performance Optimization ExpertCompany OverviewWe are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI.


  • San Jose, California, United States Tik Tok Full time

    About the RoleThe AI Model Training and Deployment Expert will design, architect, and implement backend systems to deploy generative AI models for image and video generation use cases.Responsibilities:Design and implement highly efficient engineering systems for generative AI tasks.Optimize the performance of generative AI model training and serving.Build...


  • San Jose, California, United States Adobe Inc. Full time

    About Adobe Inc.At Adobe, we're passionate about empowering creatives to push the boundaries of what's possible. With a legacy spanning over 40 years, we've been at the forefront of innovation in digital experiences. Our commitment to creativity and inclusivity drives us to create exceptional products that transform how companies interact with customers...


  • San Jose, California, United States NextDeavor Full time

    Job Title: AI Optimization Solutions ExpertAbout the Role:The GenStudio Optimization Strategist will play a crucial role in helping our customers successfully integrate and leverage our cutting-edge generative AI tool, GenStudio for Performance Marketing. This includes guiding customers through the setup and implementation of our generative AI tool,...


  • San Diego, California, United States Kneron Full time

    We are looking for a talented AI Model Compression Expert to join our team at Kneron. As a key member of our team, you will be responsible for developing and implementing model compression techniques, including QAT, model distillation, pruning, quantization, and others for deep learning models.Key Responsibilities:Develop and implement novel deep neural...


  • San Jose, California, United States Cypress HCM Full time

    Job Title: AI Optimization Strategist for Performance MarketingAbout Us:Cypress HCM is a leading multimedia and creative software company, pioneering the use of generative AI tools to enhance performance marketing. We are seeking an exceptional Ai Optimization Strategist to join our team and help our customers unlock the full potential of our cutting-edge...


  • San Francisco, California, United States Liquid AI Full time

    We are seeking a highly skilled engineer at Liquid AI to optimize inference stacks tailored to diverse hardware platforms. This role is ideal for an expert with extensive experience in CUDA, C++, and Triton, as well as a deep understanding of GPU, CPU, and NPU architectures.Key ResponsibilitiesDesign and optimize inference stacks for GPUs, CPUs, and...


  • San Francisco, California, United States Genmo Full time

    About the Role: We are seeking an AI Model Efficiency Expert to join our team at Genmo. In this role, you will analyze and optimize the performance of our massive parallel and distributed systems. You will also implement and fine-tune distributed training strategies for multi-GPU and multi-node environments and develop and maintain benchmarking suites for...


  • San Jose, California, United States Tik Tok Full time

    About the RoleWe are looking for an experienced Ai Model Deployment Specialist to join our team at TikTok. As a key member of our Intelligent Creation - AI Platform team, you will be responsible for designing and implementing efficient engineering systems for generative AI tasks, including model training, optimization, deployment, and applications such as...


  • San Jose, California, United States Advanced Micro Devices, Inc. Full time

    Job OverviewWe are seeking a highly skilled Senior AI Performance Optimization Lead to join our team at Advanced Micro Devices, Inc. (AMD). As a key member of our engineering team, you will play a critical role in driving performance improvements and shaping the future of artificial intelligence (AI) on our GPU hardware.


  • San Francisco, California, United States Lumicity Full time

    About LumicityWe are a pioneering company in generative video models, pushing the boundaries of AI innovation. With a strong presence in San Francisco and over $10M in funding, we're expanding our team to tackle cutting-edge challenges.Salary: $180,000 - $220,000 per annumThe RoleWe're seeking a highly skilled Senior AI Model Optimization Engineer to join...


  • San Francisco, California, United States Scale AI Full time

    About Scale AIAt Scale AI, our mission is to accelerate the development of AI applications. With 8 years of experience as the leading AI data foundry, we've helped fuel exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. Our recent Series F round has enabled us to accelerate the abundance of frontier data,...


  • San Francisco, California, United States Perplexity AI Full time

    Leveraging Expertise in Large Language ModelsAre you an expert in large language models and conversational AI? Do you thrive in fast-paced environments where no two days are alike? We're Perplexity AI, a cutting-edge company dedicated to revolutionizing the conversational AI landscape. As a seasoned Large Language Model Engineer, you will play a pivotal role...


  • San Jose, California, United States EVONA Full time

    Job Summary">We are seeking a highly skilled Cloud Infrastructure Optimization Expert to join our team at EVONA. As an expert in this role, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure.">The ideal candidate will have proven experience in managing and optimizing cloud infrastructure, preferably...


  • San Francisco, California, United States Liquid AI Full time

    Unlocking AI Performance with Liquid AILiquid AI is seeking a highly skilled AI Inference Expert to join our team. As a key member of our engineering team, you will be responsible for optimizing inference stacks tailored to various hardware platforms, including GPUs, CPUs, and NPUs. If you have a passion for delivering exceptional performance and low...


  • San Francisco, California, United States Perplexity AI Full time

    OverviewPerplexity AI is at the forefront of conversational search technology, having achieved tremendous growth and adoption since launching the world's first fully functional conversational answer engine. Our AI-powered search assistant has amassed 10 million monthly active users, with our mobile apps installed over 1 million times across iOS and Android...


  • San Jose, California, United States AMD Full time

    We are transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded.Underpinning our mission is the AMD culture. We push the limits of...


  • San Francisco, California, United States Liquid AI Full time

    Company Overview: Liquid AI is a cutting-edge technology company at the forefront of artificial intelligence innovation. We're dedicated to harnessing the power of machine learning to drive exceptional outcomes in various industries.Salary: $140,000 - $160,000 per annum, depending on experience and qualifications.Job Description: As we prepare to deploy our...