GPU Performance Architect

2 days ago


San Francisco, California, United States Liquid AI Full time
Unlocking AI Performance with Liquid AI

Liquid AI is seeking a highly skilled AI Inference Expert to join our team. As a key member of our engineering team, you will be responsible for optimizing inference stacks tailored to various hardware platforms, including GPUs, CPUs, and NPUs. If you have a passion for delivering exceptional performance and low latency, we want to hear from you.

Job Requirements

The ideal candidate is a highly skilled engineer with extensive experience in CUDA, C++, and Triton, as well as a deep understanding of GPU, CPU, and NPU architectures. Proficiency in building and enhancing inference stacks using frameworks like ggml, vllm, and DeepSpeed is essential. Additionally, experience with mobile development and expertise in cache-aware algorithms will be highly valued.

  • Strong ML Experience: Proficiency in Python and PyTorch to effectively interface with the ML team at a deeply technical level.
  • Hardware Awareness: Must understand modern hardware architecture, including cache hierarchies and memory access patterns, and their impact on performance.
  • Proficient in Coding: Expertise in Python, PyTorch, and either CUDA, Triton, or C++ is essential for this role.
  • Optimization of Low-Level Primitives: Responsible for optimizing core primitives to ensure efficient model execution.
  • Self-Guided and Ownership: Ability to independently take a PyTorch model and inference requirements (e.g., maximize GPU throughput or minimize CPU latency) and deliver a fully optimized stack with minimal guidance.
  • Research-Driven: Should stay up-to-date with advancements in ML inference, such as new quantization techniques or speculative decoding, while maintaining focus on delivering practical solutions.
Our Benefits

We offer a competitive salary of $190,000 - $230,000 per annum, depending on experience, plus benefits including health insurance, retirement plan, and generous paid time off. Our team is passionate about innovation and collaboration, and we strive to create a workplace that is inclusive and respectful.



  • San Diego, California, United States Qualcomm Full time

    Qualcomm is a leading technology innovator pushing the boundaries of what's possible to enable next-generation experiences and drive digital transformation.This role involves architecting, designing, implementing, verifying, and optimizing the performance and power of GPU cores.The successful candidate will collaborate with cross-functional teams to meet and...


  • San Jose, California, United States Software Guidance and Assistance, Inc. Full time

    **About Us**Software Guidance & Assistance, Inc., (SGA), is a technology and resource solutions provider dedicated to delivering exceptional results. As a women-owned business, we pride ourselves on our personal approach to solving complex IT problems.We are seeking a highly skilled GPU Software Developer - C++ for a contract assignment with one of our...

  • Senior GPU Architect

    3 weeks ago


    San Jose, California, United States TalentBridge Full time

    At TalentBridge, we are seeking a highly skilled Senior GPU Architect to join our team. This role is an excellent opportunity for a talented software engineer with expertise in high-performance computing and modern GPU processing paradigms.About the RoleWe are looking for a motivated individual who can work collaboratively with global teams to develop...


  • San Diego, California, United States Qualcomm Full time

    Unlock Next-Generation Experiences with QualcommAs a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. With our innovative products and services, we're transforming industries, shaping markets, and...


  • San Jose, California, United States Software Guidance and Assistance, Inc. Full time

    Company OverviewSoftware Guidance & Assistance, Inc. is a technology and resource solutions provider driven to stand out as a women-owned business.We are dedicated to solving big IT problems with a more personal, boutique approach, matching consultants like you to over 1,000 engagements each year.Job DescriptionResponsibilities:Develop high-performance GPU...


  • San Diego, California, United States LanceSoft Full time

    Why Join Us?LanceSoft is committed to fostering a collaborative and inclusive work environment where our employees can grow professionally and personally. As a High-Performance GPU Developer, you will have the opportunity to work on cutting-edge projects that drive innovation in the field of graphics processing units (GPUs).Job Details:This position involves...


  • San Diego, California, United States Qualcomm Full time

    Job OverviewWe are seeking a skilled GPU Performance Optimization Engineer to join our team at Qualcomm.About QualcommQualcomm is a leading technology innovator pushing the boundaries of what's possible to enable next-generation experiences and drive digital transformation.ResponsibilitiesLeverage advanced GPU knowledge to architect, design, implement,...


  • San Jose, California, United States Advanced Micro Devices , Inc. Full time

    Job Title: GPU Architect and Software EngineerAre you a skilled engineer with expertise in GPU architecture and software development? We are seeking a talented GPU Architect and Software Engineer to join our AI team at Advanced Micro Devices, Inc.Estimated Salary: $200,000 - $280,000 per yearAbout the Role:The successful candidate will have a strong...


  • San Jose, California, United States Advanced Micro Devices , Inc. Full time

    At Advanced Micro Devices, Inc., we're on a mission to revolutionize the world of computing with our cutting-edge technology. As a Powerful GPU Architect Lead, you'll play a vital role in shaping the future of graphics processing units (GPUs) and unlocking new possibilities for our customers, partners, and developers.We're looking for an exceptional...


  • San Jose, California, United States Software Guidance and Assistance, Inc. Full time

    Job DescriptionWe are seeking a highly skilled GPU Software Engineer to join our team at Software Guidance and Assistance, Inc. in San Jose, CA or Seattle, WA.The successful candidate will be responsible for developing GPU components for the video processing pipeline, architecting, coding, and productizing high-performance GPU components, ensuring quality,...


  • San Jose, California, United States Advanced Micro Devices , Inc. Full time

    Job DescriptionWe are seeking a highly skilled Senior GPU Architect Lead to join our team at Advanced Micro Devices, Inc. This is a unique opportunity to transform lives with AMD technology and enrich our industry, communities, and the world.About UsWe are a leader in the development of high-performance hardware architecture.We push the limits of innovation...


  • San Jose, California, United States Software Guidance and Assistance, Inc. Full time

    Job Title: C++ Developer for Video ProcessingAt Software Guidance and Assistance, Inc., we are looking for a talented C++ Developer to join our team in San Jose, CA or Seattle, WA. The ideal candidate will have a strong background in C++ and GPU programming and be able to design and implement high-performance GPU components for the video processing...


  • San Francisco, California, United States Acceler8 Talent Full time

    Acceler8 Talent is a Fast-Growing StartupWe are looking for a talented High-Performance Processor Architect to join our team and contribute to the development of revolutionary processors designed to significantly outperform traditional GPUs for Generative AI workloads.Responsibilities:Design and optimize memory subsystems, including HBM, to meet...


  • San Jose, California, United States TalentBridge Full time

    TalentBridge is seeking a highly skilled Sr. Software Engineer - GPU Architecture to join our team in San Jose, CA.Estimated Salary: $140,000 - $170,000 per year.We are looking for an experienced professional with 5+ years of experience in Software Engineering and expertise in GPU programming with CUDA, Metal, and OpenCL. The ideal candidate will have strong...


  • San Jose, California, United States Software Guidance and Assistance, Inc. Full time

    About Software Guidance and Assistance, Inc. We're a technology and resource solutions provider dedicated to delivering exceptional results. As a women-owned business, we pride ourselves on our boutique approach to solving big IT problems. Each year, we match consultants like you to over 1,000 engagements. Our mission is to provide personalized support that...

  • Senior GPU Architect

    3 weeks ago


    San Jose, California, United States Samsung Austin Semiconductor Full time

    Company Overview:Samsung Austin Semiconductor, LLC is a leading semiconductor manufacturer that has multiple positions available in San Jose, California. We are seeking highly skilled professionals to join our team.Salary Range:The estimated salary for this position is between $214,207 and $257,336 per year.Job Description:Develop physical design/integration...


  • San Jose, California, United States Ursus Inc Full time

    **Job Overview:** At Ursus Inc, we are looking for a skilled GPU Subsystem Verification Expert to lead our verification efforts. The successful candidate will be responsible for developing and executing verification plans, working closely with cross-functional teams.**Responsibilities:** Developing and executing verification plans for GPU subsystemsManaging...


  • San Jose, California, United States AMD Full time

    We are transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded.Underpinning our mission is the AMD culture. We push the limits of...


  • San Francisco, California, United States ZipRecruiter Full time

    About the Company:">ZipRecruiter's client is a pioneering company that designs and manufactures cutting-edge pure digital AI inference chips. They are seeking a skilled Software Architect to lead their software efforts, drive innovation, and advance the software stack that includes ML frameworks, compilers, libraries, and...


  • San Diego, California, United States Qualcomm Full time

    Unlock Next-Generation Gaming, XR, and AI ExperiencesWe are seeking a highly skilled Senior GPU Machine Learning Engineer to join our team at Qualcomm. As a leading technology innovator, we push the boundaries of what's possible to enable next-generation gaming, XR, and AI experiences.Key Responsibilities:Architect, design, implement, verify, and optimize...