AI GPU Runtime Software Engineer

4 weeks ago


San Jose, United States Advanced Micro Devices , Inc. Full time

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance_

The Role

We are building IREE as an open-source compiler and runtime solution to productionize ML on a variety of usage scenarios and hardware targets. Among them, having wide and performant GPU support is critical. We aim at a broad range of GPU coverage, from mobile to datacenter, via a unified software stack. It requires us to write the most efficient code to interact with the OS and device drivers with minimal dependency and small binary size. There will be no short of intriguing technical challenges to tackle, and there are abundant chances to collaborate with industry experts working at different layers of the stack. If this sounds interesting to you, please don't hesitate to reach out to us

The Person

An ideal candidate should be familiar with GPU runtime APIs, GPU drivers, GPU architectures, OS, parallel/asynchronous programming, efficient resource management. He/she should be comfortable at performing quantitative analysis of workload and drive improvements at suitable software stack layers. Most importantly, the candidate is willing to learn and work across boundaries.

Key Responsibilities:

  • Design, develop, and maintain GPU related runtime implementations in IREE over HIP, CUDA, Vulkan, DirectX, Metal.
  • Design, develop, and maintain multi-GPU runtime and communication solutions including collectives
  • Manage testing and releasing of runtime components
  • Quantitively analyze end-to-end model performance, identify bottlenecks, propose ideas to improve, prototype and productionize solutions
  • Design and implement compiler passes to better schedule and utilize resources
  • Design and implement Python interactions with runtime components
  • Drive towards general solutions that benefit different all GPU targets and the overall community

Preferred Experience in following tools/flows

  • Experience with GPU APIs (HIP, CUDA, Vulkan, DirectX, Metal)
  • Understanding of GPU architectures
  • Understanding of parallel/asynchronous programming
  • Familiarity with operating system internals and resource management
  • Understanding of game engine internals
  • Experience with various system debugging/benchmarking/profiling tools
  • Strong C/C++ understanding and skills
  • Familiarity with IREE, MLIR, LLVM, SPIR-V or other compiler technologies
  • Open-source development ethos

Preferred Academic Credentials

BS/MS (Computer Science, Computer Engineering, Electrical Engineering, or related equivalent)

Location:
San Jose, CA, USA

#LI-G11

At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD's Employee Stock Purchase Plan. You'll also be eligible for competitive benefits described in more detail here.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.



  • San Jose, United States AMD Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • San Jose, United States Advanced Micro Devices, Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • San Jose, United States Advanced Micro Devices , Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • San Jose, United States Oho Group Ltd Full time

    An industry leading smart electric vehicle company is looking for a Virtualization Engineer that specialises within GPU.Their focus areas include designing, developing, co-manufacturing, and selling high-end smart electric vehicles. They specialize within autonomous driving, digital technologies, electric powertrains, and battery systems.Roles and...


  • san jose, United States Oho Group Ltd Full time

    An industry leading smart electric vehicle company is looking for a Virtualization Engineer that specialises within GPU.Their focus areas include designing, developing, co-manufacturing, and selling high-end smart electric vehicles. They specialize within autonomous driving, digital technologies, electric powertrains, and battery systems.Roles and...


  • San Francisco, United States ZipRecruiter Full time

    About the Company:Our client is a company building the world's highest-performance pure digital AI inference chip. They are seeking a Software Architect to lead their software efforts and advance the software stack that includes ML frameworks, compilers, libraries, and runtime. As a Software Architect, you will be responsible for designing and developing...


  • San Jose, United States Software Guidance and Assistance, Inc. Full time

    Software Guidance & Assistance, Inc., (SGA), is searching for a GPU Software Developer - C++ for a Contract assignment with one of our premier SaaS clients in San Jose, CA or Seattle, WA . Responsibilities : Work on developing GPU components for the video processing pipeline Work on architecting, coding and productizing the high-performance GPU components...


  • San Jose, United States Software Guidance and Assistance, Inc. Full time

    Software Guidance & Assistance, Inc., (SGA), is searching for a GPU Software Developer - C++ for a Contract assignment with one of our premier SaaS clients in San Jose, CA or Seattle, WA . Responsibilities : Work on developing GPU components for the video processing pipeline Work on architecting, coding and productizing the high-performance GPU...


  • San Jose, United States Advanced Micro Devices, Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming, and embedded. Underpinning our...

  • AI Solution Architect

    3 weeks ago


    San Jose, United States Advanced Micro Devices , Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...

  • AI Solution Architect

    3 weeks ago


    San Jose, United States Advanced Micro Devices , Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • San Jose, United States Oho Group Ltd Full time

    We are working with a leading innovator in smart electric vehicles who are seeking GPU Virtualization Engineers. The company specializes in autonomous driving, digital systems, electric powertrains, and batteries. Notable advancements include battery swapping technology, Battery as a Service (BaaS), and Autonomous Driving as a Service (ADaaS). Its diverse...


  • San Jose, United States Advanced Micro Devices, Inc Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...

  • AI Inference Engineer

    2 weeks ago


    San Francisco, United States Perplexity AI Full time

    Job DescriptionJob DescriptionWe are looking for an AI Inference to join our growing team. Our current stack is Python, C++, TensorRT-LLM, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.ResponsibilitiesDevelop APIs for AI inference that will be used by both internal and external...

  • GPU Modeling Engineer

    6 months ago


    San Jose, United States SAMSUNG Full time

    Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...


  • San Carlos, United States Beacon AI Full time

    About Beacon AIBeacon AI is developing AI pilot assistant technology to transform aviation, flight safety, operational efficiency, and pilot capabilities. We are on a mission to leverage the power of artificial intelligence and advanced data analytics to revolutionize the aviation industry. Join us to be at the cutting edge of technological innovation for...


  • San Francisco, United States CentML Full time

    About UsWe believe AI will fundamentally transform how people live and work. CentML's mission is to massively reduce the cost of developing and deploying ML models so we can enable anyone to harness the power of AI and everyone to benefit from its potential.Our founding team is made up of experts in AI, compilers, and ML hardware and has led efforts at...


  • San Francisco, United States Unum AI Full time

    Unum is the deep-tech startup reinventing Data-Lakes for extreme scale and AI! You can think of it as Snowflake and OpenAI combined. We are searching for a passionate and competitive Product Manager to join us in designing next-generation data infrastructure to empower million data-intensive and AI applications! Tasks You will orchestrate software...


  • San Diego, California, United States Qualcomm Full time

    Job Title: Senior Staff Graphics Software EngineerCompany: Qualcomm Technologies, Inc.Job Area: Engineering Group, Engineering Group > Graphics Software EngineeringJob Summary:As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation gaming, XR, and AI experiences. Our team of skilled engineers works...

  • AI Engineer

    3 weeks ago


    San Francisco, United States Hyperbolic Labs Full time

    Who We Are: Hyperbolic Labs is on a mission to democratize AI by breaking down the barriers to computing power with our Open-Access AI Cloud. By making better use of idle computing resources across the globe, we offer an innovative GPU marketplace and AI inference service that promise affordability and accessibility for all. As pioneers at the intersection...