Deep Learning Compiler Engineer for Ryzen AI NPU

1 week ago


San Jose, California, United States Advanced Micro Devices , Inc. Full time

Overview:

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the worlds most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance_

Responsibilities:

The Role:

We are looking for a talented Machine Learning (ML) Compiler SW Engineer to join our growing team in the AI group and play a crucial role in developing SW toolset to deploy cutting-edge ML models on AMDs XDNA Neural Processing Units (NPU). You will be responsible for designing, implementing, and optimizing compilers, that translate Gen-AI ML inference models like SDXL-Turbo, Llama2, Mistral, etc into low-level code for specialized hardware architectures. Your work will directly impact the efficiency, scalability, and reliability of our ML applications. If you thrive in a fast-paced environment and love working on cutting edge machine learning inference, this role is for you.

Responsibilities:

  • Design and develop novel algorithms for tiling and mapping quantized ML workloads on application specific hardware platforms.
  • Analyze and transform intermediate representations of ML models (computational graphs) for efficient execution.
  • Collaborate with Architecture and runtime software teams to understand optimization requirements and translate them into effective compiler strategies.
  • Collaborate with kernel developers to understand the tiling requirements to strategize the dataflow and buffer allocation schemes.
  • Develop back-end optimization passes to convert high-level representation into driver calls.
  • Implement compiler optimizations for performance, resource usage, and compute efficiency.
  • Develop and maintain unit tests and integration tests for the compiler to support different generations of HW architectures.
  • Enable detailed profiling and debugging tools for analyzing performance bottlenecks and deadlocks in the dataflow schemes.
  • Stay up-to-date on the latest advancements in ML compiler technology and hardware architectures.

Qualifications:

  • Master's degree or PhD. in Computer Science, Engineering, or a related field (or Bachelor's degree with significant experience).
  • Strong understanding of compiler design principles (front-end, middle-end, back-end).
  • Experience with machine learning frameworks (e.g., TensorFlow, PyTorch).
  • Experience working with ML compilers (e.g., MLIR, TVM).
  • Experience with ML models such as CNN, LSTM, LLMs, Diffusion is a must.
  • Excellent programming skills in Python, C++, or similar languages.
  • Experience with machine learning hardware architectures (e.g., GPUs, TPUs, VLIW) is a plus.
  • A passion for innovation and a strong desire to push the boundaries of machine learning performance.

#LI-RF1

#LI-HYBRID

Qualifications:

At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMDs Employee Stock Purchase Plan. Youll also be eligible for competitive benefits described in more detail here.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants needs under the respective laws throughout all stages of the recruitment and selection process.



  • San Jose, California, United States AMD Full time

    AMD Software Development Engineer Position At AMD, we are on a mission to create products that drive the future of computing, from data centers to gaming and more. Our company culture is all about pushing the boundaries of innovation to tackle the world's toughest challenges. Job Description: Looking for a talented Compiler Engineer to contribute to...


  • San Francisco, California, United States Unreal Gigs Full time

    Job DescriptionJob DescriptionCompany Overview: Welcome to the forefront of deep learning innovation At our company, we're driving the next wave of AI revolution through cutting-edge deep learning technologies. Our mission is to develop groundbreaking solutions that solve complex problems and empower businesses across industries. Join us and be part of a...


  • San Francisco, California, United States Anyscale Full time

    About Anyscale:Anyscale is a company with a mission to make distributed computing accessible to developers of all skill levels. Commercializing Ray, a popular open-source project, Anyscale is creating a library ecosystem for scalable machine learning applications used by companies like OpenAI, Uber, Spotify, and others. With a focus on democratizing...

  • AI Compiler Manager

    1 week ago


    San Francisco, California, United States IC Resources Full time

    AI Compiler Manager Exciting / Well funded scale-up Remote Working & Equity An AI Compiler Manager is required to join an exciting US scale up, they are creating new ways to 'plug and play' different Hardware platforms, run multi purpose ML modules more efficiently and to make compatible with all Hardware interfaces Think of the possibilities What's...

  • AI Compiler Manager

    2 months ago


    San Francisco, California, United States IC Resources Full time

    AI Compiler Manager Exciting / Well funded scale-up Remote Working & Equity An AI Compiler Manager is required to join an exciting US scale up, they are creating new ways to 'plug and play' different Hardware platforms, run multi purpose ML modules more efficiently and to make compatible with all Hardware interfaces Think of the possibilities What's...


  • San Jose, California, United States IC Resources Full time

    ML/LLVM Compiler Engineer Exciting Blockchain Compiler Role / Remote working / Token equity on offer An ML/LLVM Compiler Engineer is required to join an exciting ML powered Blockchain company specialisingin all aspects of computer architecture relating to CPU's, GPU's and customer accelerators My client also integrates advanced machine learning algorithms,...

  • Side Hustle Expert

    3 weeks ago


    San Jose, California, United States AI Prompt Engineer - Fud Full time

    At Fud, we are revolutionizing the way people approach making money by creating the world's first Social Hustling Community. Our platform connects individuals with the know-how and resources they need to take action and put more money in their pockets. We believe that everyone has the potential to be a side hustle expert, and we are looking for an...

  • AI Engineer

    1 week ago


    San Francisco, California, United States Fractional AI Full time

    About Fractional AIFractional AI is the dev shop for applied AI. We solve hard AI-driven problems that our customers don't have the resources or know-how to staff themselves – cutting through jargon and flashy demos to get AI into production, fast.We believe that the most revolutionary impact of generative AI will be helping to automate existing workflows...


  • San Francisco, California, United States Scale Ai Full time

    Scale's Generative AI ML team conducts research on models, supervision, and algorithms that advance frontier models for Scale's applied-ML teams and the broader AI community. Scale is uniquely positioned at the heart of the field of AI as an indispensable provider of training and evaluation data and end-to-end solutions for the ML lifecycle. You will work...


  • San Jose, California, United States Hume AI Inc Full time

    We are seeking a talented software engineer with strong experience in backend web services and infrastructure to advance our core mission: using the world's most advanced technology for emotion understanding to build empathy and goal-alignment into AI. Join us in the heart of New York City, or wherever you are located, and contribute to our endeavor to...


  • San Jose, California, United States Cisco Full time

    Who We Are The Cisco Security AI team delivers AI products and platform for all Cisco secure products and portfolios so businesses around the world defend against threats and safeguard the most vital aspects of your business with security resilience. We are passionate about making businesses secure and simplify security with zero compromise using AI and...


  • San Francisco, California, United States META Full time

    Summary:In this role, you will be a member of the MTIA (Meta Training & Inference Accelerator) Software team and part of the bigger industry-leading PyTorch AI framework organization. MTIA Software Team has been developing a comprehensive AI Compiler strategy that delivers a highly flexible platform to train & serve new DL/ML model architectures, combined...


  • San Francisco, California, United States Block Full time

    Company DescriptionBlock is one company built from many blocks, all united by the same purpose of economic empowerment. The blocks that form our foundational teams - People, Finance, Counsel, Hardware, Information Security, Platform Infrastructure Engineering, and more - provide support and guidance at the corporate level. They work across business groups...


  • San Francisco, California, United States Block Full time

    Company DescriptionBlock is one company built from many blocks, all united by the same purpose of economic empowerment. The blocks that form our foundational teams - People, Finance, Counsel, Hardware, Information Security, Platform Infrastructure Engineering, and more - provide support and guidance at the corporate level. They work across business groups...


  • San Francisco, California, United States Mach9 Robotics Inc Full time

    Mach9 is a pioneering company focused on utilizing advanced machine learning and computer vision techniques to convert geospatial data into valuable insights for civil engineers worldwide. Our flagship product, Mach9 Digital Surveyor, streamlines the extraction of features from large-scale imagery and 3D datasets, outperforming traditional manual drafting...


  • San Jose, California, United States Hume AI Inc Full time

    Join our team at Hume AI, where we are looking for a skilled software engineer specializing in data engineering to contribute to our innovative mission. Become part of our efforts to ensure that artificial intelligence is driven by human values, a fundamental challenge (and opportunity) of our time.About Us:Hume AI is committed to developing AI that...


  • San Francisco, California, United States Labelbox Full time

    Labelbox: Labelbox is a cutting-edge AI platform dedicated to developing intelligent applications. Teams leveraging generative AI and LLMs for their projects use the Labelbox platform to incorporate human supervision and automation effectively. From refining AI products with LLMs that need human intervention to streamlining tasks like data labeling and...


  • San Francisco, California, United States Bunkerhill Health Full time

    See our job listing on our careers pageAbout the roleWe are looking for a full-time Machine Learning Engineer to join our in-person team in our office in SoMa in SF.Responsibilities includeAdapt state of the art methods to prototype and iterate on deep learning models.Define and operate batch processing pipelines.Create and maintain user documentation to...


  • San Jose, California, United States PayPal Full time

    We are seeking a talented and driven AI Machine Learning Engineer to join PayPal's nascent Strategic Generative AI Unit ("SGAU"). The SGAU's charter is to orchestrate and facilitate adoption of Generative AI techniques and algorithms across enterprise business use cases by providing technical consulting as well as conducting applied R&D to explore building...


  • San Diego, California, United States CEREBRAS SYSTEMS INC. Full time

    Cerebras' systems are designed with a singular focus on machine learning. Our processor is the Wafer Scale Engine (WSE), a single chip with performance equivalent to a cluster of GPUs, giving the user cluster-scale capability with the simplicity of programming a single device. Because of this programming simplicity, large model training can be scaled out...