AI Performance Engineer

2 weeks ago


San Jose, California, United States AMD Full time

About AMD:

At AMD, we are committed to transforming lives through our innovative technology. Our mission is to deliver exceptional products that enhance computing experiences across various domains, including data centers, artificial intelligence, personal computing, gaming, and embedded systems. We foster a culture of innovation, striving to address significant global challenges while promoting collaboration, humility, and diverse perspectives.

Position Overview:

This role is integral to our efforts in large-scale AI acceleration. You will be responsible for developing advanced tools and methodologies aimed at optimizing system performance for AI tasks on Ryzen AI SoC. Your work will involve exploring innovative optimization techniques to enhance state-of-the-art AI models, particularly in vision, language, and generative applications, while collaborating with leading engineers from AMD's CPU, GPU, and Adaptable Compute teams.

Key Responsibilities:

  • Design and implement tools for analyzing AI workloads and optimizing their mapping to AI accelerators.
  • Work collaboratively with various teams to improve components of the software stack, including AI compilers, frameworks, device drivers, and firmware.
  • Conduct experiments to refine data movement strategies and tiling techniques for the latest AI models, including generative AI.
  • Engage with cross-functional teams to establish requirements for backend libraries, compilers, and toolchains that facilitate AI model acceleration on AMD platforms.

Qualifications:

  • Strong understanding of AI and machine learning principles, with practical experience applying these concepts in research or professional environments.
  • Knowledge of how different compute, memory, and communication configurations impact AI acceleration, along with hardware and software implementation choices.
  • Experience in coding and optimizing for VLIW processors, with a focus on high-performance operations such as CONV, GEMM, and non-linear computations.
  • Familiarity with AI frameworks, especially ONNX.
  • Experience with AI/ML inference stacks, including ONNXRuntime.
  • Prior experience with GPU acceleration, utilizing either AMD or Nvidia GPUs.
  • Ability to collaborate effectively using version control systems, such as git.
  • Proficient in benchmarking methodologies and debugging tools.
  • Excellent communication and problem-solving skills.

Educational Background:

  • A Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or a related field.

Compensation and Benefits:

At AMD, your compensation package includes various elements tailored to your skills, qualifications, and experience. You may also be eligible for performance-based incentives and stock ownership opportunities through our Employee Stock Purchase Plan, along with competitive benefits.

AMD is an equal opportunity employer, committed to inclusivity and welcoming applications from all qualified individuals. We accommodate applicants' needs in accordance with applicable laws throughout the recruitment process.



  • San Jose, California, United States AMD Full time

    Transform the Future with AMDAt AMD, we are committed to innovating technology that improves lives and drives progress in our communities and around the world. Our mission is to develop exceptional products that enhance next-generation computing experiences, forming the backbone of data centers, artificial intelligence, personal computing, gaming, and...


  • San Jose, California, United States AMD Full time

    About AMD:At AMD, we are committed to transforming lives through innovative technology. Our mission is to develop exceptional products that enhance computing experiences across various sectors, including data centers, artificial intelligence, personal computing, gaming, and embedded systems. Our culture emphasizes pushing the boundaries of innovation to...


  • San Francisco, California, United States Untether AI Full time

    Untether AI is looking for a talented AI Applications Engineer to join our Product team to support our customers with SDK for our custom AI accelerator devices. You will be working with data scientists to ensure their AI workloads are ported and running efficiently on Untether AI products. Must be a US citizen to apply.Ideal candidate profileYou have...

  • AI Engineering Lead

    1 week ago


    San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking an Engineering Director to spearhead our AI Platform division. This team is responsible for developing cutting-edge software solutions that drive the Snorkel Flow platform. The focus includes creating services for training and deploying generative AI and machine learning models, utilizing innovative data-centric methodologies,...

  • AI Engineering Lead

    2 weeks ago


    San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking a Director of Engineering to spearhead our AI Platform division. This team is responsible for developing cutting-edge software systems that enhance the Snorkel Flow platform. Responsibilities include creating services for training and deploying generative AI and machine learning models, utilizing innovative data-centric...

  • AI Engineering Lead

    1 week ago


    San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking an experienced Director of Engineering to spearhead our AI Platform division. This team is responsible for developing cutting-edge software systems that drive the Snorkel Flow platform. Key responsibilities include creating services for training and deploying generative AI and machine learning models utilizing innovative...


  • San Francisco, California, United States Fractional AI Full time

    About Fractional AIWe are a cutting-edge technology company specializing in applied AI solutions. Our team of experts helps large enterprises automate complex workflows, leveraging the power of generative AI to drive innovation and efficiency.Our mission is to empower businesses to unlock the full potential of AI, streamlining processes and driving growth....


  • San Jose, California, United States Qubrid AI Full time

    Overview:This pivotal role demands recent experience in marketing AI software solutions leveraging NVIDIA GPU and software technologies. It is a high-stakes position with a modest base salary complemented by substantial commission and equity potential within a rapidly evolving AI startup.Position Overview:Qubrid AI is at the forefront of AI adoption,...


  • San Francisco, California, United States Fractional AI Full time

    About Fractional AIFractional AI is a premier development firm focused on practical AI applications. We tackle complex AI challenges that our clients lack the resources or expertise to address independently, moving beyond technical jargon and flashy presentations to implement AI solutions efficiently.We are convinced that the transformative potential of...


  • San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking a Director of Engineering to oversee our AI Platform division. This team is responsible for developing cutting-edge software systems that enhance the Snorkel Flow platform. The focus includes creating services for training and deploying generative AI and machine learning models utilizing advanced data-centric methodologies,...


  • San Jose, California, United States Qubrid AI Full time

    Position Overview:We are seeking a visionary and results-driven Vice President of Revenue for NVIDIA AI Solutions to spearhead our sales initiatives in the rapidly evolving AI landscape. This pivotal role demands a blend of strategic insight and hands-on execution, particularly in selling AI software solutions powered by NVIDIA technologies.About Qubrid...

  • AI Solutions Engineer

    2 weeks ago


    San Francisco, California, United States Fractional AI Full time

    About Fractional AIFractional AI is a premier development firm specializing in applied artificial intelligence. We tackle complex AI-driven challenges that our clients lack the resources or expertise to address independently, streamlining the process to implement AI solutions efficiently.We are convinced that the transformative potential of generative AI...


  • San Jose, California, United States AMD Full time

    About the Role:We are seeking a highly skilled Software Development Engineer to join our team at AMD, where you will play a critical role in enabling AI acceleration at scale. As a key member of our organization, you will work on developing tools and methodologies to optimize and realize full system performance for AI workloads on Ryzen AI SoC.Key...


  • San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking a Director of Engineering to oversee our AI Platform division. This team is responsible for developing cutting-edge software systems that drive the Snorkel Flow platform. Responsibilities include creating services for training and deploying generative AI and machine learning models, utilizing innovative data-centric...


  • San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking a Director of Engineering to spearhead our AI Platform division. This team is responsible for creating cutting-edge software solutions that enhance the Snorkel Flow platform. Responsibilities include developing services for training and deploying generative AI and machine learning models, utilizing innovative data-centric...


  • San Francisco, California, United States Snorkel AI, Inc. Full time

    About the RoleWe are seeking an experienced Engineering Manager to lead our AI Platform team at Snorkel AI, Inc. This is a unique opportunity to join a cutting-edge technology company and contribute to the development of innovative AI solutions.Key ResponsibilitiesLead a team of talented engineers to design, develop, and deploy large-scale data-focused AI...


  • San Francisco, California, United States Decagon AI, Inc. Full time

    About Decagon AI, Inc.:Decagon AI, Inc. is at the forefront of developing sophisticated conversational AI solutions tailored for enterprise needs. Our innovative AI agents are designed to deliver a customer support experience that closely resembles human interaction, empowering businesses to enhance their customer service capabilities and streamline their...


  • San Francisco, California, United States Decagon AI, Inc. Full time

    About Decagon AI, Inc.:Decagon AI, Inc. is at the forefront of developing sophisticated conversational AI systems tailored for enterprise applications. Our innovative solutions have attracted a diverse clientele, enhancing customer interactions through human-like support capabilities.Position Overview:We are seeking an experienced AI Engineer to contribute...

  • AI Solutions Engineer

    2 weeks ago


    San Francisco, California, United States Decagon AI, Inc. Full time

    About Decagon AI, Inc.:Decagon AI, Inc. is at the forefront of developing sophisticated conversational AI solutions tailored for enterprises. Our innovative AI agents are designed to deliver a customer support experience that mirrors human interaction, empowering businesses to enhance their customer service capabilities and streamline their customer...


  • San Francisco, California, United States Decagon AI, Inc. Full time

    About Decagon AI, Inc.:Decagon AI, Inc. is at the forefront of developing sophisticated conversational AI solutions tailored for enterprise needs. Our innovative AI agents are designed to deliver a customer support experience that mirrors human interaction, empowering businesses to enhance their customer service capabilities and streamline their operational...