Data Center GPU Validation Technical Lead

2 months ago


Austin, United States Advanced Micro Devices , Inc. Full time

Overview:

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the worlds most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance_

Responsibilities:

THE ROLE:

The Datacenter Accelerated Computing Validation Team is looking for dynamic and energetic engineers to join our growing team. As a key contributor to the validation of AMDs GPU based datacenter accelerators, you will work in cross-functional teams to deliver industry leading products for Artificial Intelligence (AI), Machine Learning (ML), and High-Performance Computing (HPC) applications.

Specifically, this role is focused on manufacturing test automation, validation, and cross-functional program management to help our manufacturing partners deliver AMDs leading datacenter GPUs.

THE PERSON:

A candidate who:

  • Has strong analytical thinking and problem solving skills with excellent attention to details
  • Experience working with cross-functional stakeholders to drive yield improvements to the manufacturing process
  • Must be a team player but also be able to work efficiently with minimal supervision
  • Has a strong interest in the manufacturing of GPU hardware and knowledge of AI, ML, and HPC products
  • Is very familiar with Linux, and computer hardware validation

?

KEY RESPONSIBILITIES:

As a DC GPU Manufacturing Validation and Test Automation Engineer, you will work with a cross-functional organization to plan, develop, and validate test content and automation for AMDs manufacturing test program.

?

Responsibilities include:

  • Identify AI, ML and HPC workloads needed to validate / stress AMD DC GPUs
  • Collaborate with SW and HW teams to create and automate validation programs for a high-volume manufacturing environments
  • Develop python based automated test suites and content for computer hardware validation
  • Work with stakeholders to drive yield and test quality improvements
  • Work with stakeholders globally to develop documentation, process improvements, and provide support to the DC GPU manufacturing program
  • Develop data analysis and reporting to support stakeholders with data driven decision making
  • Participate and lead discussions on manufacturing test program updates, constraints, and optimizations

?

?IDEAL CANDIDATE

  • Experience with industry standard benchmarks, AI/ML/HPC applications
  • History of applied Python development skills with focus on object oriented and adherence to best practices
  • Experience with hardware validation with a focus on high volume and contract manufacturing
  • Experience working systematically through manufacturing yield issues and generating solutions and follow-ups
  • Must have strong analytical skills for test creation and debug
  • Must have strong communication and collaboration skills
  • Must be a self-starter and be able to independently drive tasks to completion
  • Hands-on experience in datacenter system architecture an asset

ACADEMIC CREDENTIALS:

  • Bachelor or master's degree in Electrical/Computer Engineering, Mathematics, Computer Science or an equivalent preferred

LOCATION:

Markham, ON

#LI-SL2

#LI-HYBRID

Qualifications:

At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMDs Employee Stock Purchase Plan. Youll also be eligible for competitive benefits described in more detail here.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants needs under the respective laws throughout all stages of the recruitment and selection process.



  • Austin, United States UrBench Full time

    Job DescriptionJob DescriptionRole: Data Center Validation LeadLocation: Austin, TX OnsiteDuration: FulltimeJob Description:System level validation experience in system level test content development and issue triageFamiliarity with full stack system level validation including applications, frameworks and tools, Linux OS, ROCm + driver stack installation and...


  • Austin, United States L&T Technology Services Full time

    Role: Data Center Validation LeadLocation: Austin, TX – Onsite Duration: FulltimeJob Description:System level validation experience in system level test content development and issue triageFamiliarity with full stack system level validation including applications, frameworks and tools, Linux OS, ROCm + driver stack installation and debugs, system level BMC...


  • Austin, United States L&T Technology Services Full time

    Role: Data Center Validation LeadLocation: Austin, TX – Onsite Duration: FulltimeJob Description:System level validation experience in system level test content development and issue triageFamiliarity with full stack system level validation including applications, frameworks and tools, Linux OS, ROCm + driver stack installation and debugs, system level BMC...


  • Austin, United States AMD Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • Austin, Texas, United States Apple Full time

    Overview As a member of the Silicon Technologies team, you will play a pivotal role in the design and development of cutting-edge, high-performance processors and system-on-chip (SoC) solutions. Your expertise will contribute to ensuring that Apple products deliver exceptional performance and efficiency, enhancing the user experience for millions of...


  • Austin, Texas, United States Advanced Micro Devices, Inc Full time

    About the Role:The Power Attainment Engineer - Data Center GPU will assume responsibility for mostly post-silicon activities related to power attainment and optimization of Advanced Micro Devices, Inc. Datacenter products. This role is essential to the success of Advanced Micro Devices, Inc. as a growing company.Key Responsibilities:Actively participate in...


  • Austin, Texas, United States Apple Full time

    Overview As a pivotal member of our Silicon Technologies division, you will contribute to the design and production of advanced, high-performance processors and system-on-chip (SoC) solutions. Your expertise will ensure that our products deliver the seamless and efficient experiences that our users cherish. Role Responsibilities Your primary focus will be on...


  • Austin, Texas, United States Apple Full time

    Overview As a pivotal member of our Silicon Technologies division, you will engage in the design and production of cutting-edge, high-performance processors and system-on-chip (SoC) solutions. Your contributions will ensure that our products deliver seamless functionality, enhancing the user experience for millions of customers worldwide. Role...


  • Austin, United States Advanced Micro Devices , Inc. Full time

    Overview: WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....


  • Austin, United States Samsung Full time

    Sr. GPU Performance Engineerremote typeHybridlocations3900 N Capital of Texas Hwy, Austin, TX, USA3655 N 1st St, San Jose, CA, USAjob requisition idR88199Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin...


  • Austin, Texas, United States Samsung Full time

    Sr. GPU Performance Engineerremote typeHybridlocations3900 N Capital of Texas Hwy, Austin, TX, USA3655 N 1st St, San Jose, CA, USAjob requisition idR88199Position SummarySamsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin...


  • Austin, United States Advanced Micro Devices , Inc. Full time

    Overview: WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....


  • Austin, United States Samsung Electronics America Inc Full time

    Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy - the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...


  • Austin, United States Samsung Electronics Perú Full time

    Sr. GPU Performance Engineer Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for...

  • Systems Debug Lead

    1 month ago


    Austin, United States Advanced Micro Devices , Inc. Full time

    Overview: WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....


  • Austin, United States Samsung Electronics Co., Ltd. Full time

    Position SummarySamsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy - the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is applied...


  • Austin, United States Samsung Electronics Co., Ltd. Full time

    Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy - the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...


  • Austin, United States Advanced Micro Devices , Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • Austin, Texas, United States NVIDIA Full time

    At NVIDIA, we are at the forefront of innovation across various sectors, including Automotive, Virtual Reality, Gaming, Deep Learning, and High-Performance Computing. Experience the impact of your contributions as developers utilize your tools to debug, profile, and analyze the performance of their systems and applications through the low-level library you...


  • Austin, Texas, United States Apple Full time

    SummaryPosted: May 29, 2024Role Number: Do you love creating elegant solutions to highly complex challenges? As part of our Silicon Technologies group, you'll help design and manufacture our next-generation, high-performance, power-efficient processor, system-on-chip (SoC) You'll ensure Apple products and services can seamlessly and efficiently handle the...