Datacenter GPU Platform Performance Engineer

3 months ago


Santa Clara, United States Advanced Micro Devices , Inc. Full time

Overview:

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the worlds most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance_

Responsibilities:

Datacenter GPU Platform Performance Engineer

THE ROLE:

As part of our team, you'll be responsible for ensuring that AMD Instinct GPU-accelerated systems are operating at peak performance before being deployed to solve the world's most challenging problems. We're looking for a highly motivated candidate with expertise in GPU performance and familiarity with performance monitoring and tuning tools. The ideal candidate should also possess data science and communication skills to effectively convey their findings to engineering and business teams.


We value curiosity and innovation, and we're committed to providing a challenging and supportive environment where you can learn and grow. As you collaborate with your peers, you'll have the opportunity to make a real impact and contribute to our organization's success. We're looking for someone who's passionate about improving their skills and constantly seeking new ways to drive performance and efficiency. If you're passionate about performance engineering and want to make a real impact with customers deploying the latest AI breakthroughs, we encourage you to apply.

?

KEY RESPONSIBILITIES:

  • Define performance suite and best practices for measuring GPU-accelerated workloads to assess scalability and efficiency of AI models and algorithms
  • Benchmark and analyze AI workloads in single and multi-node configurations comparing against previous generations and our competitors
  • Perform comprehensive performance analysis and report findings for the entire platform including GPU, CPU, interconnects, network, software stack, etc.
  • Identify performance bottlenecks that impact data center GPU-accelerated workloads, tune and collaborate with other software teams to improve performance
  • Stay up to date with emerging technologies and trends and explore ways to improve the performance of GPU-accelerated workloads at scale

PREFERRED EXPERIENCE:

  • Solid knowledge of Artificial Intelligence (AI) and Machine Learning (ML) concepts and techniques, including deep learning, reinforcement learning, natural language processing, generative AI, and computer vision, as well as practical experience applying these concepts to solve real-world problems through research or work experience
  • Experience in benchmarking methodologies, performance analysis, workload profiling, performance monitoring and debugging tools
  • Advanced Linux OS, container (e.g. Docker) and GitHub skills
  • Programming skills with relevant languages such as Python or C/C++
  • Experience with deep learning frameworks like PyTorch and TensorFlow
  • Knowledge and interest in computer and GPU architecture
  • Experience with GPU acceleration with either AMD or Nvidia GPU compute products a plus
  • Inquiring mind, excellent problem-solving skills, and automation mindset

ACADEMIC CREDENTIALS:

  • B.S./M.S./PhD in Computer Science or Engineering or similar field
  • #HYBRID
  • #LI-RL1
Qualifications:

At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position. You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMDs Employee Stock Purchase Plan. Youll also be eligible for competitive benefits described in more detail here.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants needs under the respective laws throughout all stages of the recruitment and selection process.



  • Santa Clara, United States NVIDIA Full time

    NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers,...


  • Santa Clara, United States Nvidia Full time

    Senior Platform Software Engineer, AI Server - GPUlocationsUS, CA, Santa ClaraUS, Remotetime typeFull timejob requisition idJR1980965NVIDIA’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning —...


  • Santa Clara, United States NVIDIA Full time

    We are now looking for a Datacenter Product Engineer! NVIDIA Corporation is a world leader in visual computing technology. The GPU, which the company invented, serves as the visual cortex of modern computers and is at the heart of their products and services. NVIDIA has transformed into a specialized platform company that targets four large markets –...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and...


  • Santa Clara, California, United States AMD Full time

    JOIN AMD AND MAKE A DIFFERENCEAt AMD, we are dedicated to revolutionizing lives through our advanced technology, enhancing our industry, communities, and the global landscape. Our vision is to create exceptional products that propel next-generation computing experiences, serving as the foundation for data centers, artificial intelligence, personal computing,...


  • Santa Clara, California, United States NVIDIA Full time

    We are currently seeking a Lead Architect for GPU System Performance Optimization. The NVIDIA Platform Architecture team is in search of exceptional computer architects who possess a genuine enthusiasm for GPU-enhanced deep learning, data analysis, and high-performance computing. This role is pivotal in designing and developing the forthcoming generation of...


  • Santa Clara, United States TekWissen ® Full time

    Job Title: GPU development engineerWork Location: Santa Clara, CA 95054Duration: 9 Months Work Type: Contract Job Type: OnsitePay Rate: $57.79 - 86.69/Hourly/W2Overview: TekWissen Group is a workforce management provider throughout the USA and many other countries in the world. This Client is an American multinational semiconductor company based in Santa...

  • Datacenter Engineer

    2 weeks ago


    Santa Clara, United States Sustainable Talent Full time

    Job DescriptionJob DescriptionSustainable Talent is partnering with Nvidia a global leader who's been transforming computer graphics, PC gaming, and accelerated computing for over 25 years. We are looking for a Datacenter Engineer to support our client's IPP (Infrastructure, planning, and process) team. This is a W-2 full-time contract position based...


  • Santa Clara, United States Sustainable Talent Full time

    Job DescriptionJob DescriptionSustainable Talent is partnering with Nvidia a global leader who's been transforming computer graphics, PC gaming, and accelerated computing for over 25 years. We are looking for a Datacenter Systems Engineer to support our client's IPP (Infrastructure, planning, and process) team. This is a W-2 full-time contract...


  • Santa Clara, United States US Tech Solutions Full time

    Duration: 12 months contract Job Description: · This position is for an experienced engineer with GPU programming and optimizations skills, with a proven ability to analyse GPU codes and delivery of highly parallel solutions. · You will be part of a team developing and tuning a computational geometry application for Clients CPU and GPU...


  • Santa Clara, United States US Tech Solutions Full time

    Duration: 12 months contract Job Description: · This position is for an experienced engineer with GPU programming and optimizations skills, with a proven ability to analyse GPU codes and delivery of highly parallel solutions. · You will be part of a team developing and tuning a computational geometry application for Clients CPU and GPU platforms....


  • Santa Clara, California, United States Advanced Micro Devices, Inc Full time

    Overview:YOUR WORK AT AMD MAKES A DIFFERENCEWe are dedicated to enhancing lives through AMD technology, impacting our industry, communities, and the globe. Our goal is to create outstanding products that propel next-generation computing experiences – the foundational elements for data centers, artificial intelligence, personal computing, gaming, and...


  • Santa Clara, California, United States NVIDIA Full time

    We are currently seeking a Lead GPU System Architect to join our dynamic GPU team.NVIDIA's innovation in graphics and parallel computing is a cornerstone of our success, allowing us to deliver unparalleled performance in graphics processing. We are continually exploring avenues to enhance our GPU architecture and uphold our leadership position in the...


  • Santa Clara, United States netPolarity, Inc. (Saicon Consultants, Inc.) Full time

    Location: Santa Clara, CA (HYBRID) - Flexible - West coast candidates preferredDuration: 6-9 months contract W2 only + Extension Note: GPU programming skills critical (CUDA/ROCm, C++), parallel processingThe Person:A GPU software development / Library engineer with experience in writing GPU code to solve problems in computational geometry, capable of...


  • Santa Clara, United States netPolarity, Inc. (Saicon Consultants, Inc.) Full time

    Location: Santa Clara, CA (HYBRID) - Flexible - West coast candidates preferredDuration: 6-9 months contract W2 only + Extension Note: GPU programming skills critical (CUDA/ROCm, C++), parallel processingThe Person:A GPU software development / Library engineer with experience in writing GPU code to solve problems in computational geometry, capable of...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is on the lookout for a driven and innovative engineer with a strong foundation in system software and security expertise to join our Server Platform Software team. Your primary focus will be on offensive security initiatives for our Data Center Systems, including NVIDIA HGX, DGX, and MGX.Key Responsibilities:Detect vulnerabilities in our Data Center...

  • Datacenter Technician

    2 weeks ago


    Santa Clara, United States Sustainable Talent Full time

    Job DescriptionJob DescriptionSustainable Talent is partnering with Nvidia a global leader who's been transforming computer graphics, PC gaming, and accelerated computing for over 25 years. We are looking for a Datacenter Technician to support our client's IPP (Infrastructure, planning, and process) team. This is a W-2 full-time contract position...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA’s invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a highly skilled Senior GPU Performance Architect to join our AI Applications team at NVIDIA. As a key member of our architecture group, you will play a critical role in driving innovation and delivering cutting-edge performance in the field of artificial intelligence.The ideal candidate will have a strong background in computer science,...


  • Santa Clara, California, United States NVIDIA Full time

    We are seeking a dedicated Lead Software Performance Engineer to become a vital part of our innovative and dynamic team.NVIDIA has been at the forefront of technological innovation since the inception of the GPU, which has transformed the landscape of computer graphics and parallel computing. As we continue to pioneer advancements in AI, we are looking to...