GPU Performance Optimization Engineer

1 week ago


Austin Texas, United States Advanced Micro Devices, Inc Full time

Overview:

MAKE A DIFFERENCE WITH AMD TECHNOLOGY
At AMD, we are committed to transforming lives through our innovative technology, enhancing our industry, communities, and the world.

Our goal is to create exceptional products that accelerate next-generation computing experiences, serving as the foundation for data centers, artificial intelligence, personal computing, gaming, and embedded systems.

Our culture at AMD is centered around pushing the boundaries of innovation to tackle the world's most pressing challenges. We aim for excellence in execution while fostering a direct, humble, collaborative, and inclusive environment that values diverse perspectives.

Together at AMD, we advance_

Responsibilities:

YOUR ROLE:
We are looking for a driven and skilled GPU Performance Optimization Engineer to join our innovative team. In this position, you will play a crucial role in enhancing and achieving optimal performance for GPU clusters.

The ideal candidate will possess a robust background in GPU architectures, parallel computing, and practical experience in system-level performance tuning and debugging techniques.

Our team promotes continuous technical innovation, celebrating successes while supporting ongoing career development.

KEY RESPONSIBILITIES:

Performance Enhancement:

Collaborate with hardware and software teams to improve the overall performance of GPU clusters, concentrating on elements such as RDMA throughput, latency, and collective communications.


Benchmarking and Evaluation:

Design and implement thorough benchmarking strategies to evaluate baseline performance, identify bottlenecks, and pinpoint areas for enhancement within GPU cluster environments.


Scalability Assessment:
Test the scalability of GPU clusters by conducting comprehensive evaluations under diverse workloads, ensuring optimal performance across various cluster sizes, configurations, and networking technologies (IB & RoCE).

Performance Analysis:
Employ profiling tools and methodologies to assess and identify performance bottlenecks, delivering actionable insights for improvement.

Performance Optimization:
Execute optimization strategies, including but not limited to protocol enhancements, load balancing techniques, and parallel processing optimizations.

Documentation:

Produce detailed documentation of performance analyses, tuning efforts, and results, providing clear and concise reports for internal teams and stakeholders.


Collaboration:

Engage closely with cross-functional teams, including hardware engineers, software developers, and system architects, to integrate performance enhancements into the GPU cluster architecture.


Continuous Development:

Stay informed about the latest advancements in GPU architectures, parallel processing, and emerging technologies to drive ongoing improvements in GPU cluster performance.


PREFERRED EXPERIENCE:
Demonstrated experience in optimizing GPU cluster performance.

Strong grasp of GPU architectures, parallel computing principles, and network protocols.

Proficiency in scripting languages (e.g., Python, Bash) for automation and performance evaluation.

Experience with system-level performance analysis tools and methodologies for GPU clusters.

Analytical mindset with exceptional problem-solving and debugging abilities.

Familiarity with cluster management tools and systems.

Excellent communication and teamwork skills for effective collaboration.

Experience with RDMA network configuration, troubleshooting, and performance optimization.

Knowledge of Linux kernel networking.

Experience in machine learning and/or HPC system design.

ACADEMIC CREDENTIALS:
Bachelor's or Master's degree in computer science or equivalent experience.

Qualifications:
At AMD, your base pay is just one component of your total rewards package.

Your base pay will depend on your skills, qualifications, experience, and location within the hiring range for the position.

You may qualify for incentives based on your role, such as an annual bonus or sales incentive.

Many AMD employees have the opportunity to own shares of AMD stock and receive discounts when purchasing AMD stock through AMD's Employee Stock Purchase Plan.

You will also be eligible for competitive benefits, which are described in more detail.

AMD is an equal opportunity employer and welcomes applications from all qualified candidates without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.

We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.



  • Austin, Texas, United States Advanced Micro Devices , Inc. Full time

    Overview:MAKE A DIFFERENCE WITH AMD TECHNOLOGYAt AMD, we are dedicated to transforming lives through our innovative technology, enhancing our industry, communities, and the world. Our mission is to develop exceptional products that propel next-generation computing experiences, serving as the foundation for data centers, artificial intelligence, personal...


  • Austin, Texas, United States Advanced Micro Devices, Inc Full time

    Overview:MAKE A DIFFERENCE WITH AMD TECHNOLOGYAt AMD, we are committed to enhancing lives through our innovative technology that impacts our industry, communities, and the globe. Our goal is to create exceptional products that propel the future of computing experiences – essential components for data centers, artificial intelligence, personal computing,...


  • Austin, United States Samsung Full time

    Sr. GPU Performance Engineerremote typeHybridlocations3900 N Capital of Texas Hwy, Austin, TX, USA3655 N 1st St, San Jose, CA, USAjob requisition idR88199Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin...


  • Austin, Texas, United States Samsung Full time

    Sr. GPU Performance Engineerremote typeHybridlocations3900 N Capital of Texas Hwy, Austin, TX, USA3655 N 1st St, San Jose, CA, USAjob requisition idR88199Position SummarySamsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin...


  • Austin, Texas, United States NVIDIA Full time

    At NVIDIA, we are at the forefront of innovation across various sectors, including Automotive, Virtual Reality, Gaming, Deep Learning, and High-Performance Computing. Experience the impact of your contributions as developers utilize your tools to debug, profile, and analyze the performance of their systems and applications through the low-level library you...


  • Austin, United States Samsung Electronics Perú Full time

    Sr. GPU Performance Engineer Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for...


  • Austin, United States Samsung Electronics Co., Ltd. Full time

    Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy - the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...


  • Austin, Texas, United States NVIDIA Full time

    Are you eager to explore how GPU efficiency drives advancements in fields like gaming, artificial intelligence, and autonomous vehicles? Do you thrive on pushing your limits and wish to make a significant impact at a leading technology firm? We are seeking a Graphics Performance Optimization Engineer to enhance performance and contribute to the development...


  • Austin, Texas, United States Apple Full time

    SummaryPosted: Nov 28, 2023Role Number: Imagine what you could do here At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Dynamic, hard-working people and inspiring, innovative technologies are the norm...


  • Austin, Texas, United States Apple Full time

    SummaryPosted: Nov 30, 2023Role Number: Imagine what you could do here At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Dynamic, hard-working people and inspiring, innovative technologies are the norm...


  • Austin, Texas, United States Advanced Micro Devices, Inc Full time

    About the Role:The Power Attainment Engineer - Data Center GPU will assume responsibility for mostly post-silicon activities related to power attainment and optimization of Advanced Micro Devices, Inc. Datacenter products. This role is essential to the success of Advanced Micro Devices, Inc. as a growing company.Key Responsibilities:Actively participate in...


  • Austin, Texas, United States Apple Full time

    Job SummaryWe are seeking a highly skilled GPU FE Design Integration Specialist to join our Silicon Technologies group at Apple. As a key member of our team, you will play a critical role in designing and manufacturing our next-generation, high-performance, power-efficient GPU.Key ResponsibilitiesOwn RTL integration, design analysis, qualification,...


  • Austin, Texas, United States NVIDIA Full time

    About the RoleWe are seeking an experienced Senior Compiler Optimization Engineer to join our Compute Compiler Team at NVIDIA.Our team is responsible for enhancing CUDA and other compute compilers to fully leverage the power of NVIDIA GPUs across various computational workloads like deep learning, scientific computation, and self-driving technology.Key...


  • Austin, Texas, United States Apple Full time

    SummaryPosted: Aug 10, 2023Role Number: Do you love crafting elegant solutions to highly sophisticated challenges? As part of our Silicon Technologies group, you'll help design and manufacture our next-generation, high-performance, power-efficient processor, system-on-chip (SoC) You'll ensure Apple products and services can seamlessly and efficiently handle...


  • Austin, Texas, United States NVIDIA Full time

    About the RoleWe are seeking a highly skilled Senior Compiler Optimization Engineer to join our Compute Compiler Team at NVIDIA. As a key member of our team, you will play a critical role in delivering features and improvements to CUDA and other compute compilers, enabling the realization of NVIDIA GPUs' full potential for a wide range of computational...


  • Austin, Texas, United States SAMSUNG Full time

    **Job Summary**Samsung, a global leader in advanced semiconductor technology, is seeking a highly skilled GPU RTL Power Architect to join our team at the Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL). As a key member of our GPU RTL team, you will be responsible for optimizing power efficiency for our GPU blocks and...


  • Austin, United States NVIDIA Full time

    We are looking for an experienced Senior Compiler Optimization Engineer for an exciting role in our Compute Compiler Team. We deliver features and improvements to CUDA and other compute compilers to better realize the potential of NVIDIA GPUs for a growing range of computational workloads, ranging from deep learning, scientific computation, and self-driving...


  • Austin, United States Nvidia Full time

    We are searching for a Senior Backend Compiler Engineer for an exciting and fun role in our GPU Software organization. Our Compiler team is responsible for constructing and emitting the highest performance GPU machine instructions for Graphics (OpenGL, Vulkan, DX) and Compute (CUDA, PTX, OpenCL, Fortran, C++). This team is comprised of worldwide leading...


  • Austin, United States Samsung Electronics America Inc Full time

    Position Summary Samsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy - the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...


  • Austin, Texas, United States Advanced Micro Devices, Inc Full time

    Job SummaryWe are seeking a highly skilled Server Performance Architect to join our team at Advanced Micro Devices, Inc. This is a unique opportunity to work on cutting-edge server performance optimization and simulation.Key ResponsibilitiesDevelop and implement advanced server performance simulation methodologies to analyze and optimize high-performance...