Accelerator Microarchitecture Performance Modeling
4 days ago
Responsibilities and opportunities in this role include - functional and cycle-accurate simulator development, architectural and microarchitectural design-space exploration for programmable accelerators, as well as analysis and optimization of modern, highly-parallel applications.
Our mission is to reimagine silicon and create accelerated computing platforms that will transform the industry. You will have the opportunity to work with some of the most talented and passionate engineers in the world to create designs that push the envelope on performance, energy-efficiency, programmability and scalability.
You will also have the opportunity to explore many adjacent areas of research and engineering, cross-cutting many levels of abstraction that must be scaled when building computing machinery - ISA design, application software, compiler optimization, RTL design, RTL correlation, design verification, test writing, and power/area analysis.
We offer a fun, creative, collaborative and flexible work environment, where you can contribute to our vision of building server-class compute machines that fulfill the promise and potential of hardware-software co-design, while also learning every day. Requirements
- In-depth knowledge of CPU/GPU Computer Architecture and Microarchitecture.
- Excellent coding skills in C/C++ languages
- Strong understanding of workloads and benchmarks in the Machine Learning space
- Solid appreciation for the basics of SIMT processing, cache and memory hierarchies
- Knowledge of performance modeling concepts - analytical, functional and cycle-accurate modelingKnowledge of performance improvement concepts - bottleneck analysis, latency hiding, speculative execution, shared resource arbitration, scheduling, buffer sizing, replacement policies
- Ability to work well in a team, take ownership of tasks, embrace aggressive schedules, be self motivated to learn, seek help, think clearly and communicate effectively
- Performance modeling - develop functional and timing simulators in C++ modeling the programmable processing cores in a Data Parallel Accelerator.
- Performance analysis - configure and use the simulator to explore the architectural and microarchitectural design space.
- Design Space Exploration - influence the design choices based on experiments and studies
- Performance testing - develop tests to evaluate quality of model and RTL design
- Performance debug - identify and fix performance bottlenecks in tests/workloads/simulator
- Performance correlation - identify correct performance targets for tests/workloads and ensure that the RTL design meets that target
- Workload analysis - develop a deep understanding of the characteristics of workloads in the target market - machine learning, data analytics, graph analytics
- Bachelor's degree with 2-4 years of experience in a relevant field
- Master's degree with 1-2 years of experience in a relevant field
- PhD with internship experience in a relevant field
-
CPU Micro-Architect
2 days ago
Austin, Texas, United States Samsung Electronics Full time $180,200 - $297,200Position SummarySamsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...
-
Austin, Texas, United States Apple Full time $200,000 - $250,000 per yearDo you intrinsically see the importance in every detail? As part of our Silicon Technologies group, you'll help design and manufacture our next-generation, high-performance, power-efficient GPU You'll ensure Apple products and services can seamlessly and expertly handle the tasks that make them beloved by millions. Joining this group means crafting and...
-
Performance Architect
12 hours ago
Austin, Texas, United States Advanced Micro Devices, Inc Full timeWHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...
-
Senior RTL Design Engineer
7 days ago
Austin, Texas, United States Mythic Full time $120,000 - $225,000 per yearWe're hiring experienced RTL Design Engineers to play a key role in designing and implementing the components that will bring our next-generation AI processors to life.About UsMythic is building the future of AI computing with breakthrough analog technology that delivers 100× the performance of traditional digital systems at the same power and cost. This...
-
Senior RTL Design Engineer
7 days ago
Austin, Texas, United States Mythic Full time $120,000 - $225,000We're hiring experienced RTL Design Engineers to play a key role in designing and implementing the components that will bring our next-generation AI processors to life.About Us:Mythic is building the future of AI computing with breakthrough analog technology that delivers 100× the performance of traditional digital systems at the same power and cost. This...
-
Lead CPU Architect
1 day ago
Austin, Texas, United States Samsung Electronics Full timePosition SummarySamsung, a world leader in advanced semiconductor technology, is founded on a simple philosophy – the endless pursuit of excellence will create a better world for all. At Samsung Austin Research and Development Center (SARC) and Advanced Computing Lab (ACL), we are building a center of excellence for Intellectual Property (IP) that is...
-
Austin, Texas, United States Qualcomm Full time $127,000 - $190,800 per yearCompanyQualcomm Technologies, Inc.Job AreaEngineering Group, Engineering Group > DSP Architecture and DesignGeneral SummaryQualcomm is seeking a low-level embedded engineer with a strong foundation in software and processor architecture to help shape architectural features and deliver measurable performance enhancements on Qualcomm's Neural Processing Unit...
-
Principal Performance Engineer
7 days ago
Austin, Texas, United States Arm Full time $200,000 - $300,000 per yearJob ID Date posted Oct. 28, 2025Location Austin, TexasCategory Hardware EngineeringArm technology is becoming the platform of choice for compute and AI. The Arm System Engineering team's mission is to architect, design, and develop server and rack-level infrastructure for at-scale datacenter deployments. The team capabilities span across system hardware,...
-
Austin, Texas, United States Advanced Micro Devices, Inc Full time $100,000 - $150,000 per yearWHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...
-
Austin, Texas, United States Meta Full time $93,360 - $196,688 per yearReality Labs (RL) focuses on delivering Meta's vision through AI-first devices that leverage Mixed Reality (MR) and Augmented Reality (AR). The compute performance and power efficiency requirements of Mixed and Augmented Reality require custom silicon. Reality Labs Silicon team is driving the state of the art forward with highly integrated SoCs that leverage...