Staff Performance Modelling Engineer

6 days ago

San Francisco CA, United States Flux Full time

The Role Were searching for a Staff Performance Modelling Engineer ( San Francisco) , to create and own the analytical and simulation models that steer OTPU architecture and software evolution. You will build functional simulators as well as high-fidelity, cycle-accurate models of our optical compute system. This role is critical to explore what-if design spaces, and deliver insights that directly influence our software, hardware, and optical roadmaps. This role sits at the crossroads of hardware architecture, software tooling and machine-learning workload analysis, perfect for an engineer who loves data-driven decision-making and fast iteration. Responsibilities Ownership: Define and deliver the technical vision and roadmap for your team that unlocks key strategic technical and business goals that are essential to the success of Flux. Collaboration: Partner closely with all engineering teams to help shape our overall system architecture and delivery while ensuring models reflect reality and reality meets performance goals. Champion Modelling: Educate peers on modelling methodology and champion data-driven design culture. Functional Simulator: Design, build, and maintain a functional simulator of the OPTU subsystem and full pipeline. Performance Simulator: Design and maintain architectural & cycle-accurate models of the OPTU subsystems and pipeline. Identify throughput, latency and utilisation hot-spots; propose architectural, or scheduling fixes. Workload Analysis & Bottleneck Hunting: Instrument benchmarks (LLMs, diffusion, graph workloads) to collect detailed traces. Design-Space Exploration: Run massive parameter sweeps with your functional and to understand tradeoffs and guide the software, hardware, and optical teams. Tooling & Automation: Develop Python/C++ tooling for trace parsing, statistical analysis and visualisation.Integrate models into CI so that every RTL commit gets a performance smoke test. Skills & Experience 7+ years building performance or power models for CPUs, GPUs, ASICs, or accelerators. Proven track record providing technical leadership to a team of 5~10 engineers, resulting in significant business impact. Strong coding ability in C++ and Python; experience with discrete-event or cycle-accurate simulators (e.g., gem5, SystemC, custom in-house). Strong grasp of computer-architecture fundamentals: memory systems, interconnects, queuing theory, Amdahl/Gustafson analysis. Familiarity with machine-learning workloads and common frameworks (PyTorch, TensorFlow, JAX). Comfort reading RTL or schematics and discussing micro-architectural trade-offs with hardware designers. Excellent data-visualisation and communication skills: able to turn millions of simulation samples into one decisive slide. Bachelors in EE, CS, Physics, Applied Maths or related; advanced degree preferred but not required. Personal or open-source projects in simulators, ML kernels, or performance analysis are a significant plus. Compensation & Benefits Competitive salary and stock options, youre not just part of the journey, you will own a piece of it. Based in our office in central San Francisco To foster collaboration in our high-growth environment, we require all employees to work from our SF office and live within a 45-minute commute. We offer an extra ($24,000/year) incentive for those living within 20 minutes. Due to U.S. export control regulations, candidates eligibility to work at Flux depends on their most recent citizenship or permanent residency status. We are generally unable to consider applicants whose most recent citizenship or permanent residence is in certain restricted countries (currently including Iran, North Korea, Syria, Cuba, Russia, Belarus, China, Hong Kong, Macau, and Venezuela). Applicants who have subsequently obtained citizenship or permanent residency in another country not subject to these restrictions may still be eligible. We do not accept unsolicited CVs from recruitment agencies, will not be liable for any fees, and prohibit unauthorised use of our company name in recruitment activities. #J-18808-Ljbffr

Performance Modelling Engineer

2 weeks ago

San Francisco, CA, United States PageBolt WordPress Full time

The Role Below covers everything you need to know about what this opportunity entails, as well as what is expected from applicants. We’re searching for a Staff Performance Modelling Engineer to create and own the analytical and simulation models that steer OTPU architecture and software evolution. You will build functional simulators as well as...
Staff CPU Performance Modeling Engineer

2 weeks ago

San Jose, United States Samsung Semiconductor, Inc. Full time

A leading technology firm in San Jose is seeking a highly skilled Staff Engineer specializing in CPU Performance Modeling. The role involves developing performance models, collaborating with architecture and design teams, and validating processor designs. Ideal candidates will have extensive experience in CPU architecture and performance modeling, strong...
Performance Modelling Engineer

1 week ago

San Francisco, CA, United States PageBolt WordPress Full time

The Role Were searching for a Staff Performance Modelling Engineer to create and own the analytical and simulation models that steer OTPU architecture and software evolution. You will build functional simulators as well as high-fidelity, cycle-accurate models of our optical compute system. This role is critical to explore what-if design spaces, and deliver...
Performance Modeling Engineer

1 week ago

San Jose, CA, United States Mirafra Technologies Full time

Performance Modeling Engineer – SystemC/TLM2 Performance Modeling and verification Develop, enhance, and maintain SystemC/TLM2 models for memory controllers, peripherals and interconnects, ensuring they accurately simulate the behavior and performance characteristics of the hardware. Collaborate with cross teams to integrate models into AMD tools used...
Staff Engineer, CPU Performance Modeling Engineer

2 weeks ago

San Jose, United States Samsung Semiconductor, Inc. Full time

Position Title: Staff Engineer, CPU Performance Modeling Engineer Please Note: To provide the best candidate experience amidst our high application volumes, each candidate is limited to 10 applications across all open jobs within a 6-month period. Advancing the World's Technology Together Our technology solutions power the tools you use every day –...
Architecture & Performance Modeling Engineer

6 days ago

San Francisco, California, United States Eridu AI Full time $140,000 - $200,000 per year

About Eridu AIEridu AI is a Silicon Valley-based hardware startup pioneering infrastructure solutions that accelerate training and inference for large-scale AI models. Today's AI performance is frequently limited by system-level bottlenecks. Eridu AI delivers multiple industry-first innovations across semiconductors, software, and systems to unlock greater...
Software Engineer

3 weeks ago

San Francisco, United States AI Fund Full time

Overview Join to apply for the Software Engineer - Model Performance role at AI Fund. Are you passionate about advancing the application of artificial intelligence? We are looking for a Software Engineer focused on ML performance to join our dynamic team. This role is ideal for someone who thrives in a fast-paced startup environment and is eager to make...
Software Engineer

3 weeks ago

San Francisco, United States AI Fund Full time

Overview Join to apply for the Software Engineer - Model Performance role at AI Fund. Are you passionate about advancing the application of artificial intelligence? We are looking for a Software Engineer focused on ML performance to join our dynamic team. This role is ideal for someone who thrives in a fast-paced startup environment and is eager to make...
Engineering Manager

20 hours ago

San Francisco, CA, United States Baseten Full time

ABOUT BASETEN Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. With...
Engineering Manager

4 weeks ago

San Francisco, United States Baseten Full time

Join Our Dynamic Team at BasetenJoin our dynamic team at Baseten, where we're revolutionizing AI deployment with cutting-edge inference infrastructure. Backed by premier investors such as IVP, Spark Capital, Greylock, and Conviction, we're trusted by leading enterprises and AI-driven innovatorsincluding Descript, Bland.ai, Patreon, Writer, and Robust...

Americas

Europe

Asia / Oceania

Africa

Staff Performance Modelling Engineer