Software Engineer, AI Inference Co Design

4 weeks ago

Palo Alto, United States Tesla Motors, Inc. Full time

What to ExpectThe AI inference co-design team's goal is to take research models and make them run efficiently on our AI-ASIC to power real-time inference for Autopilot and Optimus programs. This unique role lies at the intersection of AI research, compiler development, kernel optimization, math and HW design. You will work extensively with AI engineers and come up with novel techniques to quantize models, improve precision and explore non-standard alternate architectures. You will be developing optimized micro kernels using a cutting-edge MLIR compiler and solve the performance bottlenecks needed to achieve real-time latency needed for self-driving and humanoid robots. You will work closely with the HW team and bring state-of-the-art HW architecture techniques to our next generation HW SoCs.What You\'ll DoResearch and implement state-of-the-art machine learning techniques to achieve high performance on our edge hardwareOptimize bottlenecks in the inference flow, make precision/performance tradeoff decisions and figure out novel techniques to improve hardware utilization and throughputImplement/improve highly performant micro kernels for Tesla\'s AI ASICWork with AI teams to design edge friendly neural network architecturesCollect extensive performance benchmarks (latency, throughput, power) and work with HW teams to shape the next generation of inference hardware, balancing performance with versatilityExperiment with numerical methods and alternative architecturesCollaborate with the compiler infrastructure for programmability and performanceWhat You\'ll BringDegree in Engineering, Computer Science or equivalent in experience and evidence of exceptional abilityProficiency with Python and C++, including modern C++ (14/17/20)Experience with AI networks, such as CNNs, transformers, and diffusion model architectures, and their performance characteristicsUnderstanding of GPU, SIMD, multithreading and/or other accelerators with vectorized instructionsExposure to computer architecture and chip architecture/micro-architectureSpecialized experience in one or more of the following machine learning/deep learning domains: Model compression, hardware aware model optimizations, hardware accelerators architecture, GPU/ASIC architecture, machine learning compilers, high performance computing, performance optimizations, numerics and SW/HW co-designCompensation and BenefitsBenefitsAetna PPO and HSA plans > 2 medical plan options with $0 payroll deductionFamily-building, fertility, adoption and surrogacy benefitsDental (including orthodontic coverage) and vision plans, both have options with a $0 paycheck contributionCompany Paid (Health Savings Account) HSA Contribution when enrolled in the High Deductible Aetna medical plan with HSAHealthcare and Dependent Care Flexible Spending Accounts (FSA)401(k) with employer match, Employee Stock Purchase Plans, and other financial benefitsCompany paid Basic Life, AD&D, short-term and long-term disability insuranceEmployee Assistance ProgramSick and Vacation time (Flex time for salary positions), and Paid HolidaysBack-up childcare and parenting support resourcesVoluntary benefits to include: critical illness, hospital indemnity, accident insurance, theft & legal services, and pet insuranceWeight Loss and Tobacco Cessation ProgramsTesla Babies programCommuter benefitsEmployee discounts and perks programExpected Compensation$132,000 - $330,000/annual salary + cash and stock awards + benefitsPay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. The total compensation package for this position may also include other elements dependent on the position offered. Details of participation in these benefit plans will be provided if an employee receives an offer of employment. #J-18808-Ljbffr

Software Engineer, Optimus Inference Co Design

4 weeks ago

Palo Alto, United States Tesla Full time

Software Engineer, Optimus Inference Co DesignJoin to apply for the Software Engineer, Optimus Inference Co Design role at TeslaSoftware Engineer, Optimus Inference Co DesignJoin to apply for the Software Engineer, Optimus Inference Co Design role at TeslaWhat To ExpectThe AI inference co-design team’s goal is to take research models and make them run...
Software Engineer, AI Inference Co Design

3 weeks ago

Palo Alto, United States Tesla Full time

What to Expect The AI inference co-design team's goal is to take research models and make them run efficiently on our AI-ASIC to power real-time inference for Autopilot and Optimus programs. This unique role lies at the intersection of AI research, compiler development, kernel optimization, math and HW design. You will work extensively with AI engineers and...
Software Engineer, Optimus Inference Co Design

3 weeks ago

Palo Alto, United States Tesla Full time

What to Expect The AI inference co-design team's goal is to take research models and make them run efficiently on our AI-ASIC to power real-time inference for Optimus humanoid robot programs, with applications extending to Autopilot. This unique role lies at the intersection of AI research, compiler development, kernel optimization, math and HW design...
Internship, Software Engineer, AI Inference Co-design

3 weeks ago

Palo Alto, United States Tesla Full time

What to Expect Consider before submitting an application: This position is expected to start around January 2026 and continue through the Winter/Spring term (approximately April 2026) or into Summer 2026 if available and there is an opportunity to do so. We ask for a minimum of 12 weeks, full-time and on-site, for most internships. Our internship program is...
Internship, Software Engineer, AI Inference Co-design

2 weeks ago

Palo Alto, CA, United States Tesla Full time

What to Expect Consider before submitting an application: This position is expected to start around January 2026 and continue through the Winter/Spring term (approximately April 2026) or into Summer 2026 if available and there is an opportunity to do so. We ask for a minimum of 12 weeks, full-time and on-site, for most internships. Our internship program is...
Software Engineer, AI Inference Codesign

3 weeks ago

Palo Alto, United States Tesla Full time

What to Expect The AI inference co-design team's goal is to take research models and make them run efficiently on our AI-ASIC to power real-time inference for Autopilot and Optimus programs. This unique role lies at the intersection of AI research, compiler development, kernel optimization, math and HW design. You will work extensively with AI engineers and...
Software Engineer, AI Performance Modeling

3 weeks ago

Palo Alto, United States Tesla Full time

What to Expect The AI co-design team is dedicated to developing and optimizing AI systems that can scale efficiently to thousands of compute nodes, enabling large-scale training, reinforcement learning at scale, and real-time inference for Autopilot and Optimus. Our goal is to push the boundaries of performance, power, and latency on our custom-designed...
Software Engineer, AI Performance Modeling

4 weeks ago

Palo Alto, United States Tesla Motors, Inc. Full time

What to Expect The AI co-design team is dedicated to developing and optimizing AI systems that can scale efficiently to thousands of compute nodes, enabling large-scale training, reinforcement learning at scale, and real-time inference for Autopilot and Optimus. Our goal is to push the boundaries of performance, power, and latency on our custom-designed...
Software Engineer, AI Inference Codesign

5 days ago

Palo Alto, CA, United States Tesla Full time

What to Expect The AI inference co-design team's goal is to take research models and make them run efficiently on our AI-ASIC to power real-time inference for Autopilot and Optimus programs. This unique role lies at the intersection of AI research, compiler development, kernel optimization, math and HW design. You will work extensively with AI engineers and...
Software Engineer, AI Inference Codesign

1 week ago

Palo Alto, CA, United States Tesla Full time

What to Expect The AI inference co-design team's goal is to take research models and make them run efficiently on our AI-ASIC to power real-time inference for Autopilot and Optimus programs. This unique role lies at the intersection of AI research, compiler development, kernel optimization, math and HW design. You will work extensively with AI engineers and...

Americas

Europe

Asia / Oceania

Africa

Software Engineer, AI Inference Co Design