Software Engineer, ML Inference Compiler

2 weeks ago


Palo Alto, CA, United States Tesla Full time
What to Expect

As a Software Engineer within our Autonomy teams, you will contribute to one of the most advanced and widely deployed AI Platforms in the world for Autopilot and our Humanoid Robot, Optimus.

In this role, you will be responsible for developing and maintaining a compiler toolchain to support pre-silicon development, enabling efficient compilation and optimization of neural networks for Tesla's next generation of custom hardware accelerators. You will collaborate closely with AI Engineers, Hardware Engineers, and cross-functional teams to build out the full inference stack, ensuring maximum performance extraction from our hardware through simulation and emulation in pre-silicon environments.

The inference stack development is purpose-driven: deployment and analysis of production models inform the team's direction, and the team's work immediately impacts performance and the ability to deploy more and more complex models. With a cutting-edge co-designed MLIR compiler and runtime architecture, and full control of the hardware, the compiler has access to traditionally unavailable features that can be leveraged via novel compilation approaches to generate higher performance models, even in pre-silicon stages.

What You'll Do
  • Write, debug, and maintain robust software for the Autopilot and Humanoid robot AI inference stack, with a focus on toolchain infrastructure for pre-silicon support
  • Work with AI and HW teamsto enable efficient compilation, simulation/emulation of AI models on next generation hardware architectures
  • Bring up compiler backends for future versions of Tesla hardware accelerators, including presilicon validation through emulation and simulation tools
  • Focus on advanced compiler algorithms (heterogeneous scheduling, accelerator instruction scheduling, memory placement, etc.) tailored for custom ASICs
  • Keep up-to-date and collaborate with the ML/compiler community to ensure the stack remains compatible with the latest developments

What You'll Bring
  • Strong familiarity with CPU/GPU/ASICs
  • Experience with pre-silicon workflows, including hardware emulation and simulation
  • Proficient C/C++ programming C/C++ including modern C/C++ (C++14/17/20)
  • Basic Python proficiency
  • Prior experience working with compilers, with a focus on middle and backend. Experience with ML compilers and frameworks is a plus (e.g. MLIR, XLA, TensorRT, LLVM, JAX, TVM)
  • Degree in Engineering, Computer Science, or equivalent in experience and evidence of exceptional ability

Compensation and Benefits Benefits

Along with competitive pay, as a full-time Tesla employee, you are eligible for the following benefits at day 1 of hire:

  • Aetna PPO and HSA plans > 2 medical plan options with $0 payroll deduction
  • Family-building, fertility, adoption and surrogacy benefits
  • Dental (including orthodontic coverage) and vision plans, both have options with a $0 paycheck contribution
  • Company Paid (Health Savings Account) HSA Contribution when enrolled in the High Deductible Aetna medical plan with HSA
  • Healthcare and Dependent Care Flexible Spending Accounts (FSA)
  • 401(k) with employer match, Employee Stock Purchase Plans, and other financial benefits
  • Company paid Basic Life, AD&D, short-term and long-term disability insurance
  • Employee Assistance Program
  • Sick and Vacation time (Flex time for salary positions), and Paid Holidays
  • Back-up childcare and parenting support resources
  • Voluntary benefits to include: critical illness, hospital indemnity, accident insurance, theft & legal services, and pet insurance
  • Weight Loss and Tobacco Cessation Programs
  • Tesla Babies program
  • Commuter benefits
  • Employee discounts and perks program
Expected Compensation $132,000 - $330,000/annual salary + cash and stock awards + benefits

Pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. The total compensation package for this position may also include other elements dependent on the position offered. Details of participation in these benefit plans will be provided if an employee receives an offer of employment.


  • ML Compiler Engineer

    2 weeks ago


    Palo Alto, CA, United States Rivian Full time

    About RivianRivian is on a mission to keep the world adventurous forever. This goes for the emissions-free Electric Adventure Vehicles we build, and the curious, courageous souls we seek to attract. As a company, we constantly challenge what's possible, never simply accepting what has always been done. We reframe old problems, seek new solutions and operate...


  • Palo Alto, CA, United States Rivian Full time

    About RivianRivian is on a mission to keep the world adventurous forever. This goes for the emissions-free Electric Adventure Vehicles we build, and the curious, courageous souls we seek to attract. As a company, we constantly challenge what's possible, never simply accepting what has always been done. We reframe old problems, seek new solutions and operate...


  • Palo Alto, CA, United States Rivian Full time

    About RivianRivian is on a mission to keep the world adventurous forever. This goes for the emissions-free Electric Adventure Vehicles we build, and the curious, courageous souls we seek to attract. As a company, we constantly challenge what's possible, never simply accepting what has always been done. We reframe old problems, seek new solutions and operate...


  • Palo Alto, CA, United States Tesla Full time

    What to Expect As a member of the Foundation Inference Infrastructure team, you will design & implement a diverse set of backend services and tools that power autonomy software and hardware development processes. The systems you build will have a large impact on autonomy, from accelerating chip design & Machine Learning workflows, compiler development, model...


  • Palo Alto, CA, United States Tesla Full time

    What to Expect As a member of the Foundation Inference Infrastructure team, you will design & implement a diverse set of backend services and tools that power autonomy software and hardware development processes. The systems you build will have a large impact on autonomy, from accelerating chip design & Machine Learning workflows, compiler development, model...


  • Palo Alto, CA, United States Tesla Full time

    What to Expect As a member of the Foundation Inference Infrastructure team, you will design & implement a diverse set of backend services and tools that power autonomy software and hardware development processes. The systems you build will have a large impact on autonomy, from accelerating chip design & Machine Learning workflows, compiler development, model...


  • Palo Alto, CA, United States Tesla Full time

    What to Expect As a member of the Foundation Inference Infrastructure team, you will design & implement a diverse set of backend services and tools that power autonomy software and hardware development processes. The systems you build will have a large impact on autonomy, from accelerating chip design & Machine Learning workflows, compiler development, model...


  • Palo Alto, CA, United States Tesla Full time

    What to Expect Tesla's AI team is pushing the frontier of real-world machine learning, building models that reason, predict, and act with human-level physical intelligence. We train and deploy large-scale ML systems powering products from Autopilot to Optimus. As part of the Model Optimization group, you will work at the intersection of machine learning and...


  • Palo Alto, CA, United States SambaNova Systems Full time

    The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale. SambaNova Suite™ is the first full-stack, generative AI platform, from chip to model, optimized...


  • Palo Alto, CA, United States SambaNova Systems Full time

    The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale. SambaNova Suite™ is the first full-stack, generative AI platform, from chip to model, optimized...