AI Engineer, Reinforcement Learning

3 weeks ago


Palo Alto, United States Tesla Motors, Inc. Full time

What to Expect Scaling transformers, as well as more recent advances in Reinforcement Learning with Verifiable Rewards (RLVR), has created models with PhD level intelligence in a wide variety of subject areas - from Math to Social Sciences. Yet these models continue to struggle in real-world physical reasoning, often struggling to tell left from right. At Tesla AI, we want to develop Olympiad-level physical intelligence that will enable highly capable robots, both wheeled and legged. These models should be able to anticipate and reason about future movements of any object or scene at the level of a race car driver or professional athlete. To accomplish this, you will have access to petabytes of multimodal (video, audio, action etc.) real-world data from our global fleet of cars and robots, as well as Tesla's state-of-the-art compute resources. An added level of complexity with robotics is the requirement to run at real-time on local compute. This requirement acts as a forcing function for Tesla AI to develop models that optimize for the highest intelligence per byte of parameters. In this role, you will have the opportunity to work on post-training and reinforcement learning of large multimodal models with an emphasis on real-world physical intelligence. You will also get the chance to distill these large models to smaller models that will run on our state-of-the-art inference hardware. What You'll Do Create multimodal post-training and RLVR datasets that utilize Tesla's fleet of cars and robots Develop training infra necessary to run reinforcement learning on large multimodal models Develop downstream evaluations that can guide the tuning of these large models Distill large models into smaller models that can run in real-time on the local compute of our cars and robots What You'll Bring Proven experience in scaling and optimizing large AI models, with a strong understanding of infrastructure challenges and solutions, especially in the domain of reinforcement learning Proficiency in Python and a deep understanding of software engineering best practices In-depth knowledge of deep learning fundamentals, including distillation and reinforcement learning Experience with deep learning frameworks such as PyTorch, TensorFlow, or JAX Strong expertise in distributed computing and parallel processing techniques Demonstrated ability to work collaboratively in a cross-functional team environment Strong problem-solving skills and the ability to troubleshoot complex system-level issues Compensation and Benefits Benefits Aetna PPO and HSA plans > 2 medical plan options with $0 payroll deduction Family-building, fertility, adoption and surrogacy benefits Dental (including orthodontic coverage) and vision plans, both have options with a $0 paycheck contribution Company Paid (Health Savings Account) HSA Contribution when enrolled in the High Deductible Aetna medical plan with HSA Healthcare and Dependent Care Flexible Spending Accounts (FSA) 401(k) with employer match, Employee Stock Purchase Plans, and other financial benefits Company paid Basic Life, AD&D, short-term and long-term disability insurance Employee Assistance Program Sick and Vacation time (Flex time for salary positions), and Paid Holidays Back-up childcare and parenting support resources Voluntary benefits to include: critical illness, hospital indemnity, accident insurance, theft & legal services, and pet insurance Weight Loss and Tobacco Cessation Programs Tesla Babies program Commuter benefits Employee discounts and perks program Expected Compensation $140,000 - $420,000/annual salary + cash and stock awards + benefits Pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. The total compensation package for this position may also include other elements dependent on the position offered. Details of participation in these benefit plans will be provided if an employee receives an offer of employment. #J-18808-Ljbffr



  • Palo Alto, CA, United States 1X Technologies AS Full time

    AI Research Engineer, Reinforcement Learning | AI & Robotics Location: Palo Alto, CA (on-site)About 1X We build humanoid robots that work alongside people to solve labor shortages and create abundance. The Role As a Research Engineer focused on Reinforcement Learning, you will be responsible for teaching NEO new capabilities through RL algorithms. You'll...


  • Palo Alto, CA, United States 1X Technologies AS Full time

    AI Research Engineer, Reinforcement Learning | AI & Robotics Location: Palo Alto, CA (on-site) We build humanoid robots that work alongside people to solve labor shortages and create abundance. The Role As a Research Engineer focused on Reinforcement Learning, you will be responsible for teaching NEO new capabilities through RL algorithms. You'll work across...


  • Palo Alto, United States Tesla Full time

    What to Expect Scaling transformers, as well as more recent advances in Reinforcement Learning with Verifiable Rewards (RLVR), has created models with Ph-D level intelligence in a wide variety of subject areas - from Math to Social Sciences. Yet these models continue to struggle in real-world physical reasoning, often struggling to tell left from right. At...


  • Palo Alto, United States Tesla Full time

    Machine Learning Engineer, Motion Planning, Self-DrivingJoin to apply for the Machine Learning Engineer, Motion Planning, Self-Driving role at TeslaMachine Learning Engineer, Motion Planning, Self-Driving1 year ago Be among the first 25 applicantsJoin to apply for the Machine Learning Engineer, Motion Planning, Self-Driving role at TeslaWhat To ExpectTesla...


  • Palo Alto, United States Tesla Full time

    Machine Learning Engineer, Motion Planning, Self-Driving Join to apply for the Machine Learning Engineer, Motion Planning, Self-Driving role at Tesla Machine Learning Engineer, Motion Planning, Self-Driving 1 year ago Be among the first 25 applicants Join to apply for the Machine Learning Engineer, Motion Planning, Self-Driving role at Tesla What To Expect...


  • Palo Alto, CA, United States Tesla Full time

    Machine Learning Engineer, Motion Planning, Self-Driving Join to apply for the Machine Learning Engineer, Motion Planning, Self-Driving role at Tesla Machine Learning Engineer, Motion Planning, Self-Driving 1 year ago Be among the first 25 applicants Join to apply for the Machine Learning Engineer, Motion Planning, Self-Driving role at Tesla What To Expect...


  • Palo Alto, CA, United States Tesla Full time

    Machine Learning Engineer, Motion Planning, Self-Driving Join to apply for the Machine Learning Engineer, Motion Planning, Self-Driving role at Tesla Machine Learning Engineer, Motion Planning, Self-Driving 1 year ago Be among the first 25 applicants Join to apply for the Machine Learning Engineer, Motion Planning, Self-Driving role at Tesla What To Expect...


  • Palo Alto, United States Tesla Full time

    What to Expect Tesla AI is solving robust embodied intelligence through humanoid robots.The goal ofourreinforcement learningteamis to build anddemonstratea general robot learning system that canleverageAI to perform complex physical tasks, ranging from full body locomotion, precise manipulation, and more.Our reinforcement and imitation learningengineersare...


  • Palo Alto, CA, United States Tesla Full time

    What to Expect Tesla AI is solving robust embodied intelligence through humanoid robots.The goal ofourreinforcement learningteamis to build anddemonstratea general robot learning system that canleverageAI to perform complex physical tasks, ranging from full body locomotion, precise manipulation, and more.Our reinforcement and imitation learningengineersare...


  • Palo Alto, CA, United States Tesla Full time

    What to Expect Tesla AI is solving robust embodied intelligence through humanoid robots.The goal ofourreinforcement learningteamis to build anddemonstratea general robot learning system that canleverageAI to perform complex physical tasks, ranging from full body locomotion, precise manipulation, and more.Our reinforcement and imitation learningengineersare...