LLM Algorithmic Optimization Engineer

2 weeks ago


San Jose, California, United States NIO Full time
Job Title: LLM Algorithmic Optimization Engineer

NIO Inc. is a pioneering company in the premium smart electric vehicle market, dedicated to shaping a joyful lifestyle. Our mission is to build a community through smart electric vehicles, sharing joy and growing together with users.

We design, develop, and manufacture premium smart electric vehicles, driving innovations in next-generation technologies, including autonomous driving, digital technologies, electric powertrains, and batteries. Our unique approach differentiates us through continuous technological breakthroughs and innovations, such as industry-leading battery swapping technologies, Battery as a Service (BaaS), and proprietary autonomous driving technologies and Autonomous Driving as a Service (ADaaS).

Job Description:Key Responsibilities:
  • Conduct research and apply cutting-edge technologies to optimize Large Language Models (LLMs) and multimodal models, exploring and implementing core algorithmic optimization on heterogeneous architectures for efficient LLM inference and deployment across distributed and heterogeneous hardware environments.
  • Focus on model optimization from a systems perspective, ensuring efficient deployment in the vehicle's digital cockpit and advanced driving (AD) domain.
  • Collaborate with cross-functional teams to integrate optimized models into real-world automotive applications.
  • Contribute to the entire pipeline from research, development, and testing to deployment on hardware, including GPUs and other distributed systems.
Requirements:
  • Currently pursuing or completed a PhD or Master's degree in Computer Science, Computer Engineering, Applied Mathematics, Communications, Electronics, or a related field with relevant research projects and publications.
  • Strong understanding of GPU/NPU architecture and optimization techniques to identify and address bottlenecks.
  • Proficient in LLM and VLM architectures and algorithms, familiar with transformer-based NLP / Audio / CV algorithms and technologies.
  • Proficiency in Python and experience with AI-related training and inference tools such as PyTorch.
  • Proficiency in C/C++ programming, familiar with at least one commonly used LLM inference engine.
  • Hands-on experience with model-serving frameworks such as Open Neural Network Exchange (ONNX).
  • Familiarity with debugging code in distributed computing environments.
  • Experience in LLM inference optimization on resource-constrained edge devices is a plus.
Preferred Qualifications:
  • Ph.D. in computer science, artificial intelligence, or related fields; or Masters degree + 3 years of relevant industry experience.
  • Experience in inference optimization techniques of deep learning models or libraries on hardware architectures.
  • Familiar with microkernel architecture, Linux kernel, hypervisor, middleware, and application framework.
  • Those who have good publication records and have published high-impact, innovative papers are preferred.
Compensation and Benefits:

NIO offers a competitive compensation package, including a base salary range of $165,000.00 to $205,000.00, depending on work location and additional factors. The company also provides a range of benefits, including medical, dental, and vision plans, 401(k) with Brokerage Link option, company-paid Basic Life, AD&D, short-term and long-term disability insurance, and more.



  • San Jose, California, United States NIO Full time

    Job Title: Senior Large Model Algorithm EngineerNIO is a pioneering company in the premium smart electric vehicle market, founded in 2014. Our mission is to shape a joyful lifestyle by building a community starting with smart electric vehicles.We design, develop, and manufacture premium smart electric vehicles, driving innovations in next-generation...


  • San Francisco, California, United States Truva Full time

    About TruvaTruva is a pioneering SaaS company at the forefront of innovation, specializing in automating tasks and optimizing workflows with the power of Large Language Models (LLMs).Backed by top VCs such as YCombinator and Fintech Collective, Truva is led by experienced founders with a combined background of over 25 years in developing applied ML solutions...


  • San Francisco, California, United States Truva Full time

    About TruvaTruva is a pioneering SaaS company at the forefront of innovation, specializing in automating tasks, optimizing workflows, and delivering unparalleled operational efficiency with Large Language Models (LLMs).Backed by top venture capital firms such as YCombinator and Fintech Collective, Truva is led by experienced founders with a combined...


  • San Jose, California, United States Collabera Full time

    About the RoleWe are seeking a highly skilled Senior/Tech Lead AI/LLM Network Software Development Engineer to join our cutting-edge team in San Jose, CA. In this role, you will design and implement high-speed network technologies and infrastructure to support large-scale AI/LLM applications.Key ResponsibilitiesDesign, implement, and deploy high-speed...


  • San Jose, California, United States Hireio, Inc. Full time

    Job Title: Tech Lead, Large Language ModelsAbout the TeamHireio, Inc. is a leading innovator in the field of artificial intelligence, and our team is working on the next generation of large language model-based applications. Our focus is on developing sophisticated AI algorithms that can understand, learn, predict, and improve user...


  • San Jose, California, United States WeRide Full time

    Job Title: Autonomous Driving Algorithm EngineerWeRide is a leading global company that develops autonomous driving technologies from Level 2 to Level 4. We are seeking an experienced algorithm engineer to join our team and contribute to the development and optimization of Multi-Agent Planning, 3D Reconstruction, and World Model tasks.Key...


  • San Jose, California, United States Tik Tok Full time

    About the RoleWe are seeking a talented Machine Learning Engineer Intern to join our team at TikTok. As a key member of our Business Risk Integrated Control (BRIC) team, you will be responsible for researching and developing algorithms for AI native applications and optimizing Large Language Models (LLMs).ResponsibilitiesDevelop and implement novel solutions...


  • San Jose, California, United States AMD Full time

    About the RoleWe are seeking a talented AI Software Engineer to join our team at AMD, working on Generative AI inference solutions. As a key member of our AI Group, you will explore and improve upon state-of-the-art research in both academia and industry, innovating in software development, model optimization, and compression algorithms for Generative AI...


  • San Jose, California, United States Solvvy Full time

    Job SummaryWe are seeking a highly skilled Senior Video Processing Algorithm Engineer to join our team. As a key member of our video processing pipeline, you will be responsible for designing and implementing innovative algorithms that meet feature requirements.Key ResponsibilitiesAnalyze feature development at both algorithm and coding levels, including...


  • San Jose, California, United States AMD Full time

    About the RoleWe are seeking a highly skilled Machine Learning Software Engineer to join our AI team at AMD. As a key member of our team, you will be responsible for developing and optimizing Generative AI inference solutions on our products.Key ResponsibilitiesAccelerate inference of Generative AI on AMD's products.Develop tools and techniques for model...


  • San Jose, California, United States NIO Full time

    Job Title: Large Model Algorithm Engineer/DeveloperNIO Inc. is a pioneering company in the premium smart electric vehicle market, dedicated to shaping a joyful lifestyle. As a Large Model Algorithm Engineer/Developer, you will play a crucial role in driving innovations in next-generation technologies, including autonomous driving, digital technologies,...


  • San Francisco, California, United States DoorDash USA Full time

    About the RoleWe're seeking a seasoned Machine Learning Engineer to join our team at DoorDash USA. As a key member of our engineering team, you will be responsible for developing and improving the models that power our three-sided marketplace of consumers, merchants, and dashers.Key ResponsibilitiesLead the development of our support chatbot and LLM system,...


  • San Jose, California, United States Zoom Full time

    Job SummaryWe are seeking a highly skilled Senior Video Processing Algorithm Engineer to join our team at Zoom. As a key member of our engineering team, you will be responsible for designing and implementing innovative algorithms that meet feature requirements.Key ResponsibilitiesResearch and develop cutting-edge algorithms for video processing, leveraging...


  • San Jose, California, United States Hireio, Inc. Full time

    Explore a Unique Role at Hireio, Inc.About the DepartmentWe are a forward-thinking group focused on the innovation of state-of-the-art applications powered by large language models. Our mission is to leverage artificial intelligence to develop sophisticated algorithms that enhance user engagement and interaction. We aim to transform the way humans interact...


  • San Jose, California, United States Hireio, Inc. Full time

    Exciting Career Opportunity at Hireio, Inc.About the DepartmentBecome a vital part of our innovative department focused on the development of state-of-the-art applications utilizing large language models. We are committed to leveraging artificial intelligence to craft sophisticated algorithms that enhance learning, prediction, and user engagement. Our...


  • San Jose, California, United States NIO Full time

    Job Title: Large Model Algorithm Engineer/DeveloperNIO Inc. is a pioneering company in the premium smart electric vehicle market, dedicated to shaping a joyful lifestyle. As a Large Model Algorithm Engineer/Developer, you will play a crucial role in driving innovations in next-generation technologies, including autonomous driving, digital technologies,...


  • San Jose, California, United States ASML Full time

    About the RoleWe are seeking a highly skilled Senior Software Engineer to join our Computational Geometry Algorithm Team. As a key member of our team, you will be responsible for developing and implementing algorithms in C/C++ to solve complex problems in the semiconductor fab industry.Key ResponsibilitiesDesign and implement algorithms in C/C++ to optimize...


  • San Francisco, California, United States CriticalRiver Inc Full time

    Job Title: Data Scientist/Senior LLM/AI/ML EngineerWe are seeking a highly skilled Data Scientist/Senior LLM/AI/ML Engineer to join our team at CriticalRiver Inc. This role involves developing and optimizing large language models (LLMs) with a focus on enhancing AI interactions.Key Responsibilities:Design, develop, and optimize LLMs for various NLP...


  • San Jose, California, United States Hireio, Inc. Full time

    Exciting Career Opportunity at Hireio, Inc.About the TeamBecome a part of our innovative group focused on creating state-of-the-art applications utilizing large language models. We are committed to leveraging artificial intelligence to develop sophisticated algorithms that enhance user engagement and interaction. Our mission is to transform the way humans...


  • San Jose, California, United States LeadStack Inc. Full time

    Job Summary:This position is responsible for taking over an extensive simulation platform that simulates the behavior of a display over its lifetime. The model is developed in C/C++ and MATLAB and is used to evaluate the performance of various image enhancement and compression algorithms.Key Responsibilities:Take ownership of an extensive simulation platform...