LLM Research Engineer

4 weeks ago


Palo Alto, United States CHAI: AI Platform Full time

AI Research Engineer (LLM Optimization)

$250-350K | PALO ALTO, CA


Chai is one of the fastest-growing, generative AI startups in Silicon Valley. YouTube but for LLM's - we have over 1 million active users.


Who we are looking for:


We need a relentless engineer with 3+ years of experience overseeing and being responsible for optimizing our LLMs. Ensuring they are performant, scaleable, and cost-efficient. You will work alongside equally talented and driven teammates implementing cutting-edge AI inference engines. We need someone who is reliable and has high standards.



Here's why we might not be the right fit for you:

• We work hard and have a high-velocity environment with lots of growth opportunities.

• We value exceptional performance and continuous improvement. We believe that if you aren't constantly learning, you aren't growing.

• You will be responsible and accountable for making high-impact decisions that determine Chai's future


Here are the top 2 reasons why you should join us:

• Exponential growth. 1 Million MAU. Join the team that gets us to 100 million MAU

• Craftsmanship. Build something beautiful


Requirements:

• Familiar with vLLM, quantization, and current techniques of LLM optimization

• 3+ years of experience in software engineering

• Bachelor or Master degree from a leading academic institution


Here is our tech stack:

• Front end: Python, Flutter, Dart

• Back end: Python, GCP, Redis, Kubernetes


Process:

Exceptionally fast, application to offer within 7 days

1. Apply here

2. First round video interview, system design interview, then onsite

3. Reference checks, negotiation, and offer



  • Palo Alto, California, United States PlayHT Full time

    About Us:PlayHT is at the forefront of generative voice and conversational LLMs. With our Speech Synthesis and Voice Cloning models, we are building the SOTA conversational AI products.We are building a platform and infrastructure for Conversational AI Voice Agents so that every business, developer, or tinkerer can easily build talking human-like AI agents...


  • Palo Alto, United States Play Full time

    About Us: PlayHT is at the forefront of generative voice and conversational LLMs. With our Speech Synthesis and Voice Cloning models, we are building the SOTA conversational AI products. We are building a platform and infrastructure for Conversational AI Voice Agents so that every business, developer, or tinkerer can easily build talking human-like AI agents...


  • Palo Alto, United States Acceler8 Talent Full time

    About the Company: We are a well-funded Stanford Spinout, based in Palo Alto, on a mission to redefine efficiency and affordability in hardware engineering. We are already partnered with some of the world's largest semiconductor companies and are rapidly expanding our customer base...As a company, we are dedicated to revolutionizing the hardware engineering...


  • Palo Alto, United States Acceler8 Talent Full time

    About the Company: We are a well-funded Stanford Spinout, based in Palo Alto, on a mission to redefine efficiency and affordability in hardware engineering. We are already partnered with some of the world's largest semiconductor companies and are rapidly expanding our customer base...As a company, we are dedicated to revolutionizing the hardware engineering...


  • Palo Alto, United States Acceler8 Talent Full time

    About the Company: We are a well-funded Stanford Spinout, based in Palo Alto, on a mission to redefine efficiency and affordability in hardware engineering. We are already partnered with some of the world's largest semiconductor companies and are rapidly expanding our customer base...As a company, we are dedicated to revolutionizing the hardware engineering...


  • Palo Alto, United States Acceler8 Talent Full time

    About the Company: We are a well-funded Stanford Spinout, based in Palo Alto, on a mission to redefine efficiency and affordability in hardware engineering. We are already partnered with some of the world's largest semiconductor companies and are rapidly expanding our customer base...As a company, we are dedicated to revolutionizing the hardware engineering...

  • Research Engineer

    3 weeks ago


    Palo Alto, United States Pika 1.0 Full time

    ROLE: RESEARCH ENGINEER Summary: As a Research Engineer specializing in Machine Learning and Systems Engineering at our company, you will be instrumental in pioneering sophisticated AI solutions. This role demands a unique blend of leadership in conducting end-to-end research projects and technical expertise in building scalable systems. You'll be part of an...


  • Palo Alto, United States Pinterest Full time

    What you’ll do:Own the vision and strategy of ML-based product solutions for Pinterest’s largest user acquisition channel: SEO. Work closely with design, product, and backend teams to find the gap in current ML practices and propose technical solutions to tackle engineering problems.Work side-by-side with backend/fullstack product engineers, data...

  • UX Researcher

    1 week ago


    Palo Alto, California, United States SAP Full time

    We help the world run better At SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and...

  • UX Researcher

    6 days ago


    Palo Alto, California, United States SAP Full time

    We help the world run betterAt SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and...


  • Palo Alto, United States Knitit.ai Full time

    We are looking for a AI/ML Engineer to join a small team of ambitious people that are building an AI-powered assistant product in the Palo Alto, CA. We build innovative solutions that aim to amplify the power of our users through intelligent interactions. This position is ideal for a talented expert eager to apply their skills in the production of highly...

  • Research Engineer

    3 weeks ago


    Palo Alto, United States Harmonic Full time

    Harmonic is a seed-stage AI startup building the world’s most advanced mathematical reasoning engine. We are building an elite team and are backed by some of the world’s most prominent investors. We are seeking a highly motivated and skilled Research Scientist to join our AI & Formal Methods team. The initial focus of this position will be on advancing...

  • Software Engineer

    4 weeks ago


    Palo Alto, California, United States PipeIQ Full time

    PipeIQ is an early stage startup building AI Co-pilots to accelerate marketing and sales pipelines. We do this via an orchestration engine that leverages Large Language Models (LLMs) to build highly personalized chatbots and email bots, among others. Our founder has deep domain expertise in Marketing Automation and Martech, which is very relevant for the...

  • Founding Engineer

    1 week ago


    Palo Alto, United States Bolo AI Full time

    About Us: At Bolo AI, we are pioneering the future of knowledge management in heavy industries, starting with Oil & Gas. Our groundbreaking platform combines domain-specific models with cutting-edge AI technology to redefine how millions of heavy industry professionals access and create vital knowledge. Our products, Bolo AI Answer and Bolo AI Writer,...

  • Founding Engineer

    1 week ago


    Palo Alto, United States Bolo AI Full time

    About Us: At Bolo AI, we are pioneering the future of knowledge management in heavy industries, starting with Oil & Gas. Our groundbreaking platform combines domain-specific models with cutting-edge AI technology to redefine how millions of heavy industry professionals access and create vital knowledge. Our products, Bolo AI Answer and Bolo AI Writer,...


  • Palo Alto, California, United States PipeIQ Full time

    PipeIQ is an early stage startup that leverages Generative AI to accelerate marketing and sales pipelines. We do this via an orchestration engine that leverages Large Language Models (LLMs) to build highly personalized chatbots and email bots, among others. Our founder has deep domain expertise in Marketing Automation and Martech, which is very relevant for...


  • Palo Alto, California, United States PipeIQ Full time

    PipeIQ is an early stage startup building AI Co-pilots to accelerate marketing and sales pipelines. We do this via an orchestration engine that leverages Large Language Models (LLMs) to build highly personalized chatbots and email bots, among others. Our founder has deep domain expertise in Marketing Automation and Martech, which is very relevant for the...


  • Palo Alto, United States InnoPeak Technology Full time

    RESEARCH ENGINEER To R&D machine learning app & data proc/analytics for human hand tracking, img/vid proc. & related on VR/XR devices. Salary range $184,662 to $216,000/yr. Work site/resume to: InnoPeak Technology, Inc., 2479 E Bayshore Rd, Ste 110, Palo Alto, CA 94303; or


  • Palo Alto, United States InnoPeak Technology, Inc. Full time

    RESEARCH ENGINEER To R&D machine learning app & data proc/analytics for human hand tracking, img/vid proc. & related on VR/XR devices. Salary range $184,662 to $216,000/yr. Work site/resume to: InnoPeak Technology, Inc., 2479 E Bayshore Rd, Ste 110, Palo Alto, CA 94303; or HR@innopeaktech.com


  • Palo Alto, United States InnoPeak Technology Full time

    RESEARCH ENGINEER To R&D machine learning app & data proc/analytics for human hand tracking, img/vid proc. & related on VR/XR devices. Salary range $184,662 to $216,000/yr. Work site/resume to: InnoPeak Technology, Inc., 2479 E Bayshore Rd, Ste 110, Palo Alto, CA 94303; or HRinnopeaktech.com