AI Engineer — LLM Infra

2 days ago


San Francisco, California, United States Yutori Full time

Yutori is reimagining how people interact with the web by building AI agents that can reliably do everyday digital tasks. We are building the entire stack to be agent-first, from training our own models to generative product interfaces.

Towards this goal, we are looking for a member of the AI technical staff to join the founding team. Someone technically strong, and excited about building superhuman AI agents that take actions on the web.

Our founders — Devi Parikh, Abhishek Das, Dhruv Batra — have decades of experience in AI research and product spanning generative, multimodal and embodied AI at Meta. Our team combines AI experience with design-minded product thinking to build and deliver on Yutori's mission.

Yutori is backed by a stellar set of visionary investors — Elad Gil, Sarah Guo, Jeff Dean, Fei-Fei Li, Amjad Masad, Guillermo Rauch, Akshay Kothari, Soleio, Oliver Cameron, Julien Chaumond, Logan Kilpatrick, Bryan McCann, Vladlen Koltun, Jamie Cuffe, Michele Catasta, etc.

Responsibilities:

  • Scale infra for post-training of multimodal LLMs (CPT, SFT, RL, search, reward models)

  • Scale infra for agentic inference (throughput and latency of perception-planning-action loops)

  • Build the foundations of a superhuman generalist web-agent

  • Work closely with product engineers to translate cutting-edge AI capabilities into reliable product experiences.

What we're looking for:

  • Experience with ML infrastructure (GPU clusters) and supporting networking (NCCL)

  • Experience optimizing post-training and inference performance of multimodal LLMs (data/tensor/pipeline/context/expert parallelism, optimizing MFU, throughput, latency)

  • Low level systems experience (Triton, CUDA)

  • High IQ, high EQ, high agency, high craftsmanship, low ego. Proactive, clear communication.

Benefits and perks:

  • Competitive salary and equity

  • Visa sponsorship and relocation stipend to bring you to SF

  • Generous health, dental, vision insurance for you and your dependents

  • 20 days of paid time off per year

  • Work laptop and budget to set up your work office

  • Daily team lunches

  • Commuter benefits

  • Small, focused team of high-potential individuals. In-person in SF.



  • San Francisco, California, United States Muro AI Full time

    About Muro AIMuro AI is transforming how the $2T construction industry plans and builds. Founded by Cornell alumni, ex-founders, and former McKinsey operators, we're building AI agents that automate the most complex, manual, and costly phase of construction: preconstruction.We move fast, build with conviction, and obsess over delivering real impact to the...

  • AI Infra Engineer

    1 week ago


    San Francisco, California, United States Perplexity Full time

    We are looking for an AI Infra engineer to join our growing team. We work with Kubernetes, Slurm, Python, C++, PyTorch, and primarily on AWS. As an AI Infrastructure Engineer, you will be partnering closely with our Inference and Research teams to build, deploy, and optimize our large-scale AI training and inference clustersResponsibilitiesDesign, deploy,...


  • San Francisco, California, United States VizopsAI Full time

    VizopsAI is building the most advanced AI Engineering Platform to optimize and deploy self-improving AI Agents. Our vision is to be the control plane for agent performance that makes agents more capable, consistent, personalized and reliable over time. We're a lean, fast-moving team that is building novel RL algorithms and software that push the frontier of...

  • AI Engineer, .RAG

    7 hours ago


    San Francisco, California, United States Eloquent AI Full time

    Meet Eloquent AIAt Eloquent AI, we're building the next generation of AI Operators—multimodal, autonomous systems that execute complex workflows across fragmented tools with human-level precision. Our technology goes far beyond chat: it sees, reads, clicks, types, and makes decisions—transforming how work gets done in regulated, high-stakes...


  • San Francisco, California, United States The LLM Data Company Full time

    About The LLM Data CompanyThe LLM Data Company (YC X25) provides post-training data and RL environments to foundation model labs and frontier applied AI companies. We have raised $3.6m from Tier 1 VCs and are growing 200%+ month-over-month.ResponsibilitiesDesign and implement scalable RL recipes for post-training task-specific modelsDevelop modular...

  • Data Engineer

    6 days ago


    San Francisco, California, United States Bake AI Full time

    We are looking for a Data Engineer passionate about LLMs, VLMs, post-training, and reinforcement learning. You will design and implement scalable data systems that power dataset generation, filtering, and evaluation for model alignment and agentic reasoning. You'll collaborate closely with our research and infrastructure teams to ship real systems that train...

  • AI Engineer

    1 week ago


    San Francisco, California, United States Gamma Full time $150,000 - $240,000

    The RoleWe're seeking an AI engineer to own the core models and prompts that power our product. Gamma weaves together text, image, and layout generation to automate all the drudgery of building presentations and websites.We use AI throughout our product, and we want you to help us to elevate quality, evaluate new models, and push the frontier with new...

  • Principal Engineer

    1 week ago


    San Francisco, California, United States Strativ Group Full time

    Principal Engineer - AI Infra & InferenceWe are partnered with a Stealth AI Infra startup (backed by a Tier 1 AI Lab and advised by 2 of the world's most prominent ML thought-leaders), who are hiring a Principal SW Engineer (genuine progression to HoE / Chief Engineer).The business already have enterprise customer traction & are backed by Perplexity and the...

  • AI Frontend Engineer

    11 hours ago


    San Francisco, California, United States AI Aspire Full time

    We're looking for anAI Engineer – Frontendwho's excited to turn cutting-edge AI ideas into seamless, beautiful user experiences. You'll take the lead in building fast, intuitive, and scalable interfaces that power our developer tools and AI products — bringing ideas from prototype to production. This is a unique opportunity to work at the intersection of...

  • Founding Infra

    2 days ago


    San Francisco, California, United States Cadre Full time

     About the roleYou'll own the foundation of Known's product infrastructure across mobile, web, and agentic AI systems. From data pipelines to cloud infra, you'll design, build, and scale the platform that powers matching, voice, and scheduling features.ResponsibilitiesDesign and manage AWS-based infrastructure, codified in Terraform.Build and maintain data...