Staff ML Infrastructure Engineer

2 days ago


Hayward CA, United States Cubiq Recruitment Full time

Staff / Lead ML Infrastructure Engineer San Francisco, CA — Onsite
Salary - Over market average + equity

We are building one of the world’s leading generative video and multimodal AI platforms, and we’re looking for a senior infrastructure engineer to drive the backbone that makes it possible. This role is ideal for an engineer from a top-tier tech company who has built cloud-scale systems, high-performance compute platforms, and battle-tested CI/CD pipelines that support complex ML workloads.

Core ML Platform Architecture: Design and evolve the infrastructure that supports large-scale generative video and multimodal model training, evaluation, and deployment.
Build and optimize GPU/TPU clusters, distributed training systems, and orchestration layers tailored for video-heavy pipelines.
End-to-End CI/CD for ML: Lead the development of automated pipelines for model training, validation, artifact management, and production rollout.
Multimodal Data Infrastructure: Build systems to ingest, version, transform, and serve large-scale video, audio, and text datasets with high reliability.
Internal Developer Experience: Partner with research, product, and applied ML teams to build intuitive internal tooling for experiment tracking, model lineage, and resource scheduling.
Technical Leadership: Mentor engineers, set platform standards, and influence long-term architectural direction.

Experience architecting and operating large-scale infrastructure at a cloud provider, hyperscaler, or leading AI company.
Built or owned mission-critical CI/CD systems, high-capacity compute platforms, or data infrastructure supporting ML teams.
Deep experience with distributed compute across GPUs/accelerators, Kubernetes, and cloud infrastructure (AWS/GCP/Azure).
Strong engineering fundamentals in Python, Go, or equivalent languages.
Previous exposure to ML training pipelines—especially systems that handle heavy video, multimodal, or high-dimensional data.
Experience with video processing systems, large-scale media pipelines, or streaming architectures.
Familiarity with modern multimodal or video-generation frameworks (PyTorch, JAX, diffusers, custom accelerators).
Experience with Ray, Triton, CUDA optimization, or specialized scheduling for ML workloads.
Background working in high-growth AI startups or research-focused environments.
Security and compliance considerations for models that generate or process user content.

Shape the underlying platform powering one of the most advanced generative video systems in the world.
Influence the future of multimodal AI by building infrastructure that directly accelerates research and product breakthroughs.
Work closely with experienced founding engineers, researchers, and platform builders from leading tech companies.
Highly competitive compensation, meaningful equity, and strong in-person engineering culture in San Francisco.



  • Menlo Park, CA, United States Strativ Group Full time

    ML Infrastructure Engineer We are partnered with a Stealth AI Lab (backed by top-tier investors and advised by pioneering figures in generative and interactive media) that is hiring a Staff ML Infrastructure Engineer. This company push the boundaries of real-time generative models, building the core infrastructure that enables next-generation video AI. Their...

  • Staff ML Engineer

    3 weeks ago


    Sonoma, CA, United States Synergis Full time

    Staff ML Engineer Direct Hire Detroit, MI or San Francisco, CA $195K-$295K About the Team: The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure Platforms. Our team owns the cloud-agnostic, reliable, and cost-efficient platform that powers our client's AI efforts. We're proud to serve as the AI infrastructure...


  • Menlo Park, CA, United States Strativ Group Full time

    ML Infrastructure Engineer We are partnered with a Stealth AI Lab (backed by top-tier investors and advised by pioneering figures in generative and interactive media) that is hiring a Staff ML Infrastructure Engineer. This company push the boundaries of real-time generative models, building the core infrastructure that enables next-generation video AI. Their...


  • San Francisco, CA, United States Andiamo Full time

    Overview Principal Staff Engineer - AI Infrastructure. We are seeking a Principal Staff Engineer to lead the architecture and development of our next-generation AI infrastructure. This role sits at the intersection of large-scale distributed systems and cutting-edge machine learning, powering the platforms that enable researchers and engineers to build,...


  • Hayward, CA, United States Plenful Full time

    About Plenful Plenful is on a mission to transform healthcare operations from the inside out. Fresh off our $50M Series B and backed by Bessemer Venture Partners, Notable Capital, TQ Ventures, Susa/Kivu Ventures, and other leading investors, we’re building the category-defining AI agentic operating platform that healthcare teams rely on to operate smarter,...


  • San Francisco, CA, United States Attentive Full time

    Attentive® is the AI-powered mobile marketing platform transforming the way brands personalize consumer engagement. Attentive enables marketers to craft tailored journeys for every subscriber, driving higher recurring revenue and maximizing campaign performance. Activating real-time data from multiple channels and advanced AI, the platform personalizes...


  • Hayward, United States Quantix Search Full time

    Member of Technical Staff Applied AI / ML EngineerSan Francisco, Union Square (Onsite) Up to $325K + stock optionsWere partnering with an AI startup in San Francisco that operates in the tax and accounting automation space to hire a Member of Technical Staff focused on Applied AI / ML. The company builds GenAI-driven systems that turn complex, unstructured...


  • Mountain View, CA, United States Nuro Full time

    Who We Are Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world's most scalable driver, combining cutting-edge AI with automotive-grade hardware. Nuro licenses its core technology, the Nuro Driver™, to support a wide range of applications, from robotaxis and commercial fleets...


  • San Francisco, CA, United States Symbolica Full time

    DevOps Engineering Lead - ML Infrastructure Please make sure you read the following details carefully before making any applications. About Us Symbolica is an AI research lab pioneering the application of category theory to enable logical reasoning in machines. We’re a well-resourced, nimble team of experts on a mission to bridge the gap between...


  • Sunnyvale, CA, United States General Motors Full time $200,000 - $300,000 per year

    Job DescriptionPrincipal AI/ML Engineer, AV ML InfraWe're General Motors (GM), a company driving the future of mobility with advanced self-driving and electric vehicle technologies.We're building the world's most innovative autonomous vehicles to safely connect people to the places, things, and experiences they care about. We believe self-driving vehicles...