Staff ML Infrastructure Engineer
2 days ago
Staff / Lead ML Infrastructure Engineer San Francisco, CA — Onsite
Salary - Over market average + equity
We are building one of the world’s leading generative video and multimodal AI platforms, and we’re looking for a senior infrastructure engineer to drive the backbone that makes it possible. This role is ideal for an engineer from a top-tier tech company who has built cloud-scale systems, high-performance compute platforms, and battle-tested CI/CD pipelines that support complex ML workloads.
Core ML Platform Architecture: Design and evolve the infrastructure that supports large-scale generative video and multimodal model training, evaluation, and deployment.
Build and optimize GPU/TPU clusters, distributed training systems, and orchestration layers tailored for video-heavy pipelines.
End-to-End CI/CD for ML: Lead the development of automated pipelines for model training, validation, artifact management, and production rollout.
Multimodal Data Infrastructure: Build systems to ingest, version, transform, and serve large-scale video, audio, and text datasets with high reliability.
Internal Developer Experience: Partner with research, product, and applied ML teams to build intuitive internal tooling for experiment tracking, model lineage, and resource scheduling.
Technical Leadership: Mentor engineers, set platform standards, and influence long-term architectural direction.
Experience architecting and operating large-scale infrastructure at a cloud provider, hyperscaler, or leading AI company.
Built or owned mission-critical CI/CD systems, high-capacity compute platforms, or data infrastructure supporting ML teams.
Deep experience with distributed compute across GPUs/accelerators, Kubernetes, and cloud infrastructure (AWS/GCP/Azure).
Strong engineering fundamentals in Python, Go, or equivalent languages.
Previous exposure to ML training pipelines—especially systems that handle heavy video, multimodal, or high-dimensional data.
Experience with video processing systems, large-scale media pipelines, or streaming architectures.
Familiarity with modern multimodal or video-generation frameworks (PyTorch, JAX, diffusers, custom accelerators).
Experience with Ray, Triton, CUDA optimization, or specialized scheduling for ML workloads.
Background working in high-growth AI startups or research-focused environments.
Security and compliance considerations for models that generate or process user content.
Shape the underlying platform powering one of the most advanced generative video systems in the world.
Influence the future of multimodal AI by building infrastructure that directly accelerates research and product breakthroughs.
Work closely with experienced founding engineers, researchers, and platform builders from leading tech companies.
Highly competitive compensation, meaningful equity, and strong in-person engineering culture in San Francisco.
-
ML Infrastructure Engineer
3 days ago
Menlo Park, CA, United States Strativ Group Full timeML Infrastructure Engineer We are partnered with a Stealth AI Lab (backed by top-tier investors and advised by pioneering figures in generative and interactive media) that is hiring a Staff ML Infrastructure Engineer. This company push the boundaries of real-time generative models, building the core infrastructure that enables next-generation video AI. Their...
-
Staff ML Engineer
3 weeks ago
Sonoma, CA, United States Synergis Full timeStaff ML Engineer Direct Hire Detroit, MI or San Francisco, CA $195K-$295K About the Team: The ML Inference Platform is part of the AI Compute Platforms organization within Infrastructure Platforms. Our team owns the cloud-agnostic, reliable, and cost-efficient platform that powers our client's AI efforts. We're proud to serve as the AI infrastructure...
-
ML Infrastructure Engineer
3 days ago
Menlo Park, CA, United States Strativ Group Full timeML Infrastructure Engineer We are partnered with a Stealth AI Lab (backed by top-tier investors and advised by pioneering figures in generative and interactive media) that is hiring a Staff ML Infrastructure Engineer. This company push the boundaries of real-time generative models, building the core infrastructure that enables next-generation video AI. Their...
-
Principal Staff Engineer AI Infrastructure
4 days ago
San Francisco, CA, United States Andiamo Full timeOverview Principal Staff Engineer - AI Infrastructure. We are seeking a Principal Staff Engineer to lead the architecture and development of our next-generation AI infrastructure. This role sits at the intersection of large-scale distributed systems and cutting-edge machine learning, powering the platforms that enable researchers and engineers to build,...
-
Senior Backend Engineer, ML Ops
4 weeks ago
Hayward, CA, United States Plenful Full timeAbout Plenful Plenful is on a mission to transform healthcare operations from the inside out. Fresh off our $50M Series B and backed by Bessemer Venture Partners, Notable Capital, TQ Ventures, Susa/Kivu Ventures, and other leading investors, we’re building the category-defining AI agentic operating platform that healthcare teams rely on to operate smarter,...
-
Staff Software Engineer, ML Platform
3 weeks ago
San Francisco, CA, United States Attentive Full timeAttentive® is the AI-powered mobile marketing platform transforming the way brands personalize consumer engagement. Attentive enables marketers to craft tailored journeys for every subscriber, driving higher recurring revenue and maximizing campaign performance. Activating real-time data from multiple channels and advanced AI, the platform personalizes...
-
Member of Technial Staff
2 days ago
Hayward, United States Quantix Search Full timeMember of Technical Staff Applied AI / ML EngineerSan Francisco, Union Square (Onsite) Up to $325K + stock optionsWere partnering with an AI startup in San Francisco that operates in the tax and accounting automation space to hire a Member of Technical Staff focused on Applied AI / ML. The company builds GenAI-driven systems that turn complex, unstructured...
-
Mountain View, CA, United States Nuro Full timeWho We Are Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world's most scalable driver, combining cutting-edge AI with automotive-grade hardware. Nuro licenses its core technology, the Nuro Driver™, to support a wide range of applications, from robotaxis and commercial fleets...
-
DevOps Engineering Lead
3 weeks ago
San Francisco, CA, United States Symbolica Full timeDevOps Engineering Lead - ML Infrastructure Please make sure you read the following details carefully before making any applications. About Us Symbolica is an AI research lab pioneering the application of category theory to enable logical reasoning in machines. We’re a well-resourced, nimble team of experts on a mission to bridge the gap between...
-
Principal Staff AI/ML Engineer
4 days ago
Sunnyvale, CA, United States General Motors Full time $200,000 - $300,000 per yearJob DescriptionPrincipal AI/ML Engineer, AV ML InfraWe're General Motors (GM), a company driving the future of mobility with advanced self-driving and electric vehicle technologies.We're building the world's most innovative autonomous vehicles to safely connect people to the places, things, and experiences they care about. We believe self-driving vehicles...