Current jobs related to ML Inference Infrastructure Lead, Video Streaming - San Francisco - Twelve Labs


  • San Francisco, United States OpenAI Full time

    The Platform ML team builds the ML side of our state-of-the-art internal training framework used to train our cutting-edge models. We work on distributed model execution as well as the interfaces and implementation for model code, training, and inference.Our priorities are to maximize training throughput (how quickly we can train a new model) and researcher...


  • San Francisco, United States Apollo Solutions Full time

    Founding Machine Learning Engineer - InfrastructureWe are searching for a Founding ML Infrastructure Engineer who is excited about going a pre-seed start-up and building from the ground up.They have been backed by top tier Venture Capital and are building the infrastructure for real-time AI applications such as voice and video.You will play a crucial role in...


  • san francisco, United States Apollo Solutions Full time

    Founding Machine Learning Engineer - InfrastructureWe are searching for a Founding ML Infrastructure Engineer who is excited about going a pre-seed start-up and building from the ground up.They have been backed by top tier Venture Capital and are building the infrastructure for real-time AI applications such as voice and video.You will play a crucial role in...


  • San Francisco, United States Apollo Solutions Full time

    Founding Machine Learning Engineer - InfrastructureWe are searching for a Founding ML Infrastructure Engineer who is excited about going a pre-seed start-up and building from the ground up.They have been backed by top tier Venture Capital and are building the infrastructure for real-time AI applications such as voice and video.You will play a crucial role in...

  • AI Video Engineer

    2 weeks ago


    San Francisco, California, United States Tavus Inc. Full time

    About Tavus Inc.Tavus Inc. is a leading AI synthetic media startup that is revolutionizing modern marketing and product experiences with AI videos. Our team has raised $25m to make personalized video experiences scalable.We are looking for an experienced ML-focused Software Engineer to work with our applied ML team. Our ideal candidate is a self-starter with...


  • San Francisco, California, United States Kuzco Full time

    About KuzcoWe are a cutting-edge technology company that specializes in developing innovative AI solutions. Our team of expert engineers is dedicated to building a distributed LLM inference network that combines idle GPU capacity from around the world into a single cohesive plane of compute.Our network is designed to handle millions of large language model...


  • San Francisco, California, United States Warner Media, LLC. Full time

    About Warner Bros. DiscoveryWarner Bros. Discovery is a global leader in the media and entertainment industry, with a rich history of creating iconic content and beloved brands. Our company is built on a foundation of innovation, creativity, and a passion for storytelling.Job SummaryWe are seeking a highly skilled Staff Software Engineer to join our Content...


  • San Francisco, California, United States Philo Full time

    At Philo, we are a collective of innovators in technology and product development, dedicated to revolutionizing the television landscape by merging cutting-edge technology with the art of storytelling. Our mission is to create the ultimate TV experience that resonates with our vision. This involves utilizing cloud infrastructure, advanced technology stacks,...

  • Software Engineer

    2 weeks ago


    San Francisco, United States Tavus Full time

    At Tavus, we're at the forefront of AI generative video technology, offering advanced APIs to developers and product teams. Our cutting-edge models enable a wide range of applications, from text-to-video with AI avatars to real-time interactions, powering innovation across industries like video communication, marketing, sales, education, and more. As a...

  • Software Engineer

    3 days ago


    San Jose, California, United States TikTok Full time

    About the RoleWe are seeking a highly skilled Software Engineer to join our Ads Machine Learning Infrastructure team. As a key member of this team, you will be responsible for building and operating scalable and reliable ads ranking infrastructure systems.Key ResponsibilitiesLead projects to design and implement scalable and reliable ads ranking...


  • San Jose, California, United States TikTok Full time

    About the RoleWe are seeking a highly skilled Senior Software Engineer to join our Ads Machine Learning Infrastructure team at TikTok. As a key member of our team, you will be responsible for leading the development and operation of scalable and reliable ads ranking infrastructure systems.Key ResponsibilitiesLead projects to build and operate...


  • San Francisco, California, United States Philo, Inc. Full time

    At Philo, we are a collective of innovators and product specialists dedicated to redefining the television landscape by integrating cutting-edge technology with the art of storytelling. Our mission is to create the ultimate TV experience that we envision for ourselves. This involves utilizing cloud technology, contemporary tech stacks, machine learning, and...


  • San Francisco, California, United States Philo, Inc. Full time

    About Philo, Inc.At Philo, we are a dedicated team of technology and product professionals focused on revolutionizing the television landscape. Our mission is to merge cutting-edge technology with the captivating medium of television, creating the ultimate viewing experience that we have always envisioned. This involves utilizing cloud-based solutions,...


  • San Francisco, California, United States Strativ Group Full time

    Senior ML Engineering LeadCompensation: up to $450k base + ~1-1.5% equityWe are collaborating with a well-funded startup that is transforming the domain of language models (LLMs) through advanced intelligence and optimization techniques, allowing both businesses and individuals to leverage LLMs tailored to their specific requirements.This organization has...


  • San Antonio, United States Apollo Inc Full time

    About the Role: We are looking for a seasoned Senior Machine Learning Engineer to architect, build, and optimize ML inference platform. The role demands an individual with significant expertise in Machine Learning engineering and infrastructure, with an emphasis on building Machine Learning inference systems. Proven experience in building and scaling ML...


  • San Francisco, California, United States Acceler8 Talent Full time

    About the RoleWe're seeking a highly skilled Staff ML Infrastructure Engineer to join our pioneering team at the forefront of AI and ML technology. As a key member of our team, you'll collaborate with researchers and product engineers to create innovative product experiences powered by large language models.Your expertise will be instrumental in designing...


  • San Antonio, Texas, United States Rackspace Full time

    About the RoleWe are seeking a seasoned Principal ML OPS Engineer to lead the design, development, and optimization of our ML inference platform. The ideal candidate will have significant expertise in Machine Learning engineering and infrastructure, with a focus on building scalable and efficient ML inference systems.Key ResponsibilitiesArchitect and...


  • San Francisco, California, United States OpenAI Full time

    About the RoleWe are seeking a highly skilled Stream Infrastructure Architect to join our team at OpenAI. As a key member of our Applied Data Platform team, you will be responsible for designing, building, and operating the foundational data infrastructure that enables products and teams at OpenAI.Key Responsibilities:Maintain the health and operability of...


  • San Francisco, California, United States Philo, Inc. Full time

    At Philo, we are a collective of innovators and product specialists dedicated to revolutionizing the television landscape by integrating cutting-edge technology with the most engaging medium ever created. Our mission is to construct the television experience we have always envisioned for ourselves. This involves utilizing cloud-based delivery, contemporary...

  • Software Engineer

    6 days ago


    San Francisco, CA, United States Genai Works Full time

    Backed by Sequoia Capital, Y-Combinator, and other top valley firms, our team has raised $25m to revolutionizing modern marketing and product experiences with AI videos. is a leading AI synthetic media startup focused on making personalized video experiences scalable. Tavus uses artificial intelligence deepfake technology to generate realistic videos that...

ML Inference Infrastructure Lead, Video Streaming

4 months ago


San Francisco, United States Twelve Labs Full time
Who we are

We're a fast-moving, diverse team pushing the frontiers of artificial intelligence. At Twelve Labs, our mission is to help developers build programs that can see, listen, and understand the world as we do by bringing the world's most powerful video understanding infrastructure to market. As a part of achieving this mission, we are building foundation AI models that can accurately and instantly search exact moments within petabytes of video archives, generate coherent text summaries of videos, perform prompt-based video generation, and many more. The Twelve Labs platform provides access to its Large Visual Language Models (VLMs) through a suite of APIs that are trained on massive video datasets and learn to understand the meaning and context behind the visuals, conversations, and sounds within videos.

Twelve Labs recently raised $17M in seed funding, recognized as one of CB Insights' AI 100 companies within a year of its founding, and secured a massive compute resource through partnering with Oracle. We are hyper focused on delivering the Twelve Labs platform to our customers so they can build video understanding into their products and power dream features they could have only imagined.

Part of the pathway to our rapid growth has been paved by the outstanding group of people united by the company's mission. Beyond prominent venture capital firms such as Index Ventures and Radical Ventures, the Twelve Labs mission is backed by category building luminaries like Fei-Fei Li (Stanford HAI), Silvio Savarese (Salesforce), Oren Etzioni (AI2), Alexandr Wang (Scale), Lukas Biewald (W&B), Jack Conte (Patreon) and more.

We are committed to creating a diverse and inclusive work environment where our team members can bring their full selves to work, bring out their potential, and most importantly, thrive together. We welcome kind, brilliant, and open minded people from all walks of life to our team. If joining this mission speaks to you, we encourage you to apply

About the Role:

As the Lead ML Systems Engineer at Twelve Labs, you will lead the ML Engineering team, driving the development of optimal machine learning systems for video foundation (VFM) and language model (VLM) in production. Your role encompasses the entire spectrum of machine learning engineering, from optimizing and scaling the inference infrastructure, which involves extensive video processing both in the cloud and on-premise, to model deployment and operations, and data infrastructure. VFMOps & VLMOps is central to our user experience as it dictates the latency and deployment speed of the trained model.

For the first 3 to 6 months, you will be hands on and actively contribute as an individual contributor in our development process. As the lead, you will set the technical strategies and goals, recruit top talent, and be responsible for your team's success, ensuring our machine learning systems exceed user expectations in terms of speed, efficiency, and reliability. Your expertise will be key in overcoming challenges related to processing vast amounts of video data and deploying sophisticated models in production. Together with your team, you will work to enhance our VFMOps & VLMOps, contributing to a superior user experience that distinguishes Twelve Labs from its competitors. Your leadership, technical expertise, and commitment to excellence will be critical to our team's success and our users' satisfaction.

You will:
    • Prioritize the team's work in building and improving our machine learning systems in production for video foundation and language model (VFM & VLM), in collaboration with senior engineers and other stakeholders
    • Inference Infrastructure: Construct the most performant, scalable, and reliable inference engine optimized for Twelve Lab's video foundation and language models.
    • ML Deployment & Operations (VFMOps / VLMOps): Lead the initiative in serving the model in the most optimized manner, deploying the pipeline, and automating the model training to deployment process.
    • Data: Oversee the data infrastructure and preparation of high-quality video data for our training runs.
    • Design processes (e.g. postmortem review, incident response, on-call rotations) that help the team operate effectively
    • Coach and develop your reports to decide how they would like to advance in their careers and help them do so
    • Run the team's recruiting efforts through a period of rapid growth
You may be a good fit if you have:
    • 10+ years of software development experience, including experience in machine learning engineering
    • 5+ years of experience in building end-to-end machine learning systems encompassing infrastructure, MLOps, and data management
    • You have experience working with engineers at different levels and have coached them in their career development
    • 2+ years of experience managing high output engineering teams
    • Proficiency in working with video processing and data pipelining
    • Experience in establishing and maintaining secure software and system development environments
Desired Experience:
    • MS or PhD in Computer Science, Math, or equivalent real-world experience
    • Fast-paced startup engineering experience
    • Experience working with large scale models
    • Experience working with both cloud and on-premise environment
    • ML research experience would be helpful, as this role requires interchangeable effort on both research side and software side
    • Experience in handling large-scale computing system and firm understanding on scale-up and scale-out approach in cloud environment
Relevant Tech Stack:
    • Language: Python, Golang, C++, CUDA
    • ML / Platform: PyTorch, Docker, Kubernetes, Terraform
    • ML Demo page: Gradio, Streamlit
    • MLOps: MLFlow, Weights and Biases
    • Data: Pachyderm, DVC
    • Automation: Airflow, Kubeflow
    • Model serving: Triton, FasterTransformer


Interview and Onboarding Process

Recruiter Phone Screen -> Phone Interview -> Technical Screen -> Onsite Interview -> Reference Checks

We're also excited to share that we'll do global onboarding in Seoul for all new hires (company-sponsored travel).

Even if there are a few checkboxes that aren't ticked through your prior experience, we still encourage you to apply If you are a 0-to-1 achiever, a ferocious learner, and a kind and fun team player who motivates others, you will find a home at Twelve Labs.

We welcome applicants from all walks of life and are committed to equal-opportunity employment. We cherish and celebrate diversity not just because it is the right thing to do, but because it makes our company much stronger.

Benefits and Perks

An open and inclusive culture and work environment.

Work closely with a collaborative, mission-driven team on cutting-edge AI technology.

Full health, dental, and vision benefits

Extremely flexible PTO and parental leave policy. Office closed the week of Christmas and New Years.

Remote-flexible, offices in San Francisco and Seoul and coworking stipend

VISA support (such as H1B and OPT transfer for US employees)