Project Manager for AI Infrastructure

2 weeks ago


San Francisco, California, United States Together AI Full time

About the Job: We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure. You will be responsible for managing GPU hardware inventory, developing and maintaining a system to log and track GPU outages, and continuously seeking opportunities to improve GPU tracking processes and systems.

About Together AI: We are a research-driven artificial intelligence company dedicated to significantly lowering the cost of modern AI systems by co-designing software, hardware, algorithms, and models.

Responsibilities:

  • Monitor and manage GPU hardware inventory
  • Develop and maintain a system to log and track GPU outages
  • Improve GPU tracking processes and systems
  • Work with engineering, customer success, and operations to resolve outages

Requirements:

  • Bachelor's degree in business, information technology, or engineering related fields
  • At least 3 years of experience in technical program management, inventory management, and/or data center operations / project management


  • San Francisco, California, United States Together AI Full time

    Company OverviewTowards a More Transparent AI FutureTogether AI is revolutionizing the field of artificial intelligence by co-designing software, hardware, algorithms, and models. Our mission is to significantly lower the cost of modern AI systems, making them more accessible to everyone. With contributions to leading open-source research, models, and...


  • San Francisco, California, United States Together AI Full time

    Job Description:As a key member of Together AI's hardware team, you will be responsible for optimizing and scaling our decentralized GPU resources. This critical role involves ensuring the efficient operation of thousands of GPUs distributed across multiple data centers. Your expertise will enable cutting-edge AI advancements that democratize access to AI...


  • San Francisco, California, United States Snorkel AI Full time

    We're on a mission to make machine learning accessible to everyone. At Snoekl AI, we're building the definitive AI data development platform.The AI landscape has undergone significant changes over the years, but one thing remains constant: high-quality data is essential for achieving differentiation, high performance, and production-ready systems.Our...


  • San Francisco, California, United States WaveForms AI Full time

    Job title: Software Engineer, AI Infrastructure (Training + Inference) / Member of Technical Staff Who We Are WaveForms AI is an Audio Large Language Models (LLMs) company building the future of audio intelligence through advanced research and products. Our models will transform human-AI interactions making them more natural, engaging and immersive. Role...


  • San Francisco, California, United States The Rundown AI, Inc. Full time

    About Our CompanyThe Rundown AI, Inc. is a fast-growing, Series B startup revolutionizing the field of AI-data Infrastructure. We specialize in providing cutting-edge data pipeline solutions for Machine Learning, LLM, and GenAI solutions to large enterprise clients, helping them leverage the power of AI to transform their businesses.We have a fully...


  • San Francisco, California, United States Altana AI Full time

    Company Overview">Altana AI is a pioneering company that applies artificial intelligence to the world's largest organized body of supply chain data. Our mission is to create a more resilient, secure, and sustainable model of global commerce by harnessing the power of AI. We collaborate with leading organizations and government agencies worldwide to build a...


  • San Francisco, California, United States Together AI Full time

    About Together AIWe are a research-driven artificial intelligence company. Our mission is to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models.Our team has made significant contributions to open-source research, models, and datasets that advance the frontier of AI. We invite you to join our...


  • San Francisco, California, United States Together AI Full time

    Responsibilities:• Monitor and manage GPU hardware inventory across multiple decentralized data centers• Track the lifecycle of GPUs, including acquisition, deployment, usage, maintenance, and decommissioning• Develop and maintain a system to log and track all GPU outages or malfunctions, including root cause analysis, downtime duration, and...


  • San Francisco, California, United States The Rundown AI, Inc. Full time

    About the RoleWe are seeking a highly skilled Technical Project Manager to oversee and drive the execution of AI data projects. In this role, you will manage the entire lifecycle of data creation projects for enterprise clients, from initial client consultation through final delivery.This position combines technical expertise, meticulous attention to detail,...


  • San Francisco, California, United States The Rundown AI, Inc. Full time

    About the RoleThe Rundown AI, Inc. is seeking a highly skilled Machine Learning Systems Engineer to join its Model Evaluations team. As a member of this team, you will be responsible for designing, building, and maintaining scalable systems that enable researchers to effectively evaluate models and conduct inference tasks critical to the organization's...


  • San Francisco, California, United States Waveforms AI, Inc Full time

    Job title:Software Engineer, AI Infrastructure (Training + Inference) / Member of Technical StaffWho We Are WaveForms AI is an Audio Large Language Models (LLMs) company building the future of audio intelligence through advanced research and products. Our models will transform human-AI interactions making them more natural, engaging and immersive.Role...


  • San Francisco, California, United States Together AI Full time

    Company OverviewTogether AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society. Our team has been behind technological advancements such as FlashAttention, Hyena, FlexGen, and RedPajama.We invite you to join a passionate group of researchers in our...


  • San Francisco, California, United States Together AI Full time

    Key Responsibilities:• Manage GPU hardware inventory across multiple decentralized data centers• Develop and maintain a system to log and track all GPU outages or malfunctions, including root cause analysis, downtime duration, and replacement cycles• Generate reports on utilization, availability, and performance trends, and recommend improvements•...


  • San Francisco, California, United States Together AI Full time

    Project Manager, Hardware & Business Operations Location: San Francisco, CA (Hybrid)Role: As the first Project Manager for hardware at a pioneering AI infrastructure company, you will be at the core of optimizing and scaling our decentralized GPU resources. Your role is crucial in ensuring that the backbone of our AI models-thousands of GPUs distributed...


  • San Francisco, California, United States Together AI Full time

    Project Manager, Hardware & Business Operations Location: San Francisco, CA (Hybrid)Role: As the first Project Manager for hardware at a pioneering AI infrastructure company, you will be at the core of optimizing and scaling our decentralized GPU resources. Your role is crucial in ensuring that the backbone of our AI models-thousands of GPUs distributed...


  • San Francisco, California, United States Together AI Full time

    Project Manager, Hardware & Business Operations Location: San Francisco, CA (Hybrid)Role: As the first Project Manager for hardware at a pioneering AI infrastructure company, you will be at the core of optimizing and scaling our decentralized GPU resources. Your role is crucial in ensuring that the backbone of our AI models-thousands of GPUs distributed...


  • San Francisco, California, United States Jobleads-US Full time

    We are seeking an experienced AI Infrastructure Project Director to oversee the construction of our hyperscale AI data centers.In this role, you will be responsible for ensuring the successful delivery of complex projects on time, within budget, and to the highest quality standards.You will develop and implement quality management plans, conduct regular...


  • San Francisco, California, United States Together AI Full time

    As a Senior AI Infrastructure Engineer, you will be responsible for building the next generation, highly available, global, multi-cloud PaaS platform with open-source technologies to enable and accelerate Together AI's rapid growth.This system spans many diverse environments (Kubernetes, VMs, bare metal compute, and edge deployments) and provides a cohesive...


  • San Francisco, California, United States Together AI Full time

    About the Role As a Senior AI Infrastructure Engineer, you will be responsible for building the next generation, highly available, global, multi-cloud PaaS platform with open-source technologies to enable and accelerate Together AI's rapid growth. This system spans many diverse environments (Kubernetes, VMs, bare metal compute, and edge deployments) and...


  • San Francisco, California, United States Distyl AI Full time

    **About Distyl AI**We develop AI native technologies for humans & AI to collaborate and power the operations of the Global Fortune 1000. Our platform, Distillery, along with our team of AI Engineers, Researchers, and Strategists, is pioneering AI-native systems of work.**Job Description**We're looking for an experienced AI Platform Engineer to design and...