Platform ML Engineering Manager, Inference

4 weeks ago


San Francisco, California, United States Openai Full time

About the Team



The Platform ML team is responsible for building the ML side of our internal training framework, which is used to train cutting-edge models.



We work on distributed model execution, as well as the interfaces and implementation for model code, training, and inference.



Our priorities are to maximize training throughput and researcher throughput, with the goal of accelerating progress towards AGI.



We frequently collaborate with other teams to speed up the development of new capabilities.



About the Role



We are seeking an experienced engineering manager to help lead critical work on our shared internal inference stack and grow the team.



Our inference stack is primarily built by the Applied AI engineering team, and we will improve and extend it for research use cases.



In this role, you will:




  • Get SOTA throughput for our most important research models.
  • Reduce the time it takes to get efficient inference for new model architectures.
  • Collaborate closely with Applied AI engineering to maximize the benefits of our shared internal inference stack.
  • Hire world-class AI systems engineers in one of the most competitive hiring markets.
  • Coordinate the inference needs of OpenAI's research teams.


Create a diverse, equitable, and inclusive culture that makes all feel welcome while enabling radical candor and the challenging of group think.



You might thrive in this role if you:




  • Have 3+ years of experience in engineering management and 7+ years as an IC working with high scale distributed systems and ML systems.
  • Have experience with ML systems, particularly high scale distributed training or inference for modern LLMs.
  • Have familiarity with the latest AI research and working knowledge of how these systems are efficiently implemented.
  • Care deeply about diversity, equity, and inclusion, and have a track record of building inclusive teams.


About OpenAI



OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.



We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products.



AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.



We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.



OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement



We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.



OpenAI Global Applicant Privacy Policy



At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared.



  • San Francisco, California, United States Genmo Inc. Full time

    At Genmo Inc., we are a research lab dedicated to building state-of-the-art models for video generation. Our goal is to unlock the potential of Artificial General Intelligence (AGI).Job OverviewWe are seeking a senior/staff software engineer to join our inference team. This role involves designing and scaling our inference systems to support millions of...


  • San Francisco, California, United States OpenAI Full time

    Platform Engineering Team LeadWe are seeking an experienced engineering manager to lead our Platform ML team in building the ML side of our internal training framework. This framework is used to train our cutting-edge models, and our team works on distributed model execution, interfaces, and implementation for model code, training, and inference.Key...

  • ML Platform Engineer

    3 weeks ago


    San Francisco, California, United States Abridge Full time

    About the RoleAbridge is seeking a highly skilled ML Platform Engineer to join our team and help us scale our AI infrastructure. As a key member of our engineering team, you will be responsible for designing, implementing, and deploying machine learning models at scale.Our ideal candidate has a strong background in Python, Kubernetes, and cloud environments,...


  • San Francisco, California, United States Liquid AI Full time

    At Liquid AI, we're seeking a highly skilled engineer to optimize inference stacks tailored to various hardware platforms.The ideal candidate has extensive experience in CUDA, C++, and Triton, as well as a deep understanding of GPU, CPU, and NPU architectures.They should be self-motivated, capable of working independently, and driven by a passion for...


  • San Francisco, California, United States Liquid AI Full time

    About the RoleWe're seeking a highly skilled engineer to join our team at Liquid AI, where you'll play a critical role in optimizing inference stacks for our AI models.As a key member of our team, you'll be responsible for taking our models and delivering highly optimized inference stacks that leverage existing frameworks like ggml, vllm, and DeepSpeed to...

  • AI/ML Engineer

    4 weeks ago


    San Francisco, California, United States WEX Inc Full time

    About the RoleWe are seeking a highly motivated and results-oriented AI/ML Engineer to join our fast-growing team at WEX Inc. As a key member of our AI Engineering team, you will play a critical role in driving our strategic vision to integrate artificial intelligence into the core of our product and business.ResponsibilitiesCollaborate with stakeholders to...

  • AI Platform Engineer

    3 weeks ago


    San Francisco, California, United States Labelbox Full time

    About the RoleLabelbox is seeking a skilled AI Platform Engineer to join our team. As a key member of our engineering organization, you will be responsible for building and maintaining a scalable AI platform that utilizes foundation models for real-world applications.Your Day to DayEnhance and improve Labelbox's core machine learning capabilities, including...


  • San Francisco, California, United States Perplexity AI Full time

    We are seeking an experienced AI Inference Systems Engineer to join our growing team at Perplexity AI. Our current stack includes Python, C++, TensorRT-LLM, and Kubernetes, providing a unique opportunity to work on large-scale deployment of machine learning models for real-time inference.Key Responsibilities:Develop APIs for AI inference that will be used by...


  • San Francisco, California, United States OpenAI Full time

    Key Role: We're seeking a skilled Software Engineer to join our team at OpenAI and contribute to the development of our critical inference infrastructure.About the Job: As an Inference Infrastructure Engineer, you will work alongside machine learning researchers, engineers, and product managers to bring our latest technologies into production. Your primary...


  • San Jose, California, United States Adobe Full time

    Transforming Digital Experiences with AdobeWe're a company that's passionate about empowering people to create beautiful and powerful digital experiences. Our mission is to give everyone the tools they need to design and deliver exceptional experiences across every screen.The OpportunityWe're seeking an exceptional Site Reliability Engineering Manager to...


  • San Jose, California, United States Adobe Full time

    Job Title: Site Reliability Engineering Manager, AI PlatformAbout the Role:We are seeking an experienced Site Reliability Engineering Manager to lead our AI Inference Platform team at Adobe. As a key member of our Engineering organization, you will be responsible for developing and implementing strategies to ensure the reliability, scalability, and security...


  • San Francisco, California, United States Abridge Full time

    Job DescriptionAbridge is a pioneering healthcare technology company that's revolutionizing the way medical conversations are recorded and understood. As an ML Infrastructure Engineer, you'll play a critical role in scaling and deploying machine learning models to handle increasing traffic demands and integrate them with various platforms.Our team is...


  • San Francisco, California, United States Magical Tome Full time

    About TomeTome is a cutting-edge platform that empowers enterprise sellers and account managers to simplify complex research and strategic planning. Our state-of-the-art models leverage thousands of data sources to surface actionable knowledge about customers. A team of experienced sellers, engineers, and researchers tunes and customizes our system to meet...


  • San Francisco, California, United States Genmo Full time

    Role OverviewWe are seeking a senior software engineer to join our inference team at Genmo, a research lab dedicated to building open, state-of-the-art models for video generation. The successful candidate will be responsible for designing and scaling our inference systems to support millions of users across multiple data centers.Key ResponsibilitiesDevelop...


  • San Jose, California, United States PayPal Full time

    At PayPal, we're revolutionizing commerce globally, and we need a Senior AI/ML Platform Manager to help us scale our AI/ML infrastructure and platform.We're looking for a strong Senior Product Manager with a deep understanding of the AI/ML Platform stack and a strong business acumen to partner with Data Scientists and ML Engineers in delivering a...


  • San Jose, California, United States PayPal Full time

    Job Title: Senior AI/ML Platform ManagerAt PayPal, we're revolutionizing commerce globally, and we need a Senior AI/ML Platform Manager to help us scale our AI/ML infrastructure.Job Summary:We're looking for a strong Senior Product Manager with a deep understanding of the AI/ML Platform stack and a strong business acumen to partner with Data Scientists and...


  • San Francisco, California, United States Genmo Full time

    Job DescriptionWe are seeking a highly skilled Senior Staff AI Infrastructure Engineer to join our team at Genmo, a research lab dedicated to building open, state-of-the-art models for video generation. The ideal candidate will have a strong background in software engineering, with a focus on backend systems and ML infrastructure.Key Responsibilities:Design...


  • San Jose, California, United States PayPal, Inc. Full time

    Job Title: Senior AI/ML Platform ManagerJob Summary:PayPal, Inc. is seeking a Senior AI/ML Platform Manager to lead the development and implementation of our AI/ML platform. The successful candidate will have a strong background in AI/ML and experience in managing cross-functional teams.Key Responsibilities:* Develop and execute a long-term strategy for the...


  • San Francisco, California, United States Together AI Full time

    Job ResponsibilitiesInfrastructure Development:Identify and resolve infrastructure gaps to ensure reliable, efficient, and scalable AI/ML solutions.AI/ML Solutions:Develop advanced AI/ML infrastructure solutions to enhance the efficiency of our ML teams, leveraging expertise in distributed systems and large-scale data processing.System Design:Design and...


  • San Francisco, California, United States OpenAI Full time

    About the TeamThe Platform ML team at OpenAI is responsible for building the ML side of our state-of-the-art internal training framework used to train cutting-edge models. We work on distributed model execution as well as the interfaces and implementation for model code, training, and inference.Our priorities are to maximize training throughput and...