Current jobs related to Engineering Manager, AI Inference Systems - San Francisco, California - Genai Works


  • San Francisco, California, United States Perplexity AI Full time

    AI Model Inference Specialist at PerplexityWe are looking for a skilled AI Model Inference Specialist to contribute to our innovative team at Perplexity. If you have a strong interest in creating AI solutions and engaging with advanced technologies, this role may be an excellent fit for you.Role Overview:Become a vital part of a rapidly expanding team at...


  • San Francisco, California, United States Together AI Full time

    About the RoleWe are seeking a highly skilled Systems Research Engineer to join our team at Together AI. As a key member of our research team, you will play a crucial role in designing and building the next generation AI platform.Key ResponsibilitiesOptimize and fine-tune existing training and inference platforms to achieve better performance and...


  • San Francisco, California, United States Invisible AI Inc. Full time

    About Invisible AI Inc.At Invisible AI, we are pioneering advancements in computer vision technology. Our primary objective is to create a comprehensive platform that enhances manufacturing processes through digitization. By deploying edge AI cameras, we aim to transform manual assembly tasks, ensuring accuracy, reliability, and safety in people-driven...


  • San Francisco, California, United States Invisible AI Inc. Full time

    About Invisible AI Inc.At Invisible AI, we are pioneering advancements in computer vision technology. Our primary mission is to create a comprehensive platform that transforms manufacturing processes. By utilizing edge AI cameras, we aim to enhance the accuracy, reliability, and safety of manual assembly tasks, thereby revolutionizing people-driven...


  • San Francisco, California, United States Together AI Full time

    About the RoleWe are seeking a highly skilled Systems Research Engineer to join our team at Together AI. As a key member of our research team, you will play a crucial role in researching and building the next generation AI platform.Key ResponsibilitiesDesign and develop large-scale distributed training systems and low-latency/high-throughput inference...


  • San Francisco, California, United States Together AI Full time

    About the RoleWe are seeking a highly skilled Systems Research Engineer to join our team at Together AI. As a key member of our research team, you will play a crucial role in designing and building the next generation AI platform.Key ResponsibilitiesOptimize and fine-tune existing training and inference platforms to achieve better performance and...


  • San Francisco, California, United States Hyperbolic Labs Full time

    About Hyperbolic LabsWe're a pioneering company at the intersection of AI and open-source technology, on a mission to democratize AI by breaking down barriers to computing power. Our Open-Access AI Cloud offers an innovative GPU marketplace and AI inference service, making AI innovation accessible, secure, and affordable for all.The RoleWe're seeking an AI...


  • San Francisco, California, United States Zoom Corporation Full time

    OverviewAs an AI Infrastructure Engineer at Zoom Corporation, you will be responsible for the development and management of our advanced AI systems and frameworks. Your contributions will significantly enhance the training, deployment, and operational aspects of AI, ensuring improved functionality, scalability, and reliability. This role is essential in...


  • San Francisco, California, United States Descript, Inc. Full time

    About the RoleWe are seeking an experienced Engineering Manager to lead our AI Platform team at Descript, Inc. This is a unique opportunity to work at the intersection of research and applied AI, bringing cutting-edge technology to our users.Key ResponsibilitiesTeam Leadership: Manage, build out, and mentor a team of high-performing AI engineers to drive...


  • San Jose, California, United States Hume AI Full time

    About Hume AIHume AI is a pioneering company dedicated to developing artificial intelligence that prioritizes human well-being. Our mission is to create AI systems that are guided by human values, addressing the most critical challenge of the 21st century.Our ApproachWe employ a novel approach called reinforcement learning from human expression (RLHE), which...


  • San Jose, California, United States Hume AI Full time

    About Hume AIHume AI is a pioneering company dedicated to developing artificial intelligence that prioritizes human well-being. Our mission is to create AI systems that are guided by human values, addressing the most pressing challenge of the 21st century.Our ApproachWe employ a novel approach called reinforcement learning from human expression (RLHE), which...


  • San Francisco, California, United States Snorkel AI, Inc. Full time

    About the RoleWe are seeking an experienced Engineering Manager to lead our AI Platform team at Snorkel AI, Inc. This is a unique opportunity to join a cutting-edge technology company and contribute to the development of innovative AI solutions.Key ResponsibilitiesLead a team of talented engineers to design, develop, and deploy large-scale data-focused AI...

  • AI Engineering Lead

    4 weeks ago


    San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking a Director of Engineering to spearhead our AI Platform division. This team is responsible for developing cutting-edge software systems that enhance the Snorkel Flow platform. Responsibilities include creating services for training and deploying generative AI and machine learning models, utilizing innovative data-centric...

  • AI Engineering Lead

    3 weeks ago


    San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking an Engineering Director to spearhead our AI Platform division. This team is responsible for developing cutting-edge software solutions that drive the Snorkel Flow platform. The focus includes creating services for training and deploying generative AI and machine learning models, utilizing innovative data-centric methodologies,...

  • AI Engineering Lead

    3 weeks ago


    San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking an experienced Director of Engineering to spearhead our AI Platform division. This team is responsible for developing cutting-edge software systems that drive the Snorkel Flow platform. Key responsibilities include creating services for training and deploying generative AI and machine learning models utilizing innovative...


  • San Francisco, California, United States Magic Inc Full time

    Join Our Team at Magic IncBecome a pivotal part of our mission to construct and securely implement cutting-edge, superhuman AI technologies. We are developing an AI companion for programmers that operates seamlessly within their systems—intelligent, engaging, and dependable across various fields.Role Overview: As a Senior Software Engineer, you will be...


  • San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking a Director of Engineering to oversee our AI Platform division. This team is responsible for developing cutting-edge software systems that drive the Snorkel Flow platform. Responsibilities include creating services for training and deploying generative AI and machine learning models, utilizing innovative data-centric...


  • San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking a Director of Engineering to oversee our AI Platform division. This team is responsible for developing cutting-edge software systems that enhance the Snorkel Flow platform. The focus includes creating services for training and deploying generative AI and machine learning models utilizing advanced data-centric methodologies,...


  • San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking a Director of Engineering to spearhead our AI Platform division. This team is responsible for creating cutting-edge software solutions that enhance the Snorkel Flow platform. Responsibilities include developing services for training and deploying generative AI and machine learning models, utilizing innovative data-centric...


  • San Francisco, California, United States The Learning Experience #363 Full time

    About The Learning Experience #363 The Learning Experience #363 is dedicated to fostering innovative and effective educational solutions. Our goal is to create environments where learning thrives, ensuring that our systems are not only efficient but also enhance the overall educational experience. Position Overview: As an Infrastructure Engineer specializing...

Engineering Manager, AI Inference Systems

3 months ago


San Francisco, California, United States Genai Works Full time

About the Team

The Applied AI team safely brings OpenAI's technology to the world. We released ChatGPT, Plugins, DALL·E, and the APIs for GPT-4, GPT-3, embeddings, and fine-tuning. We also operate inference infrastructure at scale. There's a lot more on the immediate horizon.

We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool is used responsibly and safely. Safety is more important to us than unfettered growth.

We serve end-users directly through ChatGPT, and serve developers through our APIs, which power product features that were never before possible.

About the Role

Model inference at OpenAI is powered through a single service we call our "Engine". The Engine wraps the PyTorch transformers which are GPT-4 and ChatGPT. We are looking for an engineering manager to help lead some of the critical work for this service and grow the team.

In this role, you will:

  • Own substantial portions of our inference stack
  • Ensure we have the ability to run GPT-4, ChatGPT, and future models at increasingly high scale with increasing efficiency
  • Hire world-class AI systems engineers in one of the most competitive hiring markets
  • Coordinate the inference needs of OpenAI's teams and products
  • Create a diverse, equitable, and inclusive culture that makes all feel welcome while enabling radical candor and the challenging of group think

You might thrive in this role if you:

  • Have 3+ years of experience in engineering management and 7+ years as an IC working with high scale distributed systems and ML systems.
  • Have experience with ML systems, particularly high scale distributed inference for modern LLMs.
  • Have experience with highly available, reliable, production grade systems at scale
  • Have familiarity with the latest AI research and working knowledge of how these systems are efficiently implemented
  • Care deeply about diversity, equity, and inclusion, and have a track record of building inclusive teams
  • Have experience closing extremely competitive candidates for your team, and the ability to craft and convey compelling visions of the future
  • Have a voracious and intrinsic desire to learn and fill in missing skills—and an equally strong talent for sharing learnings clearly and concisely with others
  • Are comfortable with ambiguity and rapidly changing conditions. You view changes as an opportunity to add structure and order when necessary

As technical context: at the heart of our infrastructure is a large-scale deployment of GPU nodes running in dozens of Kubernetes clusters across regions. Some core technologies we build with include Python, PyTorch, CUDA, Triton, Redis, Infiniband, NCCL, NVLink

This role is exclusively based in our San Francisco HQ. We offer relocation assistance to new employees.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.

For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via thislink.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.


#J-18808-Ljbffr