AI Infrastructure Engineer

2 weeks ago


San Francisco, California, United States Zoom Corporation Full time

Overview

As an AI Infrastructure Engineer at Zoom Corporation, you will be responsible for the development and management of our advanced AI systems and frameworks. Your contributions will significantly enhance the training, deployment, and operational aspects of AI, ensuring improved functionality, scalability, and reliability. This role is essential in refining and advancing Zoom's AI capabilities.

Team Insights

We are on the lookout for a dedicated AI Infrastructure Engineer to join our dynamic AI infrastructure team. Our mission is to oversee the comprehensive Machine Learning Platform, which encompasses model training and the underlying infrastructure. We strive to boost efficiency, optimize GPU training, and enhance the throughput and latency of language model inference.

Key Responsibilities

  • Designing and implementing the management system for the Machine Learning Platform.
  • Creating the necessary toolchains, services, and pipelines for model development workflows and serving architectures.
  • Establishing priorities for various metrics related to model training and inference monitoring.
  • Building and maintaining a high-performance GPU infrastructure for LLM training and cluster management.
  • Gaining insights into autoscaling for inference services and managing multiple models for dynamic loading.
  • Providing support, troubleshooting, and resolving issues that arise during training and inference processes.

Candidate Profile

  • A degree in Computer Science or a related field.
  • A strong grasp of AI principles, Software Engineering, or Machine Learning.
  • Proficiency in Python and PyTorch, along with familiarity in Git and software development methodologies.
  • Experience with cloud computing platforms such as AWS, Azure, or Google Cloud, and familiarity with AI frameworks like TensorFlow and Nvidia/CUDA.
  • Expertise in Docker, with practical experience in Kubernetes, YAML, Deployment, ConfigMap, and PV/PVC.
  • Experience with operating systems like Linux and Ubuntu, and proficiency in Shell scripting.

Compensation Overview

At Zoom, we value your skills and experience, offering a competitive salary range that reflects your qualifications.

Work Environment

Our hybrid work model is designed to support both office and remote work settings, ensuring flexibility for our team members.

Employee Benefits

We are committed to fostering a positive workplace culture and offer a comprehensive benefits program aimed at supporting our employees' physical, mental, emotional, and financial well-being.

About Zoom Corporation

At Zoom, we empower individuals to connect and collaborate effectively. Our goal is to create the leading collaboration platform for enterprises, enhancing communication through our diverse range of products.

Commitment to Diversity

We believe that the unique contributions of every team member drive our success. We are an equal opportunity employer and value diversity in our workforce, ensuring a welcoming environment for all individuals.



  • San Francisco, California, United States Perplexity AI Full time

    Job DescriptionPerplexity AI is seeking a highly skilled Backend Software Engineer to join our team of innovators revolutionizing the way people interact with the internet. As a key member of our engineering team, you will be responsible for designing, implementing, and scaling systems that power our API products.Key Responsibilities:Design and Develop APIs...

  • AI Solutions Engineer

    2 weeks ago


    San Francisco, California, United States Perplexity AI Full time

    Job OverviewPerplexity AI is at the forefront of developing an innovative answer engine, enabling users to discover information in more effective and engaging ways. We leverage large language models (LLMs) for knowledge retrieval at scale, catering to millions of users globally. To further our mission, we are seeking skilled engineers to design...


  • San Francisco, California, United States Unum AI Full time

    About Unum AIUnum AI is a deep-tech startup revolutionizing data infrastructure for extreme scale and AI applications.Job DescriptionWe are seeking a highly skilled Senior DevOps Engineer: Data Infrastructure Specialist to join our team in designing and implementing next-generation data infrastructure solutions.Key ResponsibilitiesOrchestrate software...


  • San Francisco, California, United States Unum AI Full time

    **About Unum AI**We are a deep-tech startup revolutionizing data infrastructure for extreme scale and artificial intelligence. Our mission is to empower the next million data-intensive and AI applications.**Job Summary**We are seeking passionate and competitive Senior C++ Research Engineers to join our team in designing next-generation data...


  • San Francisco, California, United States Perplexity AI Full time

    Position OverviewPerplexity AI is on the lookout for a seasoned Search Engineer to revolutionize the search framework that underpins our innovative products. If you are passionate about advancing technology and making a substantial difference, this opportunity is tailored for you.Key ResponsibilitiesDeveloping and architecting extensive infrastructure to...


  • San Francisco, California, United States Perplexity AI Full time

    Position OverviewPerplexity AI is on the lookout for a seasoned Search Engineer to redefine the architecture of our search framework that underpins our offerings. If you are passionate about innovation and making a substantial difference, this position may be the perfect fit for you.Key ResponsibilitiesDeveloping and constructing extensive infrastructure to...


  • San Francisco, California, United States Relyance AI Full time

    About the RoleRelyance AI is seeking a highly skilled Lead DevSecOps Engineer - Cloud Infrastructure Specialist to join our team. As a key member of our engineering team, you will be responsible for designing and implementing secure and scalable cloud infrastructure components that meet the needs of our customers.Key ResponsibilitiesCloud Infrastructure...


  • San Francisco, California, United States Scale AI, Inc. Full time

    About the RoleWe are seeking a highly skilled Cloud Infrastructure Engineer to join our Platform Engineering team at Scale AI, Inc. As a key member of our team, you will be responsible for designing and developing core cloud infrastructure platforms and systems, while supporting orchestration, data abstraction, data pipelines, identity & access management,...


  • San Francisco, California, United States Relyance AI Full time

    About the RoleRelyance AI is seeking a highly skilled Lead DevSecOps Engineer to join our team. As a key member of our engineering team, you will be responsible for leading the development and implementation of cloud infrastructure components with a focus on high availability, scalability, and reliability.Key ResponsibilitiesCloud Infrastructure Leadership:...


  • San Francisco, California, United States Unum AI Full time

    About Unum AIUnum AI is a deep-tech startup revolutionizing data infrastructure for extreme scale and AI applications.Job DescriptionWe are seeking a highly skilled Data Infrastructure Engineer to join our team and contribute to the design and development of next-generation data infrastructure.Key ResponsibilitiesOrchestrate Software Development and Hardware...


  • San Francisco, California, United States Unum AI Full time

    About Unum AIUnum AI is a deep-tech startup revolutionizing data infrastructure for extreme scale and AI applications.Job DescriptionWe are seeking a highly skilled Data Infrastructure Engineer to join our team and contribute to the design and development of next-generation data infrastructure.Key ResponsibilitiesOrchestrate software development and hardware...


  • San Francisco, California, United States Together AI Full time

    About the RoleWe are seeking a highly skilled AI Researcher to join our team at Together AI. As an AI Researcher, you will play a key role in pushing the frontier of foundation model research and making them a reality in products.Key ResponsibilitiesDevelop novel architectures, system optimizations, optimization algorithms, and data-centric optimizations...

  • Data Engineer

    1 day ago


    San Francisco, California, United States Acceler8 Talent Full time

    About Acceler8 Talent: We're a pioneering company at the forefront of AI and ML technology, where human-computer collaboration is not just a concept but a reality. Our team is dedicated to revolutionizing user experiences by innovating at every level, from user interfaces down to the most efficient models.Our Mission: We're advancing our mission to enhance...


  • San Francisco, California, United States Unum AI Full time

    **About Unum AI**We are a deep-tech startup revolutionizing the field of data infrastructure to support extreme scale and artificial intelligence. Our mission is to empower a million data-intensive and AI applications with cutting-edge technology.**Job Summary**We are seeking a highly skilled Senior C++ Research Engineer to join our team in designing and...


  • San Francisco, California, United States Argo AI Full time

    Job SummaryWe are seeking a highly skilled Backend Software Engineer to join our team at Argo AI. As a Backend Software Engineer, you will play a critical role in building and maintaining our data infrastructure, enabling our next-generation logistics platform.About the RoleAs a Backend Software Engineer, you will be responsible for designing, developing,...


  • San Francisco, California, United States The Learning Experience #363 Full time

    About The Learning Experience #363 The Learning Experience #363 is dedicated to fostering innovative and effective educational solutions. Our goal is to create environments where learning thrives, ensuring that our systems are not only efficient but also enhance the overall educational experience. Position Overview: As an Infrastructure Engineer specializing...


  • San Francisco, California, United States Untether AI Full time

    Untether AI is looking for a talented AI Applications Engineer to join our Product team to support our customers with SDK for our custom AI accelerator devices. You will be working with data scientists to ensure their AI workloads are ported and running efficiently on Untether AI products. Must be a US citizen to apply.Ideal candidate profileYou have...


  • San Francisco, California, United States Fractional AI Full time

    About Fractional AIWe are a cutting-edge technology company specializing in applied AI solutions. Our team of experts helps large enterprises automate complex workflows, leveraging the power of generative AI to drive innovation and efficiency.Our mission is to empower businesses to unlock the full potential of AI, streamlining processes and driving growth....


  • San Francisco, California, United States Magic Inc Full time

    Join Our Team at Magic IncBecome a pivotal part of our mission to construct and securely implement cutting-edge, superhuman AI technologies. We are developing an AI companion for programmers that operates seamlessly within their systems—intelligent, engaging, and dependable across various fields.Role Overview: As a Senior Software Engineer, you will be...


  • San Francisco, California, United States Fractional AI Full time

    About Fractional AIFractional AI is a premier development firm focused on practical AI applications. We tackle complex AI challenges that our clients lack the resources or expertise to address independently, moving beyond technical jargon and flashy presentations to implement AI solutions efficiently.We are convinced that the transformative potential of...