Senior Infrastructure Engineer, GPU Platform

3 weeks ago


San Francisco, California, United States OpenAI Full time
About the Role

The Applied Engineering team at OpenAI is responsible for running the infrastructure that supports the models backing ChatGPT and the API. We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool is used responsibly and safely.

This role is part of the inference compute team, which builds and maintains infrastructure abstractions allowing OpenAI to run models at scale. The team is based in San Francisco, CA, and uses a hybrid work model of 3 days in the office per week.

As a Senior Infrastructure Engineer, you will design and build the inference infrastructure that powers our products, enabling reliability and performance. You will also ensure our infrastructure can scale to the next order of magnitude and help create a diverse, equitable, and inclusive culture that makes all feel welcome while enabling radical candor and the challenging of group think.

The ideal candidate has 10+ years of experience building core infrastructure, experience running GPU clusters at scale, and experience operating orchestration systems such as Kubernetes at scale. They should take pride in building and operating scalable, reliable, secure systems and be comfortable with ambiguity and rapid change.

This role is exclusively based in our San Francisco HQ and we offer relocation assistance to new employees.

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link. OpenAI Global Applicant Privacy Policy

  • San Francisco, California, United States Openai Full time

    About the TeamThe Applied Engineering team at OpenAI works across research, engineering, product, and design to bring the company's technology to consumers and businesses.You'll join the team responsible for running the infrastructure that supports the models backing ChatGPT and the API.The systems we support include inference kubernetes clusters, GPU...


  • San Diego, California, United States MediaTek Full time

    Job Title: Senior GPU Driver EngineerMediaTek is a global leader in developing innovative systems-on-chip (SoC) for mobile devices, home entertainment, connectivity, and IoT products. We are seeking a highly skilled Senior GPU Driver Engineer to join our team in San Diego/San Jose office.Responsibilities:Design and implement interfaces between GPU driver and...


  • San Jose, California, United States Adobe Full time

    Job DescriptionWe are seeking a highly skilled Senior AI Engineer to join our team at Adobe. As a key member of our platform engineering team, you will be responsible for designing, developing, and maintaining robust AI/ML infrastructure solutions to support the training and deployment of large-scale AI models.Key Responsibilities:Design and develop scalable...


  • San Jose, California, United States Tik Tok Full time

    Job Title: Senior Software Engineer - Generative AI InfrastructureJob SummaryWe are seeking a highly skilled Senior Software Engineer to join our Generative AI team at TikTok. As a key member of our infrastructure team, you will be responsible for designing, developing, and deploying scalable and reliable software infrastructure to support our Generative AI...

  • Platform Engineer

    4 weeks ago


    San Francisco, California, United States Voltage Park Inc Full time

    About Voltage Park On-DemandVoltage Park's mission is to make AI infrastructure accessible to all. Today, we own 24,000+ H100s and operate 7+ data-centers across the US. We serve customers of all sizes, from small research labs to large enterprises.We're looking for a skilled Platform Engineer to join our On-Demand team, where you'll help us build a platform...

  • Platform Engineer

    4 weeks ago


    San Francisco, California, United States Voltage Park Inc Full time

    About Voltage Park On-DemandVoltage Park's mission is to make AI infrastructure accessible to all. As a leading provider of AI infrastructure, we own 24,000+ H100s and operate 7+ data-centers across the US, serving customers of all sizes.Role OverviewWe're seeking a highly skilled Platform Engineer to join our On-Demand team, where you'll help us build a...

  • Senior Cloud Engineer

    4 weeks ago


    San Francisco, California, United States Crusoe Full time

    Job OverviewCrusoe Energy is revolutionizing the way we harness energy from stranded resources. As a Senior Cloud Engineer on the AI Infrastructure team, you'll play a pivotal role in designing and implementing the next-generation AI inference platform.This is a unique opportunity to build a high-performance AI product that will be central to Crusoe's...


  • San Francisco, California, United States Fieldguide Full time

    Job OverviewFieldguide is a pioneering company revolutionizing trust in global commerce and capital markets through innovative automation and streamlining of assurance and audit practices, particularly in cybersecurity, privacy, and ESG. We're seeking a seasoned Senior Platform Engineer to spearhead our Infrastructure Platform team, ensuring the stability,...


  • San Francisco, California, United States Unreal Gigs Full time

    Design and Build AI InfrastructureAs an AI Infrastructure Architect at Unreal Gigs, you will play a critical role in building the platforms that support machine learning and AI development across the organization. You will work closely with data scientists, software engineers, and DevOps teams to ensure that AI systems run efficiently, securely, and at...


  • San Francisco, California, United States Apple Full time

    Role OverviewThe Machine Learning Platform & Technology (MLPT) team in the AIML organization at Apple is seeking a Senior Engineering Program Manager for its ML Compute Platform. This platform provides services to all internal Apple developers focused on providing efficient and scalable compute and processing for machine learning lifecycle from model...


  • San Diego, California, United States Hillbot Full time

    About Us:Hillbot is a pioneering start-up that combines cutting-edge Generative AI with advanced robotics technologies. Our mission is to develop comprehensive robot foundation models that revolutionize the field and set new industry standards. We are seeking a highly skilled Senior Infrastructure Engineer who is passionate about infrastructure, data, and...


  • San Francisco, California, United States Fieldguide Full time

    About Us:Fieldguide is a company that aims to establish trust in global commerce and capital markets by automating and streamlining the work of assurance and audit practitioners, specifically in cybersecurity, privacy, and ESG. We build software for the people who enable trust between businesses.We're a remote-first company based in San Francisco, CA, but we...


  • San Francisco, California, United States Crusoe Full time

    About the RoleAs a Senior/Staff Software Engineer on the Managed AI team at Crusoe, you'll have a pivotal role in shaping the architecture and scalability of our next-generation AI inference platform.You will lead the design and implementation of core systems for our AI services, including resilient fault-tolerant queues, model catalogs, and scheduling...


  • San Francisco, California, United States Genmo Full time

    Role OverviewWe are seeking a senior software engineer to join our inference team at Genmo, a research lab dedicated to building open, state-of-the-art models for video generation. The successful candidate will be responsible for designing and scaling our inference systems to support millions of users across multiple data centers.Key ResponsibilitiesDevelop...


  • San Jose, California, United States Samsung Electronics Full time

    Job SummarySamsung Electronics is seeking a highly skilled Senior GPU Performance Engineer to join our Xclipse GPU software team. As a key member of our team, you will be responsible for developing and optimizing GPU IP from architectural planning to productization.Key Responsibilities:Optimize and fine-tune GPU-based systems and applications for maximum...


  • San Francisco, California, United States Together AI Full time

    Job ResponsibilitiesInfrastructure Development:Identify and resolve infrastructure gaps to ensure reliable, efficient, and scalable AI/ML solutions.AI/ML Solutions:Develop advanced AI/ML infrastructure solutions to enhance the efficiency of our ML teams, leveraging expertise in distributed systems and large-scale data processing.System Design:Design and...


  • San Jose, California, United States Adobe Full time

    Job SummaryWe are seeking a highly skilled Senior AI Engineer to join our team at Adobe. As a key member of our platform, you will be responsible for designing, developing, and maintaining robust AI/ML infrastructure solutions to support the training and deployment of large-scale AI models. Key ResponsibilitiesDesign and develop AI/ML infrastructure...


  • San Diego, California, United States Qualcomm Full time

    Job SummaryQualcomm is seeking a highly skilled Senior GPU Machine Learning Engineer to join our team. As a leading technology innovator, we push the boundaries of what's possible to enable next-generation gaming, XR, and AI experiences.The successful candidate will be responsible for architecting, designing, implementing, verifying, and optimizing the...


  • San Francisco, California, United States Sprig Full time

    About SprigSprig is a cutting-edge technology company empowering the fastest-growing and largest companies to build digital products for people, not just data points. Our platform enables product teams to ask any question about their product, observe user interactions, and receive actionable product recommendations to drive success.We're proud to partner...


  • San Francisco, California, United States Brex Full time

    Why Brex?Brex is the AI-powered spend platform that helps companies spend with confidence. Our integrated corporate cards, banking, and global payments, plus intuitive software for travel and expenses, make us a leader in the industry. Tens of thousands of companies from startups to enterprises, including DoorDash, Flexport, and Compass, use Brex to...