Lead AI Systems Engineer

2 weeks ago


San Francisco, California, United States One Full time
Lead AI Systems Engineer - Generative AI Product Development (Remote-Eligible)

At Capital One, our goal is to develop trustworthy, reliable, and human-centric AI systems that transform the banking experience. We have been at the forefront of the industry, leveraging machine learning to create intelligent, automated customer interactions. Our applications of AI & ML are designed to simplify banking and enhance customer service. With our significant investments in public cloud infrastructure and machine learning platforms, we are uniquely equipped to harness the potential of AI. We are dedicated to building exceptional applied science and engineering teams to advance our capabilities in delivering innovative product experiences and scalable, high-performance AI infrastructure.

We are seeking a seasoned Lead Generative AI Engineer to assist in the development and maintenance of APIs and SDKs for training, fine-tuning, and accessing AI models at scale. As a member of our Enterprise AI team, you will create systems that empower users to engage with Large Language Models (LLMs) and Foundation Models (FMs) using our public cloud infrastructure. Collaborating with a team of top-tier AI engineers and researchers, you will design and implement essential API products and services that support real-time customer-facing applications. Key projects you will be involved in include:

  • Architecting, building, and deploying well-managed core APIs and SDKs for accessing LLMs and proprietary FMs, including tasks related to training, fine-tuning, and prompting, along with orchestration SDKs.
  • Designing APIs focused on performance, real-time applications, scalability, user-friendliness, and governance automation.
  • Developing application-specific interfaces that utilize LLMs and FMs to enhance both associate and customer experiences.
  • Empowering users to create new Generative AI capabilities.
  • Creating tools and processes to monitor API access patterns and operational health.
  • Designing and implementing AI safety measures and guardrails within the API layer in close collaboration with researchers.

Capital One is open to hiring a Remote Employee for this opportunity.

Basic Qualifications:
  • Bachelor's degree in Computer Science, Computer Engineering, or a related technical field.
  • A minimum of 4 years of experience in designing, building, and deploying machine learning application platforms.
  • At least 4 years of programming experience in Python, Go, Scala, or Java.
  • A minimum of 1 year of experience in building, scaling, and optimizing training or inferencing systems for deep neural networks.
Preferred Qualifications:
  • Experience in developing large-scale AI products or platforms for NLP, speech, computer vision, or recommendation systems serving millions of users.
  • Ability to thrive in a fast-paced environment with ambiguity and competing priorities.
  • Experience in technology and product-driven companies or startups is preferred.
  • Capability to rapidly iterate with researchers and engineers to enhance product experiences while establishing foundational capabilities.
  • Familiarity with deploying large neural network models in demanding production settings.
  • Experience with API security, observability, cloud access control, and privacy best practices.

At this time, Capital One will not sponsor a new applicant for employment authorization for this position.


  • AI Engineering Lead

    2 weeks ago


    San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking a Director of Engineering to spearhead our AI Platform division. This team is responsible for developing cutting-edge software systems that enhance the Snorkel Flow platform. Responsibilities include creating services for training and deploying generative AI and machine learning models, utilizing innovative data-centric...

  • AI Engineering Lead

    1 week ago


    San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking an Engineering Director to spearhead our AI Platform division. This team is responsible for developing cutting-edge software solutions that drive the Snorkel Flow platform. The focus includes creating services for training and deploying generative AI and machine learning models, utilizing innovative data-centric methodologies,...

  • AI Engineering Lead

    1 week ago


    San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking an experienced Director of Engineering to spearhead our AI Platform division. This team is responsible for developing cutting-edge software systems that drive the Snorkel Flow platform. Key responsibilities include creating services for training and deploying generative AI and machine learning models utilizing innovative...


  • San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking a Director of Engineering to oversee our AI Platform division. This team is responsible for developing cutting-edge software systems that drive the Snorkel Flow platform. Responsibilities include creating services for training and deploying generative AI and machine learning models, utilizing innovative data-centric...


  • San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking a Director of Engineering to oversee our AI Platform division. This team is responsible for developing cutting-edge software systems that enhance the Snorkel Flow platform. The focus includes creating services for training and deploying generative AI and machine learning models utilizing advanced data-centric methodologies,...


  • San Francisco, California, United States Snorkel AI, Inc. Full time

    Position OverviewWe are seeking a Director of Engineering to spearhead our AI Platform division. This team is responsible for creating cutting-edge software solutions that enhance the Snorkel Flow platform. Responsibilities include developing services for training and deploying generative AI and machine learning models, utilizing innovative data-centric...


  • San Francisco, California, United States Perplexity AI Full time

    Position OverviewPerplexity AI is on the lookout for a seasoned Search Engineer to revolutionize the search framework that drives our innovative products. If you are passionate about advancing technology and making a substantial difference, this opportunity is tailored for you.Key Responsibilities Architecting and developing extensive infrastructure to...


  • San Francisco, California, United States Fractional AI Full time

    About Fractional AIFractional AI is a premier development firm focused on practical AI applications. We tackle complex AI challenges that our clients lack the resources or expertise to address independently, moving beyond technical jargon and flashy presentations to implement AI solutions efficiently.We are convinced that the transformative potential of...


  • San Francisco, California, United States Snorkel AI, Inc. Full time

    About the RoleWe are seeking an experienced Engineering Manager to lead our AI Platform team at Snorkel AI, Inc. This is a unique opportunity to join a cutting-edge technology company and contribute to the development of innovative AI solutions.Key ResponsibilitiesLead a team of talented engineers to design, develop, and deploy large-scale data-focused AI...

  • AI Solutions Engineer

    2 weeks ago


    San Francisco, California, United States Fractional AI Full time

    About Fractional AIFractional AI is a premier development firm specializing in applied artificial intelligence. We tackle complex AI-driven challenges that our clients lack the resources or expertise to address independently, streamlining the process to implement AI solutions efficiently.We are convinced that the transformative potential of generative AI...

  • AI Solutions Engineer

    2 weeks ago


    San Francisco, California, United States Perplexity AI Full time

    Job OverviewPerplexity AI is at the forefront of developing an innovative answer engine, enabling users to discover information in more effective and engaging ways. We leverage large language models (LLMs) for knowledge retrieval at scale, catering to millions of users globally. To further our mission, we are seeking skilled engineers to design...


  • San Francisco, California, United States Hayden AI Full time

    About Us At Hayden AI, we strive to leverage artificial intelligence and machine learning to revolutionize how governments and enterprises tackle real-world issues. Our cutting-edge mobile perception system is designed to optimize transit operations, enhance street safety, and promote a sustainable future through innovative solutions like bus lane...


  • San Francisco, California, United States Hayden AI Full time

    About Hayden AI At Hayden AI, we strive to leverage artificial intelligence and machine learning to revolutionize how organizations and governments tackle pressing challenges. Our cutting-edge mobile perception system is designed to enhance transit efficiency, improve street safety, and promote sustainable practices through innovative solutions, including...


  • San Francisco, California, United States Fractional AI Full time

    About Fractional AIWe are a cutting-edge technology company specializing in applied AI solutions. Our team of experts helps large enterprises automate complex workflows, leveraging the power of generative AI to drive innovation and efficiency.Our mission is to empower businesses to unlock the full potential of AI, streamlining processes and driving growth....


  • San Diego, California, United States Shield AI Full time

    About Shield AI:Founded in 2015, Shield AI is a venture-backed defense technology firm dedicated to safeguarding service members and civilians through intelligent systems. Our mission is to create the world's leading AI pilot, known as Hivemind, which has successfully operated various aircraft including fighter jets and drones. Position Overview:We are...


  • San Francisco, California, United States Decagon AI, Inc. Full time

    About Decagon AI, Inc.:Decagon AI, Inc. is at the forefront of developing sophisticated conversational AI systems tailored for enterprise applications. Our innovative solutions have attracted a diverse clientele, enhancing customer interactions through human-like support capabilities.Position Overview:We are seeking an experienced AI Engineer to contribute...


  • San Francisco, California, United States Decagon AI, Inc. Full time

    About Decagon AI, Inc.:Decagon AI, Inc. is at the forefront of developing state-of-the-art conversational AI solutions tailored for enterprise needs. Our innovative AI agents are designed to deliver a customer support experience that mirrors human interaction, empowering businesses to enhance their customer service capabilities and streamline their...


  • San Francisco, California, United States Decagon AI, Inc. Full time

    About Decagon AI, Inc.:Decagon AI, Inc. is at the forefront of developing sophisticated conversational AI systems tailored for enterprise needs. Our innovative solutions have garnered the trust of numerous clients, enhancing their customer support capabilities and streamlining their operational efficiency.Position Overview:We are seeking an experienced AI...


  • San Francisco, California, United States Untether AI Full time

    Untether AI is looking for a talented AI Applications Engineer to join our Product team to support our customers with SDK for our custom AI accelerator devices. You will be working with data scientists to ensure their AI workloads are ported and running efficiently on Untether AI products. Must be a US citizen to apply.Ideal candidate profileYou have...


  • San Francisco, California, United States Together AI Full time

    About the RoleWe are seeking a highly skilled AI Researcher to join our team at Together AI. As an AI Researcher, you will play a key role in pushing the frontier of foundation model research and making them a reality in products.Key ResponsibilitiesDevelop novel architectures, system optimizations, optimization algorithms, and data-centric optimizations...