Senior Manager, AI Engineer

4 weeks ago


San Francisco, United States Capital One Full time

Center 3 (19075), United States of America, McLean, Virginia

Senior Manager, AI EngineerOverview:

At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been an industry leader in using machine learning to create real-time, personalized customer experiences. Our investments in technology infrastructure and world-class talent - along with our deep experience in machine learning - position us to be at the forefront of enterprises leveraging AI. From informing customers about unusual charges to answering their questions in real time, our applications of AI & ML are bringing humanity and simplicity to banking. We are committed to continuing to build world-class applied science and engineering teams to deliver our industry leading capabilities with breakthrough product experiences and scalable, high-performance AI infrastructure. At Capital One, you will help bring the transformative power of emerging AI capabilities to reimagine how we serve our customers and businesses who have come to love the products and services we build.

Team Description:

The Intelligent Foundations and Experiences (IFX) team is at the center of bringing our vision for AI at Capital One to life. We work hand-in-hand with our partners across the company to advance the state of the art in science and AI engineering, and we build and deploy proprietary solutions that are central to our business and deliver value to millions of customers. Our AI models and platforms empower teams across Capital One to enhance their products with the transformative power of AI, in responsible and scalable ways for the highest leverage impact.

In this role, you will:
  • Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products that change how our associates work and how our customers interact with Capital One.
  • Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc.
  • Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more.
  • Invent and introduce state-of-the-art LLM optimization techniques to improve the performance - scalability, cost, latency, throughput - of large scale production AI systems.
  • Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One.
The Ideal Candidate:
  • You love to build systems, take pride in the quality of your work, and also share our passion to do the right thing. You want to work on problems that will help change banking for good.
  • Passion for staying abreast of the latest research, and an ability to intuitively understand scientific publications and judiciously apply novel techniques in production.
  • You adapt quickly and thrive on bringing clarity to big, undefined problems. You love asking questions and digging deep to uncover the root of problems and can articulate your findings concisely with clarity. You have the courage to share new ideas even when they are unproven.
  • You are deeply Technical. You possess a strong foundation in engineering and mathematics, and your expertise in hardware, software, and AI enable you to see and exploit optimization opportunities that others miss.
  • You are a resilient trail blazer who can forge new paths to achieve business goals when the route is unknown.
Basic Qualifications:
  • Bachelor's degree in Computer Science, Engineering, or AI plus at least 6 years of experience developing AI and ML algorithms or technologies, or Master's degree plus at least 4 years of experience developing AI and ML algorithms or technologies.
  • At least 6 years of experience programming with Python, Go, Scala, or Java.
Preferred Qualifications:
  • 7 years of experience deploying scalable and responsible AI solutions on cloud platforms (e.g. AWS, Google Cloud, Azure, or equivalent private cloud).
  • Experience designing, developing, integrating, delivering, and supporting complex AI systems.
  • Demonstrated ability to lead and mentor an engineering team and influence cross-functional stakeholders.
  • Experience developing AI and ML algorithms or technologies (e.g. LLM Inference, Similarity Search and VectorDBs, Guardrails, Memory) using Python, C++, C#, Java, or Golang.
  • Master's degree in Computer Science, Computer Engineering, or relevant technical field.
  • Experience developing and applying state-of-the-art techniques for optimizing training and inference software to improve hardware utilization, latency, throughput, and cost.
  • Passion for staying abreast of the latest AI research and AI systems, and judiciously apply novel techniques in production.
  • Excellent communication and presentation skills, with the ability to articulate complex AI concepts to peers.
#J-18808-Ljbffr

  • San Francisco, California, United States Perplexity AI Full time

    About Perplexity AIWe're a cutting-edge tech company revolutionizing the way people interact with information. Our mission is to empower users with intuitive and personalized experiences.As we continue to grow, we're seeking talented engineers to join our team and shape the future of conversational AI.Compensation PackageWe offer a competitive salary range...


  • San Francisco, United States Ai Brainer Full time

    Webflow's Applied AI team is focused on enhancing the process of building and designing websites by integrating generative AI-powered features. The Senior Software Engineer will collaborate with product and ML engineers to implement AI solutions that elevate user experience. This role involves keeping up-to-date with AI/ML advancements and developing...


  • San Francisco, California, United States Perplexity AI Full time

    AI-Driven Search Solutions: Technical Lead PositionWe're looking for an experienced Senior DevOps Engineer to join our team at Perplexity AI. As a key member of our infrastructure team, you'll play a crucial role in shaping the technical direction and implementing scalable solutions for our rapidly growing search platform.Technical RequirementsYou will be...


  • San Francisco, California, United States Together AI Full time

    Are you a skilled DevOps engineer looking to take your career to the next level? Do you have a passion for designing and building automated infrastructure pipelines? We are seeking a talented Senior DevOps Engineer to join our cloud engineering team at Together AI. About the RoleWe are hiring a highly experienced Senior DevOps Engineer to lead the...


  • San Francisco, California, United States Vapi Ai Full time

    About Us At Vapi AI, we're revolutionizing the way people interact with voice assistants. Our mission is to make voice technology accessible to everyone, and we need your expertise to make it happen.Job Summary We're seeking a highly skilled Senior Product Design Manager to lead the design of our voice AI platform. As a key member of our team, you'll be...

  • Lead AI Engineer

    7 days ago


    San Francisco, United States Distyl AI Full time

    Distyl AI develops production-grade AI systems to power core operational workflows for the Fortune 500. Working in partnership with OpenAI, Distyl brings deep expertise in enterprise AI, and technical investments that support the development of production-grade AI systems with rapid time-to-value. Led by proven leaders from top companies like Palantir and...

  • Lead AI Engineer

    1 month ago


    San Francisco, United States Distyl AI Full time

    Distyl AI develops production-grade AI systems to power core operational workflows for the Fortune 500. Working in partnership with OpenAI, Distyl brings deep expertise in enterprise AI, and technical investments that support the development of production-grade AI systems with rapid time-to-value. Led by proven leaders from top companies like Palantir and...


  • San Francisco, United States Relevance AI Full time

    About Relevance AI We are a fast growing Series A backed startup building the Home of the AI Workforce. Our mission is to enable the next doubling in human prosperity by delegating as much work as possible to the AI Workforce, a team of AI agents working together in a Multi-agent system. We are a team of 30+, growing fast and based in Sydney and San...


  • San Francisco, California, United States Scale AI Full time

    About Scale AIWe are accelerating the development of AI applications at an unprecedented pace. Our mission is to bridge the gap between traditional software and AI, making it accessible across every industry. As a trusted partner to top generative AI companies, government agencies, and enterprises, we're expanding our team to drive innovation.Salary RangeThe...


  • San Francisco, United States Untether AI Full time

    Untether AI is looking for a talented AI Applications Engineer to join our Product team to support our customers with SDK for our custom AI accelerator devices. You will be working with data scientists to ensure their AI workloads are ported and running efficiently on Untether AI products.Must be a US or Canadian citizen to apply.Ideal candidate profileYou...


  • San Francisco, United States Relevance AI Full time

    About Relevance AI We are a fast growing Series A backed startup building the Home of the AI Workforce. Our mission is to enable the next doubling in human prosperity by delegating as much work as possible to the AI Workforce, a team of AI agents working together in a Multi-agent system. We are a team of 30+, growing fast and based in Sydney and San...


  • San Francisco, California, United States Naptha AI Full time

    About Naptha AIWe are seeking exceptional Software Engineering interns to join Naptha AI and contribute to building the future of AI agent infrastructure.This internship offers hands-on experience working with frontier AI technology, backed by industry veterans and technical leaders through NVIDIA Inception, Google for Startups, and Microsoft for Startups.As...


  • San Francisco, California, United States Virtue AI Full time

    Virtue AI seeks an experienced Product Manager to lead our efforts in AI safety and security. This role will involve collaborating with leaders to develop and execute the GTM strategy, working closely with Engineering teams to develop and improve AI products, and tracking trends in AI security and safety.The ideal candidate will have 3+ years of technical...

  • Senior DevOps Engineer

    24 hours ago


    San Francisco, California, United States Together AI Full time

    Job SummaryWe are seeking a highly skilled Senior DevOps Engineer to join our cloud engineering organization. As a key member of our team, you will be responsible for developing and maintaining the infrastructure for our AI workloads, ensuring scalability, reliability, and high performance. Key Responsibilities- Design and implement automated infrastructure...


  • San Francisco, United States Unum AI Full time

    Unum is the deep-tech startup reinventing Data-Lakes for extreme scale and AI! You can think of it as Snowflake and OpenAI combined. We are searching for passionate and competitive Senior C++ Research Engineers to join us in designing next-generation data infrastructure to empower million data-intensive and AI applications! Tasks Implementing and optimizing...


  • San Francisco, United States Unum AI Full time

    Unum is the deep-tech startup reinventing Data-Lakes for extreme scale and AI! You can think of it as Snowflake and OpenAI combined. We are searching for passionate and competitive Senior C++ Research Engineers to join us in designing next-generation data infrastructure to empower million data-intensive and AI applications! Tasks Implementing and optimizing...


  • San Francisco, United States Abridge AI Full time

    Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most—their patients. Our enterprise-grade technology transforms patient-clinician conversations...

  • Lead AI Engineer

    4 weeks ago


    San Francisco, United States Distyl AI, Inc. Full time

    Distyl AI develops production-grade AI systems to power core operational workflows for the Fortune 500. Working in partnership with OpenAI, Distyl brings deep expertise in enterprise AI, and technical investments that support the development of production-grade AI systems with rapid time-to-value.Led by proven leaders from top companies like Palantir and...

  • Software Engineer

    7 days ago


    San Francisco, California, United States Stack AI Full time

    About Stack AIWe're a fast-growing startup on a mission to democratize access to Large Language Models. Our user-friendly and intuitive No-Code platform integrates the best AI models, common data sources, and SaaS tools.Our Traction is impressive: launched 8 months ago with over 65,000 users and 300+ paying customers, including public companies and...


  • San Francisco, United States Unum AI Full time

    Unum is the deep-tech startup reinventing Data-Lakes for extreme scale and AI! You can think of it as Snowflake and OpenAI combined. We are searching for passionate and competitive Senior C++ Research Engineers to join us in designing next-generation data infrastructure to empower million data-intensive and AI applications! Tasks Implementing and optimizing...