Principal Data Scientist

1 month ago


San Francisco, United States Capital One Full time

Center 2 (19050), United States of America, McLean, Virginia

Principal Data Scientist - Emerging ML

Data is at the center of everything we do. As a startup, we disrupted the credit card industry by individually personalizing every credit card offer using statistical modeling and the relational database, cutting edge technology in 1988 Fast-forward a few years, and this little innovation and our passion for data has skyrocketed us to a Fortune 200 company and a leader in the world of data-driven decision-making.

As a Data Scientist at Capital One, you'll be part of a team that's leading the next wave of disruption at a whole new scale, using the latest in computing and machine learning technologies and operating across billions of customer records to unlock the big opportunities that help everyday people save money, time and agony in their financial lives.

Team Description

Emerging ML is the data science and machine learning team inside Capital One's Applied Research organization. We focus on research and development of new technologies within the domain of Artificial Intelligence with a focus on Embeddings and Foundation Models. We partner closely with our product and engineering teams to connect emerging technologies with business critical use cases across Capital One's lines of business.

As part of Emerging ML, you will work on things like:

  • Conducting research into self supervised learning, transformer models, and representation learning
  • Building customer behavioral models (using transaction, clickstream, and other data) that identify trends, patterns, and relationships related to product usage
  • Refining integration patterns for encoder and decoder models for downstream use cases to connect Applied Research products and business use cases
Role Description

This is an individual contributor position. In Emerging ML, you will work at all phases of the data science lifecycle, including:

  • Build machine learning models through all phases of development, from design through training, evaluation and validation, and partner with engineering teams to operationalize them in scalable and resilient production systems that serve 50+ million customers.
  • Partner closely with a variety of business and product teams across Capital One to conduct the experiments that guide improvements to customer experiences and business outcomes in domains like marketing, servicing and fraud prevention.
  • Write software (Python, Scala, e.g.) to collect, explore, visualize and analyze numerical and textual data (billions of customer transactions, clicks, payments, etc.) using tools like Spark and AWS.
The Ideal candidate will be:
  • Curious and creative. You thrive on bringing definition to big, undefined problems. You love asking questions, and you love pushing hard to find the answers. You're not afraid to share a new idea. You communicate clearly and effectively to share your findings with non-technical audiences.
  • Technical: You have hands-on experience developing data science solutions from concept to production using open source tools and modern cloud computing platforms. You are not afraid of petabytes of data.
  • Statistically-minded. You have built models, validated them and backtested them. You know how to interpret a confusion matrix or a ROC curve. You have experience with clustering, classification, sentiment analysis, time series analysis and deep learning.
  • Customer and product oriented. You share our passion for changing banking for good.
Basic Qualifications
  • Currently has, or is in the process of obtaining a Bachelor's Degree plus 5 years of experience in data analytics, or currently has, or is in the process of obtaining a Master's Degree plus 3 years in data analytics, or currently has, or is in the process of obtaining PhD, with an expectation that required degree will be obtained on or before the scheduled start date
  • At least 1 year of experience in open source programming languages for large scale data analysis
  • At least 1 year of experience with machine learning
  • At least 1 year of experience with relational databases
Preferred Qualifications:
  • Masters in "STEM" field (Science, Technology, Engineering, or Mathematics) plus 3 years of experience in data analytics
  • Experience building transformer models at scale (>100M parameters)
  • Understanding of self-supervised learning methods
  • Strong foundation in software engineering
  • At least 1 year of experience working with AWS
  • At least 2 years' experience in Python, Scala, or R for large scale data analysis
  • At least 2 years' experience with machine learning
  • At least 2 years' experience with SQL
#J-18808-Ljbffr

  • San Francisco, United States Unreal Gigs Full time

    Introduction:Are you a master of data, with the ability to extract meaningful insights from the most complex datasets? Do you have the expertise to develop advanced models that not only solve challenging problems but also drive strategic decisions? If you’re a visionary data scientist who thrives on turning raw data into actionable intelligence, then our...


  • San Francisco, United States Windfall Data Inc Full time

    At Windfall, we leverage data-driven insights to help organizations achieve their goals, from non-profits boosting their fundraising efforts to commercial companies improving their marketing ROI. We are looking for a seasoned Principal Data Scientist to play a pivotal role in developing and scaling our foundational predictive models, such as household net...


  • San Francisco, United States ZipRecruiter Full time

    Job DescriptionIntroduction:Are you a master of data, with the ability to extract meaningful insights from the most complex datasets? Do you have the expertise to develop advanced models that not only solve challenging problems but also drive strategic decisions? If you’re a visionary data scientist who thrives on turning raw data into actionable...


  • San Francisco, United States Windfall Data, Inc. Full time

    At Windfall, we leverage data-driven insights to help organizations achieve their goals, from non-profits boosting their fundraising efforts to commercial companies improving their marketing ROI. We are looking for a seasoned Principal Data Scientist to play a pivotal role in developing and scaling our foundational predictive models, such as household net...


  • San Mateo, United States Snowflake Computing Full time

    The Product Data Science team is looking for a Principal Full-stack Data Scientist to come aboard and be part of some of the foundational areas at Snowflake. In this role, you will work closely with our Product and Engineering teams on core operations and data privacy, security, and governance domains. You will also work on long-running analytical...

  • Staff Data Scientist

    2 weeks ago


    San Francisco, United States Data Masked Full time

    This range is provided by Harnham. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $200,000.00/yr - $250,000.00/yr Additional compensation types Annual Bonus and RSUs Data Science Focused Recruitment Consultant Title: Staff Data Scientist Location: Company based in SF, CA - willing to...


  • South San Francisco, California, United States Genentech Full time

    Job Title: Principal Machine Learning Scientist, Data ArchitectAbout the Role:Genentech is seeking a highly skilled Principal Machine Learning Scientist, Data Architect to join our Computational Sciences organization. This role will focus on building novel machine-learning methods to enhance drug development and clinical trial design, with a particular...


  • San Francisco, United States Gap Inc. Full time

    Principal Data Scientist - Recommendation SystemQualificationsProven experience (at a senior level) in developing and implementing recommendation algorithms in a retail or e-commerce environmentStrong proficiency in programming languages such as Python or RSolid understanding of machine learning techniques, deep learning, and natural language...


  • San Mateo, United States Snowflake Computing Full time

    Build the future of the AI Data Cloud. Join the Snowflake team. The Product Data Science team is looking for a Principal Full-stack Data Scientist to come aboard and be part of some of the foundational areas at Snowflake. In this role, you will work closely with our Product and Engineering teams on core operations and data privacy, security, and governance...


  • San Jose, California, United States Capital One Full time

    As a Principal Data Scientist, you will play a key role in driving innovation and growth at Capital One. You will have the opportunity to work with cutting-edge technologies and develop solutions that impact millions of customers.**Estimated Salary:** $174,900 - $199,700 per year**Key Skills and Qualifications:**Strong background in machine learning and...


  • San Bruno, United States Walmart Full time

    What you'll do...Position: Principal Data ScientistJob Location: 850 Cherry Avenue, San Bruno, CA 94066Duties: Research, experiment, and design advanced statistical models and cutting-edge machine learning/deep learning algorithms to produce total market trend forecasting and predict future Wal-Mart growth opportunities. Drive machine learning model...


  • san francisco, United States Skills Alliance Full time

    About the Role: As a Bioinformatics Principal Scientist on our Computational Biology team, you will advance our genetic medicine technologies, enabling the discovery and development of new therapeutics. Working with a diverse team of cell biologists, protein and RNA engineers, computational scientists, molecular biologists, and translational biologists, you...


  • san francisco, United States Skills Alliance Full time

    About the Role: As a Bioinformatics Principal Scientist on our Computational Biology team, you will advance our genetic medicine technologies, enabling the discovery and development of new therapeutics. Working with a diverse team of cell biologists, protein and RNA engineers, computational scientists, molecular biologists, and translational biologists, you...


  • San Francisco, United States Windfall Full time

    About Windfall At Windfall, we leverage data-driven insights to help organizations achieve their goals, from non-profits boosting their fundraising efforts to commercial companies improving their marketing ROI. We are looking for a seasoned Principal Data Scientist to play a pivotal role in developing and scaling our foundational predictive models, such as...


  • San Francisco, United States Windfall Full time

    At Windfall, we leverage data-driven insights to help organizations achieve their goals, from non-profits boosting their fundraising efforts to commercial companies improving their marketing ROI. We are looking for a seasoned Principal Data Scientist to play a pivotal role in developing and scaling our foundational predictive models, such as household net...


  • San Francisco, United States Windfall Full time

    At Windfall, we leverage data-driven insights to help organizations achieve their goals, from non-profits boosting their fundraising efforts to commercial companies improving their marketing ROI. We are looking for a seasoned Principal Data Scientist to play a pivotal role in developing and scaling our foundational predictive models, such as household net...


  • San Mateo, United States Snowflake Computing Full time

    Build the future of the AI Data Cloud. Join the Snowflake team. The Product Data Science team is looking for a Principal Full-stack Data Scientist to come aboard and be part of some of the foundational areas at Snowflake. In this role, you will work closely with our Product and Engineering teams on bringing insights to the massive amount of data. You will...


  • San Diego, United States Intuit Full time

    Overview Intuit is hiring a Principal Data Science Architect to join our Intuit AI team. Come join our collaborative and creative group of data scientists and machine learning engineers and design and build AI systems that directly affect hundreds of thousands of our customers. In this role, you will be designing, building, and deploying machine learning...


  • San Francisco, United States BioSpace, Inc. Full time

    Job Details HOW MIGHT YOU DEFY IMAGINATION? If you feel like youre part of something bigger, its because you are. At Amgen, our shared missionto serve patientsdrives all that we do. It is key to our becoming one of the worlds leading biotechnology companies. We are global collaborators who achieve togetherresearching, manufacturing, and delivering...


  • San Francisco, California, United States Unreal Gigs Full time

    AI/ML Data Science LeadershipWelcome to Unreal Gigs, where innovation meets expertise. We're seeking a seasoned AI/ML Data Scientist to lead our team in shaping the future of data science.Key Responsibilities:Technical Leadership: Direct and mentor a team of AI/ML data scientists, providing guidance and support in driving AI/ML innovation.Project Planning...