Data Research Scientist
6 days ago
Data Research Scientist
Foundation Models, AI Research Institute
$200,000 - $350,000 salary + bonus
Join a groundbreaking AI research lab that is set to develop and publish transformative breakthroughs in GenAI, focusing on LLMs and Multimodal AI. Our team offers the chance to work at the forefront of AI research, tackling essential challenges in data handling and model training.
What You'll Do:
- Lead innovative research on data-centric approaches for LLMs, including pretraining corpus design and data valuation.
- Develop robust pipelines to process complex data sources into structured, reproducible training datasets.
- Create and optimize agentic data pipelines that integrate retrieval and self-curation for enhanced training quality.
- Collaborate on exciting research regarding alignment and reasoning training using data-driven methods.
- Prototype and implement evaluation frameworks to assess data quality and its impact on LLM reasoning.
- Publish your findings at prestigious conferences (e.g., NeurIPS, ICLR, ACL, EMNLP) and represent our institute globally.
- Contribute to open-source tools and datasets that propel forward the foundation model research community.
Requirements:
- Master's degree in Computer Science, Data Science, or a related field (PhD preferred).
- Experience in collecting and curating high-quality, multilingual text data.
- Hands-on expertise with large-scale dataset curation and preprocessing aimed at ML/LLM training.
- Demonstrated capability in synthesizing complex datasets, including code, math, and agentic data.
- Familiarity with ML infrastructure for scalable training and evaluation.
- Knowledge of data intersection with post-training methods (RL/SFT).
- Proven capability to independently pursue research questions related to data quality and reasoning.
Preferred Experience:
- Familiarity with retrieval-augmented generation (RAG) and reasoning benchmarks.
- Contributions to speculative decoding or reinforcement learning in data contexts.
- Background in knowledge graphs, semantic search, or indexing systems.
- Strong publication record in key AI conferences.
- Prior contributions to open-source ML data tools or benchmarks are a plus.
- Experience with speculative decoding and LLM serving engines.
- Experience training LLM as a functional judge.
- Expertise in tokenization and training tokenizers.
Why You Should Apply:
- Be part of a new division at the leading edge of AI innovation.
- Attractive salary and benefits package.
- Collaborate with top talents from FAANG and leading AI organizations.
- Comprehensive health insurance included.
- Relocation assistance is available for the right candidate.
San Francisco Bay Area, USA
Interested in applying? Please email your resume to stefani.lukic@storm3.com.
-
Sr. Data Scientist
6 days ago
Fremont, CA, United States Info Way Solutions Full timeWe are seeking 2 Senior Data Scientist to join our team. In this full-time remote role, you will be responsible for designing, developing, and implementing advanced statistical and machine learning models to help our clients make data-driven decisions. You will work closely with cross-functional teams to ensure successful implementation and adoption of...
-
Sr. Data Scientist
2 weeks ago
Fremont, CA, United States Info Way Solutions Full timeWe are seeking 2 Senior Data Scientist to join our team. In this full-time remote role, you will be responsible for designing, developing, and implementing advanced statistical and machine learning models to help our clients make data-driven decisions. You will work closely with cross-functional teams to ensure successful implementation and adoption of...
-
Sr. Data Scientist
3 days ago
Fremont, CA, United States Info Way Solutions Full timeWe are seeking 2 Senior Data Scientist to join our team. In this full-time remote role, you will be responsible for designing, developing, and implementing advanced statistical and machine learning models to help our clients make data-driven decisions. You will work closely with cross-functional teams to ensure successful implementation and adoption of...
-
Sr. Data Scientist
2 hours ago
Fremont, CA, United States Info Way Solutions Full timeWe are seeking 2 Senior Data Scientist to join our team. In this full-time remote role, you will be responsible for designing, developing, and implementing advanced statistical and machine learning models to help our clients make data-driven decisions. You will work closely with cross-functional teams to ensure successful implementation and adoption of...
-
Senior Data Scientist
2 weeks ago
Fremont, CA, United States Quantix Search Full timeSenior Data Scientist | $250K-$300K + Equity Join one of the most innovative AI companies in the world as a Senior Data Scientist! With over $230M in backing from leading investors and a valuation exceeding $1B, this company is shaping the future of conversational AI, serving prominent clients in the tech industry. In this pivotal role, you'll leverage...
-
Senior Data Scientist
6 days ago
Fremont, CA, United States Quantix Search Full timeSenior Data Scientist | $250K-$300K + Equity Join one of the most innovative AI companies in the world as a Senior Data Scientist! With over $230M in backing from leading investors and a valuation exceeding $1B, this company is shaping the future of conversational AI, serving prominent clients in the tech industry. In this pivotal role, you'll leverage...
-
Senior Data Scientist
2 weeks ago
Fremont, CA, United States Quantix Search Full timeSenior Data Scientist | $250K-$300K + Equity Join one of the most innovative AI companies in the world as a Senior Data Scientist! With over $230M in backing from leading investors and a valuation exceeding $1B, this company is shaping the future of conversational AI, serving prominent clients in the tech industry. In this pivotal role, you'll leverage...
-
Senior Data Scientist
2 weeks ago
Fremont, CA, United States Quantix Search Full timeSenior Data Scientist | $250K-$300K + Equity Join one of the most innovative AI companies in the world as a Senior Data Scientist! With over $230M in backing from leading investors and a valuation exceeding $1B, this company is shaping the future of conversational AI, serving prominent clients in the tech industry. In this pivotal role, you'll leverage...
-
(USA) Staff, Data Scientist
1 week ago
Fremont, CA, United States Sam's Club Full timePosition Summary... What you'll do... The Staff Data Scientist in the Data Science team will play a crucial role as an architect, leading AI explorations, research, algorithmic solution creation, and fast prototyping aimed to deliver impactful initiatives within the Sam's Club division of Walmart Inc. The position involves developing the foundational ML/AI...
-
(USA) Staff, Data Scientist
2 weeks ago
Fremont, CA, United States Sam's Club Full timePosition Summary... What you'll do... The Staff Data Scientist in the Data Science team will play a crucial role as an architect, leading AI explorations, research, algorithmic solution creation, and fast prototyping aimed to deliver impactful initiatives within the Sam's Club division of Walmart Inc. The position involves developing the foundational ML/AI...