Machine Learning Researcher
4 days ago
Machine Learning Researcher – Pretraining Systems
This role will require relocation to New York
Location:
United States (preferred)
Team:
Advanced Modeling Research
Discipline:
Pretraining Dynamics & Large-Scale Optimization
The Opportunity
We seek the top 1% of ML researchers. We're building a small research group focused on
frontier-scale pretraining
—where systems design, data mixtures, and optimization dynamics converge. This role sits at the intersection of modeling, distributed training, and empirical science: understanding how scale transforms representation, and how to steer it.
Our client consistently attracts the top 1% of elite ML and DL talent working on some of the largest datasets on the planet...
You'll operate as
both experimentalist and theorist
—running controlled ablations at scale, profiling training behaviors, and distilling the principles that make large models learn efficiently.
A PhD. is valued highly in this regard - however high performing MA may be considered. Typically we require 2-5 years of experience post PhD.
Core Research Areas
- Investigate
pretraining objectives
that enhance generalization, compositional reasoning, and long-horizon coherence. - Design
data mixture experiments
that balance entropy, redundancy, and signal—mapping mixture composition to model scaling efficiency. - Develop instrumentation for
training dynamics
(loss surfaces, gradient flow, activation distributions) to predict inflection points during pretraining. - Collaborate on
distributed systems optimization
—scheduling, sharding, and checkpointing for multi-node, high-throughput pretraining runs. - Explore
representation diagnostics
across model scales—alignment drift, retention, and capability formation. - Build
evaluation harnesses
for emergent behavior tracking—reasoning, tool-use proxy metrics, and temporal consistency tests.
Candidate Profile
We're interested in individuals who've g
one beyond running large models
—those who've
interpreted
them. You may have:
- Designed or scaled pretraining runs (10B+ parameters) or equivalent high-throughput distributed learning systems.
- Authored or contributed to research in
scaling laws, mixture sampling, or self-supervised pretraining
. - Deep familiarity with
distributed optimization frameworks
(FSDP, DeepSpeed, Megatron-LM, JAX/TPU). - Proven skill in
profiling model behavior
—from gradient noise scale to tokenization effects. - Experience in
data-centric experimentation
: filtering, mixing, and quality assessment for large corpora. - Rigor in
numerical reasoning
about efficiency, throughput, and empirical reproducibility.
Technical Stack
- Frameworks:
PyTorch / JAX, custom distributed schedulers, FSDP / DeepSpeed / Megatron - Languages:
Python, C++ (or sim. for profiling and system instrumentation) - Compute Scale:
Multi-node clusters (A100/H100 class GPUs or TPUv4/5) - Data Systems:
Versioned mixtures, tokenizer pipelines, distributed sampling
Research Ethos
This team values results that are:
- Empirical
— grounded in reproducible scaling evidence. - Data-aware
— understanding that data is the architecture. - Systematic
— bridging algorithmic intuition with compute pragmatism. - Quantitative
— every hypothesis testable by metrics that matter: throughput, loss curvature, generalization slope.
Indicators of Fit
- You've seen a 100B-parameter model diverge—and can explain why.
- You can quantify the cost of a tokenization decision.
- You can reduce a 72-hour pretraining run to 48 with the same validation curve.
- You think about scaling laws the way others think about architecture diagrams.
Reward
NOTE: This client is focused on calibre. Reward ranges will be highly competitive and in line with a culture of high performance and high bar to entry.
About Sentiro Partners | Leadership for the Augmentation Era
We continuously engage with the world's elite researchers, engineers, and data quants — the technical leaders shaping the next generation of intelligent systems.
Sentiro Partners works with pioneering organizations across America, Europe, and Asia to identify the minds advancing Data Science, Machine Learning, Quant Research, and AI Engineering. In the Augmentation Era, intelligence is amplified by algorithms and human insight.
Sentiro Partners was founded by Adrian Clarke, a veteran data science headhunter.
-
Robotics Machine Learning Research Scientist
6 days ago
Los Altos, California, United States Toyota Research Institute Full time $176,000 - $264,000At Toyota Research Institute (TRI), we're on a mission to improve the quality of human life. We're developing new tools and capabilities to amplify the human experience. To lead this ground-breaking shift in mobility, we've built an extraordinary team in Automated Driving, Energy & Materials, Human-Centered AI, Human-Interactive Driving, and Robotics.The...
-
Machine Learning Intern
4 days ago
Los Angeles, California, United States GenPark Full time $60,000 - $90,000 per yearCompany DescriptionGenPark AI is on a mission to connect emerging Oriental direct-to-consumer brands with Gen Z consumers globally. Using AI-driven personalized marketing, we create shopping experiences that are unique, engaging, and tailored to individual preferences. Our platform bridges cultural gaps and introduces trendsetting brands to a wider audience,...
-
Machine Learning Ops Engineer
1 week ago
Los Angeles, California, United States Keck Medicine of USC Full time $120,000 - $200,000 per yearSummary:Under the direction of Information Services Leadership, the incumbent will be responsible for the full lifecycle management of machine learning models, including design, build, and maintenance of machine learning models. The MLOps Engineer will play an integral role in implementing artificial intelligence solutions across Keck Medicine of USC. The...
-
Senior Machine Learning Engineer
2 days ago
Los Angeles, California, United States Capital Group Full time"I can succeed as a Machine Learning Engineer at Capital Solutions Group Technology (CSGT)"As a Machine Learning Engineer in CSGT, you will design and implement intelligent systems that enhance portfolio construction, investment research, and monitoring. You'll collaborate closely with investment professionals, product managers, and fellow engineers to...
-
Senior Machine Learning Engineer
2 days ago
Los Angeles, California, United States Apple Full timeApple Maps and the thousands of applications it empowers are being used by millions every single day As a fundamental tool for human activity, Maps technology is evolving and new techniques are emerging. We are looking for a Machine Learning Engineer to join and play a big part in the next revolution of Maps; to enable users to find more things in...
-
Machine Learning Software Engineer
7 days ago
Los Angeles, California, United States EVONA Full time $200,000 - $250,000 per yearMachine Learning Engineer – Edge AI & Embedded Vision | US-Based | Remote | Cutting-Edge AutonomyAre you passionate about real-time computer vision, model optimization, and deploying AI in the real world? We're working with a venture-backed startup developing next-generation autonomous systems for defense and industry. They're on the hunt for an...
-
Senior Machine Learning Engineer
1 week ago
Los Angeles, California, United States TechStarsGroup Full time $138,000 - $220,000 per yearOur client is on a mission to transform healthcare using the power of artificial intelligence. We are dedicated to leveraging AI to make a significant impact on healthcare, aiming to enhance patient care by making every patient's journey a cornerstone for healthcare decisions. Our goal is to improve treatment outcomes and accelerate drug discovery by...
-
Machine Learning Engineer, AiDP
4 days ago
Los Angeles, California, United States Apple Full time $120,000 - $250,000 per yearThe people here at Apple don't just build products - we craft the kind of wonder that's revolutionized entire industries. It's the diversity of those people and their ideas that supports the innovation that runs through everything we do, from amazing technology to industry-leading environmental efforts. Join Apple, and help us leave the world better than we...
-
Los Altos, California, United States Toyota Research Institute Full time $45 - $65At Toyota Research Institute (TRI), we're on a mission to improve the quality of human life. We're developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we've built a world-class team in Automated Driving, Energy & Materials, Human-Centered AI, Human Interactive Driving, Large Behavioral Models,...
-
Los Altos, California, United States Toyota Research Institute Full time $128,000 - $192,000 per yearAt Toyota Research Institute (TRI), we're on a mission to improve the quality of human life. We're developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we've built a world-class team in Automated Driving, Energy & Materials, Human-Centered AI, Human Interactive Driving, Large Behavior Models,...