Staff ML Research Engineer

2 weeks ago


San Francisco, CA, United States Cadre Full time

Staff ML Research Engineer

Staff+ machine learning research engineer with experience fine-tuning and post-training LLMs. We do not require healthcare experience and we value fast growing startup experience. You will push the boundaries of generative AI by translating cutting-edge research into working prototypes and experimental platforms. You'll work closely with fellow researchers, engineers, and product leads to explore novel architectures, fine-tuning methods, evaluation paradigms, and data strategies-helping to define what's possible with frontier AI models in healthcare and medicine.

Job Description

What You'll Do:

  • Prototype and Advance LLM Systems: Build and benchmark LLM-based systems and agents using open-source and proprietary models. Rapidly prototype new capabilities through fine-tuning, adapters, and reinforcement learning approaches.
  • Drive Research-First Experimentation: Translate recent academic papers into reproducible experiments, focusing on fine-tuning (e.g., LoRA, QLoRA, DPO), model alignment, and hallucination mitigation techniques. Design clear experiment plans and share findings across the team.
  • Build and Evolve Evaluation Pipelines: Define evaluation methodologies using human-in-the-loop feedback, synthetic benchmarks, and task-specific metrics. Implement continuous evaluation pipelines to track regressions and breakthroughs.
  • Shape Data and Training Strategy: Curate datasets via synthetic generation, targeted scraping, and annotation pipelines. Establish practices for discovering failure cases and improving model robustness over time.
  • Contribute to a Research-Driven Culture: Write research papers, internal memos, and blog posts. Foster a culture of experimentation, documentation, and knowledge-sharing across research and engineering teams.
Who You Are:

Research Fluent
  • Skilled at interpreting and replicating results from cutting-edge machine learning research.
  • Experienced in designing experiments, running ablation studies, and ensuring reproducibility.
  • 4+ years of experience in machine learning research, experimental AI, or applied AI engineering.
  • Demonstrated ability to replicate, extend, or publish original research.
Deep Expertise in LLM Fine-Tuning
  • Hands-on experience fine-tuning large language models and optimizing prompt and embedding strategies.
  • Proficient with Python and deep learning frameworks such as PyTorch, JAX, and Hugging Face Transformers.
  • Comfortable with distributed training environments and large-scale model experimentation.
Evaluation and Data Obsessed
  • Deep understanding of dataset curation, filtering, and alignment with evaluation goals.
  • Familiar with human annotation pipelines, ranking models (e.g., RM, RLAIF), and interpretability techniques.
  • Experienced in building evaluation frameworks tied to real-world task performance.
Collaborative and Curious
  • Thrives in research-driven environments with a commitment to experimentation, documentation, and cross-functional learning.
  • Excited to prototype, present findings, and build at the frontier of AI advancement.
Effective Interdisciplinary Collaborator
  • Able to work alongside clinicians, product managers, and fellow engineers
  • Strong communicator who can distill complex ML concepts for diverse audiences.
Mission-Aligned
  • Passion for healthcare or other mission-driven industries (e.g., education, climate tech)
  • Thrives in a fast-paced, early-stage environment; takes extreme ownership of deliverables
Nice-to-haves
  • Open-source contributions to ML libraries, datasets, or benchmarks
  • Experience working in AI research labs, frontier model companies, or early-stage AI startups
  • Background in RLHF, alignment research, or AI safety


  • San Francisco, CA, United States Bronco Full time

    About Us Bronco is an applied AI lab helping chipmakers keep Moore's law going. Our mission is to build AI silicon verification agents that find bugs, drive coverage, and help companies ship working chips on time. We're currently deployed alongside verification teams at leading chip companies building next generation hardware (4 nm process node). We are...


  • San Francisco, CA, United States Bronco Full time

    About Us Bronco is an applied AI lab helping chipmakers keep Moore's law going. Our mission is to build AI silicon verification agents that find bugs, drive coverage, and help companies ship working chips on time. We're currently deployed alongside verification teams at leading chip companies building next generation hardware (4 nm process node). We are...


  • San Francisco, CA, United States Bronco Full time

    About Us Bronco is an applied AI lab helping chipmakers keep Moore's law going. Our mission is to build AI silicon verification agents that find bugs, drive coverage, and help companies ship working chips on time. We're currently deployed alongside verification teams at leading chip companies building next generation hardware (4 nm process node). We are...


  • San Francisco, CA, United States Bronco Full time

    About Us Bronco is an applied AI lab helping chipmakers keep Moore's law going. Our mission is to build AI silicon verification agents that find bugs, drive coverage, and help companies ship working chips on time. We're currently deployed alongside verification teams at leading chip companies building next generation hardware (4 nm process node). We are...


  • San Francisco, CA, United States Bronco Full time

    About Us Bronco is an applied AI lab helping chipmakers keep Moore's law going. Our mission is to build AI silicon verification agents that find bugs, drive coverage, and help companies ship working chips on time. We're currently deployed alongside verification teams at leading chip companies building next generation hardware (4 nm process node). We are...


  • San Francisco, CA, United States Apple Full time

    Role Number: 200616697-3401 Summary We're building the next generation of AI evaluation systems — and we're looking for a hands-on engineer who can bridge ML, software, and product to make AI systems more measurable, testable, and trustworthy. We’re part of the AI/ML Evaluation organization, seeking a Senior or Staff-level Applied ML Engineer with strong...


  • San Francisco, CA, United States Apple Full time

    Role Number: 200616697-3401 Summary We're building the next generation of AI evaluation systems — and we're looking for a hands-on engineer who can bridge ML, software, and product to make AI systems more measurable, testable, and trustworthy. We’re part of the AI/ML Evaluation organization, seeking a Senior or Staff-level Applied ML Engineer with strong...


  • San Francisco, CA, United States Transparent Search Group Full time

    Job Description: Staff / Principal ML Engineer Predictive Modelling for Alternative Assets Full-time | Remote (North America) | $240K to $270K + Equity About the Company A fast-growing fintech startup is revolutionizing the valuation and trading of alternative assets. Their proprietary pricing engine powers real-time valuations, lending decisions, and risk...


  • San Francisco, CA, United States Top Engineer Full time

    TOP ENGINEER JOB POST!!! Confidential Search for International Employer Industry: Social Commerce / AI Technology Degree: BS in Computer Science or Mathematics from Top 40 University Experience: 4-8 years in Production ML Systems AI-POWERED SOCIAL COMMERCE REVOLUTION Role: Senior Machine Learning Engineer - Multimodal AI Join a leading partner in social...


  • San Francisco, CA, United States Top Engineer Full time

    TOP ENGINEER JOB POST!!! Confidential Search for International Employer Industry: Social Commerce / AI Technology Degree: BS in Computer Science or Mathematics from Top 40 University Experience: 4-8 years in Production ML Systems AI-POWERED SOCIAL COMMERCE REVOLUTION Role: Senior Machine Learning Engineer - Multimodal AI Join a leading partner in social...