Senior AI/ML Data Scientist – Natural Language Processing

1 month ago


Cambridge, Massachusetts, United States Merk Full time
Job Requirements

This posting has been created to pipeline talent for prospective roles that we anticipate will be needed in the future for our organization. By applying to this Pipeline Advertisement you will be submitting your interest to be contacted for future roles similar to what is described in the Pipeline Advertisement.

The Senior AI/ML Data Scientist – Natural Language Processing (NLP) role involves helping to develop and deploy production-grade NLP products for unstructured and semi-structured data from across our company's research and development pipeline. These models and workflows will help solve real-world problems and contribute to Artificial Intelligence and Machine Learning (AI/ML) in therapeutic research and development. Key focus areas will include the scalable deployment of ML and Generative AI approaches (such as Large Language Models, or LLMs) for surfacing insights from proprietary unstructured research data and biomedical literature, as well as developing fit-for-purpose approaches for the likes of text classification, relation extraction, and entity linking. The position is embedded in a cross-disciplinary team of data scientists, bioinformaticians, and engineers that are all focused on using cutting-edge software, AI/ML, and data science techniques to drive drug discovery and development.

Key responsibilities:

  • Staying updated on the newest methods in NLP, ML, and generative AI
  • Building novel tools that enable the discovery, development, and delivery of new therapeutics to patients in need
  • Understanding real-world challenges and developing automated data solutions for them
  • Opportunities to directly interact with users of your data science, ML, and AI products
  • Evaluating, developing, testing, and deploying new techniques for natural language understanding
  • Freedom to propose projects that interest you and to collaborate cross-functionally on delivery
  • Sharing the approaches you implement and their impact with internal company audiences and externally

Additional job details:

The types of datasets we focus on are both internal (e.g., electronic lab notebooks, safety reports, regulatory documents, clinical results) and external (e.g., public literature and Electronic Medical Records). In addition to new tool development, we often consult with some of our 5,000+ stakeholders (scientists, engineers, regulatory liaisons, data scientists, etc.) on their own projects, as well as additional stakeholders from across our company. We strive to enhance data science, NLP, and AI literacy across these groups. As part of our work, we have opportunities to co-author presentations, reports, manuscripts, and/or public code releases.

Work Experience

Education Requirements:

  • B.S. with 5 years industry experience focused on NLP, data science, AI/ML/LLM engineering, computer science, semantic engineering or a related discipline
  • OR M.S. with 2 years industry experience
  • OR PhD in data science, AI/ML/LLM engineering, computer science, semantic engineering or a related discipline

Minimum Requirements:

  • 1 year experience with Natural Language Processing, Generative AI or related techniques for machine understanding of natural language (i.e., written text, omics data, or similar)
  • 2 years experience with Python, Spark, or related frameworks in AI, machine learning, data science, data engineering or similar context

Preferred skills and experience, not required

  • Fluency in Python programming, version control and collaboration with git, environment management (e.g., poetry, conda, docker), standard Python packages (e.g., pandas, numpy, matplotlib), and at least one ML framework (e.g., pytorch, tensorflow, fairseq)
  • Experience with scalable data engineering frameworks such as Apache Spark and orchestration frameworks such as Airflow, and/or experience with semantic search and retrieval frameworks (e.g., development and benchmarking of embedding models and retrieval approaches in the context of Retrieval Augmented Generation, RAG)
  • Experience with ML model deployment and operations (e.g., DevOps, MLOps, LLMOps), including CI/CD workflows and tooling (e.g., Github actions)
  • Experience with standard operations on non-relational (e.g., Elasticsearch/Opensearch, MongoDB, Neptune), relational databases (e.g., PostgreSQL), and vector databases (e.g., pgvector, Elasticsearch dense vectors) and deployment of APIs and web applications (e.g., flask, fastAPI, django, or dash)
  • Working knowledge of statistical learning, such as supervised, unsupervised, and weakly supervised learning, particularly in NLP contexts
  • Working knowledge of NLP and/or Generative AI libraries (e.g., regular expressions, spacy, langchain), text annotation tools, and/or semantic frameworks (e.g. RDF triplestores, property graphs, ontology management)
  • A demonstrated ability to engage cross-functional teams and stakeholders, including an eagerness to acquire a level of domain knowledge
  • Excellent communication, teamwork, didactic, and leadership skills, including skills for scientific communication (authoring scientific articles and presenting) and guidance and mentorship of junior employees and less experienced collaborators

Requisition ID:P-100850



  • Cambridge, Massachusetts, United States Aurora Flight Sciences Corporation Full time

    Job SummaryAurora Flight Sciences Corporation is seeking a highly skilled AI/ML Research Scientist - Autonomy to join our team. As a key member of our research and development team, you will be responsible for developing and applying cutting-edge AI/ML techniques to address technical challenges in aerospace and defense applications.Key...


  • Cambridge, Massachusetts, United States Aurora Flight Sciences Corporation Full time

    Position OverviewAt Aurora Flight Sciences Corporation, we are at the forefront of designing, constructing, and operating advanced aircraft and their supporting technologies. We are looking for a skilled and driven AI/ML Research Scientist specializing in Autonomy to contribute to the evolution of flight technology. Key ResponsibilitiesResponsibilities will...


  • Cambridge, Massachusetts, United States Aurora Flight Sciences Corporation Full time

    Position OverviewAt Aurora Flight Sciences Corporation, we are at the forefront of designing, constructing, and operating advanced aerial vehicles and enabling technologies, transforming innovative concepts into tangible realities. We are seeking a skilled and driven AI/ML Research Scientist focused on Autonomy to contribute to the evolution of aviation. Key...


  • Cambridge, Massachusetts, United States Merck Full time

    Job DescriptionOur Artificial Intelligence and Machine Learning (AI/ML) capabilities are vital catalysts for our mission to invent new medicines that save and enhance lives. The Data, AI, and Genome Sciences (DAGS) function at our organization adopts an AI/ML-first approach to enhance target and biomarker discovery by driving the understanding of complex...


  • Cambridge, Massachusetts, United States Aurora Flight Sciences Corporation Full time

    Position OverviewAt Aurora Flight Sciences Corporation, we are at the forefront of designing, constructing, and operating advanced aircraft and pioneering technologies that transform concepts into reality. We are seeking a skilled and driven AI/ML Applied Researcher – Autonomy to contribute to the evolution of flight technology. Key Responsibilities:-...


  • Cambridge, Massachusetts, United States Aurora Flight Sciences Corporation Full time

    Position OverviewAt Aurora Flight Sciences Corporation, we are at the forefront of designing, constructing, and operating cutting-edge aircraft and associated technologies, transforming visionary concepts into tangible realities. We are seeking a highly skilled and driven AI/ML Researcher specializing in Autonomy to contribute to the evolution of flight...


  • Cambridge, Massachusetts, United States Cypress HCM Full time

    Position OverviewRole: Lead AI/ML Solutions ArchitectLocation: Boston (5 days onsite)Company Size: 50 Employees | Team Structure: 3 MembersSector: Healthcare/Medical DeviceWe are seeking a highly skilled Lead AI/ML Solutions Architect to enhance our organization's capabilities in artificial intelligence and data science. This pivotal role will involve...


  • Cambridge, Massachusetts, United States Merck Full time

    Job DescriptionOur Artificial Intelligence Machine Learning (AI/ML) capabilities are critical accelerators to our mission to delivering towards inventing new medicines that save and improve lives. Core to the Data, AI, and Genome Sciences (DAGS) function is an AI/ML-first approach to improving target and biomarker discovery, validation and selection and...


  • Cambridge, Massachusetts, United States Apple, Inc. Full time

    Would you like to play a part in building the next generation of generative AI applications at Apple? We're looking for data scientists and engineers to work on ambitious projects that will impact the future of Apple, our products, and the broader world. In this role, you'll have the opportunity to tackle innovative problems in machine learning, particularly...


  • Cambridge, Massachusetts, United States Flare Therapeutics Full time

    About Flare TherapeuticsWe are a biotechnology company pioneering a new therapeutic space with a novel approach to decipher the biology of transcription factors to develop small molecule medicines.Our team has uncovered 'switch sites,' druggable regions that are key targets for transcription factor regulation, to address mutations that cause disease.We have...


  • Cambridge, Massachusetts, United States Motion Recruitment Full time

    Motion Recruitment is seeking a highly skilled Technical Project Manager with a focus on AI initiatives.This role is available for remote or hybrid candidates and offers a contract with comprehensive benefits.As a Senior Technical Project Leader in AI, you will be instrumental in the execution of AI-driven projects by collaborating with diverse business and...


  • Cambridge, Massachusetts, United States Motion Recruitment Full time

    About the Opportunity:Our client, a prestigious educational institution, is seeking a Senior AI Project Leader to join their team on a contractual basis.This role is open to candidates working remotely or in a hybrid model. The position is a W2 contract with full benefits, initially set for a 3-month period with the possibility of extension.As a Senior AI...


  • Cambridge, Massachusetts, United States Flagship Ventures Full time

    Position Overview:We are in search of a dynamic and driven scientist with substantial expertise in computational structural biology, coupled with outstanding AI/ML computational and programming capabilities. This role is pivotal in providing strategic vision, innovative thinking, and critical insights for the advancement of novel protein-based therapies at...

  • Senior Data Scientist

    3 weeks ago


    Cambridge, Massachusetts, United States Hopper Full time

    About the jobAs a Sr Data Scientist, you will be focusing on solving data and engineering problems that allow Hopper to offer products that help our customers navigate common travel headaches (like flight delays, cancellations, or price anxiety), and ultimately get where they want to be. This role will touch all of Hopper's fintech products and be focused on...


  • Cambridge, Massachusetts, United States Sail Biomedicines Full time

    About Sail:Sail Biomedicines is harnessing evolutionary and artificial intelligence to revolutionize programmable medicines. Sail's platform combines first-in-class programmable RNA technology (Endless RNATM or eRNA), and an industry-leading platform of programmable nanoparticles, utilizing natural components, to unlock comprehensive programming of medicines...


  • Cambridge, Massachusetts, United States Apple, Inc. Full time

    Would you like to play a part in building the next generation of generative AI applications at Apple? We're looking for data scientists and engineers to work on ambitious projects that will impact the future of Apple, our products, and the broader world. In this role, you'll have the opportunity to tackle innovative problems in machine learning, particularly...


  • Cambridge, Massachusetts, United States J&J Family of Companies Full time

    Position Title: Postdoctoral Researcher - AI in RadiologyOverview:At Johnson & Johnson Innovative Medicine (JJIM), we are on the forefront of healthcare innovation, seeking a postdoctoral researcher specializing in Computer Vision within the AI/ML Radiology domain. This role can be based in various locations or may allow for remote work depending on company...


  • Cambridge, Massachusetts, United States Amgen Full time

    HOW MIGHT YOU DEFY IMAGINATION?If you feel like you're part of something bigger, it's because you are. At Amgen, our shared mission—to serve patients—drives all that we do. It is key to our becoming one of the world's leading biotechnology companies. We are global collaborators who achieve together—researching, manufacturing, and delivering ever-better...


  • Cambridge, Massachusetts, United States The Society for the Preservation of Natural History Collections. Full time

    About the RoleThe Society for the Preservation of Natural History Collections seeks a visionary leader to oversee the development of a comprehensive strategy for the natural history library collections and services.Key ResponsibilitiesDirect the strategic planning process for the natural history library collections and services.Plan, develop, administer, and...


  • Cambridge, Massachusetts, United States Bristol Myers Squibb Full time

    Join Our Team At Bristol Myers Squibb, we offer a unique opportunity to engage in work that is both challenging and impactful. Our environment fosters innovation and collaboration, where every team member contributes to transformative advancements in patient care. Here, you will find a place to grow your career alongside a diverse group of high-achieving...