Current jobs related to Research Engineer, Model Evaluations - San Francisco CA - Anthropic Limited


  • San Francisco, California, United States DALLAS VA RESEARCH CORPORATION Full time

    About the RoleWe are seeking a highly skilled Research Engineer to join our team at Dallas VA Research Corporation. As a Research Engineer, you will play a key role in advancing the state-of-the-art in large language models.Key ResponsibilitiesDesign and Develop Advanced Language Models: Design methods, tools, and infrastructure to push forward the state of...


  • San Francisco, California, United States Indiana Biosciences Research Institute Full time

    About the RoleWe are seeking a highly skilled Research Scientist to join our team at the Indiana Biosciences Research Institute. As a Research Scientist, you will play a key role in developing and advancing our large language models, working closely with our team of experts in natural language processing and machine learning.Key ResponsibilitiesDesign and...


  • San Francisco, California, United States Indiana Biosciences Research Institute Full time

    About the RoleWe are seeking a highly skilled Research Engineer to join our Large Language Model (LLM) Research team. As a key member of our team, you will be responsible for designing and developing state-of-the-art LLMs, which we often open-source.ResponsibilitiesDesign methods, tools, and infrastructure to push forward the state of the art in large...


  • San Francisco, California, United States RI Research Instruments GmbH Full time

    About the RoleWe are seeking a highly skilled Research Prompt Engineer to join our team at RI Research Instruments GmbH. As a Research Prompt Engineer, you will play a crucial role in developing and refining the prompts that instruct our large language models to deliver high-quality outputs.Key ResponsibilitiesDevelop new and innovative prompting strategies...


  • San Antonio, Texas, United States Applied Research Solutions Full time

    Position OverviewApplied Research Solutions is actively seeking a full-time Senior Modeling and Simulation Engineer. This role is integral to our commitment to delivering innovative solutions and technical expertise.Why Choose Applied Research Solutions?At Applied Research Solutions (ARS), we pride ourselves on being a premier provider of integrated...


  • San Francisco, California, United States Genmo Full time

    Job Description**Mission Statement**Genmo aims to democratize high-quality cinematic video content creation, empowering the next billion video creators to tell their stories.**The Role**As a Research Engineer, you will develop innovative machine learning techniques in the domain of visual generative modeling.**Key Responsibilities**Develop feature data sets...


  • San Antonio, Texas, United States Applied Research Solutions Full time

    Position OverviewApplied Research Solutions is in search of a dedicated Modeling and Simulation Engineer to enhance our capabilities in simulation studies and system performance analysis.Why Choose Us?At Applied Research Solutions (ARS), we pride ourselves on being a premier provider of integrated technical solutions. Recognized as a Best Places to Work...


  • San Francisco, United States DALLAS VA RESEARCH CORPORATION Full time

    Meta is seeking a Research Engineer to join our Large Language Model (LLM) Research team. We conduct focused research and engineering to build state-of-the-art LLMs, which we often open-source, like our team’s recent Llama 2. We are looking for strong engineers who have a background in generative AI and NLP, with experience in areas like language model...

  • Research Engineer

    3 weeks ago


    San Francisco, California, United States Genai Works Full time

    Job Description**About Genai Works**Genai Works is a leading innovator in AI-powered search solutions. We are seeking an experienced Research Engineer to join our team and contribute to the development of cutting-edge large-scale language models.Key Responsibilities**Lead Model Development**: Design and implement state-of-the-art tradeoffs between model...


  • San Francisco, United States DALLAS VA RESEARCH CORPORATION Full time

    As a Research Engineer on the Reinforcement Learning Fundamentals team at Anthropic, the role involves advancing the capabilities and safety of large language models through fundamental AI research in reinforcement learning. The position involves developing reinforcement learning techniques, creating environments for models, and designing experiments to...


  • San Francisco, California, United States Indiana Biosciences Research Institute Full time

    Job Title: Research Engineer, HorizonsJob Summary:We are seeking a highly skilled Research Engineer to join our team at the Indiana Biosciences Research Institute. As a Research Engineer, you will play a key role in advancing the capabilities and safety of large language models through fundamental AI research in reinforcement learning.Key...


  • San Francisco, California, United States OpenAI Full time

    About the TeamOur team is responsible for the post-training or alignment of the models behind ChatGPT and the API. We integrate various improvements from the rest of the company into our RLHF process, ultimately producing the models used by hundreds of millions of users.About the RoleWe are seeking a research scientist or senior machine learning engineer to...


  • San Francisco, California, United States OpenAI Full time

    About the TeamOur team is responsible for the post-training or alignment of the models behind ChatGPT and the API. We integrate various improvements from the rest of the company into our RLHF process, ultimately producing the models used by hundreds of millions of users.About the RoleWe are seeking a research scientist or senior machine learning engineer to...

  • Research Engineer

    3 months ago


    San Francisco, United States Genmo Full time

    Job DescriptionJob DescriptionOur missionGenmo makes it easy for anyone to create movies, as if it were magic. Using our web application, any user can create cinematic video using a simple text prompt.We imagine a world where high-quality cinematic video content is as plentiful as water. Our mission is to empower the next billion video creators to tell their...


  • San Francisco, California, United States DALLAS VA RESEARCH CORPORATION Full time

    About the RoleWe are seeking a highly skilled Research Engineer to join our team at Dallas VA Research Corporation, where you will play a key role in advancing the capabilities and safety of large language models through fundamental AI research in reinforcement learning.Key ResponsibilitiesDevelop and implement reinforcement learning techniques to improve...


  • San Antonio, Texas, United States Applied Research Associates Full time

    Position OverviewApplied Research Associates (ARA) is on the lookout for a Senior Modeling and Simulation Engineer specializing in the physics of unmanned underwater vehicles (UUVs). This role is pivotal in supporting U.S. Navy initiatives related to UUV modeling, simulation, and testing. We are searching for a candidate with extensive knowledge of UUV...


  • San Francisco, California, United States Wispr AI, Inc. Full time

    About Wispr AI, Inc.Wispr AI is pioneering a more intuitive approach to technology interaction through advanced neural interfaces. Our distinguished team comprises top-tier engineers, product designers, and research scientists dedicated to creating transformative solutions.We have successfully secured $25M in funding from leading venture capital firms,...


  • San Francisco, California, United States Wispr AI, Inc. Full time

    About Wispr AI, Inc.Wispr AI is pioneering a more intuitive method for technology interaction through advanced neural interfaces. Our team comprises top-tier engineers, product designers, and research scientists dedicated to creating innovative solutions.We have successfully secured $25M in funding from prestigious venture capital firms such as NEA and 8VC....


  • San Francisco, California, United States University of California Full time

    Job SummaryWe are seeking a highly skilled Post-doctoral Research Fellow to join our team at the University of California. The successful candidate will have a strong background in neuroimaging and computational modeling, with expertise in graph theory, pattern recognition, and computational modeling.Key ResponsibilitiesConduct advanced research in...


  • San Antonio, Texas, United States Applied Research Associates Full time

    About the RoleWe are seeking a highly skilled Staff Modeling and Simulation Engineer to join our team at Applied Research Associates. As a key member of our UUV Modeling and Simulation team, you will be responsible for conducting modeling and simulation activities that include analysis, simulation development, verification, and validation.Key...

Research Engineer, Model Evaluations

4 months ago


San Francisco CA, United States Anthropic Limited Full time

About the role:

We are looking for Research Engineers to build evaluations for our Claude family of Large Language Models. Your job will be to design and implement evaluations that allow Anthropic researchers and decision makers, and members of the public, to understand Claude’s abilities and personality. As a Research Engineer focused on Evaluation, you'll work closely with our research team to design experiments and build evaluation infrastructure. You'll help establish Anthropic as the leader in extremely well-characterized AI systems whose performance is exhaustively measured and validated across a wide range of important tasks. We aim to produce extremely well-benchmarked large language models with known performance on a wide range of tasks, turning ambiguous notions of “intelligence” into clear metrics.

Responsibilities:
  • Designing and running a new evaluation that tests Claude’s reasoning capabilities, and creating a compelling visualization that illustrates the results
  • Running experiments to determine how prompting techniques affect results on industry benchmarks
  • Improving the tooling that researchers use to implement evaluations
  • Explaining our evaluations and their results to internal decision makers and Stakeholders
  • Collaborating with a research team to develop a robust evaluation for a new model capability they are developing
You may be a good fit if you:
  • Have significant Python programming experience / machine learning research
  • Are excellent at data visualization
  • Have experience using Large Language Models such as Claude
  • Are results-oriented, with a bias towards flexibility and impact
  • Pick up slack, even if it goes outside your job description
  • Enjoy pair programming (we love to pair)
  • Want to learn more about machine learning research
  • Care about the societal impacts of your work
  • Have clear written and verbal communication
  • You want to design and implement rigorous evaluations to deeply understand the capabilities, personality, and safety of large language models like Claude.
  • You're excited to turn fuzzy notions of "AI intelligence" into clear, well-defined metrics that provide insight to researchers, decision-makers and the public.
  • You're energized by the challenge of assessing and steering powerful AI to be safe and beneficial.
Strong candidates may also have experience with:
  • Building user interfaces for data analysis
  • Developing robust evaluation metrics for language models
  • Handling textual dataset sourcing, curation, and processing tasks at scale
  • Statistics

Deadline to apply: None. Applications will be reviewed on a rolling basis.

#J-18808-Ljbffr