Current jobs related to Research Engineer, Model Evaluations - San Francisco CA - Anthropic Limited

Research Engineer, Language Model Specialist

3 weeks ago

San Francisco, California, United States DALLAS VA RESEARCH CORPORATION Full time

About the RoleWe are seeking a highly skilled Research Engineer to join our team at Dallas VA Research Corporation. As a Research Engineer, you will play a key role in advancing the state-of-the-art in large language models.Key ResponsibilitiesDesign and Develop Advanced Language Models: Design methods, tools, and infrastructure to push forward the state of...
Research Scientist, Language Model Development

1 week ago

San Francisco, California, United States Indiana Biosciences Research Institute Full time

About the RoleWe are seeking a highly skilled Research Scientist to join our team at the Indiana Biosciences Research Institute. As a Research Scientist, you will play a key role in developing and advancing our large language models, working closely with our team of experts in natural language processing and machine learning.Key ResponsibilitiesDesign and...
Research Scientist, Language Model Development

2 days ago

San Francisco, California, United States Indiana Biosciences Research Institute Full time

About the RoleWe are seeking a highly skilled Research Engineer to join our Large Language Model (LLM) Research team. As a key member of our team, you will be responsible for designing and developing state-of-the-art LLMs, which we often open-source.ResponsibilitiesDesign methods, tools, and infrastructure to push forward the state of the art in large...
Research Prompt Engineer

1 day ago

San Francisco, California, United States RI Research Instruments GmbH Full time

About the RoleWe are seeking a highly skilled Research Prompt Engineer to join our team at RI Research Instruments GmbH. As a Research Prompt Engineer, you will play a crucial role in developing and refining the prompts that instruct our large language models to deliver high-quality outputs.Key ResponsibilitiesDevelop new and innovative prompting strategies...
Senior Modeling and Simulation Engineer

3 weeks ago

San Antonio, Texas, United States Applied Research Solutions Full time

Position OverviewApplied Research Solutions is actively seeking a full-time Senior Modeling and Simulation Engineer. This role is integral to our commitment to delivering innovative solutions and technical expertise.Why Choose Applied Research Solutions?At Applied Research Solutions (ARS), we pride ourselves on being a premier provider of integrated...
Visual Generative Modeling Research Engineer

2 weeks ago

San Francisco, California, United States Genmo Full time

Job Description**Mission Statement**Genmo aims to democratize high-quality cinematic video content creation, empowering the next billion video creators to tell their stories.**The Role**As a Research Engineer, you will develop innovative machine learning techniques in the domain of visual generative modeling.**Key Responsibilities**Develop feature data sets...
Senior Modeling and Simulation Engineer

3 weeks ago

San Antonio, Texas, United States Applied Research Solutions Full time

Position OverviewApplied Research Solutions is in search of a dedicated Modeling and Simulation Engineer to enhance our capabilities in simulation studies and system performance analysis.Why Choose Us?At Applied Research Solutions (ARS), we pride ourselves on being a premier provider of integrated technical solutions. Recognized as a Best Places to Work...
Research Engineer, Language

3 weeks ago

San Francisco, United States DALLAS VA RESEARCH CORPORATION Full time

Meta is seeking a Research Engineer to join our Large Language Model (LLM) Research team. We conduct focused research and engineering to build state-of-the-art LLMs, which we often open-source, like our team’s recent Llama 2. We are looking for strong engineers who have a background in generative AI and NLP, with experience in areas like language model...
Research Engineer

3 weeks ago

San Francisco, California, United States Genai Works Full time

Job Description**About Genai Works**Genai Works is a leading innovator in AI-powered search solutions. We are seeking an experienced Research Engineer to join our team and contribute to the development of cutting-edge large-scale language models.Key Responsibilities**Lead Model Development**: Design and implement state-of-the-art tradeoffs between model...
Research Engineer, Horizons

3 weeks ago

San Francisco, United States DALLAS VA RESEARCH CORPORATION Full time

As a Research Engineer on the Reinforcement Learning Fundamentals team at Anthropic, the role involves advancing the capabilities and safety of large language models through fundamental AI research in reinforcement learning. The position involves developing reinforcement learning techniques, creating environments for models, and designing experiments to...
Research Scientist, AI Safety

1 day ago

San Francisco, California, United States Indiana Biosciences Research Institute Full time

Job Title: Research Engineer, HorizonsJob Summary:We are seeking a highly skilled Research Engineer to join our team at the Indiana Biosciences Research Institute. As a Research Engineer, you will play a key role in advancing the capabilities and safety of large language models through fundamental AI research in reinforcement learning.Key...
Research Scientist, Post-Training Model Enhancement

5 days ago

San Francisco, California, United States OpenAI Full time

About the TeamOur team is responsible for the post-training or alignment of the models behind ChatGPT and the API. We integrate various improvements from the rest of the company into our RLHF process, ultimately producing the models used by hundreds of millions of users.About the RoleWe are seeking a research scientist or senior machine learning engineer to...
Research Scientist, Post-Training Model Enhancement

2 days ago

San Francisco, California, United States OpenAI Full time

About the TeamOur team is responsible for the post-training or alignment of the models behind ChatGPT and the API. We integrate various improvements from the rest of the company into our RLHF process, ultimately producing the models used by hundreds of millions of users.About the RoleWe are seeking a research scientist or senior machine learning engineer to...
Research Engineer

3 months ago

San Francisco, United States Genmo Full time

Job DescriptionJob DescriptionOur missionGenmo makes it easy for anyone to create movies, as if it were magic. Using our web application, any user can create cinematic video using a simple text prompt.We imagine a world where high-quality cinematic video content is as plentiful as water. Our mission is to empower the next billion video creators to tell their...
Research Scientist, Innovation Hub

3 weeks ago

San Francisco, California, United States DALLAS VA RESEARCH CORPORATION Full time

About the RoleWe are seeking a highly skilled Research Engineer to join our team at Dallas VA Research Corporation, where you will play a key role in advancing the capabilities and safety of large language models through fundamental AI research in reinforcement learning.Key ResponsibilitiesDevelop and implement reinforcement learning techniques to improve...
Senior Engineer in UUV Modeling and Analysis

3 weeks ago

San Antonio, Texas, United States Applied Research Associates Full time

Position OverviewApplied Research Associates (ARA) is on the lookout for a Senior Modeling and Simulation Engineer specializing in the physics of unmanned underwater vehicles (UUVs). This role is pivotal in supporting U.S. Navy initiatives related to UUV modeling, simulation, and testing. We are searching for a candidate with extensive knowledge of UUV...
Foundation Models ML Engineer

3 weeks ago

San Francisco, California, United States Wispr AI, Inc. Full time

About Wispr AI, Inc.Wispr AI is pioneering a more intuitive approach to technology interaction through advanced neural interfaces. Our distinguished team comprises top-tier engineers, product designers, and research scientists dedicated to creating transformative solutions.We have successfully secured $25M in funding from leading venture capital firms,...
Foundation Models ML Engineer

3 weeks ago

San Francisco, California, United States Wispr AI, Inc. Full time

About Wispr AI, Inc.Wispr AI is pioneering a more intuitive method for technology interaction through advanced neural interfaces. Our team comprises top-tier engineers, product designers, and research scientists dedicated to creating innovative solutions.We have successfully secured $25M in funding from prestigious venture capital firms such as NEA and 8VC....
Neuroimaging and Computational Modeling Research Scientist

2 weeks ago

San Francisco, California, United States University of California Full time

Job SummaryWe are seeking a highly skilled Post-doctoral Research Fellow to join our team at the University of California. The successful candidate will have a strong background in neuroimaging and computational modeling, with expertise in graph theory, pattern recognition, and computational modeling.Key ResponsibilitiesConduct advanced research in...
Staff Modeling and Simulation Engineer

10 hours ago

San Antonio, Texas, United States Applied Research Associates Full time

About the RoleWe are seeking a highly skilled Staff Modeling and Simulation Engineer to join our team at Applied Research Associates. As a key member of our UUV Modeling and Simulation team, you will be responsible for conducting modeling and simulation activities that include analysis, simulation development, verification, and validation.Key...

Research Engineer, Model Evaluations

4 months ago

San Francisco CA, United States Anthropic Limited Full time

About the role:

We are looking for Research Engineers to build evaluations for our Claude family of Large Language Models. Your job will be to design and implement evaluations that allow Anthropic researchers and decision makers, and members of the public, to understand Claude’s abilities and personality. As a Research Engineer focused on Evaluation, you'll work closely with our research team to design experiments and build evaluation infrastructure. You'll help establish Anthropic as the leader in extremely well-characterized AI systems whose performance is exhaustively measured and validated across a wide range of important tasks. We aim to produce extremely well-benchmarked large language models with known performance on a wide range of tasks, turning ambiguous notions of “intelligence” into clear metrics.

Responsibilities:

Designing and running a new evaluation that tests Claude’s reasoning capabilities, and creating a compelling visualization that illustrates the results
Running experiments to determine how prompting techniques affect results on industry benchmarks
Improving the tooling that researchers use to implement evaluations
Explaining our evaluations and their results to internal decision makers and Stakeholders
Collaborating with a research team to develop a robust evaluation for a new model capability they are developing

You may be a good fit if you:

Have significant Python programming experience / machine learning research
Are excellent at data visualization
Have experience using Large Language Models such as Claude
Are results-oriented, with a bias towards flexibility and impact
Pick up slack, even if it goes outside your job description
Enjoy pair programming (we love to pair)
Want to learn more about machine learning research
Care about the societal impacts of your work
Have clear written and verbal communication
You want to design and implement rigorous evaluations to deeply understand the capabilities, personality, and safety of large language models like Claude.
You're excited to turn fuzzy notions of "AI intelligence" into clear, well-defined metrics that provide insight to researchers, decision-makers and the public.
You're energized by the challenge of assessing and steering powerful AI to be safe and beneficial.

Strong candidates may also have experience with:

Building user interfaces for data analysis
Developing robust evaluation metrics for language models
Handling textual dataset sourcing, curation, and processing tasks at scale
Statistics

Deadline to apply: None. Applications will be reviewed on a rolling basis.

#J-18808-Ljbffr

Americas

Europe

Asia / Oceania

Africa

Current jobs related to Research Engineer, Model Evaluations - San Francisco CA - Anthropic Limited

Research Engineer, Model Evaluations