Research Engineer, Post-training Model Capability

3 weeks ago


San Francisco, United States OpenAI Full time
Research Engineer, Post-training Model Capability

Location: San Francisco

About the Team

Our team is responsible for the “post-training” or alignment of the models behind ChatGPT and the API. We integrate various improvements from the rest of the company into our RLHF process ultimately producing the models used by hundreds of millions of users.

About the Role

We are looking for research scientists and research engineers to advance the capabilities of large language and multimodal models. This work includes the following areas:

  • Multimodal product research, such as building video + speech --> speech model capabilities, training smaller models with state-of-the-art multimodal capabilities, etc.

  • Multilingual research, advancing intelligence and cultural relevance for non-English languages.

  • Developing a broad set of tool uses on mobile and desktop.

  • Building a data flywheel to improve model capabilities.

  • Conducting research to identify new post-training methods and collaborating with our applied organization to enable our customers to optimize their own models.

You might thrive in this role if you:

  • Have a deep understanding of machine learning and machine learning applications.

  • Have working knowledge and experience tuning large language models (multimodal) and building evaluations.

  • Are willing to dive into large ML codebases to debug.

  • Thrive in a dynamic and technically complex environment.

  • Have a track record of delivering outside-the-box novel solutions to solve real-world constraints.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status.

Compensation: $360K + Offers Equity

#J-18808-Ljbffr

  • San Francisco, United States Openai Full time

    About the Team Our team is responsible for the post-training and alignment of the models behind ChatGPT and the API. We integrate various improvements from across the company into our RLHF process, ultimately producing the models used by hundreds of millions of users. About the Role We are looking for research scientists and research engineers to advance the...


  • San Francisco, United States OpenAI Full time

    Research Engineer, Post-training Instruction FollowingPost-training - San FranciscoAbout the TeamOur post-training team are the chefs behind GPT-4 and o1-preview, cooking up the raw ingredients of base models into something nutritious, tasty, and non-toxic for consumers.If you care about impact, this could be a good team for you. Your daily work will push...

  • Research Scientist

    4 weeks ago


    San Francisco, United States Genmo Inc. Full time

    We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what's possible in video generation.Role overview:We are seeking an exceptional Research Scientist to join our team, focusing on alignment and...


  • San Francisco, United States OpenAI Full time

    About the Team Our post-training team are the chefs behind GPT-4 and o1-preview, cooking up the raw ingredients of base models into something nutritious, tasty, and non-toxic for consumers. If you care about impact, this could be a good team for you. Your daily work will push the leading edge of AI and make a real difference to hundreds of millions of...


  • San Francisco, United States OpenAI Full time

    About the Team Our post-training team are the chefs behind GPT-4 and o1-preview, cooking up the raw ingredients of base models into something nutritious, tasty, and non-toxic for consumers. If you care about impact, this could be a good team for you. Your daily work will involve pushing the leading edge of technology and make a real difference to...


  • San Francisco, California, United States Perplexity AI Full time

    At Perplexity AI, we're on a mission to revolutionize the way people interact with information.About UsWe've experienced rapid growth and adoption since launching our conversational answer engine, amassing 10 million monthly active users and serving over 500 million queries worldwide.We've secured significant funding from top investors, including IVP,...


  • San Francisco, United States OpenAI Full time

    Research Engineer, Pre-training Architecture | OpenAIResearch Engineer, Pre-training ArchitectureFoundations - San FranciscoAbout the TeamThe architecture team is responsible for advancing the neural network architecture of OpenAI’s flagship language models. Our work spans the entire spectrum of architecture development and deployment all the way from the...

  • AI Research Engineer

    4 weeks ago


    San Francisco, United States Perplexity AI Full time

    Job DescriptionJob DescriptionPerplexity is seeking experienced AI Research Engineers and Scientists to continue to improve our in house Online LLMs, the Sonar models. Your job is to take advantage of our rich query/answer dataset to continue to scale our Sonar model performance and provide the SOTA Online LLM experience to our...


  • San Francisco, United States Databricks Full time

    Company Description Founded in late 2020 by a small group of machine learning researchers, Mosaic AI enables companies to create state-of-the-art AI models from scratch on their own data. From a business perspective, Mosaic AI is committed to the belief that a company’s AI models are just as valuable as any other core IP, and that high-quality AI models...

  • Research Engineer

    10 hours ago


    San Francisco, CA, United States Magic AI Full time

    Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL,...


  • San Francisco, United States Openai Full time

    About the Team The Platform ML team builds the ML side of our state-of-the-art internal training framework used to train our cutting-edge models. We work on distributed model execution as well as the interfaces and implementation for model code, training, and inference. Our priorities are to maximize training throughput (how quickly we can train a new model)...


  • San Francisco, United States Openai Full time

    About the Team The Alignment team at OpenAI is dedicated to ensuring that our AI systems are safe, trustworthy, and consistently aligned with human values, even as they scale in complexity and capability. Our work is at the cutting edge of AI research, focusing on developing methodologies that enable AI to robustly follow human intent across a wide range of...


  • San Francisco, United States OpenAI Full time

    OpenAI's Training team is responsible for producing the large language models that power our research, our products, and ultimately bring us closer to AGI. Achieving this goal requires combining deep research into improving our current architecture, datasets and optimization techniques, along with meaningful innovation in the software systems underlying our...


  • San Francisco, United States OpenAI Full time

    About the Team The Sora team is working on making video a key capability of OpenAI's foundation models. We are a hybrid research and product team that seeks to understand and expand the capabilities of our video models, while ensuring their reliability and safety. We accomplish this both through directly studying and experimenting with the models, as well as...


  • San Francisco, United States OpenAI Full time

    About the Team The Sora team is working on making video a key capability of OpenAI's foundation models. We are a hybrid research and product team that seeks to understand and expand the capabilities of our video models, while ensuring their reliability and safety. We accomplish this both through directly studying and experimenting with the models, as well as...


  • San Francisco, United States Scale AI, Inc. Full time

    Scale works with the industry's leading foundation model labs to provide high quality data and accelerate progress in machine learning research. As a Machine Learning Research Engineer, you will design next generation data pipelines and supervision strategies in close collaboration with our customers to accelerate progress in Generative AI. The ideal...


  • San Francisco, United States Resource Informatics Group Full time

    Role- Principal Engineer/Researcher Location- MOUNTAINVIEW CADuration- Long TermDescription: PhD in Computer Science, Math, Statistics or related field Hands on experience in R&D with proven credential in patents and publications - 5 year Strong familiarity with end-to-end data services/domains and technologies. PoC and Benchmarking experience in AI or data...


  • San Francisco, United States OpenAI Full time

    About the Team The Platform ML team builds the ML side of our state-of-the-art internal training framework used to train our cutting-edge models. We work on distributed model execution as well as the interfaces and implementation for model code, training, and inference. Our priorities are to maximize training throughput (how quickly we can train a new model)...


  • San Francisco, United States OpenAI Full time

    About the Team The Platform ML team builds the ML side of our state-of-the-art internal training framework used to train our cutting-edge models. We work on distributed model execution as well as the interfaces and implementation for model code, training, and inference. Our priorities are to maximize training throughput (how quickly we can train a new model)...


  • San Francisco, United States Openai Full time

    About the Team The ChatGPT RLHF team is a specialized subteam within the Post-Training organization, focused on aligning ChatGPT models with user needs through Reinforcement Learning with Human Feedback (RLHF) and related approaches. Our mission is to make ChatGPT more helpful and personalized for users, creating a better experience by learning from...