Research Engineer, Multimodal

3 days ago


Menlo Park CA United States Character.AI Full time
About the role

We’re looking for scrappy and self-motivated people who have full-stack machine learning skills: collecting data, training state-of-the-art models, building evaluations, writing efficient inference algorithms, and iterating on user feedback.

In the day-to-day, you will be responsible for developing new multimodal capabilities end-to-end. This means you will need to wear a lot of hats across the full ML stack. You should be comfortable thinking about all parts of the problem, and ready to work on any and all components of it.

Responsibilities
  • Determining the type of training data we need, finding where we can collect it, and writing distributed data gathering pipelines to ingest data

  • Developing new model architectures that push the state-of-the-art in terms of quality, scale, and inference speed

  • Creating new evaluations that capture different aspects of generative outputs

  • Writing fast inference algorithms to serve these models at scale

  • Working with product teams to integrate feedback mechanisms into the product, which we use to improve the model

  • Working with large scale image/audio datasets

Requirements
  • "All Industry Levels": preferably 2+ years of industry experience working deep in the weeds on hard ML problems.

    • Negative example: just stringing together a bunch of pre-existing components together. Need signal that this person can think critically about different parts of the pipeline

  • Have a deep understanding of the “whole stack” when it comes to designing, training, evaluating and deploying machine learning models, especially large language models.

    • Collected a new giant dataset

    • Published research papers

    • Played a critical role in shipping a new ML product that required custom components

    • Writing distributed ML infrastructure

    • Have debugged and fixed hard-to-find bugs in ML models

  • Have a track record of successfully owning projects from start to finish.

  • Have experience with generative models for various modalities.

  • Experience working with proven tools: ML frameworks (Tensorflow, PyTorch, Jax, …), data processing frameworks (Spark, Beam, …).

  • Experience working with diffusion models

About Character.AI

Founded in 2021, Character is a leading AI company offering personalized experiences through customizable AI 'Characters.' As one of the most widely used AI platforms worldwide, Character enables users to interact with AI tailored to their unique needs and preferences.

In just two years, we achieved unicorn status and were named Google Play's AI App of the Year – a testament to our groundbreaking technology and vision.

Ready to shape the future of Consumer AI?

At Character, we value diversity and welcome applicants from all backgrounds. As an equal opportunity employer, we firmly uphold a non-discrimination policy based on race, religion, national origin, gender, sexual orientation, age, veteran status, or disability. Your unique perspectives are vital to our success.

#J-18808-Ljbffr

  • Menlo Park, California, United States META Full time

    Job DescriptionMETA is seeking a talented Multimodal Research Engineer to join our Llama Large Language Model (LLM) Research team. This role involves working on cutting-edge vision large language models and developing scalable data curation, model development, and evaluation systems.


  • Redmond, WA, United States Facebook Full time

    Summary: At Reality Labs Research (RL-R), our goal is to explore, innovate, and design novel interfaces and hardware subsystems for the next generation of virtual, augmented, and mixed reality experiences. We are driving research towards a vision of an always-on augmented reality device that can enable high-quality contextually relevant interactions across...


  • Menlo Park, United States META Full time

    Summary: Our team has released the Seamless Communication models at the end of 2023, the very first massively multilingual, streaming and expressive multimodal translation systems. We are looking for a Research Engineer, expert in speech generation to take these models to the next level by making them production ready.Overtime, this project will be...

  • Research Engineer

    4 days ago


    Menlo Park, United States Meta Inc Full time

    Summary: Meta Reality Labs is looking for a Research Engineer to help us unleash human potential by eliminating the bottlenecks between intent and action. To achieve this, we're building a practical neural interface drawing on the rich neuromotor signals that can be measured non-invasively using electromyography (EMG). Our research lies at the intersection...

  • Data Scientist

    1 week ago


    Menlo Park, California, United States Victoryoncology Full time

    At Victoryoncology, we are committed to advancing the field of artificial intelligence by making fundamental advances in technologies to help interact with and understand our world. Our team is seeking individuals passionate about areas such as Computer Vision, Audio and Speech Processing, Natural Language Processing, Machine Learning, Deep Learning, and...


  • Menlo Park, United States META Full time

    Summary: Meta is seeking a Research Engineer to join our Large Language Model (LLM) Research team. We conduct focused research and engineering to build state-of-the-art LLMs, which we often open-source, like our team’s recent Llama 2. We are looking for strong engineers who have a background in generative AI and NLP, with experience in areas like language...


  • Menlo Park, CA, United States Victoryoncology Full time

    Meta is seeking Research Interns to join Fundamental AI Research (FAIR) Multimodal Foundations teams. We are committed to advancing the field of artificial intelligence by making fundamental advances in technologies to help interact with and understand our world. We are seeking individuals passionate in areas such as Computer Vision, Audio and Speech...


  • Menlo Park, CA, United States Character.AI Full time

    Joining us as a Research Engineer on the ML Systems team, you’ll be working on cutting-edge ML training and inference systems, optimizing the performance and efficiency of our GPU clusters, and developing new technologies that fine-tune leading consumer AI models with a data flywheel, and serve 20K+ QPS in production with LLMs. Your work will directly...


  • Menlo Park, California, United States META Full time

    At Meta, we are looking for a visionary AI Research Scientist to join our Llama Large Language Model (LLM) Research team. As a key member of this team, you will be responsible for pushing the boundaries of multimodal reasoning and generation research. This is an exceptional opportunity to work on ambitious long-term goals while identifying intermediate...


  • Seattle, WA, United States Facebook Full time

    Summary: The GenAI org at Meta builds industry leading LLM and multimodal generative foundation models, which sets the industry benchmark of open source foundation models and enables many Meta products. The team is working on the industrial leading research on multimodal generative foundation models with a focus on the audio modality (including speech,...


  • San Francisco, CA, United States OpenAI Full time

    The Safety Systems team is responsible for various safety work to ensure our best models can be safely deployed to the real world to benefit society and is at the forefront of OpenAI's mission to build and deploy safe AGI, driving our commitment to AI safety and fostering a culture of trust and transparency. The Safety Reasoning Research team is poised at...


  • Mountain View, CA, United States Samsung Research America Full time

    The Samsung AI Center Mountain View (SAIC-MV) leads at the forefront of innovation in providing the best on-device user interaction experience to Samsung users. The success of AI will depend on how well devices understand their users – and how well devices empower users. SAIC-MV takes on grand scientific and engineering challenges in machine intelligence...


  • Menlo Park, United States META Full time

    Summary: Meta is seeking a Research Engineer to join our Llama Large Language Model (LLM) Research team. We are looking for recognized experts in VLLMs; with experience in areas like vision encoders, data filtering/curation for pre and post-training, RLHF, responsible AI and model controllability. The ideal candidate will have an interest in producing and...


  • Mountain View, CA, United States Newsbreakdigest Full time

    Machine Learning Engineer, NLP and multimodal Mountain View, California, United States About NewsBreak NewsBreak is redefining the way users interact with local news and their communities. By bridging local users, local content creators, and local businesses, our mission is to foster safer, more vibrant, and authentically connected lives. Through robust...

  • Research Engineer

    2 weeks ago


    Menlo Park, United States Altera AI Full time

    Job Title: Research Engineer As a Research Engineer at Altera.AL, you will play a pivotal role in bringing our cutting-edge research to life. Your work will involve implementing and experimenting with the latest research techniques, and developing tools and infrastructure that streamline the transition of research into viable products. The ideal candidate...


  • Seattle, WA, United States Facebook Full time

    Summary: Meta was built to help people connect and share, and over the last decade our tools have played a critical part in changing how people around the world communicate with one another. With over a billion people using the service and more than fifty offices around the globe, a career at Meta offers countless ways to make an impact in a fast growing...

  • Research Engineer

    3 days ago


    San Francisco, CA, United States RI Research Instruments GmbH Full time

    You want to build large scale ML systems from the ground up. You care about making safe, steerable, trustworthy systems. As a Research Engineer, you'll touch all parts of our code and infrastructure, whether that's making the cluster more reliable for our big jobs, improving throughput and efficiency, running and designing scientific experiments, or...


  • Seattle, WA, United States Tencent Americas Full time

    We are seeking artificial general intelligence research interns who are interested in developing novel audio/speech/language processing techniques and large multimodal models for our Seattle area office located at Bellevue WA for the year 2025. Every research intern will work with researchers on a research project aimed at attacking one of the core problems...


  • Menlo Park, California, United States ANNEA GmbH Full time

    Meta, as part of ANNEA GmbH, is at the forefront of innovation, shaping how people connect and share. Our tools have played a pivotal role in revolutionizing global communication, with over a billion users and numerous offices worldwide. This presents a unique opportunity to make an impact in a rapidly growing organization.We are committed to advancing...


  • Menlo Park, United States META Full time

    Summary: Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to...