Staff Data Engineer

3 weeks ago


Menlo Park, United States CareerBuilder Full time

About us

Characters mission is to empower everyone with AGI. Our vision is to enable people with our technology so that they can use

Character.AI

any moment of any day.
Character.AI

is one of the worlds leading personal AI platforms. Founded in 2021 by AI pioneers Noam Shazeer and Daniel De Freitas,

Character.AI

is a full-stack AI company with a globally scaled direct-to-consumer platform. As of 2023 that platform was #2 in the space in user engagement.

Character.AI

is uniquely centered around people, letting users personalize their experience by interacting with AI Characters. The company achieved unicorn status in 2023 and was named Google Plays AI App of the Year.
Noam co-invented the key tech powering LLMs and was recently named to TIME100s Most Influential People in AI list. TIME called him one of the most important and impactful people of the spaces past, present, and future. Daniel created and led LaMDA, the breakthrough conversational tech project currently powering Bard.
To learn more, please visit

beta.character.ai .
About the role

You would be a great fit for this role if you are an experienced engineer who will be instrumental in building the world's best LLMs by collecting and refining the essential training data that powers them. In pursuit of the best language models, your responsibility is twofold:
First, identify and collect data at the scale required to feed our largest models. This involves managing a diverse set of sources, including structured and unstructured content from text and multimedia formats. Your engineering expertise is crucial in crafting the infrastructure and tools necessary to efficiently collect and manage petabytes of data.

Second, you will experiment with various methods of extracting a balanced and comprehensive training dataset from the raw data. You will leverage your expertise in data to build datasets reflecting a hypothesis, train models, and evaluate experimental results. Through this experimentation, you will create the training datasets for our largest models.

These are critical steps in the construction of AI. With petabytes of data and numerous design decisions, each step requires careful attention. Expertise in AI is not necessary, but enthusiasm for the space and a track record of adapting to new domains is important.
Who were looking for

Required Experience:
5+ years of production software engineering experience

Experience building large-scale data processing pipelines, with tools like PySpark, Beam, or Flink

Familiarity with Machine Learning and NLP and willingness to learn more on the job

Track record of adapting to new domains and a desire to use data to improve products

Additional Desired Experience:
ML experience as an ML engineer, Data Scientist, or another similar role

Experience with cloud platforms like AWS or Azure, or tools such as Kubernetes and Terraform

Passionate about Conversational AI or large language models

You will be a good fit if you are proactive and have a get things done mindset. Given our current pace of growth and load on our systems, most people have had a significant impact during their first week at the company.
Character is an equal opportunity employer and does not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or any other legally protected status. We value diversity and encourage applicants from a range of backgrounds to apply.

#J-18808-Ljbffr


  • Data Engineer

    5 days ago


    Menlo Park, United States Vertisystem Full time

    Onsite and only W2 Candidate required.MUST BE FULLY KNOWLEDGABLE IN PYTHON, SQL, AND ETL • Ability to work as part of a team, as well as work independently or with minimal direction. • Excellent written, presentation, and verbal communication skills. • Collaborate with data architects, modelers and IT team members on project goals. Summary: The main...

  • Data Engineer

    5 days ago


    Menlo Park, United States Vertisystem Full time

    Onsite and only W2 Candidate required.MUST BE FULLY KNOWLEDGABLE IN PYTHON, SQL, AND ETL • Ability to work as part of a team, as well as work independently or with minimal direction. • Excellent written, presentation, and verbal communication skills. • Collaborate with data architects, modelers and IT team members on project goals. Summary: The main...

  • Senior Data Engineer

    2 weeks ago


    Menlo Park, United States Vertisystem Full time

    ONLY W2 role No C2C Local Candidate requiredThe main function of the Data Engineer is to develop, evaluate, test and maintain architectures and data solutions within our organization. The typical Data Engineer executes plans, policies, and practices that control, protect, deliver, and enhance the value of the organization’s data assets.Job...

  • Senior Data Engineer

    2 weeks ago


    Menlo Park, United States Vertisystem Full time

    ONLY W2 role No C2C Local Candidate requiredThe main function of the Data Engineer is to develop, evaluate, test and maintain architectures and data solutions within our organization. The typical Data Engineer executes plans, policies, and practices that control, protect, deliver, and enhance the value of the organization’s data assets.Job...


  • Menlo Park, United States Vertisystem Full time

    ONLY W2 role No C2C Local Candidate requiredThe main function of the Data Engineer is to develop, evaluate, test and maintain architectures and data solutions within our organization. The typical Data Engineer executes plans, policies, and practices that control, protect, deliver, and enhance the value of the organization's data assets.Job Responsibilities:...

  • Data Engineer

    4 days ago


    Menlo Park, United States Spectraforce Technologies Full time

    Data Engineer Menlo Park, CA 12 months Job Description: Perform analysis of data extracted from testbeds / test campaigns. Writing/scripting queries to collect data across the testbeds and various test equipment (e.g, traffic generators, cellular), systems (e.g, linux systems) and other types (e.g, telecom, cellular, wifi) Data extracting...

  • Data Engineer

    2 days ago


    Menlo Park, United States SPECTRAFORCE Full time

    Data EngineerMenlo Park, CA12 monthsJob Description:Perform analysis of data extracted from testbeds / test campaignsWriting/scripting queries to collect data across the testbeds and various test equipment (e.g, traffic generators, cellular), systems (e.g, linux systems) and other types (e.g, telecom, cellular, wifi)Data extracting results, observations,...

  • Data Engineer

    5 days ago


    Menlo Park, United States SPECTRAFORCE Full time

    Data EngineerMenlo Park, CA12 monthsJob Description:Perform analysis of data extracted from testbeds / test campaignsWriting/scripting queries to collect data across the testbeds and various test equipment (e.g, traffic generators, cellular), systems (e.g, linux systems) and other types (e.g, telecom, cellular, wifi)Data extracting results, observations,...

  • Data Engineer

    7 days ago


    Menlo Park, United States Spectraforce Technologies Inc Full time

    Data Engineer Menlo Park, CA 12 months Job Description: ● Perform analysis of data extracted from testbeds / test campaigns. ● Writing/scripting queries to collect data across the testbeds and various test equipment (e.g, traffic generators, cellular), systems (e.g, linux systems) and other types (e.g, telecom, cellular, wifi)...

  • Data Engineer

    5 days ago


    Menlo Park, United States SPECTRAFORCE Full time

    Data EngineerMenlo Park, CA12 monthsJob Description:Perform analysis of data extracted from testbeds / test campaignsWriting/scripting queries to collect data across the testbeds and various test equipment (e.g, traffic generators, cellular), systems (e.g, linux systems) and other types (e.g, telecom, cellular, wifi)Data extracting results, observations,...

  • Data Engineer III

    2 weeks ago


    Menlo Park, United States Orangepeople Full time

    Welcome to the forefront of data innovation! The main function of the Data Engineer is to develop, evaluate, test, and maintain architectures and data solutions within our organization. The typical Data Engineer executes plans, policies, and practices that control, protect, deliver, and enhance the value of the organization's data assets. Join us as a Data...

  • Data Engineer

    4 days ago


    Menlo, United States Vertisystem Full time

    Onsite and only W2 Candidate required. MUST BE FULLY KNOWLEDGABLE IN PYTHON, SQL, AND ETL • Ability to work as part of a team, as well as work independently or with minimal direction. • Excellent written, presentation, and verbal communication skills. • Collaborate with data architects, modelers and IT team members on project goals. Summary: The main...

  • Data Engineer

    4 days ago


    Menlo, United States SPECTRAFORCE Full time

    Data Engineer Menlo Park, CA 12 months Job Description: Perform analysis of data extracted from testbeds / test campaigns Writing/scripting queries to collect data across the testbeds and various test equipment (e.g, traffic generators, cellular), systems (e.g, linux systems) and other types (e.g, telecom, cellular, wifi) Data extracting results,...

  • Research Engineer

    4 weeks ago


    Menlo Park, United States Character.AI Full time

    About usCharacter’s mission is to empower everyone with AGI. Our vision is to enable people with our technology so that they can use Character.AI any moment of any day.Character.AI is one of the world’s leading personal AI platforms. Founded in 2021 by AI pioneers Noam Shazeer and Daniel De Freitas, Character.AI is a full-stack AI company with a globally...


  • Menlo Park, United States Cerncourier Full time

    SLAC National Accelerator Laboratory seeks an Application Specific Integrated Circuit (ASIC) design engineer within the Integrated Circuits Department of the Instrumentation Division of the Technology Innovation Directorate. The IC department develops state-of-the-art, low-noise and low-power front-end Application Specific Integrated Circuits (ASICs) to...


  • Menlo Park, United States Yeah! Global Full time

    Job DescriptionBe a Backend Architect for the Future of AI Interaction!Calling all passionate backend engineers! Do you dream of building the systems that power groundbreaking AI interfaces?This is your chance to join a revolutionary team! We're searching for a Senior or Staff Level Backend Engineer to focus on building the infrastructure behind...


  • Menlo Park, United States SPECTRAFORCE Full time

    Title: Sr. Network Engineer - (InfiniBand expert)/ Data Center Engineer Location: On site in Menlo Park CADuration: 12 months to start (will be longer term)Main responsibilities: · Network engineering - Designing networks, Configurations etc.· Be an expert in InfiniBand.Must have:· At least 5 YOE in Infiniband (OR Person can have heavy network Eng exp –...


  • Menlo Park, United States SPECTRAFORCE Full time

    Title: Sr. Network Engineer - (InfiniBand expert)/ Data Center Engineer Location: On site in Menlo Park CADuration: 12 months to start (will be longer term)Main responsibilities: · Network engineering - Designing networks, Configurations etc.· Be an expert in InfiniBand.Must have:· At least 5 YOE in Infiniband (OR Person can have heavy network Eng exp –...


  • Menlo Park, United States OSI Engineering Full time

    We’re looking for an experienced Software Engineer to be a key contributor in developing cloud-based services that will drive the future of the business. You will join our small and dynamic Cloud Services team, using the latest technology and tools to build high-quality, cross-platform solutions that delight our customers. Responsibilities:Staff Software...


  • Menlo Park, United States OSI Engineering Full time

    We’re looking for an experienced Software Engineer to be a key contributor in developing cloud-based services that will drive the future of the business. You will join our small and dynamic Cloud Services team, using the latest technology and tools to build high-quality, cross-platform solutions that delight our customers. Responsibilities:Staff Software...