Staff Data Engineer

4 days ago


Menlo Park, California, United States Character Full time
About Us

At Character, our mission is to empower individuals with Artificial General Intelligence (AGI). Our vision is to enable people to utilize our technology at any moment, every day.

We are a full-stack AI company with a globally scaled direct-to-consumer platform. As a leading personal AI platform, we are uniquely centered around people, allowing users to personalize their experience by interacting with AI 'Characters.'

We have achieved significant milestones, including being named Google Play's AI App of the Year. Our team is comprised of AI pioneers, including Noam Shazeer, who co-invented the key technology powering Large Language Models (LLMs), and Daniel De Freitas, who created and led LaMDA, the breakthrough conversational tech project powering Bard.

About the Role

We are seeking an experienced Data Engineer to join our team. As a key member of our organization, you will be instrumental in building the world's best LLMs by collecting and refining the essential training data that powers them.

Your responsibilities will be twofold:

  • First, you will identify and collect data at the scale required to feed our largest models. This involves managing a diverse set of sources, including structured and unstructured content from text and multimedia formats. Your engineering expertise is crucial in crafting the infrastructure and tools necessary to efficiently collect and manage petabytes of data.
  • Second, you will experiment with various methods of extracting a balanced and comprehensive training dataset from the raw data. You will leverage your expertise in data to build datasets reflecting a hypothesis, train models, and evaluate experimental results. Through this experimentation, you will create the training datasets for our largest models.

These are critical steps in the construction of AI. With petabytes of data and numerous design decisions, each step requires careful attention. Expertise in AI is not necessary, but enthusiasm for the space and a track record of adapting to new domains is important.

Requirements

To be successful in this role, you will need:

  • 5+ years of production software engineering experience
  • Experience building large-scale data processing pipelines, with tools like PySpark, Beam, or Flink
  • Familiarity with Machine Learning and NLP and willingness to learn more on the job
  • Track record of adapting to new domains and a desire to use data to improve products

Additional desired experience includes:

  • ML experience as an ML engineer, Data Scientist, or another similar role
  • Experience with cloud platforms like AWS or Azure, or tools such as Kubernetes and Terraform
  • Passionate about Conversational AI or large language models

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability, or any other legally protected status. We value diversity and encourage applicants from a range of backgrounds to apply.



  • Menlo Park, California, United States Meta Full time

    About the RoleAs a Director of Data Engineering at Meta, you will lead our data engineering organization, supporting messaging experiences across both the Facebook and Instagram networks and the Messenger App. Your vision will drive the formulation and implementation of a trusted data analytics platform supporting product development across the Facebook and...


  • Menlo Park, California, United States META Full time

    About the RoleWe are seeking a highly skilled Visiting Professor to join our Release to Production Engineering (RTP) team at Meta. As a Visiting Professor, you will play a key role in applying state-of-the-art data science and machine learning techniques to improve our cutting-edge power and hardware fleet.Key ResponsibilitiesDevelop Research Ideas and...


  • Menlo Park, California, United States TEKsystems Full time

    About the RoleWe are seeking a highly skilled Linguistic Data Engineer to join our Gen AI team focused on building Multi Lingual models. As a key member of our team, you will be responsible for building datasets, pipelines, and models for ML applications.Key ResponsibilitiesClearly communicate with project stakeholders to ensure seamless collaboration and...

  • Data Architect

    3 days ago


    Menlo Park, California, United States TEKsystems Full time

    About the RoleWe are seeking a highly skilled Data Engineer to join our team at TEKsystems. As a Data Engineer, you will play a critical role in developing, evaluating, testing, and maintaining architectures and data solutions within our organization.Key ResponsibilitiesDesign and Development: Design, construct, install, test, and maintain highly scalable...


  • Menlo Park, California, United States tapwage Full time

    Company OverviewDriven by our vision of the affordable, reliable, net-zero carbon grid of the future, Mainspring has developed a new category of power generation — the linear generator — that delivers local, scalable, and fuel-flexible power to help accelerate the transition to the net-zero carbon grid.The unique combination of attributes offered by...

  • Data Scientist

    2 months ago


    Menlo Park, California, United States Instagram Full time

    You can create a Career Profile to get job suggestions, prepare for the interview process, and more.As the most experienced Data Scientist on Instagram's Capacity Team, you have the opportunity to shape the compute and storage strategy for an app that serves content to over a 2 Billion per month.This strategy will inform the investments we are able to take...


  • Menlo Park, California, United States META Full time

    About the RoleWe are seeking a Visiting Professor, RTP Data Analytics to join our Release to Production Engineering (RTP) team at Meta. As a Visiting Professor, you will be responsible for applying state-of-the-art data science and machine learning techniques to improve our cutting-edge power and hardware fleet.Key ResponsibilitiesDevelop Research Ideas and...


  • Menlo Park, California, United States META Full time

    About the RoleWe are seeking a highly skilled Visiting Professor to join our Release to Production Engineering (RTP) team at Meta. As a Visiting Professor, you will play a key role in applying state-of-the-art data science and machine learning techniques to improve our cutting-edge power and hardware fleet.Key ResponsibilitiesDevelop Research Ideas and...

  • Data Scientist

    7 days ago


    Menlo Park, California, United States Robinhood Full time

    About the RoleWe are seeking a highly skilled Machine Learning Engineer to join our team at Robinhood, where you will play a critical role in developing and deploying machine learning models to detect and reduce risk to our business.As a key member of our Data Team, you will work closely with cross-functional teams to understand and mitigate the risks to our...


  • Menlo Park, California, United States Instagram Full time

    About the RoleWe are seeking a highly skilled Data Science Director to lead our Sharing Experiences Organization at Instagram. As a key member of our team, you will be responsible for managing our data science team to drive business growth and inform product strategy.Key ResponsibilitiesPartner with Product, Engineering, and cross-functional teams to inform...

  • Data Analyst

    1 month ago


    Menlo Park, California, United States Facebook Full time

    The Creative Audio Data Analyst will be responsible for designing, building, and measuring CA data dashboards to assist Creative Audio Leadership in driving strategy, performance and impact through key data insights.Data Analyst Responsibilities: Work as part of a project team to coordinate database development and determine project scope and...


  • Menlo Park, California, United States Facebook Full time

    As a Data Scientist, you will collaborate on a wide array of product and technical problems with a diverse set of cross-functional partners across Product, Engineering, Research, Data Engineering, Finance and others. You will use data and analysis to identify and solve our biggest challenges in developing foundational AI models. You will influence product...


  • Menlo Park, California, United States Mainspring Energy, Inc. Full time

    About Mainspring Energy, Inc.Mainspring Energy, Inc. is a pioneering company in the field of electric power generation, dedicated to developing innovative solutions for a sustainable future. Our mission is to accelerate the transition to a net-zero carbon grid by harnessing the power of linear generators.Job SummaryWe are seeking a highly skilled Staff...

  • Data Scientist IV

    3 days ago


    Menlo Park, California, United States orangepeople Full time

    About the RoleWelcome to the challenging and dynamic role of Data Scientist at OrangePeople! As a key member of our team, you will play a crucial part in producing innovative solutions driven by exploratory data analysis from complex and high-dimensional datasets.This role will work closely with our Business Marketing Group to analyze, experiment, model, and...

  • Lead Privacy Engineer

    2 weeks ago


    Menlo Park, California, United States Character Technologies Full time

    Position OverviewWe are in search of a seasoned Staff Security Engineer to spearhead our Privacy Engineering initiatives at Character Technologies. As a pivotal member of our security division, you will collaborate closely with various teams to implement essential privacy measures for our expanding platform and develop the underlying technology that supports...

  • Lead Privacy Engineer

    2 weeks ago


    Menlo Park, California, United States Character Technologies Full time

    Position OverviewWe are looking for a highly skilled Staff Security Engineer to spearhead our Privacy Engineering initiatives at Character Technologies. As a pivotal member of our security division, you will collaborate with various teams to implement robust privacy measures for our expanding platform and develop the underlying technology that supports these...


  • Menlo Park, California, United States Character Technologies Full time

    Position OverviewWe are on the lookout for a Staff Security Engineer to spearhead our Privacy Engineering initiatives at Character Technologies. As a pivotal member of our security division, you will collaborate extensively with various teams to implement privacy measures for our expanding platform and develop the underlying technology that supports these...


  • Menlo Park, California, United States Character Technologies Full time

    Position OverviewWe are in search of a Staff Security Engineer to spearhead our Privacy Engineering initiatives at Character Technologies. As a pivotal member of our security division, you will collaborate closely with various teams to establish privacy safeguards for our expanding platform and develop the technology that supports these measures.This role...

  • Optical Engineer

    2 months ago


    Menlo Park, California, United States Meta Full time

    The Optical Technologies Group enables optical communication hardware for Meta data center networking and AI/ML systems to support Meta's mission of bringing the world together. We are looking for an Optical Engineer to qualify optical communications modules for Meta's cutting-edge, global, data center network. As an Optical Engineer, you will have a unique...


  • Menlo Park, California, United States Exponent Full time

    About Exponent:Exponent stands as a premier engineering and scientific consulting firm, equipped with the extensive expertise necessary to tackle our clients' most distinctive and urgent challenges.Our mission is to mobilize multidisciplinary teams of science, engineering, and regulatory specialists to provide clients with solutions that foster a safer,...