Research Engineer, Media Data Research
2 weeks ago
Meta is seeking AI research engineers to help us build the data foundation for Meta's most advanced Large Language and Media Models. We're looking for engineers with LLM/LMM expertise to join us on working with data at scale and to push beyond the data ceiling. Our team contributes to data curation across all stages of LLM/LMM development (pre-training, mid-training, post-training) and all domains/modalities (image, video, agent, media perception and generation). We are tackling complex challenges at trillion-scale, including organic data curation, synthetic data generation, agent and interaction data, and frontier paradigms that redefine what is possible. Based in Meta Superintelligence Labs (MSL) within the Fundamental AI Research Organization (FAIR), you'll directly contribute to Meta's frontier models like Llama, while having the chance to collaborate with researchers and engineers across MSL.
Responsibilities
Collaborate with cross-functional teams to develop Meta's next foundational models
• Architect efficient and scalable data curation systems and pipelines
• Fundamentally improve our data velocity across workflows and projects by contributing to the advancement of data tooling
• Execute on high priority projects in pre-training, mid-training, or post-training data curation
• Apply specialized expertise in video/image generation, video/image perception, OCR, data scaling laws, or data mixing
• Lead complex technical projects end-to-end
Minimum Qualifications
• Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
• 2+ years of industry research experience in LLM/NLP, computer vision, or related AI/ML models
• Experience as a formal technical lead, leading major technical initiatives with cross-functional impact, and/or influencing strategy across multiple teams
• Practical experience with multimodal pre-training or mid-training data curation for large media perception or generation models
• Demonstrated data infrastructure and software background, and experience building data tooling and services
• Published research in leading peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV) and/or demonstrated significant industry influence in the field of AI
Preferred Qualifications
• Experience working on frontier-quality/ state-of-the-art Large Language or Large Media Models
• Masters degree or PhD in Computer Science or a related technical field
• Programming experience in Python and hands-on experience with frameworks like PyTorch or Spark, or related distributed computing frameworks (Ray, DataFlow)
• Familiarity with SQL and file formats, such as Hive, Iceberg, Parquet, etc
About Meta
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today-beyond the constraints of screens, the limits of distance, and even the rules of physics.
Equal Employment Opportunity
Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.
-
Research Engineer, Media Data Research
1 week ago
Menlo Park, CA, United States Meta Inc Full timeMeta is seeking AI research engineers to help us build the data foundation for Meta's most advanced Large Language and Media Models. We're looking for engineers with LLM/LMM expertise to join us on working with data at scale and to push beyond the data ceiling. Our team contributes to data curation across all stages of LLM/LMM development (pre-training,...
-
PhD Research Intern
4 days ago
Menlo Park, CA, United States Fundamental Research Labs Full timeAbout the Role We're looking for smart, motivated, and curious PhD students to join us next summer or earlier as Research Scientist Interns. This is a unique opportunity to explore new research directions in AI, reinforcement learning, and multi-agent systems while working alongside a small, elite team of researchers and engineers. You will not be...
-
Research Engineer, Text Data Research
2 weeks ago
Menlo Park, CA, United States META Full timeMeta is seeking AI research engineers to help us build the data foundation for Meta's most advanced Large Language Models. We're looking for engineers with LLM expertise to join us on working with data at scale and to push beyond the data ceiling. Our team contributes to data curation across all stages of LLM development (pre-training, mid-training,...
-
Research Engineer, Text Data Research
2 weeks ago
Menlo Park, CA, United States META Full timeMeta is seeking AI research engineers to help us build the data foundation for Meta's most advanced Large Language Models. We're looking for engineers with LLM expertise to join us on working with data at scale and to push beyond the data ceiling. Our team contributes to data curation across all stages of LLM development (pre-training, mid-training,...
-
Research Engineer, Text Data Research
6 days ago
Menlo Park, CA, United States META Full timeMeta is seeking AI research engineers to help us build the data foundation for Meta's most advanced Large Language Models. We're looking for engineers with LLM expertise to join us on working with data at scale and to push beyond the data ceiling. Our team contributes to data curation across all stages of LLM development (pre-training, mid-training,...
-
Research Engineer, Text Data Research
2 days ago
Menlo Park, CA, United States META Full timeMeta is seeking AI research engineers to help us build the data foundation for Meta's most advanced Large Language Models. We're looking for engineers with LLM expertise to join us on working with data at scale and to push beyond the data ceiling. Our team contributes to data curation across all stages of LLM development (pre-training, mid-training,...
-
Research Engineer, Language
1 week ago
Menlo Park, CA, United States META Full timeSummary: Meta is seeking a Research Engineer to join our Large Language Model (LLM) Research team. We conduct focused research and engineering to build state-of-the-art LLMs, which we often open-source, like our team's recent Llama 2. We are looking for strong engineers who have a background in generative AI and NLP, with experience in areas like language...
-
Research Engineer, Language
1 week ago
Menlo Park, CA, United States META Full timeSummary: Meta is seeking a Research Engineer to join our Large Language Model (LLM) Research team. We conduct focused research and engineering to build state-of-the-art LLMs, which we often open-source, like our team's recent Llama 2. We are looking for strong engineers who have a background in generative AI and NLP, with experience in areas like language...
-
Research Engineer, Lab Automation
6 days ago
Menlo Park, CA, United States Periodic Labs Full timeAbout Periodic LabsWe are an AI + physical sciences lab building state of the art models to make novel scientific discoveries. We are well funded and growing rapidly. Team members are owners who identify and solve problems without boundaries or bureaucracy. We eagerly learn new tools and new science to push forward our mission. What to Expect Join a...
-
Research Engineer, Lab Automation
1 week ago
Menlo Park, CA, United States Periodic Labs Full timeAbout Periodic LabsWe are an AI + physical sciences lab building state of the art models to make novel scientific discoveries. We are well funded and growing rapidly. Team members are owners who identify and solve problems without boundaries or bureaucracy. We eagerly learn new tools and new science to push forward our mission. What to Expect Join a...