MTS: Data Acquisition

3 weeks ago


San Francisco, United States essential AI Full time
Job DescriptionJob Description

Essential AI’s mission is to deepen the partnership between humans and computers, unlocking collaborative capabilities that far exceed what could be achieved today. We believe that building delightful end-user experiences requires innovating across the stack - from the UX all the way down to models that achieve the best user value per FLOP.

We believe that a small, focused team of motivated individuals can create outsized breakthroughs. We are building a world-class multi-disciplinary team who are excited to solve hard real-world AI problems. We are well-capitalized and supported by March Capital and Thrive Capital, with participation from AMD, Franklin Venture Partners, Google, KB Investment, NVIDIA.

The Role

The Data Acquisition (Crawler) Engineer will be responsible for developing and maintaining the systems that allow for the smooth and efficient collection, storage, and processing of data from various sources. Your primary responsibility will be to design, develop, and maintain web crawlers and data acquisition systems in an efficient and reliable manner to support our model training.

What you’ll be working on
  • Architect and build large scale distributed web crawler system.

  • Design and implement web crawlers and scrapers to automatically extract data from websites, handling challenges like dynamic content and scaling to large data volumes.

  • Develop data acquisition pipelines to ingest, transform, and store large volumes of data.

  • Develop a highly scalable system and optimize crawler performance.

  • Monitor and troubleshoot crawler activities to detect and resolve issues promptly.

  • Work closely with data infrastructure and data researcher to improve the quality of the data.

What we are looking for
  • Previous large scale web crawling experience is a must for this role.

  • Minimum of 5 years of experience in data-intensive applications and distributed systems.

  • Proficiency in high performance programming languages like Go or Rust or C++.

  • Strong understanding of orchestration and containerization frameworks like Docker / Kubernetes.

  • Experience building on GCP or AWS services.

  • Bonus: You have deep expertise working with headless browsers and Chrome DevTools Protocol.

  • Bonus: You are curious to learn and develop understanding of how data sources and quality affects LLM capabilities.

We encourage you to apply for this position even if you don’t check all of the above requirements but want to spend time pushing on these techniques.

We are based in-person in SF. We offer relocation assistance to new employees.


  • MTS: Back End Data Engineer

    Found in: Appcast Linkedin GBL C2 - 3 weeks ago


    San Francisco, United States Acceler8 Talent Full time

    About the Company: Join a pioneering Google Brain spinout at the forefront of AI and ML technology, where human-computer collaboration is not just a concept but a reality. Our team is dedicated to revolutionizing user experiences by innovating at every level, from user interfaces down to the most efficient models. Our founders have already published one of...

  • MTS: Back End Data Engineer

    Found in: Appcast US C2 - 3 weeks ago


    San Francisco, United States Acceler8 Talent Full time

    About the Company: Join a pioneering Google Brain spinout at the forefront of AI and ML technology, where human-computer collaboration is not just a concept but a reality. Our team is dedicated to revolutionizing user experiences by innovating at every level, from user interfaces down to the most efficient models. Our founders have already published one of...


  • San Francisco, United States Hitachi Data Systems Full time

    * Residency in the Bay Area and complex solution selling experience required The Team We represent Hitachi Vantara to clients across various industries, establishing business relationships to understand customer challenges so that we can deliver profitable business for Hitachi products, services and solutions. We collaborate as a team and...


  • San Francisco, United States Eon Systems PBC Full time

    Eon collects large-scale neuroscientific data sets to train machine learning based brain emulations. We believe it is possible to scale this technology in a safe, secure and trustworthy manner in the next decade and empower humanity in unprecedented ways.RoleThe data infrastructure engineer is responsible for the setup and maintenance of systems capable of...

  • Senior Software Engineer

    Found in: Appcast US C2 - 2 weeks ago


    San Francisco, United States Lumicity Full time

    You will collaborate with image reconstruction scientists to design, implement, and verify:Algorithms for image reconstruction, image segmentation, classification, denoising, and enhancement to improve image quality and diagnostic accuracy;Techniques in deep learning, signal processing, and optimization to address challenges in medical image reconstruction...

  • Senior Software Engineer

    Found in: Appcast Linkedin GBL C2 - 3 weeks ago


    San Francisco, United States Lumicity Full time

    You will collaborate with image reconstruction scientists to design, implement, and verify:Algorithms for image reconstruction, image segmentation, classification, denoising, and enhancement to improve image quality and diagnostic accuracy;Techniques in deep learning, signal processing, and optimization to address challenges in medical image reconstruction...

  • Senior Software Engineer

    Found in: Jooble US O C2 - 2 weeks ago


    San Francisco, CA, United States Lumicity Full time

    You will collaborate with image reconstruction scientists to design , implement , and verify : Algorithms for image reconstruction, image segmentation, classification, denoising, and enhancement to improve image quality and diagnostic accuracy; Techniques in deep learning, signal processing, and optimization to address challenges in medical image...

  • MTS: Data Researcher

    2 weeks ago


    San Francisco, United States essential AI Full time

    Job DescriptionJob DescriptionEssential AI’s mission is to deepen the partnership between humans and computers, unlocking collaborative capabilities that far exceed what could be achieved today. We believe that building delightful end-user experiences requires innovating across the stack - from the UX all the way down to models that achieve the best user...

  • MTS: Applied Machine Learning Scientist

    Found in: Appcast US C2 - 2 weeks ago


    San Francisco, United States Acceler8 Talent Full time

    About the Company: Join a pioneering Google Brain spinout at the forefront of AI and ML technology, where human-computer collaboration is not just a concept but a reality. Our team is dedicated to revolutionizing user experiences by innovating at every level, from user interfaces down to the most efficient models. Our founders have already published one of...

  • MTS: Applied Machine Learning Scientist

    Found in: Appcast Linkedin GBL C2 - 3 weeks ago


    San Francisco, United States Acceler8 Talent Full time

    About the Company: Join a pioneering Google Brain spinout at the forefront of AI and ML technology, where human-computer collaboration is not just a concept but a reality. Our team is dedicated to revolutionizing user experiences by innovating at every level, from user interfaces down to the most efficient models. Our founders have already published one of...


  • San Francisco, United States Acceler8 Talent Full time

    About the Company: Join a pioneering Google Brain spinout at the forefront of AI and ML technology, where human-computer collaboration is not just a concept but a reality. Our team is dedicated to revolutionizing user experiences by innovating at every level, from user interfaces down to the most efficient models. Our founders have already published one of...

  • Senior Growth Partnership Manager, User Acquisition

    Found in: Talent US C2 - 2 weeks ago


    San Francisco, United States Unity Full time

    Role Description The opportunity At Unity, we shape the future of on-device monetization and services. Our team of Growth Partnership Managers is partnering with a diverse range of advertising partners (major app developers, game developers, and mobile agencies) to optimize and drive growth in their User Acquisition strategy. We are actively...

  • Data Analyst

    3 days ago


    San Francisco, United States Aquent Talent Full time

    We are seeking a skilled Data Analyst to join our Ecommerce Experience & Growth team, specializing in analyzing various metrics for our online business. As the data analyst for the team, you will delve into a broad range of data sources to derive actionable insights aimed at driving business growth. Your primary focus will be on understanding and optimizing...

  • Navy Acquisition Support Specialist

    Found in: Talent US C2 - 2 weeks ago


    San Diego, United States LinQuest Full time

    LinQuest is seeking a Navy Acquisition Support Specialist to join our team in San Diego, CA. US Citizenship, and the ability to obtain a clearance. LinQuest is seeking a qualified Mid-Level Acquisition Support Specialist with experience supporting DoD SATCOM acquisition to join our team. The successful candidate will be responsible for coordinating and...

  • Marketing Data Analyst

    Found in: Appcast Linkedin GBL C2 - 2 days ago


    San Francisco, United States Retail Apparel and Fashion Full time

    This is 9 months contract 32 Hours per week job.Payrate: $33/hr on W2What You’ll Do• Analyze a diverse range of metrics across multiple platforms and channels to evaluate the performance of our online business.• Identify trends, patterns, and correlations within the data to uncover insights that inform strategic decision-making.• Develop brand new...

  • AI Data Coordinator

    Found in: beBee jobs US - 3 weeks ago


    San Francisco, California, United States Heretic Full time

    Overview of RoleHeretic's San Francisco based stealth portfolio company, a consumer-facing AI startup, is seeking an entry-level coordinator to help us gather and organize a wide range of data to train our AI models. This is a contract role with the possibility of converting into a full-time role at some point in the future. This role requires good...

  • Data Analyst-III

    7 days ago


    San Francisco, United States Russell Tobin Full time

    Job DescriptionJob DescriptionTitle : Data Analyst IIDuration 10 monthsLocation Remote (PST)Pay : $50-$64/hr onw2 DOEDescriptionWe are looking for a data scientist to use quantitative background to dive into large datasets to guide decision-making. We handle many exciting challenges including customer acquisition, brand marketing, growth marketing, campaign...

  • Senior Data Scientist

    Found in: beBee jobs US - 7 days ago


    San Francisco, California, United States Nextdoor Full time

    #TeamNextdoorNextdoor is where you connect to the neighborhoods that matter to you so you can belong. Our purpose is to cultivate a kinder world where everyone has a neighborhood they can rely on.Neighbors around the world turn to Nextdoor daily to receive trusted information, give and get help, get things done, and build real-world connections with those...

  • Retail Talent Acquisition Specialist

    Found in: beBee jobs US - 2 weeks ago


    San Francisco, California, United States Richemont Full time

    Cartier North America is proud to employ talent from many different backgrounds, experiences, and identities. We believe that when diversity and inclusion are fully embraced and empowered, creativity and knowledge emerge to deliver excellence. We continue to work towards creating a workforce that represents the diversity of our clients and our communities. ...

  • Native GTM Data Analyst

    Found in: beBee S US - 3 weeks ago


    San Francisco, United States Procter & Gamble Full time

    Job Requirements Analyze and report on key financial/eCommerce/Marketing metrics, and data trends, drawing practical business insights and recommending actions to drive business growth. Analyze marketing channels and customer journey with new attribution methodology to highlight trends and opportunities. Support A/B testing and experimentation methodology....