Senior Data Engineer

3 days ago


Bodega Bay, United States People Data Labs Full time
Job DescriptionJob Description

About Us

At People Data Labs, we're committed to democratizing access to high-quality B2B data and leading the emerging DaaS economy. We empower developers, engineers, and data scientists to create innovative, compliant data products at scale with our clean, easy-to-use datasets of resume, company, location, and education data consumed through our suite of APIs.

PDL is an innovative, fast-growing, global team backed by world-class investors, including Craft Ventures, Flex Capital, and Founders Fund. We scour the world for people hungry to improve, curious about how things work, and willing to challenge the status quo to build something new and better.

Roles & Responsibilities:

  • Build infrastructure for ingestion, transformation, and loading an exponentially increasing volume of data from a variety of sources using Spark, SQL, AWS, and Databricks
  • Building an organic entity resolution framework capable of correctly merging hundreds of billions of individual entities into a number of clean, consumable datasets.
  • Developing CI/CD pipelines and anomaly detection systems capable of continuously improving the quality of data we're pushing into production.
  • Devising solutions to largely-undefined data engineering and data science problems.
  • Work with stakeholders in Engineering and Product to assist with data-related technical issues and support their infrastructure needs

Technical Requirements

  • 5-7+ years industry experience with clear examples of strategic technical problem solving and implementation
  • Strong software development fundamentals.
  • Experience with Python
  • Expertise with Apache Spark (Java, Scala, and/or Python-based)
  • Experience with SQL
  • Experience building scalable data processing systems (e.g., cleaning, transformation) from the ground up.
  • Experience using developer-oriented data pipeline and workflow orchestration (e.g., Airflow (preferred), dbt, dagster or similar)
  • Knowledge of modern data design and storage patterns (e.g., incremental updating, partitioning and segmentation, rebuilds and backfills)
  • Experience working in Databricks (including delta live tables, data lakehouse patterns, etc.)
  • Experience with cloud computing services (AWS (preferred), GCP, Azure or similar)
  • Experience with data warehousing (e.g., Databricks, Snowflake, Redshift, BigQuery, or similar)
  • Understanding of modern data storage formats and tools (e.g., parquet, ORC, Avro, Delta Lake)

Professional Requirements

  • Must thrive in a fast paced environment and be able to work independently
  • Can work effectively remotely (able to be proactive about managing blockers, proactive on reaching out and asking questions, and participating in team activities)
  • Strong written communication skills on Slack/Chat and in documents
  • You are experienced in writing data design docs (pipeline design, dataflow, schema design)
  • You can scope and breakdown projects, communicate and collaborate progress and blockers effectively with your manager, team, and stakeholders

Nice To Haves:

  • Degree in a quantitative discipline such as computer science, mathematics, statistics, or engineering
  • Experience working with entity data (entity resolution / record linkage)
  • Experience working with data acquisition / data integration
  • Expertise with Python and the Python data stack (e.g., numpy, pandas)
  • Experience with streaming platforms (e.g., Kafka)
  • Experience evaluating data quality and maintaining consistently high data standards across new feature releases (e.g., consistency, accuracy, validity, completeness)

Our Benefits

  • Stock
  • Competitive Salaries
  • Unlimited paid time off
  • Medical, dental, & vision insurance
  • Health, fitness, and office stipends
  • The permanent ability to work wherever and however you want

Salary: $190K - $220K

No C2C, 1099, or Contract-to-Hire. Recruiters need not apply.

People Data Labs does not discriminate on the basis of race, sex, color, religion, age, national origin, marital status, disability, veteran status, genetic information, sexual orientation, gender identity or any other reason prohibited by law in provision of employment opportunities and benefits.

Qualified Applicants with arrest or conviction records will be considered for Employment in accordance with the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act.

Personal Privacy Policy for California Residents
https://privacy.peopledatalabs.com/policies?name=personnel-privacy-policy



  • Bodega, California, United States People Data Labs Full time

    About the RoleWe are seeking a seasoned Data Engineer to lead our team in developing innovative, scalable data processing systems. With your expertise in Apache Spark and cloud computing services, you will build and maintain complex data pipelines that enable us to provide high-quality B2B data solutions.Key ResponsibilitiesDesign and implement data...


  • Bodega, California, United States People Data Labs Full time

    We are looking for a skilled Data Engineering Manager to lead our Web and Customer applications team at People Data Labs. As a key member of our engineering leadership team, you will be responsible for managing a team of Front End and Full Stack software engineers, driving projects and initiatives through the team, and working closely with product and design...


  • Bodega Bay, United States People Data Labs Full time

    Job DescriptionJob DescriptionPeople Data Labs (PDL) is the provider of people and company data. We do the heavy lifting of data collection and standardization so our customers can focus on building and scaling innovative, compliant data solutions. Our sole focus is on building the best data available by integrating thousands of compliantly sourced datasets...


  • Bodega Bay, United States Abnormal Security Full time

    Job DescriptionJob DescriptionAbout The RoleEnterprises of all sizes trust Abnormal Security's cloud products to stop cybercrime. These products are data intensive SaaS applications that depend on reliable, scalable, and secure access to data. This is where our Data Storage Platform team fits in, offering scalable storage systems (Postgresql, OpenSearch,...

  • Head of Data Science

    1 month ago


    Bodega Bay, United States Nextdoor Full time

    Job DescriptionJob Description#TeamNextdoorNextdoor is where you connect to the neighborhoods that matter to you so you can belong. Our purpose is to cultivate a kinder world where everyone has a neighborhood they can rely on.Neighbors around the world turn to Nextdoor daily to receive trusted information, give and get help, get things done, and build...


  • Bodega Bay, United States MindsDB Full time

    Job DescriptionJob DescriptionABOUT USMindsDB is a fast-growing AI startup headquartered in San Francisco, California. As a leading innovator bringing AI and Data together, our passion is empowering companies to easily build AI capabilities that can Think, Understand and Orchestrate: enabling teams to move from prototyping & experimentation to production in...


  • Bodega Bay, United States Guidewheel Full time

    Job DescriptionJob DescriptionAbout UsAt Guidewheel, we're revolutionizing the manufacturing industry with our cutting-edge platform that seamlessly integrates into manufacturing workflows. Our mission is to collect and leverage proprietary data to drive efficiency and innovation. Our team is composed of industry, software, and AI experts dedicated to...

  • Software Engineer

    1 month ago


    Bodega Bay, United States Avela Full time

    Job DescriptionJob DescriptionAvela is a Nobel Prize winning platform for families to navigate their child's educational journey. Parents can find, apply, register, and pay for school and programs for their children, all from a common application system with saved profiles. Avela also powers backend admissions and operational workflows, making it easy...


  • Curtis Bay, Maryland, United States Pantheon Data Full time

    Job OverviewPantheon Data, a private company based in the Washington, DC area, is seeking a highly skilled Marine Mechanical Engineer to join our team. As a valued member of our organization, you will have the opportunity to work on exciting projects and contribute to the success of our clients.About the RoleWe are looking for an experienced Mechanical...

  • Senior UX Engineer

    3 days ago


    Bodega Bay, United States Figure Full time

    Job DescriptionJob DescriptionAbout FigureFigure is revolutionizing financial services with its disruptive technology platform. Our flagship product is the #1 non-bank HELOC in America. We're delivering new consumer lending products and a capital markets ecosystem that maximize efficiency and transparency – by capitalizing on our loan origination...


  • Bodega Bay, United States Taskrabbit Full time

    Job DescriptionJob DescriptionAbout Taskrabbit:Taskrabbit is a marketplace platform that conveniently connects people with Taskers to handle everyday home to-do's, such as furniture assembly, handyman work, moving help, and much more.At Taskrabbit, we want to transform lives one task at a time. As a company we celebrate innovation, inclusion and hard...


  • Bodega, California, United States Abnormal Security Full time

    Company Overview:Abnormal Security is a pioneering cybersecurity firm that protects its clients from evolving threats. Its innovative behavioral-based approach has earned the company recognition as one of the top cybersecurity startups, with a robust AI system trusted to safeguard over 8% of the Fortune 1000.About the Role:We are seeking an experienced...


  • Bodega Bay, United States Abnormal Security Full time

    Job DescriptionJob DescriptionAbout the RoleAbnormal Security is looking for a Senior Software Engineer to join the Message Detection - Attack Detection team. At Abnormal, we protect our customers against nefarious adversaries who are constantly evolving their techniques and tactics to outwit and undermine the traditional approaches to Security. That's...


  • Bodega, California, United States People Data Labs Full time

    At People Data Labs, we're looking for a skilled Full Stack Engineering Manager to lead our Web and Customer applications team. This role involves managing a team of Front End and Full Stack software engineers, working with product and design counterparts to drive the customer experience of peopledatalabs.com and the applications our customers use to...


  • Bodega Bay, United States Linden Lab Full time

    Job DescriptionJob DescriptionCompany SnapshotFounded in 1999, Linden Lab develops platforms that empower people to create, share, and thrive within virtual experiences. In 2003, Linden Lab first launched Second Life, the groundbreaking virtual world enjoyed by millions around the globe, which has since gone on to boast nearly two billion user creations and...


  • Bodega, California, United States Abnormal Security Full time

    About Abnormal SecurityWe are a leading provider of cloud-based security solutions, trusted by enterprises of all sizes to stop cybercrime. Our products are data-intensive SaaS applications that rely on reliable, scalable, and secure access to data.The RoleWe are looking for a highly skilled Data Platform Architect to spearhead key initiatives owned by our...


  • Bodega Bay, United States Baton (A Ryder Technology Lab) Full time

    Job DescriptionJob DescriptionWho We AreBaton is seeking ambitious individuals who desire the autonomy and agility of a startup environment combined with the backing, power, reach, and stability of a highly respected logistics industry giant.Baton is the Silicon Valley-based technology innovation lab for Ryder, a leading logistics company that owns 260k...


  • Bodega Bay, United States Linden Lab Full time

    Job DescriptionJob DescriptionCompany SnapshotFounded in 1999, Linden Lab develops platforms that empower people to create, share, and thrive within virtual experiences. In 2003, Linden Lab first launched Second Life, the groundbreaking virtual world enjoyed by millions around the globe, which has since gone on to boast nearly two billion user creations and...


  • Bodega, California, United States People Data Labs Full time

    Job OverviewAt People Data Labs, we're driven by a singular focus on delivering exceptional customer experiences. We're seeking a seasoned Full Stack Engineering Manager to lead our Web and Customer Applications team in driving innovation and growth.Key ResponsibilitiesManage a high-performing team of 4-8 Front End and Full Stack software engineers,...


  • Bodega, California, United States Avela Full time

    About AvelaAvela is a pioneering edtech startup that has revolutionized the educational journey for families, making it easier to navigate and find the right opportunities for their children. We're not just an application platform; we're a game-changer in the education industry.Job OverviewWe're seeking an experienced Full Stack Software Engineer to join our...