Staff Data Engineer

2 months ago


San Francisco, California, United States Pachama Full time

Who we are.

Pachama is a mission-driven company looking to restore nature to help address climate change. Pachama brings the latest technology in remote sensing and AI to the world of forest carbon in order to enable forest conservation and restoration to scale. Pachama's core technology harnesses satellite imaging with artificial intelligence to measure carbon captured in forests. Through the Pachama marketplace, responsible companies and individuals can connect with carbon credits from projects that are protecting and restoring forests worldwide.

We are backed by mission-aligned investors including Breakthrough Energy Ventures, Amazon Climate Fund, Chris Sacca, Saltwater Ventures, and Paul Graham.

Recent press:

Pachama is #1 most innovative AI company

Jeff Bezos' Last Shareholder Update

Pachama to monitor and manage Mercado Libre forest projects

We are looking for a Staff Data Engineer to lead the development of cutting-edge data systems backing our products for our mission to restore and protect the planet's forests. As a leader on the DMRV (Digital measurement, reporting, and verification) team, you will build, scale and deploy systems for ingesting, storing and computing the data powering our AI and Remote Sensing insights and are responsible for delivering those data insights to our customers to enable them to identify and originate the highest quality nature-based projects.

A typical day includes collaborating across engineering and science teams to understand new dataset ingest pathways for model or algorithm features, writing code to support efficient compute and scalable transformation and algorithms to unlock insights over project data, designing systems for easy data access and experimentation pathways, pair coding with other engineers to raise the standards and bar on our technical work, and roadmapping core improvements to our data, compute or measurement stack.

We're looking for engineers who find joy in the craft of building but live for seeing the end-to-end impact and want to rally engineers around them. Engineers who push forward initiatives by asking great questions, cutting through ambiguity, and organizing to win. Engineers who are relentlessly detail-oriented, methodical in their approach to understanding trade-offs, and place the highest emphasis on building and building quickly.

Location:

This role is remote. However, given the cross-functional communication responsibilities, it is preferred that you be within 3 hours of Pacific time.

What You Will Help Us With:

  • Impact: Empower our interdisciplinary team and customers to derive insights needed to originate high quality nature based projects from our multi-TB datasets by building the ingest pipelines, access and compute supporting our geospatial and remote sensing data powering our products.
  • Technical leadershipand innovation: for cross-functional projects as our data and compute pipelines are core platform assets used across teams. Connect product value across teams with the core design and technologies available to develop strategies and vision for the data systems we need to build and how we build them. You will work with teams to implement this vision.
  • Advocating for and mentoring on best practices: applied to our data pipelines and compute. Mentoring teammates to raise the bar across the engineering teams to enable step-level increases in efficiency.
  • Hands on contributions: coding the systems and tools that enable all engineering and science to produce high-quality insight for forest carbon projects and optimizing methods to run efficiently on large amounts of geospatial and remote sensing data.

Experience & Skills We're Looking For:

  • Experience leading larger cross-team engineering efforts
  • Experience with data engineering including ingest, storage, orchestration and compute at scale with an ability to apply these skills to new domains like forest science and remote sensing.
  • Strong software engineering practices and a background in Python programming, debugging/profiling, and version control and system design.
  • Distributed Compute - familiarity with distributed compute technologies and knowledge of distributed systems concepts (like CPU/GPU interactions/transfers, latency/throughput bottlenecks, pipelining/multiprocessing) Our tech stack includes Dask and Flyte deployed through Kubernetes and GCP.
  • Comfort with fast pace execution and rapid iteration startup environment. Excited by product impact.
  • Passion for environmental sustainability and a desire to make a meaningful impact on the planet.

Preferred (but not Required) Qualifications:

  • Owned and operated distributed compute system - you aren't just familiar with distributed workflows but have been responsible for deploying, scaling, overseeing and maintaining the infrastructure needed to run them.
  • Built Data pipelines and infra ML and Scientific applications- Have worked with ML and/or Science teams previously.
  • Geospatial - familiarity or willingness to get your hands dirty with raster and vector data, and nuances of geospatial data and common geospatial cloud-native data formats (geopackage, flatgeobuf, cloud-optimized geotiff). Our tech stack includes Zarr, Rasterio, Geopandas, and Xarray

Even if you don't meet all these requirements, we encourage you to apply if this job description excites you. We are looking for ambitious people to help make an impact on climate change. That purpose requires us to bring together a diverse set of people with different backgrounds, perspectives, and skills to create solutions that work for all.


  • Staff Data Engineer

    1 month ago


    San Francisco, California, United States SoFi Full time

    Employee Applicant Privacy Notice Who we are:Shape a brighter financial future with us.Together with our members, we're changing the way people think about and interact with personal finance.We're a next-generation financial services company and national bank using innovative, mobile-first technology to help our millions of members reach their goals. The...

  • Staff Data Engineer

    2 months ago


    San Francisco, California, United States Faire Full time

    About FaireFaire is an online wholesale marketplace built on the belief that the future is local — independent retailers around the globe are doing more revenue than Walmart and Amazon combined. At Faire, we're using the power of tech, data, and machine learning to connect this thriving community of entrepreneurs across the globe. Picture your favorite...


  • San Francisco, California, United States Northbeam Full time

    About UsNorthbeam is building the world's most advanced marketing intelligence platform for growth. Our attribution modeling technology and customizable dashboards provide our customers with a unified view of their e-commerce business data. The smartest brands in ecommerce trust Northbeam to accurately attribute their advertising spend, understand the entire...


  • San Francisco, California, United States Tbwa ChiatDay Inc Full time

    Job SummaryWe are seeking a highly skilled Staff Software Engineer to join our Data Infrastructure team at Tbwa Chiat/Day Inc. As a key member of our team, you will be responsible for designing and developing large-scale data solutions that drive business growth and innovation.Key ResponsibilitiesLead the design and development of our data infrastructure...

  • Staff Data Engineer

    2 weeks ago


    San Diego, California, United States Ascent Funding Full time

    Responsibilities:Expand and optimize data and data pipeline architecture as well as optimize data flow and collection for cross-functional teams to support the critical data needs of multiple teams, systems, and products. Create and maintain optimal data pipeline architecture. Assemble large, complex data sets that meet functional/non-functional business...

  • Senior Data Engineer

    1 month ago


    San Francisco, California, United States Unreal Gigs Full time

    Join us and be part of a dynamic team shaping the future of data engineering.If you're a seasoned engineer with expertise in data engineering technologies and a passion for building robust data systems, we want you on our team.Establish and enforce data governance policies and best practices to ensure data quality, security, and compliance with regulatory...

  • Senior Data Engineer

    2 months ago


    San Francisco, California, United States Unreal Gigs Full time

    Join us and be part of a dynamic team shaping the future of data engineering.If you're a seasoned engineer with expertise in data engineering technologies and a passion for building robust data systems, we want you on our team.Data Governance: Establish and enforce data governance policies and best practices to ensure data quality, security, and compliance...

  • Data Engineer

    5 months ago


    San Francisco, California, United States iTvorks Inc Full time

    Required Skills Data engineer main skills Good knowledge of Data warehousing with expertise in data modeling and ETL. Advanced skills in SQL and HQL Should be good at these core tech AWS/EMR/Spark Hadoop Hive Vertica Qlikview Tableau Should have exp in CI/CD


  • San Francisco, California, United States Smule Full time

    The Smule mission is to connect people all over the world through the joy of making music at massive social scale. Music is much more than just listening... It's about creating, sharing, discovering, participating, and connecting with people. It is a social network with the power to break down barriers, touch souls, and bring people together from all over...

  • Data Engineer

    2 months ago


    San Francisco, California, United States Nimble Robotics Full time

    About NimbleNimble is a robotics and AI company building end-to-end autonomous logistics to enable fast, efficient, and sustainable commerce. We're developing generalized robot intelligence and building general-purpose logistics robots, the first in the world capable of performing all core warehouse functions. Our mission is to empower and inspire mankind to...


  • San Francisco, California, United States BRIDGEMED SOLUTIONS INC Full time

    Freemind Solutions is seeking a Founding Data Engineer / Scientist to establish their data infrastructure from scratch. This role involves designing data pipelines, creating models and visualizations, and collaborating with various teams to integrate data insights. The position is located in the SF – Bay Area / Los Angeles and requires extensive data...

  • Senior Data Engineer

    3 months ago


    San Francisco, California, United States Gantri Full time

    CompanyGantri is the world's first digital manufacturer for creative lighting. We help independent designers, studios, and influencers to develop original, sustainably made lighting designs and sell directly to consumers. We manufacture and fulfill all orders on-demand using 3D printing from innovative plant-based materials.Since launching in 2017, we've...

  • Sr. Data Engineer

    3 months ago


    San Francisco, California, United States iTvorks Inc Full time

    Role Sr. Data Engineer (8+ years) Location SFO CA Duration Long TermExperience & Skills Extensive experience with Hadoop (or similar) Ecosystem (Map Reduce HDFS Hive Spark Pig HBase) Proficient in at least one of the SQL languages (MySQL PostgreSQL SqlServer Oracle) Good understanding of SQL Engine and able to conduct advanced performance tuning Strong...

  • Senior Data Engineer

    2 months ago


    San Francisco, California, United States Innovaccer Full time

    Position:Senior Data Engineer (multiple openings)Job Location:InnovAccer, Inc. 101 Mission Street, Suite 1950, San Francisco, CA allows for telecommuting)Job Duties:With a high level of independent decision-making capability and minimum supervision, the Senior Data Engineer will be responsible for performing the following duties:Defining the end-to-end data...


  • San Francisco, California, United States The Athletic Full time

    About UsThe Athletic is a direct-to-consumer digital sports media company committed to helping subscribers experience storytelling in a whole new way. Founded in 2016 and headquartered in San Francisco, The Athletic has more than 575 full-time employees and covers more than 250 professional sports and collegiate teams in the US, Canada and the UK. The...

  • Data Platform Engineer

    2 months ago


    San Francisco, California, United States Robust Intelligence Full time

    Robust Intelligence's mission is to eliminate AI Risk. As the world increasingly adopts AI into automated decision processes, we inherit great risk. Our flagship product is built to be integrated with existing AI systems to enumerate and eliminate risks caused by unintentional and intentional (adversarial) failure modes. With Generative AI becoming...


  • San Francisco, California, United States Autodesk Full time

    Job Requisition ID #24WD79318Position OverviewThe work we do at Autodesk touches nearly every person on the planet. By creating software tools for making buildings, machines, and even the latest movies, we influence and empower some of the most creative people in the world to solve problems that matter.As a Software Engineer at Autodesk Research, you will be...

  • Senior Data Engineer

    3 months ago


    San Francisco, California, United States Nightfall AI Full time

    Nightfall AI ) is the unified platform that prevents data leaks and enables secure collaboration by protecting sensitive data and controlling how it's shared. For decades, legacy data leak prevention (DLP) solutions have failed to adequately protect sensitive information. Traditional DLP is outdated, intrusive, and complex - it wasn't designed for today's...

  • Staff ML Engineer

    4 weeks ago


    San Francisco, California, United States Uber Full time

    About the RoleThere are many different types of users, opening the app in many contexts, and we need to match them to the many services and content we have available. Rider Experience drives and enables the critical trip booking funnel within the Rides app that makes up almost all of the trip transactions and contributes tremendously to business growth. We...


  • San Francisco, California, United States Fintool Full time

    About Fintool is an AI Equity Research Copilot for institutional investors. It's a LLM on top of financial documents, starting with SEC filings. Fintool is engineered to discover financial insights beyond the reach of timely human analysis or search software.We are on the fastest growing LLM vertical applications. Thousands of investors signed up for...