Staff Data Engineer

2 weeks ago


San Francisco, California, United States Pachama Full time

Who we are.

Pachama is a mission-driven company looking to restore nature to help address climate change. Pachama brings the latest technology in remote sensing and AI to the world of forest carbon in order to enable forest conservation and restoration to scale. Pachama's core technology harnesses satellite imaging with artificial intelligence to measure carbon captured in forests. Through the Pachama marketplace, responsible companies and individuals can connect with carbon credits from projects that are protecting and restoring forests worldwide.

We are backed by mission-aligned investors including Breakthrough Energy Ventures, Amazon Climate Fund, Chris Sacca, Saltwater Ventures, and Paul Graham.

Recent press:

Pachama is #1 most innovative AI company

Jeff Bezos' Last Shareholder Update

Pachama to monitor and manage Mercado Libre forest projects

We are looking for a Staff Data Engineer to lead development of cutting-edge data systems backing our products for our mission to restore and protect the planet's forests. As a leader on the DMRV (Digital measurement, reporting, and verification) team, you will build, scale and deploy systems for ingesting, storing and computing the data powering our AI and Remote Sensing insights which delivered it to our customers enable them to identify and originate the highest quality nature based projects.

A typical day includes collaborating across engineering and science teams to understand new dataset ingest pathways for model or algorithm features, writing code to support efficient compute and scalable transformation and algorithms to unlock insights over geospatial data, designing systems for easy data access and experimentation pathways, pair coding with other engineers to raise the standards and bar on our technical work, and roadmapping core improvements to our data, compute or measurement stack.

We're looking for engineers who find joy in the craft of building but live for seeing the end to end impact and want to rally engineers around them. Engineers who push forward initiatives by asking great questions, cutting through ambiguity, and organizing to win. Engineers who are relentlessly detail-oriented, methodical in their approach to understanding trade-offs, place the highest emphasis on building, and building quickly.

Location:

This role is remote. However, being within 3 hours of Pacific time is preferred for this role given cross-functional communication responsibilities.

What You Will Help Us With:

  • Impact: Empower our interdisciplinary team and customers to derive insights needed to originate high quality nature based projects from our multi-TB datasets by building the ingest pipelines, access and compute supporting our geospatial and remote sensing data powering our products.
  • Technical leadershipand innovation: for cross-functional projects as our data and compute pipelines are core platform assets used across teams. Connect product value across teams with the core design and technologies available to develop strategies and vision for the data systems we need to build and how we build them. You will work with teams to implement this vision.
  • Advocating for and mentoring on best practices: applied to our data pipelines and compute. Mentoring teammates to raise the bar across the engineering teams to enable step-level increases in efficiency.
  • Hands on contributions: coding the systems and tools that enable all engineering and science to produce high-quality insight for forest carbon projects and optimizing methods to run efficiently on large amounts of geospatial and remote sensing data.

Experience & Skills We're Looking For:

  • Experience leading larger cross-team engineering efforts
  • Experience with data engineering including ingest, storage, orchestration and compute at scale with an ability to apply these skills to new domains like forest science and remote sensing.
  • Strong software engineering practices and a background in Python programming, debugging/profiling, and version control and system design.
  • Distributed Compute - familiarity with distributed compute technologies and knowledge of distributed systems concepts (like CPU/GPU interactions/transfers, latency/throughput bottlenecks, pipelining/multiprocessing) Our tech stack includes Dask and Flyte
  • Comfort with fast pace execution and rapid iteration startup environment. Excited by product impact.
  • Passion for environmental sustainability and a desire to make a meaningful impact on the planet.

Preferred (but not Required) Qualifications:

  • Geospatial - familiarity with raster and vector data, nuances of geospatial data and common geospatial cloud-native data formats (geopackage, flatgeobuf, cloud-optimized geotiff). Our tech stack includes Zarr, Rasterio, Geopandas, and Xarray
  • Data for ML application- Have worked with ML teams previously.

Even if you don't meet all these requirements, we encourage you to apply if this job description excites you. We are looking for ambitious people to help make an impact on climate change. That purpose requires us to bring together a diverse set of people with different backgrounds, perspectives, and skills to create solutions that work for all.


  • Data Engineer

    4 weeks ago


    San Francisco, California, United States iTvorks Inc Full time

    Required Skills Data engineer main skills Good knowledge of Data warehousing with expertise in data modeling and ETL. Advanced skills in SQL and HQL Should be good at these core tech AWS/EMR/Spark Hadoop Hive Vertica Qlikview Tableau Should have exp in CI/CD

  • Sr. Data Engineer

    3 weeks ago


    San Francisco, California, United States iTvorks Inc Full time

    Role Sr. Data Engineer (8+ years) Location SFO CA Duration Long TermExperience & Skills Extensive experience with Hadoop (or similar) Ecosystem (Map Reduce HDFS Hive Spark Pig HBase) Proficient in at least one of the SQL languages (MySQL PostgreSQL SqlServer Oracle) Good understanding of SQL Engine and able to conduct advanced performance tuning Strong...

  • Senior Data Engineer

    4 weeks ago


    San Francisco, California, United States Scroll Capital Full time

    Lead Data Engineer - Python Expert$45k – $90k • 0.1% – 0.25%About Our CompanyWe are a VC backed Fintech Startup in the Bay Area.The RoleStaff/Lead Software EngineerThis is a core and very important role for the company. You'll have a lot of agency in architecting and implementing a corner store of modern financial infrastructure.You'll be enhancing and...

  • Senior Data Engineer

    4 weeks ago


    San Francisco, California, United States Nightfall AI Full time

    Nightfall makes safeguarding sensitive data for every application simple and seamless. Organizations, from startups to global brands, trust Nightfall's software platform and APIs to discover, classify, and protect sensitive data.We're looking for a Senior Data Engineer to enhance Nightfall's core data models and infrastructure supporting our real-time and...

  • Senior Data Engineer

    4 weeks ago


    San Francisco, California, United States Glow Full time

    About GlowAt Glow, we believe that insurance can provide peace of mind to small business owners so they can pursue their dreams. We are building a digital insurance platform that ensures small businesses have the right coverage for all their insurance needs at the lowest cost, not just when they purchase, but every year. We recently completed a $22M Series A...


  • San Francisco, California, United States Trace3, Inc Full time

    Our major field office locations include Denver, Indianapolis, Grand Rapids, Lexington, Los Angeles, Louisville, Texas, San Francisco.Juice - The "Stuff" it takes to be a Needle MoverWe look forward to the goal, mentallymapping outevery checkpoint on the pathway to success, and visualizing what the final destination looks and feels like.We're looking to add...


  • San Francisco, California, United States Databricks Full time

    While candidates in the listed locations are encouraged for this role, we are open to remote candidates in other locations.As a Data Infrastructure Engineer on the security data infrastructure team you will help build Lakehouse for Security organization. You will build reliable, large-scale, multi-geo data pipelines to support detecting threats(internal and...

  • Staff/Lead Engineer

    4 weeks ago


    San Francisco, California, United States Scroll Capital Full time

    About Our CompanyVC backed Fintech Startup in the Bay Area.The Role - Staff/Lead Software Engineer - LONG TERM CONTRACTThis is a core and very important role for the company. You'll have a lot of agency in architecting and implementing a corner store of modern financial infrastructure.You'll lay out the architecture and build out the initial version of the...


  • San Francisco, California, United States Everyday Agents Full time

    About WovenConsumer Travel is Ripe for DisruptionToday's travelers are seeking personalized, immersive, and authentic experiences that reflect their unique tastes and interests. They want to be inspired, to feel the thrill of discovery, and to seamlessly benefit from technology throughout their journeys. Yet, the current travel landscape is a maze of...


  • San Francisco, California, United States Everyday Agents Full time

    About WovenConsumer Travel is Ripe for DisruptionToday's travelers are seeking personalized, immersive, and authentic experiences that reflect their unique tastes and interests. They want to be inspired, to feel the thrill of discovery, and to seamlessly benefit from technology throughout their journeys. Yet, the current travel landscape is a maze of...


  • San Francisco, California, United States Woven Full time

    About WovenConsumer Travel is Ripe for DisruptionToday's travelers are seeking personalized, immersive, and authentic experiences that reflect their unique tastes and interests. They want to be inspired, to feel the thrill of discovery, and to seamlessly benefit from technology throughout their journeys. Yet, the current travel landscape is a maze of...


  • San Francisco, California, United States Apple Full time

    SummaryImagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Do you love taking on challenges that create a positive impact? If so, then we want to speak with youAs a member of the AIML Data EPM group, you will be responsible for Apple's ability to use data to power...


  • San Francisco, California, United States RIT Solutions Inc Full time

    Required Skills:Currently working within an enterprise-level storage environment Experienced in Pure Storage FlashArray block storage products and/or in Rubrik r6000 series data protection products Experienced in Brocade G series fabric switch products Capability to follow and deliver Product Development Life Cycle type documentation as part of the product...


  • San Francisco, California, United States Imbue Full time

    Summary: Imbue believe that high-quality data is the most important part of creating high-performance machine learning systems, regardless of whether they are simple classifiers or state-of-the-art reasoning agents. Unlike many other organizations, they view this work and this role as one of the most important at the company.In this role, you will work on...


  • San Francisco, California, United States Motion Recruitment Full time

    Our large crypto company is looking for a contract Senior Software Engineer. This is a remote contract position.Contract Duration: 3-MonthsRequired Skills & ExperienceCollaboration & Communication: Strong collaborative skills for working effectively with diverse stakeholders such as HR, IT, and end-users to ensure their needs are understood and met. Change...


  • San Francisco, California, United States Vartana Full time

    Vartana is the revenue acceleration platform for top-tier enterprise technology companies. We empower sales and finance to accelerate the deal cycle without compromising cash flow through our flexible payment options and collaborative sales closing platform. Our platform is trusted by leading enterprise companies, including Motive, Domo, Samsara, and many...


  • San Francisco, California, United States Primer Full time

    Primer exists to make the world a safer place. We do this by providing trusted decision-ready AI to the world's most critical organizations. Our software enables leaders, operators, and analysts to better understand the changing world around us in real time and make informed decisions when the stakes are high. Primer has offices in San Francisco, Pasadena,...


  • San Francisco, California, United States absolute Full time

    We're looking for a Data Analyst to join our Database team. The perfect candidate must have one year of experience found SQL and Excel. The day to day obligations for the Data Analyst prospect has validating big data sets producing reports with Microsoft Excel and operating SQL queries to help you solve issues with the database program. There'll be an...


  • San Francisco, California, United States Zitara Full time

    CompanyIn order to keep climate change to 1.5°C, we'll need 30% of global GDP (all of energy generation and transportation) to run on batteries by 2035.Zitara Technologies (YCombinator S20) builds predictive battery management software for transportation and energy customers with large deployments. Our customers operate >$100M deployments of batteries in...

  • Staff Data Scientist

    2 weeks ago


    San Jose, California, United States AiDash Full time

    Who is AiDash? AiDash is making critical infrastructure industries climate-resilient and sustainable with satellites and AI. Using our full-stack SaaS solutions, customers in electric, gas, and water utilities, transportation, and construction are transforming asset inspection and maintenance - and complying with biodiversity net gain mandates and carbon...