Staff Data Engineer

4 weeks ago


San Francisco, United States Trunk.io Full time

At Trunk, we're on a mission to empower growing software organizations to deliver high-quality software quickly. We understand the challenges of merge conflicts, poor code quality or consistency, flaky tests, and other distractions that can drain productivity and morale. Our unique approach enables engineering teams to stay focused on designing, implementing, and delivering software, leading to the creation of magical, high-quality projects and happier teams.

Our journey began in 2021, with our founders leveraging their experience from some of the world's largest and fastest-growing tech companies - Uber, Google, YouTube, and Microsoft. In 2022, we achieved a significant milestone by securing a $25M Series A funding led by Garry Tan at Initialized Capital (currently President of YC) and Peter Levine at a16z. This growth and recognition are a testament to our potential and the value we bring to the software development landscape.

We know the frustration of trying to deliver code while constantly being interrupted by slow CI, flaky tests, and fragile processes. At Trunk, we’re building the tools to bring the joy back to software development. We’re looking for entrepreneurial people who are passionate about solving these problems.

As a founding member of our Data Engineering team, you’ll leverage your technical expertise to build data pipelines for processing and storing the data generated by our customer's CI/CD and automated tests. You’ll also experiment with integrating AI models to drive analytics and insights for our customers. We're tackling challenging problems and need engineers who can operate well in ambiguity and develop great solutions.

As an engineering team, we thrive on our ability to move quickly and adapt as we learn. Quickly delivering value to customers and getting their feedback is critical to our success. Engineers will be able to work closely with customers to understand the nuances of their use cases. We value empathy, hard work, and collaboration.

Our data stack is constantly evolving, but built on the foundations of Python, PostgreSQL, Spark, TimescaleDB, AWS, Kubernetes, and AWS Glue.

What you'll do
  • Build fault-tolerant and scalable data pipelines
  • Design efficient data storage, collaborating with product engineers to create fast and reliable data-driven features
  • Debug, profile, and optimize distributed data-intensive applications to improve their latency, accuracy, resource consumption, and throughput
  • Design and build observability of data quality and accuracy
  • Integrate ML models like Llama to analyze data and create features
We're looking for
  • 10-12+ years of experience as a software engineer with a strong understanding of key concepts in distributed systems
  • 10-12+ years of experience in building and deploying data applications, with a track record of regularly shipping new features
  • Fluency in at least two of these languages: Java/Scala/Kotlin, Python, Go, Rust, or C++
  • Good understanding and practical experience with partitioning, replication, map-reduce, indexing, and CAP theorem
  • Experience with distributed storage systems (S3, HDFS, Hive, ClickHouse, Elastic, etc), distributed processing engines (Spark, etc), and message queues (Kafka, SQS, etc)
  • Passion for building large-scale ML applications and improving software engineers' productivity
  • Understanding of key concepts in natural language processing, machine learning, or statistical analysis
(Nice to have) Some experience with machine learning stack (pandas, PyTorch, numpy, sci-kit, transformers, etc)What we offer
  • Unlimited PTO
  • Competitive salary and equity
  • Work-life balance
  • Flexibility to be fully or partly remote
  • Up to $200/month stipend for coworking space for remote folks
  • Few meetings, so you can ship fast and focus on building
  • One Medical membership on us
  • Top-notch medical, dental, vision, short-term disability, long-term disability, and life insurance
  • All insurance is 100% company-paid ($0 premiums) for employees and highly subsidized for dependents
  • FSA, HSA with company contributions, and pre-tax commuter benefits
  • 401(k) plan
  • Paid parental leave (up to 12 weeks)
Our tech stack
  • Frontend: Typescript, React, Redux, Next.js
  • Backend: Typescript, Node, AWS, CDK, k8s, gRPC
  • Observability: Prometheus, Grafana, Kiali, Jaeger
  • CI/CD: GitHub Actions
  • CLI/Daemon/LSP: C++20, Bazel
  • VSCode Extension: Typescript
  • General: GitHub, Slack, Linear, Slite

The salary and equity range for this role are: $200K - $245K and .3% - .5%.

Please note that the compensation range provided is a general guideline only and is subject to change based on location, qualifications, and experience.

#J-18808-Ljbffr
  • Staff Data Scientist

    15 hours ago


    San Francisco, United States Data Masked Full time

    This range is provided by Harnham. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $200,000.00/yr - $250,000.00/yr Additional compensation types Annual Bonus and RSUs Data Science Focused Recruitment Consultant Title: Staff Data Scientist Location: Company based in SF, CA - willing to...


  • San Francisco, United States DoorDash USA Full time

    About the Team Data is at the foundation of DoorDash's success. The Data Engineering team builds database solutions for various use cases including reporting, product analytics, marketing optimization, and financial reporting. The team serves as the foundation for decision-making at DoorDash. About the Role DoorDash is looking for a Staff Software Engineer,...

  • Staff Data Engineer

    2 weeks ago


    San Francisco, United States Faire Full time

    Staff Software Engineer - Data PlatformsFaire is an online wholesale marketplace built on the belief that the future is local — independent retailers around the globe are doing more revenue than Walmart and Amazon combined, but individually, they are small compared to these massive entities. At Faire, we're using the power of tech, data, and machine...


  • San Francisco, California, United States Braintrust Data Full time

    About Braintrust DataWe are a cutting-edge developer platform for building world-class AI products. Our innovative approach combines code and datasets, incrementally refining both using frequent evaluations. We provide a rich set of tools to visualize changes and interrogate failures, empowering developers to integrate our platform into their continuous...


  • San Francisco, United States AirTree Ventures Pty Full time

    The Role As Staff Software Engineer for Linktree’s Data Platform team, you’ll play a crucial role in building a robust, scalable data platform that drives innovative experiences in our core product. Your contributions will unlock new opportunities for our tens of millions of users and our billions of visitors around the world, helping us achieve...


  • San Francisco, United States AirTree Ventures Pty Full time

    The RoleAs Staff Software Engineer for Linktree’s Data Platform team, you’ll play a crucial role in building a robust, scalable data platform that drives innovative experiences in our core product. Your contributions will unlock new opportunities for our tens of millions of users and our billions of visitors around the world, helping us achieve...

  • Staff Engineer

    4 weeks ago


    San Francisco, United States Early Warning Services LLC Full time

    At Early Warning, we've powered and protected the U.S. financial system for over thirty years with cutting-edge solutions like Zelle, Paze, and so much more. As a trusted name in payments, we partner with thousands of institutions to increase access to financial services and protect transactions for hundreds of millions of consumers and small...


  • San Francisco, United States Ellation, Inc. Full time

    Who We Are We're a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our collection...

  • Data Engineer

    7 days ago


    San Francisco, United States Abridge Al, Inc Full time

    Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most-their patients. Our enterprise-grade technology transforms patient-clinician conversations into...


  • San Francisco, United States Amazon Data Services, Inc. Full time

    This unique vantage point allows the Design Managers to maintain a close connection with project details and drive project outcomes.You will have an impact on the design direction and ability to improve design delivery for an entire region, while establishing design procedures and protocols for the development of future systems.Responsibilities...


  • San Francisco, United States Chaos Industries Full time

    CHAOS Inc. is a global technology company delivering next-generation capabilities to the defense and critical industrial sectors. Founded in 2022 by a seasoned leadership team, CHAOS has quickly become the place where world-class multi-disciplinary engineers come to build mission-critical technologies. CHAOS has a mission-focused culture, dedicated to...


  • San Francisco, United States Tbwa ChiatDay Inc Full time

    CHAOS Inc. is a global technology company delivering next-generation capabilities to the defense and critical industrial sectors. Founded in 2022 by a seasoned leadership team, CHAOS has quickly become the place where world-class multi-disciplinary engineers come to build mission-critical technologies. CHAOS has a mission-focused culture, dedicated to...


  • San Francisco, United States Data Masked Full time

    Experienced Data Scientist Machine Learning - Identity Verification We believe that the way people interact with their finances will drastically improve in the next few years. Were dedicated to empowering this transformation by building the tools and experiences that thousands of developers use to create their own products. Plaid powers the tools millions of...


  • San Francisco, United States Data Masked Full time

    Experienced Data Scientist Machine Learning - Identity Verification We believe that the way people interact with their finances will drastically improve in the next few years. We’re dedicated to empowering this transformation by building the tools and experiences that thousands of developers use to create their own products. Plaid powers the tools millions...

  • Staff Data Scientist

    18 hours ago


    San Francisco, United States Northbeam LLC Full time

    About Us Northbeam is building the world's most advanced marketing intelligence platform for growth. Our marketing measurement technology and customizable dashboards provide our customers with a unified view of their e-commerce business data. The smartest brands in ecommerce trust Northbeam to accurately attribute their advertising spend, understand the...


  • San Francisco, United States Northbeam LLC Full time

    About Us Northbeam is building the world's most advanced marketing intelligence platform for growth. Our marketing measurement technology and customizable dashboards provide our customers with a unified view of their e-commerce business data. The smartest brands in ecommerce trust Northbeam to accurately attribute their advertising spend, understand the...

  • Staff Engineer

    2 weeks ago


    San Francisco, United States Early Warning® Full time

    At Early Warning, we’ve powered and protected the U.S. financial system for over thirty years with cutting-edge solutions like Zelle, Paze℠, and so much more. As a trusted name in payments, we partner with thousands of institutions to increase access to financial services and protect transactions for hundreds of millions of consumers and small...


  • San Francisco, California, United States DoorDash USA Full time

    DoorDash is a leading food delivery and logistics company, and our Data Engineering team plays a crucial role in building database solutions to support various use cases. As a Staff Software Engineer, Data, you will be responsible for architecting and scaling our data reliability, infrastructure, automation, and tools to meet growing business needs.About the...

  • Data Engineer

    3 weeks ago


    San Francisco, United States Harnham Full time

    Data Engineer1 Year ContractHybrid (2 Days Onsite)$75-85/hrWe are partnered with a leading beauty retailer, delivering innovative and personalized beauty experiences to their customers. We are looking for a highly skilled Data Engineer to join the team and help enhance our data platforms and analytics capabilities. You will work with cutting-edge...

  • Data Engineer

    3 weeks ago


    san francisco, United States Harnham Full time

    Data Engineer1 Year ContractHybrid (2 Days Onsite)$75-85/hrWe are partnered with a leading beauty retailer, delivering innovative and personalized beauty experiences to their customers. We are looking for a highly skilled Data Engineer to join the team and help enhance our data platforms and analytics capabilities. You will work with cutting-edge...