Staff Data Engineer

1 day ago


San Francisco CA United States Trunk.io Full time

At Trunk, we're on a mission to empower growing software organizations to deliver high-quality software quickly. We understand the challenges of merge conflicts, poor code quality or consistency, flaky tests, and other distractions that can drain productivity and morale. Our unique approach enables engineering teams to stay focused on designing, implementing, and delivering software, leading to the creation of magical, high-quality projects and happier teams.

Our journey began in 2021, with our founders leveraging their experience from some of the world's largest and fastest-growing tech companies - Uber, Google, YouTube, and Microsoft. In 2022, we achieved a significant milestone by securing a $25M Series A funding led by Garry Tan at Initialized Capital (currently President of YC) and Peter Levine at a16z. This growth and recognition are a testament to our potential and the value we bring to the software development landscape.

We know the frustration of trying to deliver code while constantly being interrupted by slow CI, flaky tests, and fragile processes. At Trunk, we’re building the tools to bring the joy back to software development. We’re looking for entrepreneurial people who are passionate about solving these problems.

As a founding member of our Data Engineering team, you’ll leverage your technical expertise to build data pipelines for processing and storing the data generated by our customer's CI/CD and automated tests. You’ll also experiment with integrating AI models to drive analytics and insights for our customers. We're tackling challenging problems and need engineers who can operate well in ambiguity and develop great solutions.

As an engineering team, we thrive on our ability to move quickly and adapt as we learn. Quickly delivering value to customers and getting their feedback is critical to our success. Engineers will be able to work closely with customers to understand the nuances of their use cases. We value empathy, hard work, and collaboration.

Our data stack is constantly evolving, but built on the foundations of Python, PostgreSQL, Spark, TimescaleDB, AWS, Kubernetes, and AWS Glue.

What you'll do
  • Build fault-tolerant and scalable data pipelines
  • Design efficient data storage, collaborating with product engineers to create fast and reliable data-driven features
  • Debug, profile, and optimize distributed data-intensive applications to improve their latency, accuracy, resource consumption, and throughput
  • Design and build observability of data quality and accuracy
  • Integrate ML models like Llama to analyze data and create features
We're looking for
  • 10-12+ years of experience as a software engineer with a strong understanding of key concepts in distributed systems
  • 10-12+ years of experience in building and deploying data applications, with a track record of regularly shipping new features
  • Fluency in at least two of these languages: Java/Scala/Kotlin, Python, Go, Rust, or C++
  • Good understanding and practical experience with partitioning, replication, map-reduce, indexing, and CAP theorem
  • Experience with distributed storage systems (S3, HDFS, Hive, ClickHouse, Elastic, etc), distributed processing engines (Spark, etc), and message queues (Kafka, SQS, etc)
  • Passion for building large-scale ML applications and improving software engineers' productivity
  • Understanding of key concepts in natural language processing, machine learning, or statistical analysis

(Nice to have) Some experience with machine learning stack (pandas, PyTorch, numpy, sci-kit, transformers, etc)

What we offer
  • Unlimited PTO
  • Competitive salary and equity
  • Work-life balance
  • Flexibility to be fully or partly remote
  • Up to $200/month stipend for coworking space for remote folks
  • Few meetings, so you can ship fast and focus on building
  • One Medical membership on us
  • Top-notch medical, dental, vision, short-term disability, long-term disability, and life insurance
  • All insurance is 100% company-paid ($0 premiums) for employees and highly subsidized for dependents
  • FSA, HSA with company contributions, and pre-tax commuter benefits
  • 401(k) plan
  • Paid parental leave (up to 12 weeks)
Our tech stack
  • Frontend: Typescript, React, Redux, Next.js
  • Backend: Typescript, Node, AWS, CDK, k8s, gRPC
  • Observability: Prometheus, Grafana, Kiali, Jaeger
  • CI/CD: GitHub Actions
  • CLI/Daemon/LSP: C++20, Bazel
  • VSCode Extension: Typescript
  • General: GitHub, Slack, Linear, Slite

The salary and equity range for this role are: $200K - $245K and .3% - .5%.

Please note that the compensation range provided is a general guideline only and is subject to change based on location, qualifications, and experience.

#J-18808-Ljbffr

  • Atlanta, GA, United States Data Engineer Jobs Full time

    *Please note: This role is not eligible for 100% remote work. Employees must live within a commutable distance of the Atlanta Area and must be willing to be onsite at the client and/or Slalom Atlanta office up to 5 days a week.* Who You'll Work With As a modern technology company, our Slalom Technologists are disrupting the market and bringing to life the...


  • San Francisco, United States Data Masked Full time

    This range is provided by Harnham. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $200,000.00/yr - $250,000.00/yr Additional compensation types Annual Bonus and RSUs Data Science Focused Recruitment Consultant Title: Staff Data Scientist Location: Company based in SF, CA - willing to...


  • San Francisco, CA, United States DoorDash USA Full time

    About the Team Data is at the foundation of DoorDash's success. The Data Engineering team builds database solutions for various use cases including reporting, product analytics, marketing optimization, and financial reporting. The team serves as the foundation for decision-making at DoorDash. About the Role DoorDash is looking for a Staff Software Engineer,...


  • San Francisco, CA, United States Stars Group Full time

    Responsibilities: Leadership & Strategy: Lead and mentor a team of data engineers, driving best practices in data engineering and ensuring alignment with the overall data strategy. Architecture & Development: Architect and develop highly scalable data pipelines and infrastructure using Apache Kafka, Apache Spark or Databricks, Python, Scala, and other...

  • Staff Data Engineer

    2 days ago


    San Francisco, CA, United States Faire Full time

    Staff Software Engineer - Data Platforms Faire is an online wholesale marketplace built on the belief that the future is local — independent retailers around the globe are doing more revenue than Walmart and Amazon combined, but individually, they are small compared to these massive entities. At Faire, we're using the power of tech, data, and machine...


  • San Francisco, CA, United States ADVANCED ENGINEERING GROUP PC Full time

    About the Role: Demandbase is seeking creative, highly motivated, enthusiastic engineer individuals to be part of our product development team. You will work in a fast paced agile environment to build and deliver key features of the application. This is a great opportunity to work with a talented, high energy and creative team focused on building a...


  • San Francisco, CA, United States DoorDash USA Full time

    About the Team DoorDash is a data driven organization and relies on timely, accurate and reliable data to drive many business and product decisions. The Data Platform owns all the infrastructure necessary to run an operationally efficient analytical data stack. The Core Data part of this includes data ingestion (batch and real time), data compute &...

  • Data Engineer

    1 day ago


    San Francisco, CA, United States Abridge AI Inc. Full time

    Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most—their patients. Our enterprise-grade technology transforms patient-clinician conversations...


  • San Francisco, CA, United States Ellation, Inc. Full time

    Who We Are We're a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our...


  • San Francisco, United States DoorDash USA Full time

    About the Team Data is at the foundation of DoorDash's success. The Data Engineering team builds database solutions for various use cases including reporting, product analytics, marketing optimization, and financial reporting. The team serves as the foundation for decision-making at DoorDash. About the Role DoorDash is looking for a Staff Software Engineer,...

  • Staff Data Engineer

    3 weeks ago


    San Francisco, United States Faire Full time

    Staff Software Engineer - Data PlatformsFaire is an online wholesale marketplace built on the belief that the future is local — independent retailers around the globe are doing more revenue than Walmart and Amazon combined, but individually, they are small compared to these massive entities. At Faire, we're using the power of tech, data, and machine...

  • Staff Data Engineer

    4 weeks ago


    San Francisco, United States Trunk.io Full time

    At Trunk, we're on a mission to empower growing software organizations to deliver high-quality software quickly. We understand the challenges of merge conflicts, poor code quality or consistency, flaky tests, and other distractions that can drain productivity and morale. Our unique approach enables engineering teams to stay focused on designing,...

  • Staff Data Engineer

    3 days ago


    Redwood City, CA, United States Karius Full time

    About KariusKarius is a venture-backed life science startup that is transforming the way pathogens and other microbes are observed throughout the body. By unlocking the information present in microbial cell-free DNA, we're helping doctors quickly solve their most challenging cases, providing industry partners with access to 1000's of biomarkers to...

  • Data Engineer

    2 days ago


    San Francisco, CA, United States Outdefine Full time

    As a skilled professional seeking career growth, you deserve access to the best job opportunities available. Join Outdefine's Trusted community today and apply to premier job openings with leading enterprises globally. Set your own rate, keep all your pay, and enjoy the benefits of a fee-free experience. Data Engineer Outdefine Partner Web3 10-50 ...


  • San Francisco, California, United States Braintrust Data Full time

    About Braintrust DataWe are a cutting-edge developer platform for building world-class AI products. Our innovative approach combines code and datasets, incrementally refining both using frequent evaluations. We provide a rich set of tools to visualize changes and interrogate failures, empowering developers to integrate our platform into their continuous...


  • San Jose, CA, United States Zscaler Full time

    Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your...


  • San Francisco, United States AirTree Ventures Pty Full time

    The Role As Staff Software Engineer for Linktree’s Data Platform team, you’ll play a crucial role in building a robust, scalable data platform that drives innovative experiences in our core product. Your contributions will unlock new opportunities for our tens of millions of users and our billions of visitors around the world, helping us achieve...

  • Data Engineer

    2 days ago


    San Francisco, CA, United States Woven Full time

    FORE is unlocking dormant operational data with advanced AI to diagnose operational challenges and support better performance. With close advisory from leading academic experts in Information and Operational Theory, we enable leaders across Healthcare, Private Equity, and Enterprise to understand the drivers of inefficiencies and how they can improve. At...

  • Staff Data Engineer

    2 months ago


    Austin, TX, United States Visa Full time

    Company DescriptionVisa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure...


  • San Francisco, United States AirTree Ventures Pty Full time

    The RoleAs Staff Software Engineer for Linktree’s Data Platform team, you’ll play a crucial role in building a robust, scalable data platform that drives innovative experiences in our core product. Your contributions will unlock new opportunities for our tens of millions of users and our billions of visitors around the world, helping us achieve...