Software Engineer, Enterprise Data Platform

7 days ago


San Francisco, California, United States Notion Full time

About Us
Notion helps you build beautiful tools for your life's work. In today's world of endless apps and tabs, Notion provides one place for teams to get everything done, seamlessly connecting docs, notes, projects, calendar, and email—with AI built in to find answers and automate work. Millions of users, from individuals to large organizations like Toyota, Figma, and OpenAI, love Notion for its flexibility and choose it because it helps them save time and money.

In-person collaboration is essential to Notion's culture. We require all team members to work from our offices on Mondays and Thursdays, our designated Anchor Days. Certain teams or positions may require additional in-office workdays.

About The Role
Join Notion's Data Platform team as we scale our infrastructure for enterprise customers. You'll help design and build the core data platform that powers Notion's AI, analytics, and search while meeting stringent security, privacy, and compliance requirements. This role focuses on the data platform layer (storage, compute, pipelines, governance) and partners closely with Security, Search Platform, AI, and Data Engineering.

What You'll Do

  • Design and evolve the data lakehouse

Build and operate core lakehouse components (e.g., Iceberg/Hudi/Delta tables, catalogs, schema management) that serve as the source of truth for analytics, AI, and search.

  • Own critical data pipelines and services

Design, implement, and harden batch and streaming pipelines (Spark, Kafka, EMR, etc.) that move and transform data reliably across regions and cells.

  • Advance EKM and encryption-by-design

Work with Security and platform teams to integrate Enterprise Key Management (EKM) into data workflows, including file- and record-level encryption and safe key handling in Spark and storage systems.

  • Improve data access, auditability, and residency

Build primitives for fine-grained access control, auditing, and data residency so customers can see who accessed what, where, and under which guarantees.

  • Drive reliability and observability

Raise the operational bar for our data stack: improve on-call experience, debugging, and alerting for data jobs and services.

  • Optimize large-scale performance and cost

Tackle performance and cost challenges across Kafka, Spark, and storage for very large workspaces (20k+ users, multi-cell deployments), including cluster migrations and workload tuning.

  • Enable ML and search workflows

Build infrastructure to support training and inference pipelines, ranking workflows, and embedding infrastructure on top of the shared data platform.

  • Shape the platform roadmap

Contribute to design docs and evaluations that influence our long-term platform direction and vendor choices.

Skills You'll Need

  • Experience: 5+ years building and operating data platforms or large-scale data infrastructure for SaaS or similar environments.
  • Programming: Strong skills in at least one of Python, Java, or Scala; comfortable working with SQL for analytics and data modeling.
  • Distributed data systems: Hands-on experience with Spark or similar distributed processing systems, including debugging and performance tuning.
  • Streaming & ingestion: Experience with Kafka or equivalent streaming systems; familiarity with CDC/ingestion patterns (e.g., Debezium, Fivetran, custom connectors).
  • Lakehouse / storage: Experience with data lakes and table formats (Iceberg, Hudi, or Delta) and/or data catalogs and schema evolution.
  • Security & governance: Practical understanding of access control, encryption at rest/in transit, and auditing as they apply to data platforms.
  • Cloud infrastructure: Experience with at least one major cloud provider (AWS, GCP, or Azure) and managed data/compute services (e.g., EMR, Dataproc, Kubernetes-based compute).
  • Operations: Comfortable owning services and pipelines in production, including on-call, incident response, and reliability improvements.

Nice To Haves

  • Experience working directly with enterprise customers or on features like data residency, EKM, or compliance-driven auditing.
  • Prior work on Databricks, Unity Catalog, Lake Formation, or similar catalog/governance systems.
  • Background implementing multi-region / multi-cell data architectures.
  • Experience building ML training/eval workflows or model/feature stores on top of a shared data platform.
  • Familiarity with vector databases or search infrastructure, and how they integrate with upstream data systems.
  • Experience designing or improving observability for data platforms (e.g., Honeycomb, OpenTelemetry, metrics/trace-heavy debugging).

Our customers come from all walks of life and so do we. We hire great people from a wide variety of backgrounds, not just because it's the right thing to do, but because it makes our company stronger. If you share our values and our enthusiasm for small businesses, you will find a home at Notion.

Notion is proud to be an equal opportunity employer. We do not discriminate in hiring or any employment decision based on race, color, religion, national origin, age, sex (including pregnancy, childbirth, or related medical conditions), marital status, ancestry, physical or mental disability, genetic information, veteran status, gender identity or expression, sexual orientation, or other applicable legally protected characteristic. Notion considers qualified applicants with criminal histories, consistent with applicable federal, state and local law. Notion is also committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation made due to a disability, please let your recruiter know.

Notion is committed to providing highly competitive cash compensation, equity, and benefits. The compensation offered for this role will be based on multiple factors such as location, the role's scope and complexity, and the candidate's experience and expertise, and may vary from the range provided below. For roles based in San Francisco, the estimated base salary range for this role is $230,000 - $300,000 per year.

By clicking "Submit Application", I understand and agree that Notion and its affiliates and subsidiaries will collect and process my information in accordance with Notion's Global Recruiting Privacy Policy.



  • San Francisco, California, United States Jobs via Dice Full time

    To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.Job CategorySoftware EngineeringJob DetailsAbout SalesforceSalesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action. Tech meets trust. And innovation isn't...


  • San Francisco, California, United States OpenAI Full time

    About the TeamThe Platform Team at OpenAI owns the foundational data and processing stack powering the OpenAI API and products built on top of it. We innovate on the infrastructure needed to serve the latest models and agentic workflows at an unprecedented pace of growth and scale with high performance and reliability. Join us to build and operate the core...


  • San Francisco, California, United States Hayden AI Full time

    About UsAt Hayden AI, we are on a mission to harness the power of computer vision to transform the way transit systems and other government agencies address real-world challenges.From bus lane and bus stop enforcement to transportation optimization technologies and beyond, our innovative mobile perception system empowers our clients to accelerate transit,...


  • San Francisco, California, United States Glean Full time

    About GleanGlean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI agents on one secure, open platform. With over 100 enterprise SaaS connectors, flexible LLM choice, and...


  • San Francisco, California, United States Glean Full time

    About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI agents on one secure, open platform. With over 100 enterprise SaaS connectors, flexible LLM choice, and...


  • San Francisco, California, United States Gusto Full time

    About GustoAt Gusto, we're on a mission to grow the small business economy. We handle the hard stuff—like payroll, health insurance, 401(k)s, and HR—so owners can focus on their craft and customers. With teams in Denver, San Francisco, and New York, we're proud to support more than 400,000 small businesses across the country, and we're building a...


  • San Francisco, California, United States Astranis Full time $120,000 - $150,000

    Astranis builds advanced satellites for high orbits, expanding humanity's reach into the solar system. Today, Astranis satellites provide dedicated, secure networks to highly-sophisticated customers across the globe— large enterprises, sovereign governments, and the US military. With five satellites on orbit and many more set to launch soon, the company is...


  • San Francisco, California, United States Reducto Full time

    About ReductoReducto helps AI teams ingest real world enterprise data with state of the art accuracy.The vast majority of enterprise data — from financial statements to health records — is locked in unstructured file formats like PDFs and spreadsheets. We train vision models to read those documents the way a human would, and make it possible to build...


  • San Francisco, California, United States Oscar Full time

    Oscar is working with a leading AI solution for Semiconductor Manufacturing Process Optimization organization that is looking for an experiencedSr Data Platform Engineerto join their team.As theSr Data Platform Engineer, you will design and build the core data infrastructure that underpins our AI- and analytics-driven semiconductor manufacturing platform....


  • San Francisco, California, United States Glean Full time

    About GleanGlean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI agents on one secure, open platform. With over 100 enterprise SaaS connectors, flexible LLM choice, and...