Staff Data Engineer

2 weeks ago


New York, New York, United States ASAPP Full time

Join our team at ASAPP, where we're developing transformative Vertical AI designed to improve customer experience. Recognized by Forbes AI 50, ASAPP designs generative AI solutions that transform the customer engagement practices of Fortune 500 companies. With our automation and simplified work processes, we empower people to reach their full potential and create exceptional experiences for everyone involved. Work with our team of talented researchers, engineers, scientists, and specialists to help solve some of the biggest and most complex problems the world is facing.

The Data Engineering & Analytics team (DEA) at ASAPP powers the core of our data and analytics products. ASAPP's products are based on natural language processing and serve tens of millions of end-users in real time. We need sophisticated metrics to monitor and continuously improve our systems. We are seeking a Staff Data Engineer to serve as both a technical leader and a core individual contributor, by designing and building analytic data feeds for both our business partners and internal stakeholders.

Applicants with all or some relevant combination of the requirements listed below are encouraged to apply. This is a hybrid role, with a preference for candidates in proximity to either of our NYC or Mountain View offices

What you'll do

  • Lead the batch analytics team by providing the groundwork to modernize our data analytics architecture
  • Design and maintain our data warehouse to facilitate analysis across hundreds of systems events
  • Rethink and influence strategy and roadmap for building efficient data solutions and scalable data warehouses
  • Review code for style and correctness across the entire team
  • Write production-grade Redshift, Athena, Snowflake & Spark SQL queries
  • Manage and maintain Airflow ETL jobs
  • Test query logic against sample scenarios
  • Work across teams to gather requirements and understand reporting needs
  • Investigate metric discrepancies and data anomalies
  • Debug and optimize queries for other business units
  • Review schema changes across various engineering teams
  • Maintain high-quality documentation for our metrics and data feeds
  • Work with stakeholders in Data Infrastructure, Engineering, Product and Customer Strategy to assist with data-related technical issues and build scalable cross platform reporting framework
  • Participate in, and co-manage our on-call rotation to keep production pipelines up and running

What you'll need

  • 7+ years industry experience with clear examples of strategic technical problem solving and implementation
  • Expertise in at least one flavor of SQL. (We use Amazon Redshift, MySQL, Athena and Snowflake)
  • Strong experience with data warehousing (e.g. Snowflake (preferred), Redshift, BigQuery, or similar)
  • Experience with dimensional data modeling and schema design
  • Experience using developer-oriented data pipeline and workflow orchestration (e.g. Airflow (preferred), dbt, dagster or similar)
  • Experience with cloud computing services (AWS (preferred), GCP, Azure or similar)
  • Proficiency in a high-level programming language, especially in terms of reading and comprehending other developers' code and intentions. (We use Python, Scala, and Go)
  • Deep technical knowledge of data exchange and serialization formats such as Protobuf, YAML, JSON, and XML
  • Familiarity with BI & Analytics tools (e.g. Looker, Tableau, Sisense, Sigma computing or similar)
  • Familiarity with streaming data technologies for low-latency data processing (e.g. Apache Spark/Flink, Apache Kafka, Snowpipe or similar)
  • Familiarity with Terraform, Kubernetes and Docker
  • Understanding of modern data storage formats and tools (e.g. parquet, Avro, Delta Lake)
  • Knowledge of modern data design and storage patterns (e.g. incremental updates, partitioning and segmentation, rebuilds and backfills)

What we'd like to see

  • Experience working at a startup preferred
  • Excellent communication skills - (Slack/Email/Documents)
  • Experienced with end user management & communication (cross team as well as external)
  • Must thrive in a fast paced environment and be able to work independently with urgency
  • Can work effectively remotely (able to be proactive about managing blockers, proactive on reaching out and asking questions, and participating in team activities)
  • Experienced in writing technical data design docs (pipeline design, dataflow, schema design)
  • Can scope and breakdown projects, communicate and collaborate progress and blockers effectively with your manager, team, and stakeholders
  • Good at task management & capacity tracking (JIRA (preferred))

ASAPP is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, disability, age, or veteran status. If you have a disability and need assistance with our employment application process, please email us at to obtain assistance. #LI-AG1 #LI-Remote



  • New York, New York, United States Particle Health Full time

    At Particle Health, our goal is to harness the potential of medical records in an innovative platform that shifts focus back to the patient. We dedicate our efforts to connecting diverse medical data sets, making them valuable in various contexts, and creating a seamless method to share this information with chosen organizations. Our clientele includes...

  • Staff Data Scientist

    4 weeks ago


    New York, New York, United States Particle Health Full time

    At Particle Health, our mission is to unlock the power of medical records in an intelligent platform that focuses healthcare back on the patient. Our energy is spent connecting to people's diverse sets of medical data, making that data useful in different settings, and designing an effortless way to share that information with any organization a person...


  • New York, New York, United States Domino Data Lab Full time

    Who we areDomino Data Lab powers model-driven businesses with its leading Enterprise AI platform trusted by over 20% of the Fortune 100. Domino accelerates the development and deployment of data science work while increasing collaboration and governance. With Domino, enterprises worldwide can develop better medicines, grow more productive crops, build better...

  • Data Engineer

    3 weeks ago


    New York, New York, United States Masterworks Full time

    Masterworks Overview:Masterworks is the only platform for investing in blue-chip, multi-million dollar artworks. Prior to Masterworks, the only way to allocate to art was to purchase a multi-million dollar painting, making the asset class inaccessible for most investors. The company has over 900,000 users signed-up purchasing shares in primary offerings...

  • Data Engineer

    3 weeks ago


    New York, New York, United States TopEdge Technology Full time

    Job DescriptionPosition Data Engineer Duration 12 months Location New Jersey Remote Work Permitted Due to COVID-19 the client has agreed to allow the selected candidate to work remotely for the time being. However the selected candidate must be available to report onsite as directed by the client.Position Data EngineerJob DescriptionThe Senior Data Engineer...


  • New York, New York, United States Avero Full time

    Created by hospitality operators for hospitality operators, Avero is the trusted technology partner for the hospitality industry. We empower 40,000+ hospitality professionals with the answers they need to transform their businesses and their lives, getting them out of the back office and into the kitchen with their staff, onto the floor with their guests,...

  • Data Engineer

    4 weeks ago


    New York, New York, United States Esvee Technologies Full time

    Data Engineer (Netezza / Teradata / BigQuery/ GCP) New York City. Job description This role requires reengineering of the existing DW (Netezza/Teradata) and migrate the data to BQ. Whiteboarding and discussing with client leadership explaining architecture options and recommendations for data migration to BQ. 1. Successful background as an architect for...

  • AWS Data Engineer

    3 weeks ago


    New York, New York, United States symplore inc Full time

    JD AWS Data EngineerVisa Status ALL ARE OKAY Location Must Be Local To New Jersey and NYC and the General VicinityContract Length Long Term Years Experience 3+ Years of ExperienceKey Responsibilities Coding Proficiency Mandatory skills in SQL Python and PySpark. Desired expertise in Unix scripting. AWS Data Engineering Hands-on experience in designing and...

  • Senior Data Engineer

    4 weeks ago


    New York, New York, United States The Cypress Group Full time

    Location: Fully Remote (Must live in the US)We are seeking a Senior Data Engineer to join our team and play a crucial role in scaling out our healthcare marketing platform.As a key member of our engineering team, you will be working with Scala, Apache Spark, and Databricks to develop and maintain our data infrastructure.Develop and maintain scalable data...

  • Data Engineer

    4 weeks ago


    New York, New York, United States Strata Infosys Full time

    Requirements Must have 5 years minimum hands-on experience with the following Writes complex SQL queries required to perform Data Acquisition and Ingestion required for Data pipelines Builds Data pipelines and does data engineering activities using technologies like Python Hadoop Spark etc. Ensures the upkeep of the Hadoop Data Lake Platform by monitoring...


  • New York, New York, United States Atechstar Full time

    BASIC QUALIFICATIONS7 years of relevant(data) engineering experience 2 years of people management experience managing data engineers Experience in partnering with product and program management teams Bachelors degree in Engineering Computer Science or a related technical field 2 years of experience owning a program/product/feature scoping requirements...


  • New York, New York, United States RWA Full time

    About is a data provider and analytics dashboard for the next big wave in DeFi: Real-World Assets. DeFi today exists in an insular bubble, but RWAs present the chance for the vast efficiency and transparency benefits of DeFi to impact the outside world. We build tools to help investors effectively navigate this new paradigm.We're backed by top VCs in the...

  • Jr Data Engineer

    1 week ago


    New York, New York, United States codesbright Full time

    Responsibilities Develop stages of a distributed parallel data processing pipelines which includes but is not limited to processes such as configuring data connections data parsing data normalization data mapping and modeling data enrichment and integration with data analytics. Convey data visualization findings through different formats such as graphs...


  • New York, New York, United States Grow Therapy Full time

    About the RoleAs a staff software engineer at Grow Therapy, you'll have a huge amount of scope and autonomy. You get to see a number of different product surface areas (provider portal, marketplace, client experience, infrastructure, and enabling operations) in addition to working full-stack (Flask/Python on the backend, React/TypeScript on the frontend,...

  • Senior Data Engineer

    3 weeks ago


    New York, New York, United States Aura Intel Full time

    We are looking for an engineer to join our team and help us unlock the full potential of our most critical asset: our data. You will have direct impact on our revenue and delivering value to our customers by ensuring that our systems can handle the complex relationship between our entities.We are looking for someone who is constantly looking for ways to...


  • New York, New York, United States Ripple Full time

    Developer Operations (DevOps) at Ripple is responsible for communication, collaboration and integration between the Development and Operations teams so infrastructures, tools and processes are streamlined and effective for faster and automated delivery of products.The Staff Software Engineer, DevOps will contribute in discovery, design and implementation of...


  • New York, New York, United States Simplebet Full time

    Our mission is to power the future of fan engagement. Simplebet is a B2B sports technology company that uses machine learning and real-time solutions to make every moment of every sporting event a betting opportunity. We're reimagining how people enjoy sports with products that are simple, intuitive, and entertaining. Our technology powers micro-betting...


  • New York, New York, United States NBCUniversal Full time

    Company DescriptionWe create world-class content, which we distribute across our portfolio of film, television, and streaming, and bring to life through our theme parks and consumer experiences. We own and operate leading entertainment and news brands, including NBC, NBC News, MSNBC, CNBC, NBC Sports, Telemundo, NBC Local Stations, Bravo, USA Network, and...


  • New York, New York, United States domunus Full time

    Senior Data Architect EngineerJob Overview***This position is a partner position and based within a start-up environment and the compensation will initially be on a handsome equity basis. Upon launch, a monthly fee will be paid.*Domunus is a web-based application for private residential real estate. It resolves the tedious transaction process and provides an...


  • New York, New York, United States Atechstar Full time

    Job description 0-2 years of demonstrable experience designing technological solutions to complex data problems developing & testing modular reusable efficient and scalable code to implement those solutions. Ideally this would include work on the following technologies Expert-level proficiency in at-least one of Java C++ or Python (preferred)/Scala knowledge...