Data Reliability Engineer

11 hours ago


Boston, Massachusetts, United States WHOOP Full time
Unlock Human Performance with WHOOP

At WHOOP, we're on a mission to empower individuals to perform at their best. As an Observability Analyst, you'll play a crucial role in ensuring the reliability and accuracy of our data infrastructure, including customer-facing ML models.

Responsibilities:
  • Design and implement monitoring solutions to track data ingestion, processing, and ML model performance.
  • Develop tools for analyzing data quality, enabling internal stakeholders to evaluate metrics, identify anomalies, and implement measures to address data integrity issues.
  • Create alerting mechanisms to proactively detect and respond to data pipeline failures, latency issues, and anomalies.
  • Collaborate with cross-functional teams to understand data pipelines, requirements, and use cases, and implement solutions to enhance data observability.
  • Automate remediation processes to streamline data validation, error handling, and incident response procedures.
  • Perform root cause analysis to investigate and troubleshoot data infrastructure issues, and implement preventive measures to minimize future occurrences.
  • Stay current with industry trends and best practices in data observability, and incorporate relevant innovations into our data engineering processes.
Qualifications:
  • Bachelor's degree in Computer Science, Engineering, or a related field; Master's degree preferred.
  • Proven experience working as a Data Analyst, Data Scientist, or a similar role with a focus on data observability and reliability.
  • Strong proficiency in programming languages such as Python, Java, or Scala.
  • Proficiency with SQL.
  • Familiarity with data storage and processing technologies such as Snowflake, S3, Spark, Kafka, and relational databases.
  • Proven understanding of ML Model lifecycles.
  • Expertise in designing and implementing monitoring and alerting solutions using tools like HEX, Grafana, Datadog, or similar.
  • Excellent analytical and problem-solving skills with a keen attention to detail.
  • Strong communication and collaboration skills with the ability to work effectively in cross-functional teams.
  • Experience with AWS cloud platform.

This role is based in the WHOOP office located in Boston, MA. The successful candidate must be prepared to relocate if necessary to work out of the Boston, MA office.

WHOOP is an Equal Opportunity Employer and participates in E-verify to determine employment eligibility. It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment.



  • Boston, Massachusetts, United States WHOOP Full time

    At WHOOP, we're on a mission to unlock human performance. Our innovative data platforms are the game-changing connective tissue flowing vital resources to teams, applications, and insightful solutions that power real-time AI, cutting-edge science, and bold visionary decision-making.As a Data Reliability Engineer at WHOOP, you will play a crucial role in...


  • Boston, Massachusetts, United States Klaviyo Full time

    About KlaviyoKlaviyo is a leading provider of email marketing and customer data platforms. We empower creators to own their destiny by making first-party data accessible and actionable like never before.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring...


  • Boston, Massachusetts, United States Klaviyo Full time

    About KlaviyoKlaviyo is a leading provider of email marketing and customer data platforms. We empower creators to own their destiny by making first-party data accessible and actionable like never before.Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for...

  • Reliability Engineer

    3 weeks ago


    Boston, Massachusetts, United States Cirkul Inc Full time

    About Cirkul IncCirkul Inc is a rapidly growing beverage technology company dedicated to making a healthier world by promoting water consumption.Job Title: Reliability EngineerWe are seeking a skilled Reliability Engineer to join our team. As a Reliability Engineer, you will play a crucial role in developing and implementing strategies to improve the...


  • Boston, Massachusetts, United States Dice Full time

    Revolutionize Data Management with Our Dynamic StartupWe are partnered with a cutting-edge startup poised to disrupt the data management industry, competing with established players. Our client, Motion Recruitment Partners, LLC, is seeking a Senior Site Reliability Engineer to join their growing DevOps team to ensure the reliability and performance of their...


  • Boston, Massachusetts, United States Klaviyo Full time

    About the RoleWe're seeking a seasoned Site Reliability Engineering Manager to lead our team in Boston and remotely. As a key member of our engineering organization, you'll be responsible for managing a team of 4-6 Site Reliability Engineers and driving the development of secure software architecture and development.Key ResponsibilitiesManage a team of Site...

  • Data Engineer

    4 weeks ago


    Boston, Massachusetts, United States Holistic Industries Full time

    Job Title: Data EngineerWe are seeking a highly skilled Data Engineer to join our dynamic technology team at Holistic Industries. As a key member of our team, you will play a crucial role in designing, implementing, and optimizing data capture processes to ensure smooth data flow for analysis and reporting.Key Responsibilities:Data Capture and Integration:...

  • Data Engineer

    4 weeks ago


    Boston, Massachusetts, United States Gunderson Dettmer Full time

    About Gunderson DettmerGunderson Dettmer is a leading international law firm with a unique focus on the innovation economy. We're at the forefront of legal innovation, actively developing and refining the law firm tech stack of the future.Job DescriptionWe're seeking a talented Data Engineer to join our team and help architect the modern law firm's data...

  • Data Engineer

    3 weeks ago


    Boston, Massachusetts, United States Gunderson Dettmer Full time

    Job Title: Data EngineerGunderson Dettmer is a leading international law firm that specializes in the innovation economy. We are seeking a highly skilled Data Engineer to join our team and contribute to the development of our data infrastructure.Responsibilities:Design, build, and manage robust ETL processes with performance monitoring.Create data...

  • Data Engineer

    3 weeks ago


    Boston, Massachusetts, United States Holistic Industries Full time

    Job OpportunityWe are seeking a highly skilled Data Engineer to join our dynamic technology team at Holistic Industries. As a key member of our team, you will play a crucial role in integrating, processing, and managing data to support our business operations.Key Responsibilities:Collaborate with operations and business intelligence teams to design and...

  • Data Engineer

    3 weeks ago


    Boston, Massachusetts, United States Gunderson Dettmer Full time

    Job Title: Data EngineerGunderson Dettmer is a leading international law firm that specializes in the innovation economy. We are seeking a highly skilled Data Engineer to join our team and contribute to the development of our data infrastructure.Responsibilities:Design, build, and manage robust ETL processes with performance monitoring.Create data...


  • Boston, Massachusetts, United States Beacon Engineering Resources Full time

    Beacon Engineering Resources is seeking a skilled Reliability Engineer to provide critical guidance on Design for Reliability, Maintainability, and Supportability for new product introduction teams. The ideal candidate will have a strong background in Failure Modes Effects Analysis, reliability predictions, and probabilistic modeling. Key responsibilities...


  • Boston, Massachusetts, United States CarGurus Full time

    We're seeking a seasoned Data Engineering Principal to join our team at CarGurus. As a key member of our Data Engineering team, you'll be responsible for transforming raw data into clean, reliable, and organized data models that drive informed decision-making across the organization.With a strong background in data modeling and a passion for innovation,...

  • MDM Data Engineer

    4 days ago


    Boston, Massachusetts, United States MassMutual Full time

    Job Title: MDM Data EngineerMassMutual is seeking a highly skilled MDM Data Engineer to join our Data Platform Data Engineering team. As an MDM Data Engineer, you will design, build, and measure complex Informatica Power Center and MDM processes to master data for different Master Data domains in MassMutual.Key Responsibilities:Design and implement ELT/ETL...


  • Boston, Massachusetts, United States Klaviyo Full time

    About KlaviyoKlaviyo is a leading provider of email marketing and customer data platforms. We empower creators to own their destiny by making first-party data accessible and actionable like never before.Job DescriptionWe are seeking a highly skilled Site Reliability Engineering Lead to join our team. As a Site Reliability Engineer, you will be responsible...


  • Boston, Massachusetts, United States WEX Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Platform Reliability organization. As a key member of our team, you will be responsible for ensuring the reliability and performance of our internal systems and services.As a Site Reliability Engineer, you will work closely with our development teams to design and implement...

  • Senior Data Engineer

    18 hours ago


    Boston, Massachusetts, United States Motion Recruitment Full time

    Job Title: Senior Data EngineerWe're seeking a highly skilled Senior Data Engineer to join our team at Motion Recruitment. As a key member of our Data Team, you will play a critical role in developing new platforms to store and process massive amounts of customer data, providing actionable insights to improve the customer experience.Responsibilities:* Build...


  • Boston, Massachusetts, United States Oracle Full time

    Job DescriptionThis team will focus on product automation of Infrastructure, sustainability, and troubleshooting for Oracle Health.As a Site Reliability DevOps Engineer, you will be responsible for defining and deploying key services with deep focus on architecture, production operations, capacity planning, performance management, deployment, and release...


  • Boston, Massachusetts, United States StartUs GmbH Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Spotify. As a Site Reliability Engineer, you will be responsible for designing and implementing scalable and reliable systems to support our production infrastructure.Key Responsibilities:Design and document systems, including writing and...


  • Boston, Massachusetts, United States Insight Global Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our software systems.Key ResponsibilitiesDesign and implement scalable and reliable software systemsCollaborate with cross-functional teams to...