Current jobs related to Director, Site Reliability Engineer - Boston, Massachusetts - Chewy


  • Boston, Massachusetts, United States Red Hat Full time

    About the JobThe Red Hat Site Reliability Engineering (SRE) team is seeking a Director, Site Reliability Engineering to lead our managed OpenShift cloud service offerings. As a Director of SRE, you'll oversee a region of SRE teams in the development and operations of our managed OpenShift services.Key ResponsibilitiesHire, develop, and retain SRE Managers...


  • Boston, Massachusetts, United States WEX Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Platform Reliability organization. As a key member of our team, you will be responsible for ensuring the reliability and performance of our internal systems and services.As a Site Reliability Engineer, you will work closely with our development teams to design and implement...


  • Boston, Massachusetts, United States StartUs GmbH Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Spotify. As a Site Reliability Engineer, you will be responsible for designing and implementing scalable and reliable systems to support our production infrastructure.Key Responsibilities:Design and document systems, including writing and...


  • Boston, Massachusetts, United States AXON-Networks Full time

    AXON Networks is a leading provider of AI-driven, analytics-based orchestration platforms and next-gen high-speed routers that leverage the latest Wi-Fi technologies.Our innovative solutions empower ISPs to manage and troubleshoot their networks in real-time, delivering an exceptional customer experience.As a trusted strategic partner, AXON Networks helps...


  • Boston, Massachusetts, United States Klaviyo Full time

    {"title": "Site Reliability Engineering Manager", "description": "Job SummaryKlaviyo is seeking a Site Reliability Engineering Manager to lead our SRE Security team in Boston and remotely. As a key member of our engineering organization, you will be responsible for managing a team of Site Reliability Engineers and working closely with product engineers to...


  • Boston, Massachusetts, United States Klaviyo Full time

    About KlaviyoKlaviyo is a leading provider of email marketing and customer data platforms. We empower creators to own their destiny by making first-party data accessible and actionable like never before.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring...


  • Boston, Massachusetts, United States FareHarbor Full time

    About FareHarborFareHarbor is a leading provider of innovative solutions for the experiences industry. Our mission is to empower our clients to deliver exceptional experiences to their customers.The RoleWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building,...


  • Boston, Massachusetts, United States Klaviyo Full time

    About the RoleWe're seeking an experienced Site Reliability Engineering Manager to join our team at Klaviyo. As a key member of our engineering organization, you will be responsible for leading a team of Site Reliability Engineers and driving the development of secure, scalable, and reliable systems.Key ResponsibilitiesManage a team of 4-6 Site Reliability...


  • Boston, Massachusetts, United States Klaviyo Full time

    About the RoleWe're seeking a seasoned Site Reliability Engineering Manager to lead our team in Boston and remotely. As a key member of our engineering organization, you'll be responsible for managing a team of 4-6 Site Reliability Engineers and driving the development of secure software architecture and development.Key ResponsibilitiesManage a team of Site...


  • Boston, Massachusetts, United States Insight Global Full time

    Site Reliability Engineer ManagerWe are seeking a highly motivated Site Reliability Engineer Manager to join our rapidly growing team. As a key member of our Site Reliability Engineering team, you will be responsible for providing tooling and guidance to our product engineers to ensure productivity and success.The Site Reliability team is responsible for...


  • Boston, Massachusetts, United States Global InfoTek Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Global InfoTek, Inc. The ideal candidate will have a strong background in cloud infrastructure, DevOps, and reliability engineering.Key Responsibilities:Design and implement scalable cloud infrastructure solutionsDevelop and...


  • Boston, Massachusetts, United States Klaviyo Full time

    Klaviyo is committed to empowering creators to own their destiny by making first-party data accessible and actionable like never before. To achieve this goal, we need a talented Site Reliability Engineering Manager to join our team.The Site Reliability Engineering Manager will be responsible for leading a team of Site Reliability Engineers in Klaviyo's...


  • Boston, Massachusetts, United States Klaviyo Full time

    About the RoleWe're seeking a skilled Site Reliability Engineer to join our team at Klaviyo. As a Site Reliability Engineer, you will be responsible for ensuring the availability and scalability of our systems, as well as collaborating with product teams to deliver high-quality software.Key ResponsibilitiesDesign and develop systems and processes to enable...


  • Boston, Massachusetts, United States Klaviyo Full time

    At Klaviyo, we value the unique backgrounds, experiences, and perspectives each team member brings to our workplace every day.We believe everyone deserves a fair shot at success and appreciate the experiences each person brings beyond traditional job requirements.Want to learn more about life at Klaviyo? Visit our website to see how we empower creators to...


  • Boston, Massachusetts, United States Dice Full time

    Revolutionize Data Management with Our Dynamic StartupWe are partnered with a cutting-edge startup poised to disrupt the data management industry, competing with established players. Our client, Motion Recruitment Partners, LLC, is seeking a Senior Site Reliability Engineer to join their growing DevOps team to ensure the reliability and performance of their...


  • Boston, Massachusetts, United States Klaviyo Full time

    Unlock Your Potential as a Senior Site Reliability Engineer at KlaviyoWe're on a mission to empower creators to own their destiny, and we need talented individuals like you to help us achieve it. As a Senior Site Reliability Engineer at Klaviyo, you'll play a critical role in ensuring the reliability, scalability, and security of our platform.Key...


  • Boston, Massachusetts, United States Klaviyo Full time

    About the RoleWe're seeking a highly skilled Senior Site Reliability Engineer to join our team at Klaviyo. As a key member of our Site Reliability Engineering team, you will play a critical role in ensuring the reliability, scalability, and security of our services.Key ResponsibilitiesDesign and develop systems and processes to enable highly available and...


  • Boston, Massachusetts, United States Veradigm Full time

    About VeradigmVeradigm is a leading provider of healthcare solutions, harnessing the power of research, analytics, and artificial intelligence to develop scalable data-driven solutions that bring significant value to all healthcare stakeholders.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability...


  • Boston, Massachusetts, United States Veradigm Full time

    Transforming Healthcare with VeradigmWelcome to Veradigm, where our mission is to harness the power of research, analytics, and artificial intelligence to develop scalable data-driven solutions that bring significant value to all healthcare stakeholders. As a Senior Site Reliability Engineer, you will be part of a dynamic team that is dedicated to delivering...


  • Boston, Massachusetts, United States Zscaler Full time

    About the RoleZscaler is seeking an experienced Staff Site Reliability Engineer (Federal) to join our ZPA team. This is a hybrid role, requiring on-site presence in a Boston office.Key ResponsibilitiesOversee operational tasks for FedRAMP cloud products, including deployments, on-call duties, and incident management.Manage all cloud infrastructure...

Director, Site Reliability Engineer

3 months ago


Boston, Massachusetts, United States Chewy Full time

Our Opportunity:

We are looking for a Director, Site Reliability Engineer at our facility in Boston, Massachusetts to establish and manage incident response protocols for SREs, including on-call schedules and post-incident reviews, to minimize downtime and improve system performance.

What You'll Do:

  • Develop and execute a comprehensive SRE strategy that aligns with the company's business objectives and growth plans.
  • Recruit, mentor, and develop SRE team members, fostering their professional growth and skill development.
  • Cross-functional engagement with other engineering teams, managing issues when they happen, as well as promoting reliability and resilience practices throughout the organization.
  • Transform business priorities into technical initiatives and ensure the alignment of SRE efforts with the broader organizational goals.
  • Ensure timely and consistent communication to facilitate a clear understanding of ongoing projects and their prioritization within the organization.
  • Establish strong working relationships at all organizational levels and across functional teams.
  • 15% domestic travel required.

What You'll Need:

  • Bachelor's degree in Computer Science, Computer Systems Engineering, Electrical Engineering, Telecommunication System Management or related field and 10 years of experience.
  • Experience must include 7 years with: engineering management; ServiceNow ITOM, ITSM Modules that focuses on incident, problem and change management;
  • Developing executive friendly dashboards based on observable metrics in IT systems (KPIs, Incident Trends, MTTR, MTTD etc.);
  • Docker & Kubernetes or similar container-based architectures;
  • Micro-services architecture, design patterns, and standard methodologies.
  • Experience must also include: performance engineering, observability, resiliency and chaos engineering of largescale latency sensitive enterprise applications;
  • ITSM process & tools like JIRA, PagerDuty;
  • Standard DevOps tools,
  • Build automation tools (Jenkins), issue tracking tools and source control systems (GitHub);
  • AWS offerings such as ECS, EC2, Lambda, Fargate, S3, DynamoDB, and API Gateway; and
  • Telemetry tooling and observability systems such as: Prometheus, Splunk, DataDog, Grafana.
  • 15% domestic travel required.
  • The position is eligible for the Employee Referral Program.

Chewy is committed to equal opportunity. We value and embrace diversity and inclusion of all Team Members. If you have a disability under the Americans with Disabilities Act or similar law, and you need an accommodation during the application process or to perform these job requirements, or if you need a religious accommodation, please contact

If you have a question regarding your application, please contact

To access Chewy's Customer Privacy Policy, please click here. To access Chewy's California CPRA Job Applicant Privacy Policy, please click here.