Staff Site Reliability Engineer

3 days ago


Denver, Colorado, United States Cribl, Inc Full time

Cribl, Inc is seeking a Staff Site Reliability Engineer to join our mission to unlock the value of all observability data. Our team is committed to shipping high-quality software and enjoying the best of the internet. As a remote-first company, we empower our employees to do their best work, wherever they are. We're growing rapidly and looking for collaborative, curious, and motivated team members who are passionate about putting customers first. As the data engine for IT and Security, many of the biggest names in the most demanding industries trust Cribl to solve their most pressing data needs. If you're passionate about reliability and have strong opinions on how to make things better, we want to talk. As an active member of our team, you will engage with teams to improve service delivery and reliability across the entire lifecycle. You will measure and monitor production systems with an eye towards availability, latency, and overall system health. You will seek out the cause of errors and instability in our production cloud services and drive teams towards better operational excellence. You will engage with product and platform teams to improve and evolve systems by lobbying for changes that improve reliability, resilience, and observability. You will help identify and drive down toil with creative innovation and automation. On-call responsibilities are also a part of this role. We're looking for someone with extensive experience with enterprise-scale continuous delivery environments, 8+ years of experience with a DevOps or SRE job title, and development experience in a Linux/Mac environment. You should also have experience with configuration management tools like Terraform, Puppet, Chef, or Ansible. Additionally, you should have knowledge of cloud platforms, container, and orchestration technologies, as well as APM and Observability tools like New Relic, Splunk, CloudWatch, Prometheus, Grafana/Kibana, and Sentry. A background in Linux Systems Engineering and experience with incident response related tools like PagerDuty, FireHydrant, and Blameless are also required. If you have a love for high-quality software and a knack for testing, and you're comfortable with a high level of autonomy and working with a distributed team, we want to hear from you. Preferred qualifications include knowledge of cloud and application security, strong knowledge of cloud design patterns or scale, data management, resiliency, etc. We offer a competitive salary range of $144,000 - $278,000, dependent on geographic location, as well as a generous benefits package, including health, dental, vision, short-term disability, and life insurance, paid holidays and paid time off, a fertility treatment benefit, 401(k), equity, and eligibility for a discretionary company-wide bonus. We're an equal opportunity employer and welcome diversity in all its forms. We're building a culture where differences are valued and welcomed, and we work together to bring out the best in each other. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or any other applicable.



  • Denver, Colorado, United States Cribl, Inc Full time

    About Cribl, IncCribl, Inc is a leading provider of observability solutions, empowering organizations to unlock the value of their data. Our mission is to deliver innovative, scalable, and reliable solutions that meet the evolving needs of our customers.Job SummaryWe are seeking a highly skilled Staff Site Reliability Engineer to join our team. As a key...


  • Denver, Colorado, United States VIZIO Full time

    About the RoleWe are seeking an experienced Site Reliability Staff Engineer to join our dynamic team at VIZIO. As a key player on the Operations team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based platform.Key ResponsibilitiesDesign, build, and review key SRE metrics to ensure platform availability and...


  • Denver, Colorado, United States VIZIO Full time

    About the RoleWe are seeking an experienced Site Reliability Staff Engineer to join our dynamic team at VIZIO. As a key player on the Operations team, you will demonstrate senior-level proficiency in leveraging cloud technologies to ensure the reliability, scalability, and performance of our platform.Key ResponsibilitiesDesign, build, and review key SRE...


  • Denver, Colorado, United States RingCentral Full time

    About the RoleRingCentral is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and availability of our cloud-based services. You will work closely with our development and operations teams to identify and resolve potential issues, and...


  • Denver, Colorado, United States Vertafore Full time

    Job Title: Site Reliability EngineerVertafore is a leading technology company that provides innovative software solutions to the insurance industry. We are seeking a talented Site Reliability Engineer to join our team.Job Summary:We are looking for a skilled Site Reliability Engineer to ensure the high availability and stability of our Vertafore products....


  • Denver, Colorado, United States Ping Identity Full time

    About Ping IdentityPing Identity is a leading provider of cloud identity and access management solutions. We empower organizations to provide secure and seamless digital experiences for their users, without compromise.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible...


  • Denver, Colorado, United States Vertafore Full time

    Job Title: Senior Site Reliability EngineerVertafore is a leading technology company that provides innovative software solutions to the insurance industry. We are seeking a talented Senior Site Reliability Engineer to join our team.Job SummaryWe are looking for a highly skilled and experienced Senior Site Reliability Engineer to join our team. The successful...


  • Denver, Colorado, United States Zayo Group Full time

    Job Title: Senior Site Reliability EngineerZayo Group is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a critical member of our infrastructure team, you will be responsible for ensuring the uptime, performance, and scalability of our critical infrastructure.Key Responsibilities:Incident Management: Lead the incident...


  • Denver, Colorado, United States Fruition Full time

    About the RoleFruition, a leader in software development, is seeking an experienced Site Reliability Engineer to improve our CI/CD process and enhance observability using Prometheus, Grafana, and Kubernetes.Key ResponsibilitiesOptimize CI/CD pipelines for various content management systems and web applications.Migrate existing CI/CD workflows from GitLab to...


  • Denver, Colorado, United States Oracle Full time

    About the RoleWe are seeking a highly skilled Site Reliability DevOps Engineer to join our team at Oracle. As a Site Reliability DevOps Engineer, you will be responsible for defining and deploying key services with a deep focus on architecture, production operations, capacity planning, performance management, deployment, and release...


  • Denver, Colorado, United States DAT Freight & Analytics Full time

    About DAT Freight & AnalyticsDAT Freight & Analytics is a leading provider of transportation supply chain logistics solutions. With 45 years of experience, we continue to innovate and transform the industry by deploying software solutions to millions of customers daily. Our mission is to provide the most relevant data and accurate insights to help customers...


  • Denver, Colorado, United States RingCentral Full time

    Unlock New OpportunitiesAt RingCentral, we're on a mission to revolutionize the way businesses communicate and collaborate. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability, performance, and availability of our cloud-based services.Key Responsibilities:Collaborate with development and operations teams to integrate...


  • Denver, Colorado, United States RingCentral Full time

    Unlock Opportunities with RingCentralAt RingCentral, we're on a mission to revolutionize the way businesses communicate and collaborate. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability, performance, and availability of our cloud-based services.Key Responsibilities:Collaborate with development and operations teams to...


  • Denver, Colorado, United States RingCentral Full time

    Unlock Opportunities with RingCentralAt RingCentral, we're on a mission to revolutionize the way businesses communicate and collaborate. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability, performance, and availability of our cloud-based services.Key Responsibilities:Collaborate with development and operations teams to...


  • Denver, Colorado, United States LeoVegas Group Full time

    About the RoleAt LeoVegas Group, we're seeking a skilled Site Reliability Engineer to join our team. As a critical part of our platform strategy, this role will focus on providing technical expertise and support to our engineering teams to enable them to deliver high-quality software solutions efficiently.Key ResponsibilitiesHelp domain teams build software...


  • Denver, Colorado, United States LeoVegas Group Full time

    About the RoleAs a Site Reliability Engineer at LeoVegas Group, you will play a critical part in our platform strategy, focusing on providing technical expertise and support to our engineering teams to enable them to deliver high-quality software solutions efficiently.This includes helping teams with technical continuous effort, overcoming major technical...


  • Denver, Colorado, United States RingCentral Full time

    Unlock Opportunities with RingCentralAt RingCentral, we're on a mission to revolutionize the way businesses communicate and collaborate. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability, performance, and availability of our cloud-based services.Key Responsibilities:Collaborate with development and operations teams to...


  • Denver, Colorado, United States Ping Identity Full time

    Job Title: Site Reliability Engineer IIAbout Ping Identity:We're a leading provider of cloud identity and access management solutions, dedicated to making digital experiences secure and seamless for all users. Our mission is to empower individuals and organizations to thrive in a rapidly changing digital landscape.Job Summary:We're seeking a highly skilled...


  • Denver, Colorado, United States Vertafore Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Vertafore. As a key member of our engineering team, you will be responsible for ensuring the high availability and stability of our Vertafore products.Key Responsibilities:Subject Matter Expert on how the applications work, its...


  • Denver, Colorado, United States Saxon Global Full time

    Job Summary: We are seeking a highly skilled Principal Site Reliability Engineer to join our team at Saxon Global. As a key member of our cloud engineering team, you will be responsible for providing primary management, administration, support, and ongoing maintenance of production platforms within a 24x7x365 environment and data center environment.Key...