Staff Cloud Reliability Engineer
2 weeks ago
We are seeking a highly skilled Staff Cloud Reliability Engineer to join our team at Cribl. As a key member of our engineering organization, you will be responsible for envisioning, creating, deploying, testing, and shipping our cloud-based products.
This is a unique opportunity to be part of a company that is fundamentally changing the technology landscape. You will have the chance to work on cutting-edge projects, collaborate with a talented team of engineers, and contribute to the development of our next-generation software.
As a Cloud Site Reliability Engineer, you will be involved in all aspects of our cloud infrastructure, from design and development to deployment and maintenance. You will work closely with our product and platform teams to improve and evolve our systems, ensuring they are reliable, resilient, and observable.
Key responsibilities will include:
- Engaging with teams to improve service delivery and reliability across the entire lifecycle
- Measuring and monitoring production systems for availability, latency, and overall system health
- Seeking out the cause of errors and instability in our production cloud services and driving teams towards better operational excellence
- Engaging with product and platform teams to improve and evolve systems by lobbying for changes that improve reliability, resilience, and observability
- Helping to identify and drive down toil with creative innovation and automation
- On-call responsibilities
Requirements for this role include:
- Extensive experience with enterprise-scale continuous delivery environments
- 8+ years of experience with a DevOps or SRE job title
- Development experience in a Linux/Mac environment
- Experience with Configuration Management Tools like Terraform (preferred) or Puppet, Chef, Ansible
- Experience with sustainable incident response in a blameless environment
- Knowledge of cloud platforms (prefer AWS) and container + orchestration technologies
- Experience with APM and Observability and related tools such as, New Relic, Splunk, CloudWatch, Prometheus, Grafana/Kibana, Sentry etc.
- Background in Linux Systems Engineering
- Experience with incident response related tools for instance, PagerDuty, FireHydrant, Blameless etc.
- Comfortable with a high level of autonomy and working with a distributed team
Preferred qualifications include:
- Knowledge of Cloud and application security
- Strong knowledge of cloud design patterns for scale, data management, resiliency, etc.
- A love for high quality and a knack for testing
- Opinions about dashboards, metrics, and SLO's
Cribl offers a competitive salary range of $144,000 - $278,000, dependent on geographic location, as well as a generous benefits package including health, dental, vision, short-term disability, and life insurance, paid holidays and paid time off, a fertility treatment benefit, 401(k), equity, and eligibility for a discretionary company-wide bonus.
-
Cloud Reliability Engineer
3 weeks ago
Phoenix, Arizona, United States Futran Tech Solutions Pvt. Ltd. Full timeRole: Cloud Reliability Engineer Location: Phoenix, AZ Note: Splunk or Signalfx is required for this position. Responsibilities: Implement enterprise capabilities, tools, and innovation to improve availability in a multi-cloud ecosystem by evolving observability, monitoring, logging, and CI/CD integration. Develop and maintain systems, infrastructure, and...
-
Staff Site Reliability Engineer
4 weeks ago
Phoenix, Arizona, United States Cribl Full timeAbout CriblCribl is a serious company that doesn't take itself too seriously. We're a remote-first company that believes in empowering our employees to do their best work, wherever they are. As the data engine for IT and Security, we're trusted by many of the biggest names in the most demanding industries to solve their most pressing data needs.Job...
-
Reliability Engineer
3 weeks ago
Phoenix, Arizona, United States TWO95 International Full timeJob Title: Site Reliability EngineerAt TWO95 International, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the stability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement monitoring and alerting...
-
Site Reliability Engineer
3 weeks ago
Phoenix, Arizona, United States CloudBC Labs Full timeJob Title: Site Reliability EngineerJob Summary:CloudBC Labs is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:• Lead onshore and offshore teams to ensure seamless...
-
Site Reliability Engineer
3 weeks ago
Phoenix, Arizona, United States Fly LLC Full timeAbout Fly LLCFly LLC is a cloud-based platform that enables users to deploy applications near their users, regardless of their location. Our platform is built on top of Linux, HashiCorp stack, Firecracker, and WireGuard, and we're looking for a skilled Site Reliability Engineer to join our team.Job DescriptionWe're seeking a highly motivated and experienced...
-
Site Reliability Engineer
3 weeks ago
Phoenix, Arizona, United States Fly LLC Full timeAbout UsFly LLC is a cloud platform that enables developers to deploy and manage applications with ease. We're a team of passionate engineers who are dedicated to building a scalable and reliable infrastructure.Job DescriptionWe're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for...
-
Site Reliability Engineer
3 weeks ago
Phoenix, Arizona, United States Trident Consulting Full timeJob Title: Site Reliability EngineerTrident Consulting is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the stability and reliability of our production systems.Key Responsibilities:Lead production stability efforts by preventing production issues and improving...
-
Site Reliability Engineer
2 weeks ago
Phoenix, Arizona, United States CloudBC Labs Full timeJob SummaryCloudBC Labs is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based infrastructure. This is a hybrid role that requires both onshore and offshore experience.Key ResponsibilitiesLead onshore and offshore teams to ensure...
-
Site Reliability Engineer
3 weeks ago
Phoenix, Arizona, United States Curve Full timeAbout CurveCurve is a pioneering fintech company that's revolutionizing the way people manage their finances. Our mission is to simplify financial lives, empowering individuals to focus on what truly matters.Job DescriptionWe're seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our engineering organization, you'll play a...
-
Cloud Infrastructure Engineer
3 weeks ago
Phoenix, Arizona, United States CloudBC Labs Full timePosition OverviewCloudBC Labs is seeking a highly skilled Site Reliability Engineer to join our team.Job DescriptionWe are looking for a talented individual with experience leading onshore/offshore teams and a strong background in cloud infrastructure engineering. The ideal candidate will have hands-on experience building and troubleshooting cloud-based...
-
Cloud Engineer
3 weeks ago
Phoenix, Arizona, United States Diverse Lynx Full timeJob Title:SRE EngineerLocation: Remote OpportunityPosition Type: ContractJob Overview:We are seeking a highly skilled SRE Engineer to join our team at Diverse Lynx LLC. As a key member of our technical staff, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design, implement,...
-
Site Reliability Engineer
3 weeks ago
Phoenix, Arizona, United States SMBC Full timeTransformative Banking ExperienceSMBC is revolutionizing the banking industry with a completely new, 100% digital bank that prioritizes customers' best interests. We're seeking a skilled Site Reliability Engineer to join our mission.Key ResponsibilitiesEnsure production applications reliability through proactive monitoring and incident response.Collaborate...
-
Site Reliability Engineer
3 weeks ago
Phoenix, Arizona, United States Sumitomo Mitsui Banking Corp Full timeEvolve Banking with UsWe're on a mission to create a 100% digital bank that truly serves customers' best interests. Our team of seasoned financial services professionals is committed to building a bank from scratch, with a focus on technology infrastructure, modern marketing, and customer experience.As a Site Reliability Engineer, you'll play a critical role...
-
Site Reliability Engineer
2 weeks ago
Phoenix, Arizona, United States Curve Full timeJob DescriptionAt Curve, we're on a mission to simplify your finances, so you can focus on what matters most in life. We're developing a ground-breaking product with our customers at the core, and we're looking for a skilled Site Reliability Engineer to join our team.About the RoleWe're searching for a talented individual who is excited by the idea of owning...
-
Site Reliability Engineer
3 weeks ago
Phoenix, Arizona, United States Frontend Arts Full timeJob Title: Site Reliability EngineerAt Frontend Arts, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the smooth operation of our production environment.Key Responsibilities:Perform root cause analysis and manage communication with stakeholders to resolve...
-
Senior Cloud Engineer
3 weeks ago
Phoenix, Arizona, United States American Express Full timeLead the Way in Cloud EngineeringAt American Express, we're committed to delivering exceptional customer experiences through innovative technology solutions. As a Senior Engineer in our Public Cloud Platform SRE team, you'll play a pivotal role in ensuring the reliability, scalability, and performance of our public cloud infrastructure.Key...
-
Site Reliability Engineer II
3 weeks ago
Phoenix, Arizona, United States Experis Full timeJob Title: Site Reliability Engineer IIWe are seeking a highly skilled Site Reliability Engineer II to join our team at Experis. As a key member of our operations team, you will be responsible for ensuring the reliability and support of our Container Platform across on-prem and external clouds.Responsibilities:Monitor and troubleshoot performance,...
-
Site Reliability Engineer
3 weeks ago
Phoenix, Arizona, United States Futran Tech Solutions Pvt. Ltd. Full timeJob Title: Site Reliability EngineerAbout the Role:Futran Tech Solutions Pvt. Ltd. is seeking a highly skilled Site Reliability Engineer to join our team in Phoenix, AZ. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our cloud-based infrastructure and applications.Key...
-
Site Reliability Engineer
3 weeks ago
Phoenix, Arizona, United States LightEdge Solutions Full timeJoin LightEdge Solutions as a Site Reliability EngineerLightEdge Solutions is a leading provider of IT solutions, dedicated to delivering cutting-edge technology to propel businesses forward. As a Site Reliability Engineer, you will play a crucial role in ensuring the reliable operation of our systems and services.Key Responsibilities:Monitoring and...
-
Senior Software Engineer
1 week ago
Phoenix, Arizona, United States Genie Healthcare Full timeJob Description:As a Senior Software Engineer - Cloud Architecture, you will be responsible for designing and implementing cloud-based systems to meet our business needs. This includes developing and maintaining cloud infrastructure, ensuring scalability, security, and reliability.Key Responsibilities:- Design and implement cloud-based systems- Develop and...