Staff Site Reliability Engineer

3 days ago


San Jose, California, United States Zscaler Full time
About Zscaler

Zscaler is a leading cloud security company that serves thousands of enterprise customers worldwide, including 40% of Fortune 500 companies. Founded in 2007, our mission is to make the cloud a safe place to do business and provide a seamless experience for enterprise users.

Our Engineering Team

Our Engineering team has built the world's largest cloud security platform from the ground up, with over 100 patents and a strong focus on innovation. We're committed to enhancing our services and increasing our global footprint, making us a leader in cloud security.

Job Summary

We're seeking an experienced Staff Site Reliability Engineer - Technical Duty Officer to join our Shared Platform Engineer team. As a key member of our team, you'll be responsible for leading and advocating for the transformation to a world-leading SRE organization, promoting SRE principles within the Engineering Department.

Key Responsibilities
  • Lead and advocate for the transformation to a world-leading SRE organization
  • Provide expert leadership during critical outages, coordinating multiple teams to ensure streamlined decision-making and quick resolution
  • Promote a customer-focused approach by addressing and mitigating global customer environment issues
  • Develop and implement scalable process frameworks and observability strategies to ensure rapid problem diagnosis, response, and service reliability
  • Collaborate with product teams to thoroughly analyze failures and integrate insights to improve service reliability, scalability, and operational efficiency
Requirements
  • 5+ years of experience as a Site Reliability Engineer, with relevant experience in an Operations or Engineering environment
  • Hands-on experience troubleshooting Linux-based systems
  • Networking knowledge and ability to troubleshoot TCP/IP, SSL/TLS, DNSSEC, IPsec, and BGP issues
  • Coding experience (preferably Python) building tools, scripting, or automation
  • Bachelor's degree in Computer Science, a related technical field involving computer systems engineering, or equivalent practical experience
Preferred Qualifications
  • Experience supporting High/Moderate FedRAMP environments
  • Understanding of Observability practices and Tools - Grafana, DataDog, Splunk, etc.
  • Experience leading major incidents in large-scale, high-uptime environments
What We Offer

Zscaler offers a comprehensive benefits program, including various health plans, time off plans, parental leave options, retirement options, education reimbursement, and more. We're committed to creating an inclusive environment for all employees and offer a competitive salary range of $136,500-$195,000 USD.

We're an equal opportunity and affirmative action employer, celebrating diversity and committed to creating an inclusive environment for all employees. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, age, national origin, sexual orientation, gender identity, or any other characteristic protected by federal, state, or local laws.



  • San Jose, California, United States Zscaler Full time

    About ZscalerZscaler is a leading cloud security company that accelerates digital transformation for its customers. With a cloud-native platform, Zscaler protects thousands of organizations from cyber threats and data loss by securely connecting users, devices, and applications in any location.As a pioneer in cloud security, Zscaler has over 10 years of...


  • San Jose, California, United States Zscaler Full time

    About ZscalerZscaler is a leading cloud security company that accelerates digital transformation for its customers. With its cloud-native platform, Zscaler protects thousands of customers from cyber threats and data loss by securely connecting users, devices, and applications in any location.Position:Staff Site Reliability EngineerLocation:Remote within the...


  • San Jose, California, United States Zscaler Full time

    About ZscalerZscaler is a leading cloud security company that accelerates digital transformation for its customers. With a cloud-native platform, Zscaler protects thousands of organizations from cyber threats and data loss by securely connecting users, devices, and applications in any location.As a pioneer in cloud security, Zscaler has over 10 years of...


  • San Jose, California, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement automation scripts using...


  • San Jose, California, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement automation scripts using shell,...


  • San Jose, California, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • San Jose, California, United States Altius Technologies, Inc. Full time

    Job Title: Site Reliability EngineerAltius Technologies, Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining the infrastructure and systems that support our business applications.Key Responsibilities:Design and implement automation...


  • San Francisco, California, United States Gusto Full time

    About GustoGusto is a modern, online people platform that empowers small businesses to take care of their teams. Our comprehensive suite of tools includes full-service payroll, health insurance, 401(k)s, expert HR, and team management solutions. With offices in Denver, San Francisco, and New York, we serve over 300,000 businesses nationwide.Our MissionWe...


  • San Jose, California, United States ApTask Full time

    About ApTask:ApTask is a leading global provider of workforce solutions and talent acquisition services, dedicated to shaping the future of work.As an African American-owned and Veteran-certified company, ApTask offers a comprehensive suite of services, including staffing and recruitment solutions, managed services, IT consulting, and project management.With...


  • San Jose, California, United States Adobe Full time

    About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our team at Adobe. As a key member of our Cloud Engineering team, you will play a critical role in designing, deploying, and optimizing our cloud services.Key ResponsibilitiesDevelop software and tools to improve the reliability and performance of our cloud servicesCollaborate...


  • San Francisco, California, United States Crunchyroll Full time

    About CrunchyrollWe're a global entertainment company dedicated to delivering the art and culture of anime to a passionate community. Our mission is to help everyone belong, and we're committed to creating a workplace that reflects this value.The RoleWe're seeking a highly skilled Staff Site Reliability Engineer to join our Data Engineering team. As a key...


  • San Jose, California, United States Cisco Full time

    About the RoleCisco is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure. You will work closely with our development teams to identify and resolve issues, and collaborate with other teams to...


  • San Jose, California, United States Splunk Full time

    About SplunkSplunk is a leading provider of cloud-based data analytics and monitoring solutions. Our mission is to make machine data accessible, usable, and valuable to everyone.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our Cloud TechOps team. As a Site Reliability Engineer, you will be responsible for ensuring the...


  • San Jose, California, United States Adobe Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Adobe. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our cloud-based services.ResponsibilitiesEnsure the highest level of uptime and Quality of Service (QoS) to Adobe's customers through...


  • San Jose, California, United States Adobe Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Adobe. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our cloud-based services.Key ResponsibilitiesEnsure the highest level of uptime and Quality of Service (QoS) to Adobe's customers through...


  • San Jose, California, United States Trianz Full time

    About TrianzTrianz is a leading-edge technology platforms and services company that accelerates digital transformations at Fortune 100 and emerging companies worldwide in data & analytics, digital experiences, cloud infrastructure, and security.Our VisionWe believe that companies around the world face three challenges in their digital transformation journeys...


  • San Jose, California, United States Tik Tok Full time

    {"title": "Site Reliability Engineer", "description": "\u003Cp\u003EAt TikTok, we're seeking Site Reliability Engineers (SREs) to join our monetization technology team.\u003C/p\u003E\u003Cp\u003EOur team works on building and running large-scale, globally distributed, fault-tolerant ads systems.\u003C/p\u003E\u003Cp\u003ESREs keep the systems up and running...


  • San Jose, California, United States Zscaler Full time

    About ZscalerZscaler is a leading cloud security company that provides a comprehensive security platform to protect enterprises from cyber threats. With a mission to make the cloud a safe place to do business, Zscaler has built a reputation for delivering innovative and effective security solutions.Job SummaryWe are seeking an experienced Staff Site...


  • San Jose, California, United States Adobe Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Adobe. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud services.Key ResponsibilitiesDevelop software and tools to design, deploy, and optimize cloud servicesProvide hands-on technical...


  • San Jose, California, United States Altius Technologies Inc Full time

    Job DescriptionAt Altius Technologies Inc, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for creating and supporting automation scripts for infrastructure deployments, validations, and monitoring to improve operational tasks.Key Responsibilities:Design and implement...