Senior Staff Site Reliability Engineer

4 weeks ago


Santa Clara, California, United States Palo Alto Networks Full time
About Us

Palo Alto Networks is a leader in the cybersecurity industry, dedicated to protecting the digital way of life. Our mission is to be the cybersecurity partner of choice, and we're looking for innovators who share our passion for shaping the future of cybersecurity.

We're a company built on disruption, and we're looking for individuals who are comfortable with ambiguity and excited by the prospect of a challenge. Our engineers are at the core of our products, and we're constantly innovating to solve problems that no one has pursued before.

Job Description

We're seeking a Senior Staff Site Reliability Engineer to join our CDL/SLS team, supporting the services running on our large infrastructure. As a key member of our team, you'll contribute to the success of SRE and DevOps, develop expertise in new technologies, and work with developers, researchers, data scientists, and security experts.

Responsibilities include designing, building, and operating reliable, secure cloud infrastructure, ensuring applications are production-ready, scalable, and reliable, and developing tools and automation frameworks. You'll also automate robust deployment of robust services, orchestrate end-to-end monitoring and alerting, and participate in the on-call rotation.

Qualifications include 4+ years of experience as an engineer in infrastructure, operations, DevOps, or system engineering, 3+ years of building high availability, scalable cloud-native applications on AWS or GCP, and a BS or MS in Computer Science or a related field. Expertise in configuration management with a framework such as Ansible, Terraform, or Helm is required, as well as a passion for infrastructure and monitoring as code.

We offer a competitive compensation package, including a starting base salary of $203,500/YR, restricted stock units, and a bonus. Our employee benefits include a FLEXBenefits wellbeing spending account, mental and financial health resources, and personalized learning opportunities.

We're committed to providing reasonable accommodations for all qualified individuals with a disability. Palo Alto Networks is an equal opportunity employer, and we celebrate diversity in our workplace. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.



  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RoleWe are seeking a highly skilled Senior Staff Site Reliability Engineer to join our team at Palo Alto Networks. As a key member of our Cloud Infrastructure team, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Our ideal candidate will have a strong background in cloud computing, with...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job OverviewPalo Alto Networks is seeking a highly skilled Cloud Infrastructure Engineer to join our CDL/SLS team. As a Senior Staff Site Reliability Engineer, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Our team is at the forefront of innovation, constantly pushing the boundaries of what is...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job DescriptionPalo Alto Networks is seeking a highly skilled Senior Staff Site Reliability Engineer to join our CDL/SLS team. As a key member of our infrastructure team, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Key Responsibilities:Develop expertise in new technologies and contribute to the...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RoleWe are seeking a highly skilled Senior Staff Site Reliability Engineer to join our CDL/SLS team at Palo Alto Networks. As a key member of our team, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Our Infrastructure Platform stack includes Terraform, Kubernetes, GitLab CI/CD, GitOps,...


  • Santa Clara, California, United States NVIDIA Full time

    As a Senior Manager in Site Reliability Engineering (SRE) at NVIDIA, you will lead a team dedicated to the design, construction, and maintenance of expansive production systems, emphasizing high efficiency and availability. This role spans various domains, including software and systems engineering, cloud-scale storage, data management, and services. SRE...


  • Santa Clara, California, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineeringWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a leader in AI, machine learning, and datacenter acceleration. Our company is expanding its leadership into datacenter networking with ethernet switches, NICs, and DPUs. We have continuously reinvented ourselves over two decades, with our invention of the GPU in 1999 sparking the growth of the PC gaming market, redefining modern computer graphics,...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job DescriptionPalo Alto Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building, maintaining, and scaling production services and server farms within our FedRAMP SASE product portfolio.Key ResponsibilitiesDesign and implement scalable and reliable...


  • Santa Clara, California, United States Nvidia Full time

    Senior Reliability EngineerNVIDIA is seeking a highly skilled Senior Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for planning and implementing the qualifications of new NVIDIA products, including IC chips in AI, Mobile, Automotive, Deep Learning, Graphic Processor, and System on Chip sectors.Key...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job DescriptionPalo Alto Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining scalable and reliable infrastructure for our cloud-based products.Key Responsibilities:Design and implement scalable and reliable infrastructure for...


  • Santa Clara, California, United States XPENG Motors Full time

    Job Title: Senior Staff AI Infrastructure SREXpeng Motors is a leading smart electric vehicle company that designs, develops, and manufactures cutting-edge EVs with advanced Internet, AI, and autonomous driving technologies. We are committed to in-house R&D and intelligent manufacturing to create a better mobility experience for our customers.About the...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job DescriptionPalo Alto Networks is seeking a highly skilled Principal Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.You will work closely with our development team to ensure that applications are production-ready,...


  • Santa Clara, California, United States NVIDIA Full time

    Job Title: Senior Cloud Reliability EngineerWe are seeking a highly motivated Senior Cloud Reliability Engineer to join our Embedded organization.This team is responsible for automating, deploying, and maintaining infrastructure for various NVIDIA AI workflows and applications such as Metropolis, ACE, and Riva hosted in the cloud.The Senior Cloud Reliability...


  • Santa Clara, California, United States NVIDIA Full time

    At NVIDIA, we're seeking a highly skilled Senior Cloud Reliability Engineer to join our team. As a key member of our Site Reliability Engineering (SRE) team, you'll be responsible for designing, building, and maintaining large-scale production systems with high efficiency and availability.This is a highly specialized discipline that demands knowledge across...


  • Santa Clara, California, United States Anello Photonics Full time

    About Anello Photonics:Anello Photonics is a leading-edge technology company based in Santa Clara, CA. The company has developed integrated photonic system-on-chip technology for next-generation navigation. ANELLO's SIPHOGTM gyroscope is based on its patented photonic integrated circuit technology. The result is a product that is higher performance, much...


  • Santa Barbara, California, United States Invoca Full time

    About Invoca:Invoca is a leading provider of AI-powered Conversation Intelligence solutions. With a strong focus on innovation and customer satisfaction, we're shaping the future of customer engagement. Our team is passionate about delivering exceptional results and making a lasting impact.About the Role:We're seeking a highly skilled Site Reliability...


  • Santa Clara, California, United States NVIDIA Full time

    Reliability Engineer Job DescriptionNVIDIA is a leader in the field of computer graphics, PC gaming, and accelerated computing. We are seeking a highly skilled Reliability Engineer to join our team.Key Responsibilities:Develop, debug, and manage test programs for the HTOL oven.Review and design HTOL board schematics for various ovens.Diagnose signal...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job DescriptionYour CareerThe Global Customer Operation Team is responsible for building products that protect data, workloads, and infrastructure for some of the largest enterprise customers in the world.We help our customers in their journey to the public cloud by ensuring they have the best in class protection.The public cloud market has been growing at a...


  • Santa Clara, California, United States Palo Alto Networks Full time

    We are revolutionizing the cybersecurity landscape with our cloud-delivered security services, and our cloud infrastructure is rapidly expanding globally.We're seeking experienced SREs and software engineers interested in production engineering to help us scale the world's largest enterprise security cloud infrastructure.Palo Alto Networks has transformed...


  • Santa Clara, California, United States Omega Solutions Full time

    Job Title: Senior Systems Administrator/ Systems EngineerJob Summary:Omega Solutions is seeking a highly skilled Senior Systems Administrator/ Systems Engineer to join our team. The ideal candidate will have expertise in Active Directory and M365 infrastructure, as well as strong leadership and communication skills.Key Responsibilities:Lead the...