Current jobs related to Lead Site Reliability Engineer - Santa Clara, California - Palo Alto Networks


  • Santa Clara, California, United States Cryptoware Technologies Inc Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Cryptoware Technologies Inc. As a Site Reliability Engineer, you will be responsible for leading the effort of global expansion of Huobi globe-spanning infrastructure.Key Responsibilities:Lead the effort of global expansion of Huobi...


  • Santa Clara, California, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design, implement, and maintain cloud infrastructure on AWS,...


  • Santa Clara, California, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineeringWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using...


  • Santa Clara, California, United States Insight Global Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Santa Clara, California, United States NVIDIA Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our Applications Infrastructure organization at NVIDIA. This team is responsible for designing, deploying, and maintaining infrastructure for various NVIDIA AI workflows and applications hosted in the cloud.Key Responsibilities:Develop and integrate new...


  • Santa Clara, California, United States NVIDIA Full time

    Job Title: Site Reliability EngineerWe are seeking a highly motivated Site Reliability Engineer to join our Applications Infrastructure organization. This team is responsible for automating, deploying, and maintaining infrastructure for various NVIDIA AI workflows and applications such as Metropolis, ACE, and Riva hosted in the cloud.Key...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RolePalo Alto Networks is seeking a highly skilled Principal Site Reliability Engineer to join our team. As a Principal Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining scalable and reliable infrastructure to support our mission-critical platforms.Key ResponsibilitiesDesign and implement scalable and...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Palo Alto Networks. As a key member of our engineering team, you will be responsible for designing, building, and operating reliable, secure cloud infrastructure.About the RoleThis is a unique opportunity to work with a...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RolePalo Alto Networks is seeking a highly skilled Principal Site Reliability Engineer to join our team. As a Principal Site Reliability Engineer, you will be responsible for designing, building, and operating reliable, secure cloud infrastructure. You will work closely with developers, researchers, data scientists, and security experts to ensure...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job DescriptionPalo Alto Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building, maintaining, and scaling production services and server farms within our FedRAMP SASE product portfolio.Key ResponsibilitiesDesign and implement scalable and reliable...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job DescriptionPalo Alto Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based security solutions.Key ResponsibilitiesDesign, build, and maintain scalable and reliable infrastructure for our...


  • Santa Clara, California, United States NVIDIA Full time

    Unlock the Power of Cloud ServicesWe are seeking a highly motivated Site Reliability Engineer to join our Applications Infrastructure organization.This team is responsible for automating, deploying, and maintaining infrastructure for various NVIDIA AI workflows and applications such as Metropolis, ACE, and Riva hosted in the cloud.The SRE role focuses on...


  • Santa Clara, California, United States Syntricate Technologies Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key ResponsibilitiesDesign, implement, and maintain cloud infrastructure on AWS, including EC2, SSM,...


  • Santa Clara, California, United States Syntricate Technologies Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design, implement, and maintain cloud infrastructure on AWS, including EC2,...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Global Customer Operation Team at Palo Alto Networks. As a Site Reliability Engineer, you will play a critical role in designing, building, maintaining, and scaling production services and server farms within our FedRAMP SASE product portfolio.Key ResponsibilitiesDesign and...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Global Customer Operations team at Palo Alto Networks. As a Site Reliability Engineer, you will play a critical role in designing, building, maintaining, and scaling production services and server farms within our FedRAMP SASE product portfolio.Key ResponsibilitiesDesign and...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RolePalo Alto Networks is seeking a highly skilled Principal Site Reliability Engineer to join our team. As a key member of our infrastructure platform, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Key ResponsibilitiesContribute to the success of SRE and DevOps teams by developing expertise...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job DescriptionPalo Alto Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in designing, building, and maintaining scalable and reliable infrastructure for our FedRAMP SASE product portfolio.Key ResponsibilitiesDesign and implement scalable and reliable...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job DescriptionPalo Alto Networks is seeking a highly skilled Site Reliability Engineer to join our Global Customer Operation Team. As a Site Reliability Engineer, you will play a critical role in designing, building, maintaining, and scaling production services and server farms within our FedRAMP SASE product portfolio.Key ResponsibilitiesDesign and...


  • Santa Clara, California, United States Centrify Corporation Full time

    Cloud Site Reliability EngineerAt Centrify Corporation, we're seeking a skilled Cloud Site Reliability Engineer to join our Cloud DevOps team. As a key member of our operations team, you'll play a critical role in ensuring the uptime and delivery of our cloud-based services.Key Responsibilities:Manage our cloud application using DevOps and Agile practices to...

Lead Site Reliability Engineer

2 months ago


Santa Clara, California, United States Palo Alto Networks Full time
Job Overview

Company Overview

To comply with U.S. federal government requirements, U.S. citizenship is required for this position.

Our Mission

At Palo Alto Networks, our mission is clear:

To be the cybersecurity partner of choice, safeguarding our digital existence.

We envision a world where each day is safer and more secure than the last. Our foundation is built on challenging the status quo and we seek innovators who are dedicated to shaping the future of cybersecurity.

Our Work Philosophy

We prioritize personalization and choice in all our employee programs. We have redefined the traditional perspective that all employees share the same needs and desires. We empower our employees to select what suits them best - from wellness support to professional growth and beyond.

At Palo Alto Networks, we recognize the importance of collaboration and value face-to-face interactions. This is why our employees primarily work from the office, with some flexibility when necessary. This environment encourages informal discussions, problem-solving, and building trust. While specifics may evolve, our aim is to cultivate a space where innovation flourishes, with teams coming together three days a week to collaborate on leading cybersecurity solutions.

Your Career Path

The Global Customer Operation Team is tasked with developing products that secure data, workloads, and infrastructure for some of the largest enterprises globally. We assist our clients in their transition to the public cloud by ensuring they receive top-tier protection. The public cloud sector has been expanding rapidly in recent years, leading to an unquenchable demand for securing workloads. Following our acquisition of two leading firms in this domain, Palo Alto Networks has established itself as the market leader.

We are in search of development-focused Site Reliability Engineers to design, construct, maintain, and scale production services and server farms within our FedRAMP SASE product portfolio. We seek enthusiastic engineers who contribute innovative ideas across all aspects of DevOps. We value leaders who take ownership of their responsibilities and are motivated to address challenges at every level. Collaboration and partnership are central to our culture, and we require engineers who can communicate effectively and work cohesively towards shared objectives.

Your Contributions
  • Develop Terraform code and terragrunt scripts.
  • Create automation workflows using Python or Go.
  • Design BGP and networking monitoring/remediation tools.
  • Engage with clients on escalations to provide solutions.
  • Software Architecture and Scalability
    • Enhance software architecture to boost scalability in networking protocols like BGP and OSPF, ensuring service reliability, capacity, and performance.
    • Collaborate with development teams to align applications with infrastructure requirements, emphasizing scalability and reliability.
  • Automation and Infrastructure Provisioning
    • Write automation scripts for provisioning and managing infrastructure at scale.
    • Partner with Dev/QA teams to establish pipelines and automation for application delivery and deployment.
  • On-call Support and Incident Management
    • Participate in on-call rotations to support infrastructure.
    • Investigate incidents, formulate hypotheses, and identify root causes for prompt resolution.
    • Draft postmortem reviews and provide recommendations for remediation.
  • Cross-Functional Collaboration
    • Offer technical support to Systems Administrators, Systems Engineers, Customer Support, and Professional Services teams regarding the product.
    • Identify product feature gaps and communicate them to Product Management teams.
  • Customer Engagement and Collaboration
    • Collaborate with external parties and clients, participating in Proof of Concept and Proof of Value activities with Systems Engineers and SASE architects.
    • Conduct customer training sessions and technical webinars.
    • Identify gaps and work with Product Managers to enhance feature accessibility for customers.
  • Continuous Improvement
    • Work with Product Managers to define new features and establish a vision for product evolution.
    • Actively seek opportunities to enhance infrastructure, streamline processes, and improve overall system efficiency.
Your Qualifications
  • Bachelor's degree or higher in Computer Science, Engineering, or a related field, or equivalent military experience.
  • CCIE certification in switching and routing.
  • Strong understanding of IPv6, Nat64, and IPv6 subnetting.
  • Proven experience in designing, implementing, and maintaining scalable and reliable infrastructure.
  • Proficient in automation scripting and Infrastructure as Code (IaC).
  • Excellent problem-solving abilities and the capacity to troubleshoot complex issues.
  • Strong communication skills, both written and verbal.
  • Experience in collaborative, cross-functional environments.
  • Demonstrated ability to lead and mentor teams.
  • Proficiency in Python and Go programming.
Additional Requirements
  • Availability for occasional on-call support.
  • Willingness to engage directly with customers and represent the technical aspects of the product.
The Team

As a member of the Site Reliability Engineering team, you will contribute to the development of mission-critical platforms, tools, and processes that ensure the highest levels of availability and reliability for our applications. We seek creative and innovative problem solvers who can collaborate with our application development teams to enhance service usability. Our SRE team is presented with a unique opportunity to build tools, frameworks, and cloud platforms that will support our company's growth in the coming years. If you are a proactive individual eager to implement new ideas to enhance platform stability, security, and functionality, this could be your next career move.

Our Commitment

We are pioneers who think big, take risks, and challenge the norms of cybersecurity. We recognize that we cannot fulfill our mission without diverse teams innovating together.

Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

All your information will be kept confidential according to EEO guidelines.

The compensation offered for this position will depend on qualifications, experience, and work location. For candidates who receive an offer at the posted level, the starting base salary is expected to be between $144,200/yr to $233,200/yr. The offered compensation may also include restricted stock units and a bonus. A description of our employee benefits may be found here.

Is this role eligible for Immigration Sponsorship?: No. Please note that we will not sponsor applicants for work visas for this position.