Principal Site Reliability Engineer

3 weeks ago


Santa Clara, United States Palo Alto Networks Full time
Company Description

To comply with U.S. federal government requirements, U.S. citizenship is required for this position

Our Mission

At Palo Alto Networks® everything starts and ends with our mission:

Being the cybersecurity partner of choice, protecting our digital way of life.

Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are.

Our Approach to Work

We lead with personalization and choice in all of our people programs. We have disrupted the traditional view that all employees have the same needs and wants. We offer our employees the opportunity to choose what works best for them as often as possible - from your wellbeing support to your growth and development, and beyond

At Palo Alto Networks, we believe in the power of collaboration and value in-person interactions. This is why our employees generally work from the office with some flexibility offered where needed. This setup fosters casual conversations, problem-solving, and trusted relationships. While details may evolve, our goal is to create an environment where innovation thrives, with office-based teams coming together three days a week to collaborate on the industry's best cybersecurity solutions together

Job Description

Your Career

The Global Customer Operation Team is responsible for building products that protect data, workloads, and infrastructure for some of the largest enterprise customers in the world. We help our customers in their journey to the public cloud by ensuring they have the best in class protection. The public cloud market has been growing at a very rapid rate for the last few years. As more and more enterprises leverage public cloud, there is an insatiable demand for securing workloads in public cloud. With the recent acquisition of two leading companies in this space - RedLock and Evident.io, Palo Alto Networks is the market leader in this space.

We are seeking development heavy Site Reliability Engineers to design, build, maintain, and scale production services and server farms within our FedRAMP SASE product portfolio in. We want passionate engineers who bring new ideas in all facets of DevOps. We are looking for leaders who take ownership of their areas of focus and who are driven to solve problems at every level. Collaboration and partnership are at the foundation of our culture and we need engineers who can communicate at a high level and work as a team towards achieving a common goal.

Your Impact
  • Build Terraform code and terragrunt terragrunt
  • Build automation work follow using python or go code
  • Build BGP and networking monitoring/ remediation tools
  • Engage with customers on escalations to provide remediation
  • Software Architecture and Scalability
    • Design and enhance software architecture to improve scalability in networking like BGP, OSPF, service reliability, capacity, and performance
    • Collaborate with development teams to ensure applications align with infrastructure requirements, focusing on scalability and reliability
  • Automation and Infrastructure Provisioning
    • Write automation code for provisioning and operating infrastructure at a massive scale
    • Work with Dev/QA teams to build pipelines and automation for delivering and deploying applications to production
  • On-call Support and Incident Resolution
    • Participate in occasional on-call rotations to support the infrastructure
    • Investigate incidents, formulate hypotheses, and identify root causes to solve issues promptly
    • Write postmortem reviews and provide remediation recommendations
  • Cross-Functional Collaboration
    • Provide technical assistance to Systems Administrators (SA), Systems Engineers (SE), Customer Support (CS), and Professional Services (PS) teams regarding the product
    • Identify missing product features and communicate them to the Product Management (PM) teams
  • Customer Interaction and Collaboration
    • Work with external parties and clients, participating in Proof of Concept (POC) and Proof of Value (POV) activities with SEs and SASE architects for customers
    • Conduct customer training sessions and technical webinars
    • Identify gaps and collaborate with PMs to make features accessible to customers
  • Continuous Improvement
    • Collaborate with PMs to characterize new features and establish a vision for the product's evolution
    • Actively seek ways to enhance the infrastructure, streamline processes, and improve overall system efficiency
Qualifications

Your Experience
  • Bachelor's or higher degree in Computer Science, Engineering, or related field or equivalent military experience required
  • CCIE in switching, routing
  • Strong knowledge of IPv6, and Nat64, IPv6 subnetting
  • Proven experience in designing, implementing, and maintaining scalable and reliable infrastructure
  • Strong proficiency in automation scripting and infrastructure as code (IaC)
  • Excellent problem-solving skills and the ability to troubleshoot complex issues
  • Effective communication skills, both written and verbal
  • Experience working in collaborative, cross-functional environments
  • Demonstrated ability to lead and mentor teams
  • Python/Go programming
Additional Requirements
  • Availability for occasional on-call support
  • Willingness to engage with customers directly and represent the technical aspects of the product


Additional Information

The Team

As a member of the SRE team, you will work on producing mission-critical platforms, tools, and processes that will ensure the highest levels of availability and reliability of all our applications. We need creative and innovative problem solvers who can partner with our Application development teams to make their services more usable. Our SRE team is furnished with a standout opportunity to build tools, frameworks, and cloud platforms that will support our company's growth over the next decade. If you are a self-starter and jump on new ideas to make the platform more stable, secure and feature-rich, this is your new career.

Our Commitment

We're trailblazers that dream big, take risks, and challenge cybersecurity's status quo. It's simple: we can't accomplish our mission without diverse teams innovating, together.

We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at accommodations@paloaltonetworks.com.

Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

All your information will be kept confidential according to EEO guidelines.

The compensation offered for this position will depend on qualifications, experience, and work location. For candidates who receive an offer at the posted level, the starting base salary (for non-sales roles) or base salary + commission target (for sales/com-missioned roles) is expected to be between $144,200/yr to $233,200/yr. The offered compensation may also include restricted stock units and a bonus. A description of our employee benefits may be found here.

#LI-TD1

Is role eligible for Immigration Sponsorship?: No. Please note that we will not sponsor applicants for work visas for this position.

  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RolePalo Alto Networks is seeking a highly skilled Principal Site Reliability Engineer to join our team. As a key member of our Global Customer Operation Team, you will be responsible for designing, building, maintaining, and scaling production services and server farms within our FedRAMP SASE product portfolio.Key ResponsibilitiesDesign and...


  • Santa Clara, California, United States Diverse Lynx Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based applications and infrastructure.Key ResponsibilitiesDesign, implement, and maintain cloud infrastructure on...


  • Santa Clara, United States Palo Alto Networks Full time

    Job Description Your Career The Global Customer Operation Team is responsible for building products that protect data, workloads, and infrastructure for some of the largest enterprise customers in the world. We help our customers in their journey to the public cloud by ensuring they have the best in class protection. The public cloud market has been...


  • Santa Clara, United States Palo Alto Networks Full time

    Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we’re looking...


  • Santa Clara, United States Veear Full time

    Position: Site Reliability Engineer Location: Remote role Duration: 12+ Months Contract with possible extension Job Description: We seek development-heavy Site Reliability Engineers to design, build, maintain, and scale production services and server farms within our FedRAMP SASE product portfolio. We want passionate engineers who bring new ideas to all...


  • Santa Clara, United States VeeAR Projects Inc. Full time

    Position: Site Reliability EngineerLocation: Remote roleDuration: 12+ Months Contract with possible extensionJob Description: We seek development-heavy Site Reliability Engineers to design, build, maintain, and scale production services and server farms within our FedRAMP SASE product portfolio. We want passionate engineers who bring new ideas to all facets...


  • Santa Clara, United States VeeAR Projects Inc. Full time

    Position: Site Reliability EngineerLocation: Remote roleDuration: 12+ Months Contract with possible extensionJob Description: We seek development-heavy Site Reliability Engineers to design, build, maintain, and scale production services and server farms within our FedRAMP SASE product portfolio. We want passionate engineers who bring new ideas to all facets...


  • Santa Clara, United States NVIDIA Full time

    NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and outstanding people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers,...


  • Santa Clara, United States Palo Alto Networks Full time

    Job DescriptionJob DescriptionCompany DescriptionTo comply with U.S. federal government requirements, U.S. citizenship is required for this positionOur MissionAt Palo Alto Networks® everything starts and ends with our mission:Being the cybersecurity partner of choice, protecting our digital way of life.Our vision is a world where each day is safer and more...


  • Santa Clara, United States Centrify Corporation Full time

    Our software runs on public clouds with 99.9% or better uptime and is mission critical for our customers. Our cloud operations team is where the rubber meets the road and needs innovative Site Reliability Engineers. Join a professional team of smart and hard-working professionals building enterprise-class cloud-based services in the rapidly growing market of...


  • Santa Clara, United States Palo Alto Networks Full time

    To comply with U.S. federal government requirements, U.S. citizenship is required for this positionOur MissionAt Palo Alto Networks® everything starts and ends with our mission:Being the cybersecurity partner of choice, protecting our digital way of life.Our vision is a world where each day is safer and more secure than the one before. We are a company...


  • Santa Clara, United States Palo Alto Networks Full time

    To comply with U.S. federal government requirements, U.S. citizenship is required for this positionOur MissionAt Palo Alto Networks® everything starts and ends with our mission:Being the cybersecurity partner of choice, protecting our digital way of life.Our vision is a world where each day is safer and more secure than the one before. We are a company...


  • Santa Clara, United States Palo Alto Networks Full time

    We are reshaping the cybersecurity market through our cloud-delivered security services, and our cloud infrastructure is quickly and massively growing with a global footprint. We’re looking for great SREs, as well as software engineers interested in production engineering, to help us scale the largest enterprise security cloud infrastructure in the...


  • Santa Clara, California, United States OMNIVISION Full time

    Job Overview We are seeking a Staff Reliability Engineer to join our team at OMNIVISION. The ideal candidate will possess a strong educational background and relevant experience in the field of reliability engineering. Qualifications: A Bachelor’s degree in Physics, Electrical Engineering, Materials Science, or a related engineering field, with...


  • Santa Clara, California, United States Promote Project Full time

    About Promote Project: Promote Project is a leader in innovative technology solutions, dedicated to pushing the boundaries of what is possible in the realm of artificial intelligence and cloud computing. Our commitment to excellence is reflected in our talented workforce and our pursuit of groundbreaking advancements.Position Overview: We are seeking a...


  • Santa Clara, California, United States Promote Project Full time

    About the Company: Promote Project is at the forefront of innovation, leveraging cutting-edge technology to redefine the landscape of AI and computing. Our mission is to harness the power of advanced computing to create transformative solutions that impact various industries.Position Overview: We are seeking a Manager of Site Reliability Engineering to...


  • Santa Clara, California, United States Veear Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Veear. As a key member of our infrastructure team, you will play a critical role in ensuring the reliability, scalability, and security of our cloud-based systems.Key ResponsibilitiesCollaboration and PartnershipPartner with cross-functional teams to ensure security...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job OverviewCompany OverviewTo comply with U.S. federal government requirements, U.S. citizenship is required for this position.Our MissionAt Palo Alto Networks, our mission is clear:To be the cybersecurity partner of choice, safeguarding our digital existence.We envision a world where each day is safer and more secure than the last. Our foundation is built...


  • Santa Clara, California, United States OMNIVISION Full time

    Position Overview We are seeking a Staff Reliability Engineer to join our team at OMNIVISION. The ideal candidate will possess a strong educational background and relevant experience in the field of reliability engineering, particularly in semiconductor technologies. Qualifications: A Bachelor’s degree in Physics, Electrical Engineering,...


  • Santa Clara, California, United States ServiceNow Full time

    Company OverviewAt ServiceNow, we harness technology to create a better world for everyone, driven by our talented workforce. We prioritize speed and innovation to meet the demands of our customers and communities.Joining ServiceNow means becoming part of a dynamic team of innovators who possess a relentless curiosity and a commitment to creativity.We...