Staff Service Reliability Engineer

1 week ago


Pittsburgh PA USA, United States Proofpoint Full time
About the Role

We are seeking a highly skilled Staff Service Reliability Engineer to join our team at Proofpoint. As a key member of our infrastructure team, you will be responsible for ensuring the reliability and scalability of our cloud-based security products.

Key Responsibilities
  • Develop a deep understanding of our cloud infrastructure and services
  • Contribute to the architecture and design of scalable and reliable systems
  • Provision, maintain, and scale production services and server farms across a hybrid cloud environment
  • Work cross-functionally to improve automation and orchestration platforms
  • Build long-lasting partnerships with Product, Engineering, and Operations teams
  • Organize and manage multiple simultaneous projects
  • Lead by example and establish credibility with the quality of your technical execution
  • Mentor junior members of the team and foster a service ownership mentality
Requirements
  • Demonstrable skills and 10+ years' experience managing, troubleshooting, and tuning Linux systems
  • Experience working in a high-volume, large deployment, multi-datacenter/multi-cloud environment
  • Experience automating management of systems and applications using common frameworks, platforms, and coding languages
  • Experience with observability technologies such as Open Telemetry and Prometheus or similar
  • Deep experience with industry-standard foundation technologies such as TCP/IP, HTTP, DNS, SMTP, and LDAP
  • Experience in management of a large distributed computing environment
  • Experience with common virtualization platforms – KVM, VMware vSphere, ESX, ESXi, and vCenter
  • Experience with multiple cloud providers and technologies including AWS, GCP, and Azure
  • Experience with containers and container orchestration platforms such as Kubernetes
  • Excellent verbal and written communication skills
  • Experience with monitoring and alerting systems
  • Experience with industry-standard operational practices such as change management, incident management, and working in colocation datacenters
  • Extensive experience with configuration management such as Puppet or Chef
  • Experience with load-balancing technologies – F5, Netscaler or similar
About Proofpoint

Proofpoint is a leading cybersecurity company protecting organizations' greatest assets and biggest risks: vulnerabilities in people. We are committed to creating a diverse, equitable, and inclusive environment where our employees feel valued and empowered to grow.

We are singularly devoted to helping our customers protect what matters most. That's why we're a leader in next-generation cybersecurity—and why more than half of the Fortune 100 trust us as a security partner.

We are an equal opportunity employer, and we hire without consideration to race, religion, creed, color, national origin, age, gender, sexual orientation, marital status, veteran status, or disability.


  • Reliability Engineer

    2 weeks ago


    Pittsburgh, PA , USA, United States Philips Full time

    About the RoleWe are seeking a highly skilled Reliability Engineer to join our team at Philips. As a Reliability Engineer, you will be responsible for ensuring the reliability and quality of our medical devices. This is a challenging and rewarding role that requires a strong understanding of reliability engineering principles and practices.Key...

  • Reliability Engineer

    2 weeks ago


    Pittsburgh, PA , USA, United States Philips Full time

    About the RoleWe are seeking a highly skilled Reliability Engineer to join our team at Philips. As a key member of our reliability team, you will be responsible for ensuring the reliability and quality of our medical technology products.Key ResponsibilitiesDevelop and implement reliability testing strategies to ensure product reliability and...


  • Pittsburgh, PA , USA, United States Aurora Innovation Full time

    Job Title: InfoSec Site Reliability EngineerAurora Innovation is seeking a highly skilled InfoSec Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the integrity and availability of our enterprise fleet of Ubuntu, Mac, and Windows laptops, and InfoSec/Enterprise infrastructure services.Key...


  • Pittsburgh, Pennsylvania, United States EZ Texting Full time

    About the RoleWe are seeking a highly skilled Staff Site Reliability Engineer to join our team at EZ Texting. As a Staff SRE, you will be responsible for ensuring the reliability and performance of our telecom and SMS infrastructure. This is a critical role that requires a deep understanding of telecom and messaging systems, as well as excellent...


  • Market St #, San Francisco, CA , USA, United States Airbnb Full time

    About the RoleAirbnb is seeking a Staff Software Engineer to join our Site Reliability Engineering team. As a Staff Software Engineer in SRE, you will be responsible for developing and maintaining the tools and systems that enable our engineering teams to operate our services reliably and at scale.Key ResponsibilitiesDesign, implement, and maintain the tools...

  • Reliability Engineer

    2 weeks ago


    Pittsburgh, Pennsylvania, United States Philips Full time

    Job Title: Reliability EngineerWe are seeking a skilled Reliability Engineer to join our team at Philips. As a Reliability Engineer, you will play a critical role in ensuring the quality and reliability of our medical technology products.About the Role:As a Reliability Engineer, you will be responsible for developing and implementing testing strategies to...


  • Pittsburgh, Pennsylvania, United States Philips North America Full time

    Job Title: Reliability EngineerJoin Philips North America as a Reliability Engineer and contribute to the development of innovative medical technology solutions.About the Role:We are seeking a skilled Reliability Engineer to work cross-functionally with our team to develop and improve the reliability program for our products. As a key member of our team, you...


  • Pittsburgh, Pennsylvania, United States General Dynamics Mission Systems Full time

    Job Title: Site Reliability EngineerAt General Dynamics Mission Systems, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the uptime and reliability of mission-critical resources.Responsibilities:Ensuring the uptime of critical systemsAutomating systems...


  • Peachtree Corners, GA , USA, United States Insight Global Full time

    {"Responsibilities": "Reliability Strategy DevelopmentDevelop and implement reliability and maintenance strategies to optimize plant performance.Establish reliability objectives and key performance indicators (KPIs) for critical equipment and systems.Collaborate with other departments to align reliability goals with overall plant objectives.Maintenance and...


  • Pittsburgh, Pennsylvania, United States General Dynamics Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at General Dynamics Mission Systems. As a Site Reliability Engineer, you will be responsible for maintaining the survivability and reliability of mission critical resources.Key ResponsibilitiesEnsure uptime of critical systemsAutomate systems administration...


  • Newton, MA, USA, United States CyberArk Full time

    About CyberArkCyberArk is the global leader in Identity Security, providing the most comprehensive security offering for any identity - human or machine - across business applications, distributed workforces, hybrid cloud workloads, and throughout the DevOps lifecycle. The world's leading organizations trust CyberArk to help secure their most critical...


  • Pittsburgh, Pennsylvania, United States General Dynamics Corporation Full time

    Job SummaryWe are seeking a skilled Site Reliability Engineer to join our team at General Dynamics Mission Systems. As a key member of our cross-functional team, you will be responsible for ensuring the reliability and survivability of our mission-critical resources.Key ResponsibilitiesEnsure uptime of critical systems and automate systems administration...


  • Pittsburgh, Pennsylvania, United States INEOS Full time

    About the RoleWe are seeking a highly skilled Asset Engineer to join our team at INEOS. As an Asset Engineer, you will play a critical role in ensuring the safe and efficient operation of our plant.Key ResponsibilitiesDevelop and implement strategies to improve employee safety and health, environmental impact, and security.Ensure compliance with corporate...


  • Pittsburgh, Pennsylvania, United States General Dynamics Mission Systems Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at General Dynamics Mission Systems. As a key member of our cross-functional team, you will be responsible for maintaining the survivability and reliability of mission-critical resources.Key ResponsibilitiesEnsure uptime of critical systems and infrastructureAutomate...


  • Pittsburgh, Pennsylvania, United States PNC Full time

    Job SummaryPNC is seeking a highly skilled System Reliability Engineer to join our team. As a key member of our IT organization, you will be responsible for ensuring the stability and compliance of our systems and software.Key ResponsibilitiesLead and participate in the monitoring, maintenance, support, and modernization of systems and software to ensure...

  • Reliability Engineer

    4 weeks ago


    Pennsylvania, USA, United States Gable Search Group Full time

    Job DescriptionJob Title: Reliability EngineerJob Summary:Gable Search Group is seeking a highly skilled Reliability Engineer to join our team. As a key member of our organization, you will be responsible for ensuring the reliability and sustainability of our equipment and processes.Key Responsibilities:Equipment Reliability: Analyze equipment failure data...


  • Pittsburgh, Pennsylvania, United States The PNC Financial Services Group, Inc Full time

    Job Title: Senior Site Reliability EngineerAt The PNC Financial Services Group, Inc, we are committed to delivering exceptional customer experiences. As a Senior Site Reliability Engineer, you will play a critical role in ensuring the stability and performance of our technology infrastructure.Key Responsibilities:Coordinate responses with the Site...


  • Pittsburgh, Pennsylvania, United States General Dynamics Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at General Dynamics Mission Systems. As a key member of our cross-functional team, you will be responsible for maintaining the survivability and reliability of mission-critical resources.Key ResponsibilitiesEnsure uptime of critical systems and infrastructureAutomate...


  • Pittsburgh, Pennsylvania, United States TEKsystems Full time

    Unlock Your Potential as a Site Reliability EngineerTEKsystems is a leader in IT solutions, partnering with top clients to drive transformation. We're seeking a talented Site Reliability Engineer to join our team and contribute to the success of a top American bank holding company and financial services corporation.About the RoleAs a Site Reliability...


  • Pittsburgh, Pennsylvania, United States General Dynamics Corporation Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at General Dynamics Mission Systems. As a Site Reliability Engineer, you will be responsible for ensuring the uptime and reliability of mission-critical resources. Your expertise in system administration, automation, and troubleshooting will be essential in maintaining the...