Current jobs related to Manager of Site Reliability - Pittsburgh - FTSi.Tech


  • Pittsburgh, United States FTSi.Tech Full time

    Manager Site Reliability Engineering Job DescriptionPosition Title: Manager Site Reliability EngineeringReports to: Director of Systems Engineering Position SummaryThis position is responsible managing the overall stability of customer engineering organization, facilitating a team of dedicated engineers while coordinating with stakeholders in development,...


  • Pittsburgh, Pennsylvania, United States PNC Financial Services Group Full time

    Job Profile Position Overview At PNC Financial Services Group, our workforce is our most significant differentiator and competitive edge in the markets we operate. We are united in our commitment to delivering exceptional experiences for our clients. We collaborate daily to cultivate an inclusive workplace culture where all employees feel respected, valued,...


  • Pittsburgh, United States The PNC Financial Services Group, Inc Full time

    Position Overview At PNC, our people are our greatest differentiator and competitive advantage in the markets we serve. We are all united in delivering the best experience for our customers. We work together each day to foster an inclusive workplace culture where all of our employees feel respected, valued and have an opportunity to contribute to the...


  • Pittsburgh, Pennsylvania, United States The PNC Financial Services Group Full time

    Position OverviewAt The PNC Financial Services Group, our workforce is our most significant differentiator and competitive edge in the markets we serve. We are united in our commitment to providing the best experience for our clients.We collaborate daily to cultivate an inclusive workplace culture where all employees feel respected, valued, and empowered to...


  • Pittsburgh, Pennsylvania, United States Edge Case Research Full time

    Join Edge Case Research as a Site Reliability EngineerWe are a cutting-edge company specializing in autonomous system safety. Our team of experts is dedicated to developing products that ensure the safety of autonomous systems in various industries. We are currently expanding our DevOps and Site Reliability Team and looking for a skilled engineer to join...


  • Pittsburgh, Pennsylvania, United States General Dynamics Mission Systems Full time

    Reference #: Basic Qualifications: A Bachelor's degree in Computer Science or a related discipline, or equivalent experience, is essential, along with a minimum of 10 years of pertinent experience; alternatively, a Master's degree with 8 years of relevant experience is acceptable.Clearance Requirements: Candidates must have the ability to obtain a Department...


  • Pittsburgh, United States The Bank of New York Mellon Full time

    Senior Vice President, Site Reliability/DevOps Engineer (Dev Infrastructure Platform) (Senior Vice President, Technical Product Specialist and App Delivery) At BNY, our culture empowers you to grow and succeed. As a leading global financial services company at the center of the world's financial system we touch nearly 20% of the world's investible assets....


  • Pittsburgh, Pennsylvania, United States Google Full time

    About the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems.SRE ensures that Google Cloud's services both our internally critical and our externally-visible systems have reliability, uptime appropriate to customer's needs and a fast rate of...


  • Pittsburgh, Pennsylvania, United States General Dynamics Mission Systems, Inc Full time

    Essential Qualifications:Educational Background:A Bachelor's degree in Systems Engineering or a related field in Science, Engineering, or Mathematics is required. Additionally, 10+ years of relevant experience is necessary, or a Master's degree accompanied by 8 years of relevant experience. Experience in Agile methodologies is preferred.Security...

  • Site Manager

    3 months ago


    Pittsburgh, United States Housing Authority of the City of Pittsburgh Full time

    Job DescriptionJob DescriptionSummaryThe primary purpose of this position is to direct all facets of business at HACP-operated sites. The incumbent will manage property management employees, including Assistant Site Manager, ensure that all procedures and units are compliant with HUD, state, local, and HACP regulations, and enforce leasing agreements and...


  • Pittsburgh, United States Google Full time

    About the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems.SRE ensures that Google Cloud's services both our internally critical and our externally-visible systems have reliability, uptime appropriate to customer's needs and a fast rate of...


  • Pittsburgh, Pennsylvania, United States Build Partners Recruitment Limited Full time

    OverviewAbout Us:Build Partners Recruitment Limited is proud to collaborate with a reputable family-owned General Contractor renowned for its commitment to excellence, integrity, and client satisfaction. They specialize in multi-residential developments and emphasize a nurturing and cooperative work atmosphere, providing clear career advancement...

  • Reliability Engineer

    4 weeks ago


    Pittsburgh, Pennsylvania, United States Philips Full time

    Job TitleReliability EngineerJob DescriptionDevelop the reliability strategy for the Philips Sleep & Respiratory CareBusiness.Please note: Due to the consolidation of our sites, this role will start at the Bakery Square office in Pittsburgh, Pennsylvania and will move to another site in the greater Pittsburgh area by the end of the year. This change will...


  • Pittsburgh, Pennsylvania, United States General Dynamics Mission Systems, Inc Full time

    Job SummaryWe are seeking a highly skilled Senior Principal Site Reliability Engineer to join our team at General Dynamics Mission Systems, Inc. As a key member of our cross-functional team, you will be responsible for ensuring the survivability and reliability of mission-critical resources.Key ResponsibilitiesEnsure uptime of critical systems and...

  • Sr. Software Engineer

    1 month ago


    Pittsburgh, United States Comcast Corporation Full time

    FreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we’re making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...

  • Sr. Software Engineer

    2 weeks ago


    Pittsburgh, United States Comcast Corporation Full time

    FreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we’re making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...


  • Pittsburgh, United States Aurora Innovation Full time

    Who We Are Aurora (Nasdaq: AUR) is delivering the benefits of self-driving technology safely, quickly, and broadly to make transportation safer, increasingly accessible, and more reliable and efficient than ever before. The Aurora Driver is a self-driving system designed to operate multiple vehicle types, from freight-hauling semi-trucks to ride-hailing...

  • Site Supervisor

    7 days ago


    Pittsburgh, Pennsylvania, United States City of Pittsburgh Full time

    Position OverviewThe City of Pittsburgh is seeking a dedicated and experienced Site Supervisor to oversee and manage personnel and teams responsible for the upkeep and maintenance of public spaces. This role is essential in ensuring that streets, parks, and recreational areas are well-maintained and safe for community use.Key ResponsibilitiesSupervise and...


  • Pittsburgh, United States NextGen | GTA: A Kelly Telecom Company Full time

    Small cell/New Build experience preferred. May require some travel to office/customer meetings, typically once per quarter.Responsible for all Site Acquisition and associated activities on given projects. From propagation model and RF design to acceptance of the NTP. They must ensure that all Service Providers are adhering to the processes and procedures,...


  • Pittsburgh, United States NextGen | GTA: A Kelly Telecom Company Full time

    Small cell/New Build experience preferred. May require some travel to office/customer meetings, typically once per quarter.Responsible for all Site Acquisition and associated activities on given projects. From propagation model and RF design to acceptance of the NTP. They must ensure that all Service Providers are adhering to the processes and procedures,...

Manager of Site Reliability

1 month ago


Pittsburgh, United States FTSi.Tech Full time

Manager Site Reliability Engineering Job Description

Position Title: Manager Site Reliability Engineering

Reports to: Director of Systems Engineering

Position Summary

This position is responsible managing the overall stability of customer engineering organization, facilitating a team of dedicated engineers while coordinating with stakeholders in development, infrastructure, product, and leadership. This position is responsible for managing the stability of the website and store fleet on incident occurrence, as well as identifying how we can be better in the future. The manager of the Site Reliability Engineering team has the opportunity to develop processes and technological solutions to address site stability, and will have full control over the direction of the stability roadmap.

Responsibilities

• Manage Site Reliability engineering roadmap, backlog and active triages to ensure team is delivering on both the proactive and reactive stability needs of the customer engineering organization

• Deliver on tactical decisions while maintaining quality of day to day activities through effective management of full time and contract resources.

• Define day to day tasks and projects for team members, track and manage the delivery of work.

• Communicate effectively with leadership, cross functional partners, and individual contributors through verbal and written communication regarding incidents, followup, and team deliverables

• Maintain and enhance stability benchmarks that reflect overall stability of the site through KPIs, SLAs, SLOs, and SLIs and report on these metrics regularly

• Identify opportunities for process, people, technological improvement in the stability organization and formalize plans to execute on these improvements

• Reduce manual tasks through automation, process improvement, training, or elimination of manual need

• Mentor individual contributors to achieve technical maturity and personal growth

• Participate in business critical incident events and facilitate coordination, communication, and resolution as well as incident followup and prevention

• Partner with development team to understand applications and features will impact overall stability of site and introduce or modify monitoring and operational processes to meet these need

• Partner with cross-functional teams to identify and mitigate risks to system reliability and ensure application stability

Qualifications

• Experience as Engineering Lead / Manager (Infrastructure, SRE, Devops, Development, Incident Management)

• Experience in business critical technical incident triage and troubleshooting

• Expertise in monitoring tools and technologies (New Relic, Datadog, Dynatrace, Splunk, Elk, Google Observability) and their usage in triage and problem investigation

• Experience in automation tools (Ansible, Chef, Puppet, Terraform)

• Understanding of cloud platforms (AWS, GCP, Azure)

• Effective verbal/written communication to technical and non technical audiences

• Demonstrated hands-on experience and expertise, understanding of software development, testing, deployments, project management methodologies

• Experience in developing and executing plans, meeting deadlines and operating under tight time constraints

• Demonstrated ability to anticipate, mitigate, and resolve technical challenges across numerous disciplines