Sr. Site Reliability Engineer
4 days ago
Is this the role you are looking for If so read on for more details, and make sure to apply today.
Contract Type: Contract to hire
Location: Hybrid (Dallas Tx / Pittsburgh PA)
Must Have and Metrics Technical Skills:
Years of experience: 7+
Ability to collaborate with cross-functional teams, troubleshoot effectively, and proactively identify areas for improvement in network reliability and performance
Ansible Tower
BigPanda
Configuring, managing, and troubleshooting network performance and latency issues across complex, distributed systems
Dynatrace
Grafana
Network performance tuning and monitoring, with a deep understanding of network protocols and network optimization techniques
ThousandEyes
Extensive experience in network performance tuning and monitoring
Deep understanding of network protocols (e.g., TCP/IP, DNS, HTTP/S) and network optimization techniques.
Proficiency with Dynatrace and BigPanda for real-time monitoring, root cause analysis, and incident response; hands-on experience with these tools is required.
Strong background in configuring, managing, and troubleshooting network performance and latency issues across complex, distributed systems.
Experience with additional monitoring and observability tools like Thousand Eyes and Grafana.
Skilled in Ansible Tower for automation of network and system configurations.
Demonstrated ability to collaborate with cross-functional teams, troubleshoot effectively, and proactively identify areas for improvement in network reliability and performance.
Flex Skills/Nice to Have:
Proven experience in incident/problem management with a good understanding of any of the tools used for this purpose.
- Good understanding of both UNIX and Windows operating systems
- Good understanding of web hosting technologies like Apache / tomcat or other equivalent web/app servers.
- Good understanding of Big Data & cloud concepts.
- Good understanding of database technologies like ORACLE and SQL.
- Good understanding of monitoring tools is an added advantage.
- Solid understanding of the major functionality bundled into a release, both from a technology and business point of view.
- Strong knowledge of relevant applications and development life cycles.
- Experience working with geographically distributed and culturally diverse work-groups.
- Strong desire to learn new technology.
Roles and Responsibilities:
Monitor infrastructure, servers, middleware, databases, and batch jobs.
Aggressively respond to service requests from business partners facing support teams, Operations, Risk/control partners, etc.
Troubleshoot environment, data control and operational issues.
Create and Maintain documentation to ensure knowledge accessibility.
Automate and streamline process using scripts and scheduling tools.
Liaise with other application support teams and internal/external business and technical partners.
Provide ad hoc and on-demand reports.
Perform timely escalation of critical issues and proactively identify patterns of recurring issues to improve production.
Lead problem resolution and conduct root cause analysis and establish processes that will help incident prevention.
Participates in the Incident and Problem Management processes as a resolver accountable for root cause analysis, resolution and reporting.
Ensures that all production changes are processed according to Change Management policies and procedures.
Ensures that appropriate levels of Quality Assurance have been met for all new and existing products.
Support Sustained Resiliency, Disaster Recovery, and High Availability events.
Help Level 2 operation team with setting up monitoring and bridging the gaps in current monitoring setup.
Play key part in setting up reporting and be a key component in Monitor -> Report -> Improve principle
Coordinate incident management coverage, to ensure appropriate coverage.
Call facilitation, coordination and communications during critical outage situations.
Call documentation, queue management, ticket analysis and interface to impacting lines of business for incident impact analysis via the Production Assurance process.
End to end view of issues for objectivity.
Influence senior technology leads across organizations to ensure timely resolution of incidents
Problem Management:
Participate and ensure RCA (root cause analysis) activities on client impacting incidents are executed and action items are assigned / completed.
Provide expertise and support during critical incidents, interfacing with all impacted groups to better manage the message.
Chronic issue coordination and leadership.
Guidance to all staff involved and vendors in driving a coordinated approach for results.
Hygiene and Capacity Maintenance:
Responsible for data quality of PLM.
Work aggressively to make sure all servers are up to company standards as per uptimes, patch level etc.
Work on Capacity planning for applications, estimating and analyzing growth rates of vital infrastructure components and adding capacity pro-actively as and when required.
Understand application code, work flow and business usage of application.
Understand DB component of application.
Understand the impacts of application based on seasonality of critical applications.
Document known errors and play important role in Knowledge transfer to Level 1 team.
Reduce escalations to Level 3 based on incremental learning about applications.
Intended length of Assignment: 4/5/2025
Reason for open position: SRE/SRC Special Projects
Potential for Contract Extension: N/A
This position is contract with the right to hire if a need becomes available. Manager will only look at candidates that are open to converting to a full time PNC employee. PNC will not sponsor work visas if the decision is made to hire the contingent worker: YES
Initiatives/Projects: SRE / SRC Special Projects
Industry background: Technical
Soft Skills:
- Excellent communication skills, both verbal and written, with the ability to lead/manage large conference calls.
- Comfortable providing clear problem descriptions and guidance to business users in a time critical environment.
- Ability to be proactive with a strong bias for action, naturally inquisitive, and bias for continuous improvement of practices / processes.
- Excellent influence, negotiation and presentation skills.
- Experience in working with cross line of business teams, Outside Service Providers and Partner Organizations.
- Outstanding interpersonal skills and ability to establish strong relationships with all levels of management.
- Ability to work independently as a self-starter, and within a team environment.
Interview Process:
Logistics:
2 step interview
1st round with HM
2nd round panel ITV with engineering managers
-
Sr. Site Reliability Engineer
1 week ago
Pittsburgh, United States Sygna LLC Full timeJob Title: Sr. Site Reliability Engineer Contract Type: Contract to hireLocation: Hybrid (Dallasâ¯Tx / Pittsburghâ¯PA)â¯Must Have and Metrics Technical Skills: Years of experience: 7+
-
Sr. Site Reliability Engineer
1 week ago
Pittsburgh, United States Sygna LLC Full timeJob Title: Sr. Site Reliability Engineer Contract Type: Contract to hire Location: Hybrid (Dallas Tx / Pittsburgh PA) Must Have and Metrics Technical Skills: Years of experience: 7+ Ability to collaborate with cross-functional teams, troubleshoot effectively, and proactively identify areas for improvement in network reliability and performance...
-
Site Reliability Engineer Manager
4 weeks ago
Pittsburgh, Pennsylvania, United States PNC Full timeJob SummaryPNC is seeking a highly skilled Site Reliability Engineer Manager to join our team. As an SRE Group Manager, you will be responsible for leading a team of Site Reliability Engineers to ensure the reliability and performance of our applications and infrastructure.Key ResponsibilitiesLead a team of Site Reliability Engineers to design, implement,...
-
Site Reliability Engineering Group Manager
4 weeks ago
Pittsburgh, Pennsylvania, United States PNC Full timeJob DescriptionPosition OverviewPNC is a leading financial institution that values its people as its greatest differentiator and competitive advantage. We strive to deliver the best experience for our customers by fostering an inclusive workplace culture where all employees feel respected, valued, and empowered to contribute to the company's success.As a...
-
Site Reliability Engineering Group Manager
4 weeks ago
Pittsburgh, PA , USA, United States PNC Full timeJob SummaryPNC is seeking a highly skilled Site Reliability Engineering Group Manager to join our team. As a key member of our Site Reliability team, you will be responsible for managing teams of Site Reliability Engineers across multiple operating sites and applications to improve reliability, quality, and time-to-market of highly complex software...
-
Site Reliability Engineer
2 weeks ago
Pittsburgh, United States ConsultUSA Full timeDescription:Our client has an immediate need for a Site Reliability Engineer, who will be responsible for specializing in improving all aspects of reliability, acting as a conduit between infrastructure and application teams on support issues, and improving tools, automation, processes, and software.Requirements:Bachelor’s degree in Engineering, Computer...
-
Site Reliability Engineer
2 weeks ago
pittsburgh, United States ConsultUSA Full timeDescription:Our client has an immediate need for a Site Reliability Engineer, who will be responsible for specializing in improving all aspects of reliability, acting as a conduit between infrastructure and application teams on support issues, and improving tools, automation, processes, and software.Requirements:Bachelor’s degree in Engineering, Computer...
-
Site Reliability Engineer
2 weeks ago
pittsburgh, United States Rose International Full timeDate Posted: 11/08/2024Hiring Organization: Rose InternationalPosition Number: 474141Job Title: Site Reliability EngineerJob Location: Pittsburgh, PA, USA, 15222Work Model: HybridShift:Hybrid: 3 days in office / 2 remoteHours: 8 am to 5 pm CSTEmployment Type: Temp to HireEstimated Duration (In months): 6Min Hourly Rate($): 65.00Max Hourly Rate($): 70.00Must...
-
Site Reliability Engineer
2 weeks ago
Pittsburgh, United States Rose International Full timeDate Posted: 11/08/2024Hiring Organization: Rose InternationalPosition Number: 474141Job Title: Site Reliability EngineerJob Location: Pittsburgh, PA, USA, 15222Work Model: HybridShift:Hybrid: 3 days in office / 2 remoteHours: 8 am to 5 pm CSTEmployment Type: Temp to HireEstimated Duration (In months): 6Min Hourly Rate($): 65.00Max Hourly Rate($): 70.00Must...
-
FUll Stack Developer
2 months ago
Pittsburgh, United States Stefanini North America and APAC Full timeWe are seeking a talented Full Stack / Site Reliability Engineer to play a key role in developing a comprehensive Internal Developer Platform (IDP) that includes CI/CD pipelines, managed infrastructure, observability, and a developer portal. The primary focus of this role will be on ensuring the stability and scalability of the Internal Developer Platform...
-
Pittsburgh, United States BNY Full timeVice President, Site Reliability/DevOps Engineer (Dev Infrastructure Platform) (Vice President, Technical Product Specialist and App Delivery) At BNY, our culture empowers you to grow and succeed. As a leading global financial services company at the center of the world’s financial system we touch nearly 20% of the world’s investible assets. Every day...
-
Site Reliability Engineer
2 weeks ago
Pittsburgh, PA, United States ConsultUSA Full timeDescription:Our client has an immediate need for a Site Reliability Engineer, who will be responsible for specializing in improving all aspects of reliability, acting as a conduit between infrastructure and application teams on support issues, and improving tools, automation, processes, and software.Requirements:Bachelor’s degree in Engineering, Computer...
-
Site Reliability Engineer
2 weeks ago
Pittsburgh, PA, United States Rose International Full timeDate Posted: 11/08/2024Hiring Organization: Rose InternationalPosition Number: 474141Job Title: Site Reliability EngineerJob Location: Pittsburgh, PA, USA, 15222Work Model: HybridShift:Hybrid: 3 days in office / 2 remoteHours: 8 am to 5 pm CSTEmployment Type: Temp to HireEstimated Duration (In months): 6Min Hourly Rate($): 65.00Max Hourly Rate($): 70.00Must...
-
InfoSec Site Reliability Engineer
4 weeks ago
Pittsburgh, Pennsylvania, United States Aurora Innovation Full timeJob DescriptionAurora Innovation is seeking a highly skilled InfoSec Site Reliability Engineer to join our team. As a key member of our Client Platform Engineering group, you will be responsible for ensuring the integrity and availability of our enterprise fleet of Ubuntu, Mac, and Windows laptops, as well as our InfoSec/Enterprise infrastructure...
-
SR Staff Geotechnical Engineer
3 weeks ago
Pittsburgh, United States System One Full timeSystem One is currently seeking a Sr. Staff Geotechnical Engineer for an industry-leading client in the Pittsburgh, PA area.For a complete understanding of this opportunity, and what will be required to be a successful applicant, read on. Your Role: Directly lead and manage projects, including planning, engineering, and design; serve as Engineer-of-Record...
-
SR Staff Geotechnical Engineer
6 days ago
Pittsburgh, United States System One Full timeSystem One is currently seeking a Sr. Staff Geotechnical Engineer for an industry-leading client in the Pittsburgh, PA area.For a complete understanding of this opportunity, and what will be required to be a successful applicant, read on. Your Role: Directly lead and manage projects, including planning, engineering, and design; serve as Engineer-of-Record...
-
Sr. Engineer
3 weeks ago
Pittsburgh, United States Duquesne Light Company Full timeDuquesne Light Company, headquartered in downtown Pittsburgh, is a leader in providing electric energy and has been in the forefront of the electric energy market, with a history rooted in technological innovation and superior customer service. Today, the company continues its role as a leader in the transmission and distribution of electric energy,...
-
Senior Reliability Engineer
4 weeks ago
Pittsburgh, Pennsylvania, United States Philips Full timeAbout the Role:We are seeking a highly skilled Senior Reliability Engineer to join our team at Philips. As a key member of our Sleep and Respiratory Care business, you will play a critical role in developing and implementing reliability strategies to ensure the success of our products.Your Key Responsibilities:Develop a reliability program plan to document...
-
Sr. Network Engineer
6 days ago
pittsburgh, United States OpenArc, LLC. Full timeOpenArc - Empowering Your Career. As a leading IT staffing firm, we are dedicated to connecting talented professionals with your ideal opportunities. We are currently seeking a qualified Sr. Network Engineer to join our client’s organization and contribute to their ongoing success.Job summaryThis technical position will participate in the analysis,...
-
Sr. Network Engineer
6 days ago
Pittsburgh, United States OpenArc, LLC. Full timeOpenArc - Empowering Your Career. As a leading IT staffing firm, we are dedicated to connecting talented professionals with your ideal opportunities. We are currently seeking a qualified Sr. Network Engineer to join our client’s organization and contribute to their ongoing success.Job summaryThis technical position will participate in the analysis,...