Site Reliability Engineer

3 weeks ago


Pittsburgh, United States ConsultUSA Full time

Description:

Our client has an immediate need for a Site Reliability Engineer, who will be responsible for specializing in improving all aspects of reliability, acting as a conduit between infrastructure and application teams on support issues, and improving tools, automation, processes, and software.


Requirements:

  • Bachelor’s degree in Engineering, Computer Science, or a related field is a plus
  • Extensive experience in network performance tuning and monitoring
  • Deep understanding of network protocols (e.g., TCP/IP, DNS, HTTP/S) and network optimization techniques.
  • Proficiency with Dynatrace and BigPanda for real-time monitoring, root cause analysis, and incident response; hands-on experience with these tools
  • Strong background in configuring, managing, and troubleshooting network performance and latency issues across complex, distributed systems
  • Experience with additional monitoring and observability tools like Thousand Eyes and Grafana
  • Skilled in Ansible Tower for automation of network and system configurations
  • Demonstrated ability to collaborate with cross-functional teams, troubleshoot effectively, and proactively identify areas for improvement in network reliability and performance
  • Proven experience in incident/problem management with a good understanding of any of the tools used for this purpose is a plus
  • - Good understanding of both UNIX and Windows operating systems
  • - Good understanding of web hosting technologies like apache/tomcat or other equivalent web/app servers
  • - Good understanding of Big Data & cloud concepts
  • - Good understanding of database technologies like ORACLE and SQL
  • - Good understanding of monitoring tools is an added advantage
  • - Solid understanding of the major functionality bundled into a release, both from a technology and business point of view
  • - Strong knowledge of relevant applications and development life cycles
  • - Experience working with geographically distributed and culturally diverse work-groups


Responsibilities:

  • Monitor infrastructure, servers, middleware, databases, and batch jobs
  • Aggressively respond to service requests from business partners facing support teams, Operations, Risk/control partners, etc.
  • Troubleshoot environment, data control, and operational issues
  • Create and Maintain documentation to ensure knowledge accessibility
  • Automate and streamline processes using scripts and scheduling tools
  • Liaise with other application support teams and internal/external business and technical partners
  • Provide ad hoc and on-demand reports
  • Perform timely escalation of critical issues and proactively identify patterns of recurring issues to improve production
  • Lead problem resolution conduct root cause analysis and establish processes that will help incident prevention
  • Participates in the Incident and Problem Management processes as a resolver accountable for root cause analysis, resolution, and reporting
  • Ensures that all production changes are processed according to Change Management policies and procedures
  • Ensures that appropriate levels of Quality Assurance have been met for all new and existing products
  • Support Sustained Resiliency, Disaster Recovery, and High Availability events
  • Help the Level 2 operation team with setting up monitoring and bridging the gaps in the current monitoring setup
  • Play a key part in setting up reporting and be a key component in Monitor -> Report -> Improve principle
  • Coordinate incident management coverage, to ensure appropriate coverage
  • Call facilitation, coordination, and communications during critical outage situations
  • Call documentation, queue management, ticket analysis, and interface to impacting lines of business for incident impact analysis via the Production Assurance process
  • End-to-end view of issues for objectivity
  • Influence senior technology leads across organizations to ensure the timely resolution of incidents
  • Participate and ensure RCA (root cause analysis) activities on client-impacting incidents are executed and action items are assigned/completed
  • Provide expertise and support during critical incidents, interfacing with all impacted groups to better manage the message
  • Chronic issue coordination and leadership.
  • • Guidance to all staff involved and vendors in driving a coordinated approach to results
  • Responsible for data quality of PLM
  • Work aggressively to make sure all servers are up to company standards as per uptimes, patch level, etc
  • Work on Capacity planning for applications, estimating and analyzing growth rates of vital infrastructure components, and adding capacity pro-actively as and when required
  • Understand application code, workflow, and business usage of the application
  • Understand DB component of application
  • Understand the impacts of application based on the seasonality of critical applications
  • Document known errors and play an important role in Knowledge transfer to the Level 1 team
  • Reduce escalations to Level 3 based on incremental learning about applications

Why Work for ConsultUSA:

  • ConsultUSA offers competitive salaries, major medical (PPO or HDHP w/ HSA), dental, and vision insurance plans, and 401k plan with immediate eligibility for both salary and hourly employees
  • ConsultUSA hosts several outings and events, holiday and summer parties, and volunteer opportunities throughout the year for employees
  • We will work with you to obtain training for in-demand technologies and prepare you for industry-recognized certification exams
  • ConsultUSA offers Business Analysis and Project Management training through our Project Management Institute (PMI)® award-winning sister company, PMCentersUSA

How to Apply:

To submit your application, please click the “Apply Now” button located at the top and bottom of the page.

ConsultUSA is committed to providing equal employment opportunities (EEO) to all qualified employees and applicants for employment without regard to race, color, religion, gender identity or expression, sexual orientation, national origin, age, disability, genetic information, marital status, pregnancy, ancestry, or status as a covered veteran as well as any other prohibited criteria under any applicable federal, state, and local laws applicable to ConsultUSA.

For a complete listing of all ConsultUSA jobs please visit www.consultusa.com



  • Pittsburgh, Pennsylvania, United States PNC Full time

    Job SummaryPNC is seeking a highly skilled Site Reliability Engineer Manager to join our team. As an SRE Group Manager, you will be responsible for leading a team of Site Reliability Engineers to ensure the reliability and performance of our applications and infrastructure.Key ResponsibilitiesLead a team of Site Reliability Engineers to design, implement,...


  • Pittsburgh, United States Sygna LLC Full time

    Job Title: Sr. Site Reliability Engineer Contract Type: Contract to hireLocation: Hybrid (Dallas Tx / Pittsburgh PA) Must Have and Metrics Technical Skills: Years of experience: 7+


  • Pittsburgh, Pennsylvania, United States PNC Full time

    Job DescriptionPosition OverviewPNC is a leading financial institution that values its people as its greatest differentiator and competitive advantage. We strive to deliver the best experience for our customers by fostering an inclusive workplace culture where all employees feel respected, valued, and empowered to contribute to the company's success.As a...


  • pittsburgh, United States ConsultUSA Full time

    Description:Our client has an immediate need for a Site Reliability Engineer, who will be responsible for specializing in improving all aspects of reliability, acting as a conduit between infrastructure and application teams on support issues, and improving tools, automation, processes, and software.Requirements:Bachelor’s degree in Engineering, Computer...


  • pittsburgh, United States Rose International Full time

    Date Posted: 11/08/2024Hiring Organization: Rose InternationalPosition Number: 474141Job Title: Site Reliability EngineerJob Location: Pittsburgh, PA, USA, 15222Work Model: HybridShift:Hybrid: 3 days in office / 2 remoteHours: 8 am to 5 pm CSTEmployment Type: Temp to HireEstimated Duration (In months): 6Min Hourly Rate($): 65.00Max Hourly Rate($): 70.00Must...


  • Pittsburgh, United States Rose International Full time

    Date Posted: 11/08/2024Hiring Organization: Rose InternationalPosition Number: 474141Job Title: Site Reliability EngineerJob Location: Pittsburgh, PA, USA, 15222Work Model: HybridShift:Hybrid: 3 days in office / 2 remoteHours: 8 am to 5 pm CSTEmployment Type: Temp to HireEstimated Duration (In months): 6Min Hourly Rate($): 65.00Max Hourly Rate($): 70.00Must...


  • Pittsburgh, United States Sygna LLC Full time

    Job Title: Sr. Site Reliability Engineer Contract Type: Contract to hire Location: Hybrid (Dallas Tx / Pittsburgh PA) Must Have and Metrics Technical Skills: Years of experience: 7+ Ability to collaborate with cross-functional teams, troubleshoot effectively, and proactively identify areas for improvement in network reliability and performance...

  • FUll Stack Developer

    2 months ago


    Pittsburgh, United States Stefanini North America and APAC Full time

    We are seeking a talented Full Stack / Site Reliability Engineer to play a key role in developing a comprehensive Internal Developer Platform (IDP) that includes CI/CD pipelines, managed infrastructure, observability, and a developer portal. The primary focus of this role will be on ensuring the stability and scalability of the Internal Developer Platform...


  • Pittsburgh, United States Sygna LLC Full time

    Job Title: Sr. Site Reliability Engineer Is this the role you are looking for If so read on for more details, and make sure to apply today.Contract Type: Contract to hireLocation: Hybrid (Dallas Tx / Pittsburgh PA) Must Have and Metrics Technical Skills: Years of experience: 7+ Ability to collaborate with cross-functional teams, troubleshoot effectively, and...


  • Pittsburgh, United States BNY Full time

    Vice President, Site Reliability/DevOps Engineer (Dev Infrastructure Platform) (Vice President, Technical Product Specialist and App Delivery) At BNY, our culture empowers you to grow and succeed. As a leading global financial services company at the center of the world’s financial system we touch nearly 20% of the world’s investible assets. Every day...


  • Pittsburgh, PA, United States ConsultUSA Full time

    Description:Our client has an immediate need for a Site Reliability Engineer, who will be responsible for specializing in improving all aspects of reliability, acting as a conduit between infrastructure and application teams on support issues, and improving tools, automation, processes, and software.Requirements:Bachelor’s degree in Engineering, Computer...


  • Pittsburgh, PA, United States Rose International Full time

    Date Posted: 11/08/2024Hiring Organization: Rose InternationalPosition Number: 474141Job Title: Site Reliability EngineerJob Location: Pittsburgh, PA, USA, 15222Work Model: HybridShift:Hybrid: 3 days in office / 2 remoteHours: 8 am to 5 pm CSTEmployment Type: Temp to HireEstimated Duration (In months): 6Min Hourly Rate($): 65.00Max Hourly Rate($): 70.00Must...


  • Pittsburgh, Pennsylvania, United States Aurora Innovation Full time

    Job DescriptionAurora Innovation is seeking a highly skilled InfoSec Site Reliability Engineer to join our team. As a key member of our Client Platform Engineering group, you will be responsible for ensuring the integrity and availability of our enterprise fleet of Ubuntu, Mac, and Windows laptops, as well as our InfoSec/Enterprise infrastructure...


  • Pittsburgh, Pennsylvania, United States Philips Full time

    About the Role:We are seeking a highly skilled Senior Reliability Engineer to join our team at Philips. As a key member of our Sleep and Respiratory Care business, you will play a critical role in developing and implementing reliability strategies to ensure the success of our products.Your Key Responsibilities:Develop a reliability program plan to document...


  • UNITED STATES, PA, PITTSBURGH BNY Full time

    Vice President, Site Reliability/DevOps Engineer (Dev Infrastructure Platform) (Vice President, Technical Product Specialist and App Delivery) At BNY, our culture empowers you to grow and succeed. As a leading global financial services company at the center of the world’s financial system we touch nearly 20% of the world’s investible assets. Every day...

  • Site Civil Engineer

    3 weeks ago


    Pittsburgh, United States Bohler Engineering Full time

    Overview At Bohler, we empower the ambitious to become the accomplished. This greater purpose connects us with like-minded professionals, fosters meaningful relationships, and generates the alignment necessary to produce an unrivaled consulting and employment experience. Our Pittsburgh, PA office is looking for a Design Engineer who embodies this purpose. ...

  • Reliability Engineer

    4 weeks ago


    Pittsburgh, Pennsylvania, United States Philips Full time

    Job SummaryWe are seeking a highly skilled Reliability Engineer to join our team at Philips Sleep & Respiratory Care. As a key member of our engineering team, you will be responsible for developing and implementing reliability strategies to ensure the success of our products.Key ResponsibilitiesDevelop reliability program plans to document tasks, methods,...


  • Pittsburgh, Pennsylvania, United States Bohler Engineering Full time

    About the Role:We are seeking a talented Design Engineer to join our team at Bohler Engineering. As a Design Engineer, you will collaborate with our team to work on challenging land development projects in a fast-paced environment.Key Responsibilities:Collaborate with team members to work on challenging land development projectsEnhance your technical site...


  • Pittsburgh, Pennsylvania, United States Bohler Full time

    About the Role:We are seeking a Senior Site Civil Engineer to join our team at Bohler. As a Senior Site Civil Engineer, you will be responsible for designing site plans for some of the most recognizable brand name clients across a wide spectrum of industries.Key Responsibilities:Engage in valuable interactions with external clientsPerfect your technical site...


  • Pittsburgh, United States Colliers Engineering & Design Full time

    Overview: A Civil/Site Senior Engineer is a key member of the Civil/Site design team. A Senior Engineer is responsible for various technical aspects of land development projects including, but not limited to: site feasibility and conceptual planning, detailed grading, drainage designs and profiles, stormwater management, designing Erosion & Sedimentation...