Current jobs related to Site Reliability Engineer III - Pittsburgh - Net Health


  • Pittsburgh, Pennsylvania, United States Edge Case Research Full time

    Join Edge Case Research as a Site Reliability EngineerWe are a cutting-edge company specializing in autonomous system safety. Our team of experts is dedicated to developing products that ensure the safety of autonomous systems in various industries. We are currently expanding our DevOps and Site Reliability Team and looking for a skilled engineer to join...


  • Pittsburgh, United States FTSi.Tech Full time

    Manager Site Reliability Engineering Job DescriptionPosition Title: Manager Site Reliability EngineeringReports to: Director of Systems Engineering Position SummaryThis position is responsible managing the overall stability of customer engineering organization, facilitating a team of dedicated engineers while coordinating with stakeholders in development,...


  • Pittsburgh, Pennsylvania, United States PNC Financial Services Group Full time

    Job Profile Position Overview At PNC Financial Services Group, our workforce is our most significant differentiator and competitive edge in the markets we operate. We are united in our commitment to delivering exceptional experiences for our clients. We collaborate daily to cultivate an inclusive workplace culture where all employees feel respected, valued,...


  • Pittsburgh, United States The PNC Financial Services Group, Inc Full time

    Position Overview At PNC, our people are our greatest differentiator and competitive advantage in the markets we serve. We are all united in delivering the best experience for our customers. We work together each day to foster an inclusive workplace culture where all of our employees feel respected, valued and have an opportunity to contribute to the...


  • Pittsburgh, Pennsylvania, United States General Dynamics Mission Systems Full time

    Reference #: Basic Qualifications: A Bachelor's degree in Computer Science or a related discipline, or equivalent experience, is essential, along with a minimum of 10 years of pertinent experience; alternatively, a Master's degree with 8 years of relevant experience is acceptable.Clearance Requirements: Candidates must have the ability to obtain a Department...


  • Pittsburgh, Pennsylvania, United States Google Full time

    About the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems.SRE ensures that Google Cloud's services both our internally critical and our externally-visible systems have reliability, uptime appropriate to customer's needs and a fast rate of...

  • Reliability Engineer

    4 weeks ago


    Pittsburgh, Pennsylvania, United States Philips Full time

    Job TitleReliability EngineerJob DescriptionDevelop the reliability strategy for the Philips Sleep & Respiratory CareBusiness.Please note: Due to the consolidation of our sites, this role will start at the Bakery Square office in Pittsburgh, Pennsylvania and will move to another site in the greater Pittsburgh area by the end of the year. This change will...


  • Pittsburgh, Pennsylvania, United States General Dynamics Mission Systems, Inc Full time

    Essential Qualifications:Educational Background:A Bachelor's degree in Systems Engineering or a related field in Science, Engineering, or Mathematics is required. Additionally, 10+ years of relevant experience is necessary, or a Master's degree accompanied by 8 years of relevant experience. Experience in Agile methodologies is preferred.Security...


  • Pittsburgh, United States Google Full time

    About the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems.SRE ensures that Google Cloud's services both our internally critical and our externally-visible systems have reliability, uptime appropriate to customer's needs and a fast rate of...

  • Sr. Software Engineer

    1 month ago


    Pittsburgh, United States Comcast Corporation Full time

    FreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we’re making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...

  • Sr. Software Engineer

    2 weeks ago


    Pittsburgh, United States Comcast Corporation Full time

    FreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we’re making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...


  • Pittsburgh, Pennsylvania, United States General Dynamics Mission Systems, Inc Full time

    Job SummaryWe are seeking a highly skilled Senior Principal Site Reliability Engineer to join our team at General Dynamics Mission Systems, Inc. As a key member of our cross-functional team, you will be responsible for ensuring the survivability and reliability of mission-critical resources.Key ResponsibilitiesEnsure uptime of critical systems and...


  • Pittsburgh, United States The Bank of New York Mellon Full time

    Senior Vice President, Site Reliability/DevOps Engineer (Dev Infrastructure Platform) (Senior Vice President, Technical Product Specialist and App Delivery) At BNY, our culture empowers you to grow and succeed. As a leading global financial services company at the center of the world's financial system we touch nearly 20% of the world's investible assets....


  • Pittsburgh, Pennsylvania, United States General Dynamics Mission Systems Full time

    Reference #: Basic Qualifications: A Bachelor's degree in Systems Engineering or a related field in Science, Engineering, or Mathematics is required. Additionally, candidates should possess 5+ years of relevant experience, or a Master's degree accompanied by 3 years of pertinent experience. Experience in Agile methodologies is preferred.CLEARANCE...


  • Pittsburgh, Pennsylvania, United States Philips North America Full time

    Job Title: Lead Reliability EngineerJob Description:As a key member of the Philips Sleep & Respiratory Care team, you will be responsible for shaping the reliability framework.Your Responsibilities:Collaborate with project teams to create a comprehensive reliability program plan that outlines essential tasks, methodologies, tools, testing, and analyses...


  • Pittsburgh, Pennsylvania, United States Philips Full time

    Overview:As a Lead Reliability Engineer at Philips, you will be instrumental in shaping the reliability framework for our Sleep & Respiratory Care division.Your Responsibilities:Collaborate with cross-functional project teams to formulate a comprehensive reliability program plan, detailing the necessary tasks, methodologies, tools, testing protocols, and...


  • Pittsburgh, Pennsylvania, United States Philips Full time

    Job TitleSenior Reliability EngineerJob DescriptionDevelop the reliability strategy for the Philips Sleep & Respiratory CareBusiness.Your role:Work with the project teams to develop a reliability program plan to document tasks, methods, tools, tests and analyses that are required for the success of new products. Assist engineering teams to achieve warranty...


  • Pittsburgh, United States Philips Iberica SAU Full time

    Job Title Senior Reliability Engineer Job Description Your challenge Develop the reliability strategy for the business. You are responsible for: Work with the project teams to develop a reliability program plan to document tasks, methods, tools, tests and analyses that are required for the success of new products. Assist engineering teams to achieve warranty...


  • Pittsburgh, Pennsylvania, United States Actalent Full time

    Dynamic Opportunity for a Reliability Engineering Specialist Position Overview The Reliability Engineering Specialist plays a crucial role in ensuring the dependability and longevity of both new and existing machinery. This position entails spearheading initiatives aimed at enhancing processes and products, serving as a vital technical resource for the...


  • Pittsburgh, United States Aurora Innovation Full time

    Who We Are Aurora (Nasdaq: AUR) is delivering the benefits of self-driving technology safely, quickly, and broadly to make transportation safer, increasingly accessible, and more reliable and efficient than ever before. The Aurora Driver is a self-driving system designed to operate multiple vehicle types, from freight-hauling semi-trucks to ride-hailing...

Site Reliability Engineer III

2 months ago


Pittsburgh, United States Net Health Full time

JOB OVERVIEW

As a Site Reliability Engineer III, you will collaboratively manage the performance, stability, and redundancy of all Platform systems and infrastructure. You will be part of a team responsible for remediating system instability and slowness through monitoring, fault tolerance, tooling, capacity management, and automation. Proactive and relentless pursuit of the identification and implementation of infrastructure solutions to ensure high degrees of observability, availability, and reliability will be at the core of this role. Partnership with development teams in ensuring NH Platforms are performant, scalable, fault tolerant, and HIPAA compliant is critical.

RESPONSIBILITIES AND DUTIES

Leading emergency response efforts in conjunction with Engineering, Infrastructure, and Database teams to establish root cause Leading the efforts to build robust monitoring solutions while expanding our current monitoring and alerting footprint Participate in the design of solutions increasing the holistic stability of NH Platforms and identifying potential risks Conduct Blameless Postmortems and Anomaly Investigations after incidents to further analyze root cause and create permanent solutions to improve serviceability and prevent future outages Establish a Don't Repeat Incidents (DRI) culture by learning from past issues and always looking to improve monitoring and dashboarding capabilities Ensuring applications are performing efficiently, collaborating with development teams and architecture to resolve application performance issues Consults with management in the analysis of short- and long-range business requirements and recommends innovations Championing automation efforts to reduce or eliminate repetitive, manual processes Partner with project management to define Service Level Objectives (SLO) and identify and implement Service Level Indicators (SLI) to track compliance Championing capacity management and disaster recovery testing efforts QUALIFICATIONS Bachelor's degree in computer science OR equivalent 6+ years' progressive experience in IT Operations and/or systems management 6+ years direct experience in a technical role dealing with complex enterprise software landscapes (DevOps focused development) 6+ years' experience with scripting and automating technical activities Experience with best-in-class application monitoring (APM) tooling (New Relic, Dynatrace, AppDynamics) Direct, hands-on experience with automated software and system management. Strong knowledge of change control best practices and methodologies Experience with Ansible, Terraform, Python, or Docker (or similar) is a plus Experience with Agile development methodology and/or ITIL ITSM is a plus REQUIRED HARDWARE EXPERIENCE Servers, Workstations, Load Balancers, Switches, Routers, Firewalls, SAN, NAS and other storage hardware REQUIRED SOFTWARE EXPERIENCE PowerShell scripting, and coding standards Best-in-class application monitoring (APM) tooling (New Relic, Dynatrace, AppDynamics) Azure and/or AWS PaaS/IaaS Linux OS and Apache (e.g. SALT, etc.) Direct, hands-on experience with automated software delivery and system management. Agile development methodology Working understanding of Platform Engineering work model in a software development environment Proven project management skills and/or substantial exposure to project-based work structures, project lifecycle models, etc Proven experience in architecting and overseeing the direction, development, and implementation of technology solutions O/S - Windows and Linux, VMWare, Powershell, Azure Administration, PRTG and other systems monitoring software, DNS Management, IIS, TomCat, Docker, APM Monitoring, ITSM tools, SSL/TLS certificates, JavaScript, Json, Python, Ansible, Terraform, Vsphere, Kubernetes, Service Fabric, Azure Management, Elastic, Citrix, JIRA, New Relic, Project Management Tools, ADO, DUO, Secret Server, Qualys, Pager Duty Application, Couchbase, Redis, API gateways, DNS, Security, IP Routing, SSH, FTP, LDAP, HTTP/HTTPS, Email Routing, Jenkins, GitHub, AWS , Cloud development pipelines using CI/CD tooling, Bash scripting

#J-18808-Ljbffr