Current jobs related to Site Reliability Engineer III - Pittsburgh - Net Health
-
Site Reliability Engineer II
4 weeks ago
Pittsburgh, Pennsylvania, United States Edge Case Research Full timeJoin Edge Case Research as a Site Reliability EngineerWe are a cutting-edge company specializing in autonomous system safety. Our team of experts is dedicated to developing products that ensure the safety of autonomous systems in various industries. We are currently expanding our DevOps and Site Reliability Team and looking for a skilled engineer to join...
-
Manager of site reliability
2 months ago
Pittsburgh, United States FTSi.Tech Full timeManager Site Reliability Engineering Job DescriptionPosition Title: Manager Site Reliability EngineeringReports to: Director of Systems Engineering Position SummaryThis position is responsible managing the overall stability of customer engineering organization, facilitating a team of dedicated engineers while coordinating with stakeholders in development,...
-
Manager of Site Reliability Engineering
1 week ago
Pittsburgh, Pennsylvania, United States PNC Financial Services Group Full timeJob Profile Position Overview At PNC Financial Services Group, our workforce is our most significant differentiator and competitive edge in the markets we operate. We are united in our commitment to delivering exceptional experiences for our clients. We collaborate daily to cultivate an inclusive workplace culture where all employees feel respected, valued,...
-
Site Reliability Engineer Senior
1 week ago
Pittsburgh, United States The PNC Financial Services Group, Inc Full timePosition Overview At PNC, our people are our greatest differentiator and competitive advantage in the markets we serve. We are all united in delivering the best experience for our customers. We work together each day to foster an inclusive workplace culture where all of our employees feel respected, valued and have an opportunity to contribute to the...
-
Lead Site Reliability Engineer
1 week ago
Pittsburgh, Pennsylvania, United States General Dynamics Mission Systems Full timeReference #: Basic Qualifications: A Bachelor's degree in Computer Science or a related discipline, or equivalent experience, is essential, along with a minimum of 10 years of pertinent experience; alternatively, a Master's degree with 8 years of relevant experience is acceptable.Clearance Requirements: Candidates must have the ability to obtain a Department...
-
Pittsburgh, Pennsylvania, United States Google Full timeAbout the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems.SRE ensures that Google Cloud's services both our internally critical and our externally-visible systems have reliability, uptime appropriate to customer's needs and a fast rate of...
-
Reliability Engineer
4 weeks ago
Pittsburgh, Pennsylvania, United States Philips Full timeJob TitleReliability EngineerJob DescriptionDevelop the reliability strategy for the Philips Sleep & Respiratory CareBusiness.Please note: Due to the consolidation of our sites, this role will start at the Bakery Square office in Pittsburgh, Pennsylvania and will move to another site in the greater Pittsburgh area by the end of the year. This change will...
-
Lead Site Reliability Engineer
2 days ago
Pittsburgh, Pennsylvania, United States General Dynamics Mission Systems, Inc Full timeEssential Qualifications:Educational Background:A Bachelor's degree in Systems Engineering or a related field in Science, Engineering, or Mathematics is required. Additionally, 10+ years of relevant experience is necessary, or a Master's degree accompanied by 8 years of relevant experience. Experience in Agile methodologies is preferred.Security...
-
Pittsburgh, United States Google Full timeAbout the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems.SRE ensures that Google Cloud's services both our internally critical and our externally-visible systems have reliability, uptime appropriate to customer's needs and a fast rate of...
-
Sr. Software Engineer
1 month ago
Pittsburgh, United States Comcast Corporation Full timeFreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we’re making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...
-
Sr. Software Engineer
2 weeks ago
Pittsburgh, United States Comcast Corporation Full timeFreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we’re making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...
-
Senior Principal Site Reliability Engineer
3 days ago
Pittsburgh, Pennsylvania, United States General Dynamics Mission Systems, Inc Full timeJob SummaryWe are seeking a highly skilled Senior Principal Site Reliability Engineer to join our team at General Dynamics Mission Systems, Inc. As a key member of our cross-functional team, you will be responsible for ensuring the survivability and reliability of mission-critical resources.Key ResponsibilitiesEnsure uptime of critical systems and...
-
Pittsburgh, United States The Bank of New York Mellon Full timeSenior Vice President, Site Reliability/DevOps Engineer (Dev Infrastructure Platform) (Senior Vice President, Technical Product Specialist and App Delivery) At BNY, our culture empowers you to grow and succeed. As a leading global financial services company at the center of the world's financial system we touch nearly 20% of the world's investible assets....
-
Reliability Systems Engineer
1 week ago
Pittsburgh, Pennsylvania, United States General Dynamics Mission Systems Full timeReference #: Basic Qualifications: A Bachelor's degree in Systems Engineering or a related field in Science, Engineering, or Mathematics is required. Additionally, candidates should possess 5+ years of relevant experience, or a Master's degree accompanied by 3 years of pertinent experience. Experience in Agile methodologies is preferred.CLEARANCE...
-
Lead Reliability Engineer
1 week ago
Pittsburgh, Pennsylvania, United States Philips North America Full timeJob Title: Lead Reliability EngineerJob Description:As a key member of the Philips Sleep & Respiratory Care team, you will be responsible for shaping the reliability framework.Your Responsibilities:Collaborate with project teams to create a comprehensive reliability program plan that outlines essential tasks, methodologies, tools, testing, and analyses...
-
Lead Reliability Engineer
1 week ago
Pittsburgh, Pennsylvania, United States Philips Full timeOverview:As a Lead Reliability Engineer at Philips, you will be instrumental in shaping the reliability framework for our Sleep & Respiratory Care division.Your Responsibilities:Collaborate with cross-functional project teams to formulate a comprehensive reliability program plan, detailing the necessary tasks, methodologies, tools, testing protocols, and...
-
Senior Reliability Engineer
1 month ago
Pittsburgh, Pennsylvania, United States Philips Full timeJob TitleSenior Reliability EngineerJob DescriptionDevelop the reliability strategy for the Philips Sleep & Respiratory CareBusiness.Your role:Work with the project teams to develop a reliability program plan to document tasks, methods, tools, tests and analyses that are required for the success of new products. Assist engineering teams to achieve warranty...
-
Senior Reliability Engineer
2 months ago
Pittsburgh, United States Philips Iberica SAU Full timeJob Title Senior Reliability Engineer Job Description Your challenge Develop the reliability strategy for the business. You are responsible for: Work with the project teams to develop a reliability program plan to document tasks, methods, tools, tests and analyses that are required for the success of new products. Assist engineering teams to achieve warranty...
-
Reliability Engineering Specialist
1 week ago
Pittsburgh, Pennsylvania, United States Actalent Full timeDynamic Opportunity for a Reliability Engineering Specialist Position Overview The Reliability Engineering Specialist plays a crucial role in ensuring the dependability and longevity of both new and existing machinery. This position entails spearheading initiatives aimed at enhancing processes and products, serving as a vital technical resource for the...
-
InfoSec Site Reliability Engineer
2 weeks ago
Pittsburgh, United States Aurora Innovation Full timeWho We Are Aurora (Nasdaq: AUR) is delivering the benefits of self-driving technology safely, quickly, and broadly to make transportation safer, increasingly accessible, and more reliable and efficient than ever before. The Aurora Driver is a self-driving system designed to operate multiple vehicle types, from freight-hauling semi-trucks to ride-hailing...
Site Reliability Engineer III
2 months ago
JOB OVERVIEW
As a Site Reliability Engineer III, you will collaboratively manage the performance, stability, and redundancy of all Platform systems and infrastructure. You will be part of a team responsible for remediating system instability and slowness through monitoring, fault tolerance, tooling, capacity management, and automation. Proactive and relentless pursuit of the identification and implementation of infrastructure solutions to ensure high degrees of observability, availability, and reliability will be at the core of this role. Partnership with development teams in ensuring NH Platforms are performant, scalable, fault tolerant, and HIPAA compliant is critical.
RESPONSIBILITIES AND DUTIES
Leading emergency response efforts in conjunction with Engineering, Infrastructure, and Database teams to establish root cause
Leading the efforts to build robust monitoring solutions while expanding our current monitoring and alerting footprint
Participate in the design of solutions increasing the holistic stability of NH Platforms and identifying potential risks
Conduct Blameless Postmortems and Anomaly Investigations after incidents to further analyze root cause and create permanent solutions to improve serviceability and prevent future outages
Establish a Don't Repeat Incidents (DRI) culture by learning from past issues and always looking to improve monitoring and dashboarding capabilities
Ensuring applications are performing efficiently, collaborating with development teams and architecture to resolve application performance issues
Consults with management in the analysis of short- and long-range business requirements and recommends innovations
Championing automation efforts to reduce or eliminate repetitive, manual processes
Partner with project management to define Service Level Objectives (SLO) and identify and implement Service Level Indicators (SLI) to track compliance
Championing capacity management and disaster recovery testing efforts
QUALIFICATIONS
Bachelor's degree in computer science OR equivalent 6+ years' progressive experience in IT Operations and/or systems management
6+ years direct experience in a technical role dealing with complex enterprise software landscapes (DevOps focused development)
6+ years' experience with scripting and automating technical activities
Experience with best-in-class application monitoring (APM) tooling (New Relic, Dynatrace, AppDynamics)
Direct, hands-on experience with automated software and system management.
Strong knowledge of change control best practices and methodologies
Experience with Ansible, Terraform, Python, or Docker (or similar) is a plus
Experience with Agile development methodology and/or ITIL ITSM is a plus
REQUIRED HARDWARE EXPERIENCE
Servers, Workstations, Load Balancers, Switches, Routers, Firewalls, SAN, NAS and other storage hardware
REQUIRED SOFTWARE EXPERIENCE
PowerShell scripting, and coding standards
Best-in-class application monitoring (APM) tooling (New Relic, Dynatrace, AppDynamics)
Azure and/or AWS PaaS/IaaS
Linux OS and Apache (e.g. SALT, etc.)
Direct, hands-on experience with automated software delivery and system management.
Agile development methodology
Working understanding of Platform Engineering work model in a software development environment
Proven project management skills and/or substantial exposure to project-based work structures, project lifecycle models, etc
Proven experience in architecting and overseeing the direction, development, and implementation of technology solutions
O/S - Windows and Linux, VMWare, Powershell, Azure Administration, PRTG and other systems monitoring software, DNS Management, IIS, TomCat, Docker, APM Monitoring, ITSM tools, SSL/TLS certificates, JavaScript, Json, Python, Ansible, Terraform, Vsphere, Kubernetes, Service Fabric, Azure Management, Elastic, Citrix, JIRA, New Relic, Project Management Tools, ADO, DUO, Secret Server, Qualys, Pager Duty Application, Couchbase, Redis, API gateways, DNS, Security, IP Routing, SSH, FTP, LDAP, HTTP/HTTPS, Email Routing, Jenkins, GitHub, AWS , Cloud development pipelines using CI/CD tooling, Bash scripting
#J-18808-Ljbffr