Senior SRE Monitoring Engineer
8 hours ago
Senior SRE Monitoring Engineer We are looking for candidates with SRE monitoring experience who have the Pharma Industry experience. As a Senior Production Engineer, you will serve as a technical leader responsible for supporting architecture, securing, and sustaining the production infrastructure supporting our regulated digital health and medical software platforms. You will ensure reliability, scalability, and compliance of critical systems in alignment with FDA GxP guidelines and HITRUST standards for healthcare data protection. You will lead initiatives in incident response, deployment automation, observability, and capacity planning—leveraging modern DevOps/SRE methodologies, cloud-native technologies, and advanced tooling. Collaborating across engineering, quality, and compliance teams, you will ensure our solutions remain both safe and effective for patient care, while meeting stringent regulatory requirements. Key Responsibilities: • Support the design, implementation, and sustainment of CI/CD pipelines with embedded with auditable deployment processes. • Promote infrastructure-as-code using Terraform, Helm, and Ansible, incorporating HITRUST and GxP controls into reusable modules. • Architect and maintain highly available, scalable, and compliant systems leveraging Kubernetes and cloud platforms (AWS, GCP, Azure). • Apply SRE principles—defining, measuring, and improving reliability metrics (SLIs/SLOs/SLAs) in regulated healthcare environments. • Lead capacity planning, performance tuning, and infrastructure optimization initiatives focused on regulatory and privacy requirements. • Manage the full incident lifecycle (detection, triage, resolution, postmortem), documenting as required for FDA compliance and audit readiness. • Develop and maintain incident response playbooks, including IT and regulatory escalation protocols. • Implement and manage monitoring solutions (Datadog, Prometheus, Grafana, Elastic Search) to support rapid issue identification in compliance with healthcare mandates. • Integrate and manage SIEM tools (Splunk, Datadog Security, Elastic Security) for log aggregation, threat detection, and support of regulatory audits (HITRUST, GxP). • Collaborate with security, quality assurance, and regulatory teams to monitor and respond to production security incidents. • Ensure logging, auditing, and reporting meet FDA, HITRUST, ISO 27001 and healthcare industry standards—including data retention, traceability, and privacy safeguards. • Document and communicate infrastructure processes clearly to facilitate internal knowledge transfer and external audit readiness. • Plan and manage resource utilization to meet both performance goals and regulatory efficiency standards. • Troubleshoot and support cloud/network issues, ensuring secure handling of protected health information (PHI) and device data. Qualifications: • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field. • 7+ years in Production Engineering, DevOps, or SRE roles within healthcare, medical device, or life sciences industries. • Expertise in containerization (Kubernetes, Docker), cloud platforms, and infrastructure-as-code. • Direct experience supporting systems subject to FDA GxP and HITRUST compliance; familiarity with HIPAA, SOC2, ISO 27001 frameworks. • Strong skills in scripting/automation (Python, Bash, Go). • Proven track record managing SIEM and monitoring platforms in regulated environments. • In-depth knowledge of incident response and reliability engineering in healthcare/medical device settings. • Certifications in cloud security, DevOps, and/or healthcare compliance (e.g., HITRUST, AWS Security, etc.) strongly preferred. Preferred Skills: • Experience deploying and supporting medical device software under FDA regulations. • Familiarity with quality management systems, validation procedures, and documentation for regulatory audits and FDA submissions. • Strong communication and leadership skills for cross-functional collaboration in a regulated setting. • Ability to innovate while maintaining strict compliance constraints. ________________________________________ Featured benefits Medical insurance, Vision insurance, Dental insurance Powered by JazzHR
-
Senior SRE Engineer
4 days ago
Sunnyvale, United States Orevan Full timeRole: Senior SRE Engineer Location: Sunnyvale, CAInterview process: 2 video interviewsMust have GXP & Pharma clients ExperienceQualifications: Bachelor’s or Master’s degree in Computer Science, Engineering, or related field. 9+ years in Production Engineering, DevOps, or SRE roles within healthcare, medical device, or life sciences industries. Expertise...
-
Cloud Platform SRE
3 weeks ago
Sunnyvale, United States Alibaba Cloud Full timeJoin to apply for the Cloud Platform SRE role at Alibaba Cloud. Base Pay Range $104,400.00/yr - $171,000.00/yr Mission The mission of the Cloud Intelligence Group SRE (Site Reliability Engineering) Team is to ensure the stability of production environments, enterprise-grade cloud data reliability, and service continuity for the Cloud Intelligence Group. Our...
-
Sunnyvale, United States Google Full timeSenior Software Engineer, Site Reliability Engineering Join to apply for the Senior Software Engineer, Site Reliability Engineering role at Google 1 week ago Be among the first 25 applicants Note: By applying to this position you will have an opportunity to share your preferred working location from the following: Sunnyvale, CA, USA; Mountain View, CA, USA....
-
Sunnyvale, United States Google Full timeSenior Systems Engineer, Site Reliability Engineering, Google Cloud Join to apply for the Senior Systems Engineer, Site Reliability Engineering, Google Cloud role at Google. About the job Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault‑tolerant systems. SRE ensures that...
-
Lead Senior DevOps Engineer
2 weeks ago
Sunnyvale, CA, United States Premier Group Inc Full timeSenior DevOps Engineer $175,000 - $210,000 Hybrid Join us as a Senior DevOps Engineer and play a pivotal role in driving infrastructure initiatives, streamlining CI/CD systems, and mentoring a team of talented engineers. This is a unique opportunity to contribute to the scaling of our client's platform. Role Overview As a Senior DevOps Engineer, you’ll own...
-
Lead Senior DevOps Engineer
2 weeks ago
Sunnyvale, CA, United States Premier Group Inc Full timeSenior DevOps Engineer $175,000 - $210,000 Hybrid Join us as a Senior DevOps Engineer and play a pivotal role in driving infrastructure initiatives, streamlining CI/CD systems, and mentoring a team of talented engineers. This is a unique opportunity to contribute to the scaling of our client's platform. Role Overview As a Senior DevOps Engineer, you’ll own...
-
Sunnyvale, CA, United States Donato Technologies Inc Full timeGreetings from Donato Technologies Inc. We have an immediate opening with my client. If you are looking for a new project, please send me a copy of your updated resumes Title: Sr. SRE / DevOps Engineer Location: Sunnyvale, CA (Only Local candidate) Client Interview – In-Person Job Summary – For this role, we are looking for a Sr. SRE / DevOps Engineer at...
-
Sunnyvale, United States Google Inc. Full timeSenior Software Engineer, Site Reliability Engineering corporate_fare Google place Sunnyvale, CA, USA Apply Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. 5 years of experience with software development in one or more programming languages. 3 years of experience in designing, analyzing, and troubleshooting...
-
Senior SRE
3 weeks ago
Sunnyvale, United States Blockchain Technologies. LLC Full timeA technology company in Sunnyvale, CA, seeks an experienced SRE DevOps professional with advanced skills in AWS and programming. Candidates should have 8+ years in a related role and a strong passion for building reliable systems. This hybrid position requires significant experience with CI/CD, Linux, and monitoring tools like CloudWatch and Grafana,...
-
Sunnyvale, United States Google Inc. Full timeSenior Staff Software Engineer, Site Reliability Engineering Apply X Note: By applying to this position you will have an opportunity to share your preferred working location from the following: Sunnyvale, CA, USA; New York, NY, USA. Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. 8 years of experience with...