Site Reliability Engineer
2 days ago
This team will focus on product automation of Infrastructure, sustainability, and troubleshooting for Oracle Health.
As a Site Reliability DevOps Engineer, you will be responsible for defining and deploying key services with deep focus on architecture, production operations, capacity planning, performance management, deployment, and release engineering.
You will work with multiple cross-functional teams helping deliver new and outstanding experiences to our collaborators while ensuring reliability and performance.
Responsibilities- Take ownership of the architecture, analysis, design, implementation, and production operations of a wide array of Core System Framework solutions.
- React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems.
- Be a strong contributor to supporting and development of platform services including architecture, provisioning, configuration, deployment, and support.
- Partner with the distributed team in prototyping new platform services.
- Stay informed of new technologies.
- Innovate.
- Solve complex problems related to infrastructure services and build automation to prevent problem recurrence.
- Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services.
- Develop designs, architectures, standards, and methods for large-scale distributed systems.
- Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.
- The ability to acquire federal security clearance vital for this role, which requires you to be a US citizen.
- Developing/operating large-scale distributed services/applications.
- Infrastructure automation through Terraform, Chef, Ansible, Puppet, Packer, or similar.
- Experience with Cloud Orchestration frameworks, development, and SRE support of these systems.
- Working with or supporting production, test, and development environments for medium to large user environments.
- Experience in developing scripts to automate software deployments and installations using PowerShell.
- Knowledge of cloud compute technologies, infrastructure monitoring, data processing, and analytics.
- Experience with a modern programming language such as Python.
- Experience working with fault-tolerant, highly available, high-throughput, distributed, scalable systems.
- Experience operating services in one of the major Clouds such as AWS, OCI, Azure, etc.
-
Site Reliability Engineer
2 weeks ago
Boston, Massachusetts, United States Insight Global Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our software systems.Key ResponsibilitiesDesign and implement scalable and reliable software systemsCollaborate with cross-functional teams to...
-
Site Reliability Engineering Manager
10 hours ago
Boston, Massachusetts, United States Klaviyo Full time{"title": "Site Reliability Engineering Manager", "description": "Job SummaryKlaviyo is seeking a Site Reliability Engineering Manager to lead our SRE Security team in Boston and remotely. As a key member of our engineering organization, you will be responsible for managing a team of 4-6 Site Reliability Engineers and working closely with product engineers...
-
Site Reliability Engineering Manager
5 days ago
Boston, Massachusetts, United States Insight Global Full timeAbout the RoleWe are seeking a highly motivated Site Reliability Engineer Manager to join our team at Insight Global. As a key member of our Site Reliability Engineering team, you will be responsible for providing tooling and guidance to our customer's product engineers to ensure productivity and success.Key ResponsibilitiesLead and deliver security...
-
Site Reliability Engineering Manager
2 days ago
Boston, Massachusetts, United States Insight Global Full timeAbout the RoleWe are seeking a highly motivated Site Reliability Engineer Manager to join our team at Insight Global. As a key member of our Site Reliability Engineering team, you will be responsible for providing tooling and guidance to our customer's product engineers to ensure productivity and success.Key ResponsibilitiesLead and deliver security...
-
Site Reliability Engineering Lead
2 days ago
Boston, Massachusetts, United States Klaviyo Full timeAbout KlaviyoKlaviyo is a leading provider of email marketing and customer data platforms. We empower creators to own their destiny by making first-party data accessible and actionable like never before.Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for...
-
Senior Site Reliability Engineer
2 weeks ago
Boston, Massachusetts, United States Insight Global Full timeAbout the RoleWe are seeking a highly motivated Senior Site Reliability Engineer to join our team at Insight Global. As a key member of our Site Reliability Engineering team, you will play a critical role in ensuring the reliability and scalability of our complex distributed systems.Key ResponsibilitiesDesign, develop, and deploy scalable and reliable...
-
Senior Site Reliability Engineer
3 days ago
Boston, Massachusetts, United States Insight Global Full timeJob DescriptionWe are seeking a highly motivated Senior Site Reliability Engineer to join our team at Insight Global. As a key member of our Site Reliability Engineering team, you will play a critical role in ensuring the reliability and scalability of our complex distributed systems.Key ResponsibilitiesDesign, develop, and deploy scalable and reliable...
-
Site Reliability Engineering Manager
6 days ago
Boston, Massachusetts, United States Insight Global Full timeAbout the RoleWe are seeking a highly motivated Site Reliability Engineer Manager to join our rapidly growing team. As a senior software engineer, you will be responsible for providing tooling and guidance to our customer's product engineers to ensure productivity and success.Key ResponsibilitiesBuilding backend services to enhance the product teams' overall...
-
Senior Site Reliability Engineer
2 days ago
Boston, Massachusetts, United States Klaviyo Full timeAbout the RoleWe're seeking a highly skilled Senior Site Reliability Engineer to join our team at Klaviyo. As a key member of our Site Reliability Engineering team, you will play a critical role in ensuring the reliability, scalability, and security of our services.Key ResponsibilitiesDesign and develop systems and processes to enable highly available and...
-
Site Reliability Engineer
7 days ago
Boston, Massachusetts, United States Datadog Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Datadog. As a key member of our operations team, you will be responsible for ensuring the reliability, availability, and performance of our cloud-based infrastructure.Key ResponsibilitiesDesign, build, and maintain scalable and efficient infrastructure to support our...
-
Boston, Massachusetts, United States Business Value Intelligence Services Full timeJob Title: Site Reliability Engineer with Kubernetes Expertise Job Description: We are seeking a highly skilled Site Reliability Engineer with expertise in Kubernetes to join our team at Business Value Intelligence Services. Key Responsibilities: * Strong hands-on experience with Kubernetes, including deployment, scaling, and management * Proficient in...
-
Site Reliability Engineer
2 days ago
Boston, Massachusetts, United States StartUs GmbH Full timeJob DescriptionRole OverviewWe are seeking a highly skilled Site Reliability Engineer to join our team at Spotify. As a member of our small cross-functional squad, you will own a particular infrastructure challenge and be responsible for designing and documenting systems to automate away problems within your domain.Key ResponsibilitiesDesign and document...
-
Senior Site Reliability Engineer
2 weeks ago
Boston, Massachusetts, United States Klaviyo Full timeAbout the RoleWe're seeking a highly skilled Senior Site Reliability Engineer to join our team at Klaviyo. As a key member of our Site Reliability Engineering team, you will be responsible for designing, building, and delivering software to improve the availability, scalability, and efficiency of our services.Key ResponsibilitiesShip foundational services to...
-
Director of Site Reliability Engineering
6 days ago
Boston, Massachusetts, United States Red Hat Full timeAbout the JobThe Red Hat Site Reliability Engineering (SRE) team is seeking a Director, Site Reliability Engineering to lead our managed OpenShift cloud service offerings. As a Director of SRE, you'll oversee a region of SRE teams in the development and operations of our managed OpenShift services.Key ResponsibilitiesHire, develop, and retain SRE Managers...
-
Staff Site Reliability Engineer
5 days ago
Boston, Massachusetts, United States Zscaler Full timeAbout ZscalerZscaler is a leading cloud security company that provides a comprehensive security platform to protect enterprises from cyber threats. With a mission to make the cloud a safe place to do business, Zscaler has built a reputation for delivering innovative security solutions that enable organizations to accelerate their digital transformation.Job...
-
Principal Site Reliability Engineer
5 days ago
Boston, Massachusetts, United States Global InfoTek Full timeJob SummaryWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Global InfoTek, Inc. As a key member of our engineering team, you will be responsible for designing, building, and maintaining large-scale, multi-site infrastructure as code.Key ResponsibilitiesEvaluate and assess new ways to scale platform capabilitiesAutomate...
-
Site Reliability Engineer Lead
5 days ago
Boston, Massachusetts, United States Insight Global Full timeAbout the RoleWe are seeking a highly motivated Site Reliability Engineer to join our team in Downtown Boston. As a Lead SRE, you will be responsible for providing tooling and guidance to our product engineers to ensure productivity and success.Key ResponsibilitiesEmbed yourself within product teams to advance the architecture and performance of software...
-
Site Reliability Engineer Lead
1 week ago
Boston, Massachusetts, United States Insight Global Full timeAbout the RoleWe are seeking a highly motivated Site Reliability Engineer to join our team in Downtown Boston. As a Lead SRE, you will be responsible for providing tooling and guidance to our product engineers to ensure productivity and success.Key ResponsibilitiesEmbed yourself within product teams to advance the architecture and performance of software...
-
Staff Site Reliability Engineer
3 days ago
Boston, Massachusetts, United States Zscaler Full timeAbout ZscalerZscaler is a leading cloud security company that provides a secure platform for enterprises to connect users, devices, and applications in any location. With a mission to make the cloud a safe place to do business, Zscaler accelerates digital transformation and enables enterprises to be more agile, efficient, resilient, and secure.Our...
-
Staff Site Reliability Engineer
6 days ago
Boston, Massachusetts, United States Zscaler Full timeAbout ZscalerZscaler is a leading cloud security company that provides a secure platform for enterprises to connect users, devices, and applications in any location. With a mission to make the cloud a safe place to do business, Zscaler has built a reputation for delivering innovative security solutions that accelerate digital transformation.Job SummaryWe are...