Site Reliability Engineer

2 days ago


Boston, Massachusetts, United States Oracle Full time
Job Description

This team will focus on product automation of Infrastructure, sustainability, and troubleshooting for Oracle Health.

As a Site Reliability DevOps Engineer, you will be responsible for defining and deploying key services with deep focus on architecture, production operations, capacity planning, performance management, deployment, and release engineering.

You will work with multiple cross-functional teams helping deliver new and outstanding experiences to our collaborators while ensuring reliability and performance.

Responsibilities
  • Take ownership of the architecture, analysis, design, implementation, and production operations of a wide array of Core System Framework solutions.
  • React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems.
  • Be a strong contributor to supporting and development of platform services including architecture, provisioning, configuration, deployment, and support.
  • Partner with the distributed team in prototyping new platform services.
  • Stay informed of new technologies.
  • Innovate.
  • Solve complex problems related to infrastructure services and build automation to prevent problem recurrence.
  • Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services.
  • Develop designs, architectures, standards, and methods for large-scale distributed systems.
  • Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.
Key Requirements/Experience
  • The ability to acquire federal security clearance vital for this role, which requires you to be a US citizen.
  • Developing/operating large-scale distributed services/applications.
  • Infrastructure automation through Terraform, Chef, Ansible, Puppet, Packer, or similar.
  • Experience with Cloud Orchestration frameworks, development, and SRE support of these systems.
  • Working with or supporting production, test, and development environments for medium to large user environments.
  • Experience in developing scripts to automate software deployments and installations using PowerShell.
  • Knowledge of cloud compute technologies, infrastructure monitoring, data processing, and analytics.
  • Experience with a modern programming language such as Python.
  • Experience working with fault-tolerant, highly available, high-throughput, distributed, scalable systems.
  • Experience operating services in one of the major Clouds such as AWS, OCI, Azure, etc.


  • Boston, Massachusetts, United States Insight Global Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our software systems.Key ResponsibilitiesDesign and implement scalable and reliable software systemsCollaborate with cross-functional teams to...


  • Boston, Massachusetts, United States Klaviyo Full time

    {"title": "Site Reliability Engineering Manager", "description": "Job SummaryKlaviyo is seeking a Site Reliability Engineering Manager to lead our SRE Security team in Boston and remotely. As a key member of our engineering organization, you will be responsible for managing a team of 4-6 Site Reliability Engineers and working closely with product engineers...


  • Boston, Massachusetts, United States Insight Global Full time

    About the RoleWe are seeking a highly motivated Site Reliability Engineer Manager to join our team at Insight Global. As a key member of our Site Reliability Engineering team, you will be responsible for providing tooling and guidance to our customer's product engineers to ensure productivity and success.Key ResponsibilitiesLead and deliver security...


  • Boston, Massachusetts, United States Insight Global Full time

    About the RoleWe are seeking a highly motivated Site Reliability Engineer Manager to join our team at Insight Global. As a key member of our Site Reliability Engineering team, you will be responsible for providing tooling and guidance to our customer's product engineers to ensure productivity and success.Key ResponsibilitiesLead and deliver security...


  • Boston, Massachusetts, United States Klaviyo Full time

    About KlaviyoKlaviyo is a leading provider of email marketing and customer data platforms. We empower creators to own their destiny by making first-party data accessible and actionable like never before.Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for...


  • Boston, Massachusetts, United States Insight Global Full time

    About the RoleWe are seeking a highly motivated Senior Site Reliability Engineer to join our team at Insight Global. As a key member of our Site Reliability Engineering team, you will play a critical role in ensuring the reliability and scalability of our complex distributed systems.Key ResponsibilitiesDesign, develop, and deploy scalable and reliable...


  • Boston, Massachusetts, United States Insight Global Full time

    Job DescriptionWe are seeking a highly motivated Senior Site Reliability Engineer to join our team at Insight Global. As a key member of our Site Reliability Engineering team, you will play a critical role in ensuring the reliability and scalability of our complex distributed systems.Key ResponsibilitiesDesign, develop, and deploy scalable and reliable...


  • Boston, Massachusetts, United States Insight Global Full time

    About the RoleWe are seeking a highly motivated Site Reliability Engineer Manager to join our rapidly growing team. As a senior software engineer, you will be responsible for providing tooling and guidance to our customer's product engineers to ensure productivity and success.Key ResponsibilitiesBuilding backend services to enhance the product teams' overall...


  • Boston, Massachusetts, United States Klaviyo Full time

    About the RoleWe're seeking a highly skilled Senior Site Reliability Engineer to join our team at Klaviyo. As a key member of our Site Reliability Engineering team, you will play a critical role in ensuring the reliability, scalability, and security of our services.Key ResponsibilitiesDesign and develop systems and processes to enable highly available and...


  • Boston, Massachusetts, United States Datadog Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Datadog. As a key member of our operations team, you will be responsible for ensuring the reliability, availability, and performance of our cloud-based infrastructure.Key ResponsibilitiesDesign, build, and maintain scalable and efficient infrastructure to support our...


  • Boston, Massachusetts, United States Business Value Intelligence Services Full time

    Job Title: Site Reliability Engineer with Kubernetes Expertise Job Description: We are seeking a highly skilled Site Reliability Engineer with expertise in Kubernetes to join our team at Business Value Intelligence Services. Key Responsibilities: * Strong hands-on experience with Kubernetes, including deployment, scaling, and management * Proficient in...


  • Boston, Massachusetts, United States StartUs GmbH Full time

    Job DescriptionRole OverviewWe are seeking a highly skilled Site Reliability Engineer to join our team at Spotify. As a member of our small cross-functional squad, you will own a particular infrastructure challenge and be responsible for designing and documenting systems to automate away problems within your domain.Key ResponsibilitiesDesign and document...


  • Boston, Massachusetts, United States Klaviyo Full time

    About the RoleWe're seeking a highly skilled Senior Site Reliability Engineer to join our team at Klaviyo. As a key member of our Site Reliability Engineering team, you will be responsible for designing, building, and delivering software to improve the availability, scalability, and efficiency of our services.Key ResponsibilitiesShip foundational services to...


  • Boston, Massachusetts, United States Red Hat Full time

    About the JobThe Red Hat Site Reliability Engineering (SRE) team is seeking a Director, Site Reliability Engineering to lead our managed OpenShift cloud service offerings. As a Director of SRE, you'll oversee a region of SRE teams in the development and operations of our managed OpenShift services.Key ResponsibilitiesHire, develop, and retain SRE Managers...


  • Boston, Massachusetts, United States Zscaler Full time

    About ZscalerZscaler is a leading cloud security company that provides a comprehensive security platform to protect enterprises from cyber threats. With a mission to make the cloud a safe place to do business, Zscaler has built a reputation for delivering innovative security solutions that enable organizations to accelerate their digital transformation.Job...


  • Boston, Massachusetts, United States Global InfoTek Full time

    Job SummaryWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Global InfoTek, Inc. As a key member of our engineering team, you will be responsible for designing, building, and maintaining large-scale, multi-site infrastructure as code.Key ResponsibilitiesEvaluate and assess new ways to scale platform capabilitiesAutomate...


  • Boston, Massachusetts, United States Insight Global Full time

    About the RoleWe are seeking a highly motivated Site Reliability Engineer to join our team in Downtown Boston. As a Lead SRE, you will be responsible for providing tooling and guidance to our product engineers to ensure productivity and success.Key ResponsibilitiesEmbed yourself within product teams to advance the architecture and performance of software...


  • Boston, Massachusetts, United States Insight Global Full time

    About the RoleWe are seeking a highly motivated Site Reliability Engineer to join our team in Downtown Boston. As a Lead SRE, you will be responsible for providing tooling and guidance to our product engineers to ensure productivity and success.Key ResponsibilitiesEmbed yourself within product teams to advance the architecture and performance of software...


  • Boston, Massachusetts, United States Zscaler Full time

    About ZscalerZscaler is a leading cloud security company that provides a secure platform for enterprises to connect users, devices, and applications in any location. With a mission to make the cloud a safe place to do business, Zscaler accelerates digital transformation and enables enterprises to be more agile, efficient, resilient, and secure.Our...


  • Boston, Massachusetts, United States Zscaler Full time

    About ZscalerZscaler is a leading cloud security company that provides a secure platform for enterprises to connect users, devices, and applications in any location. With a mission to make the cloud a safe place to do business, Zscaler has built a reputation for delivering innovative security solutions that accelerate digital transformation.Job SummaryWe are...