Engineer I, Site Reliability

3 weeks ago


Chicago, United States Oak Street Health Full time

Role Description

As a Site Reliability Engineer, you will be instrumental to the stability and performance of a new kind of platform for healthcare, one built specifically for the clinical team. From design to implementation, you will partner with our stellar software engineering teams in a fast-paced, agile environment to transform ideas into a reality. Utilizing modern methodologies and open source tools, you will be empowered to set the engineering excellence standards as we seek to deliver applications that will directly and immediately impact the experience of our teams and our patients. 

Core Responsibilities

Review systems to identify and implement the necessary telemetry, monitoring and alerting for proactive and reactive management. 

Partner with Product and AD to define/review Service Level Objectives and Service Level Agreements. 

Participate in design reviews to ensure solutions can meet SLO's / SLA's.

Design and automate performance and resiliency test cases in partnership with application development and infra teams.

Identify and eliminate manual repeatable tasks with automation or application enhancements partnering with development. 

Other duties, as assigned.

What are we looking for?

Bachelors or Relevant industry experience

Minimum of 3 years of development experience in consumer facing products leveraging cloud native technologies 

Experience automating pipelines using continuous delivery tools.

Experience with system monitoring, alerting and observability platform tools and best practices. 

Experience with capacity planning and management.

Experience with resilient systems, resiliency testing and design best practices.

Experience with nonfunctional requirements along with SLO's/ SLA's.

Preferred: Our Tech Stack â Istio, Grafana Labs,.NET Core, Confluent Kafka, Mongo, gRPC, AKS, Docker, Azure

Preferred: Experience managing Kubernetes clusters in a production environment

Preferred: Experience monitoring applications at scale using Microservices

US Work Authorization

Someone who embodies being 'Oaky'



  • Chicago, United States Allied Reliability Full time

    Overview: The Maintenance Reliability Engineer is responsible for implementing machinery and process improvements using management of change best practices while promoting values of a safe, environmentally compliant workplace, and philosophy of continuous improvement with the workforce. Responsibilities: Process Improvements and Operational Upgrading Works...


  • Chicago, United States Allied Reliability Full time

    Overview The Maintenance Reliability Engineer is responsible for implementing machinery and process improvements using management of change best practices while promoting values of a safe, environmentally compliant workplace, and philosophy of continuous improvement with the workforce. Responsibilities Process Improvements and Operational Upgrading Works...


  • Chicago, Illinois, United States Motion Recruitment Full time

    A financial company is looking for senior level Site Reliability Engineers to join their team in troubleshooting applications and managing their Azure environment. This will be a contract-to-hire position that is hybrid 3 days a week in the Chicago area. Expertise in Terraform, YAML, and Azure infrastructure is mandatory. This company is a global leader in...


  • Chicago, United States Rackera Inc Full time

    Find the below role Role : Site reliability engineerLocation :Chicago, IllinoisLong term project Job Description:6+ plus years of application development experience using modern technologies and architecture, including experience collaborating with technology teams.2 plus years of Site Reliability Engineering experience.Good Understanding of at least one...


  • Chicago, United States JobRialto Full time

    Top 3 requirements: Ecommerce experience (think Nordstrom, Target, where you purchase a product) Java Spring boot Kubernetes Plusses: Azure Kubernetes preferred Description: Client is looking for a forward-thinking, energetic Site Reliability Engineering Manager to join our team. Client serves the ecommerce needs of leading and growing grocery retailers...


  • Chicago, Illinois, United States Balyasny Asset Management L. P Full time

    We are looking for a Senior Site Reliability Engineer who can cultivate our SRE philosophy, processes, and technologies from the ground up.As a Senior Site Reliability Engineer within the Platform group, you will lay the groundwork for our SRE infrastructure.Develop and promote our SRE philosophy, establishing best practices and processes that will be...


  • Chicago, United States McDonald's Corporation Full time

    Job Description This opportunity is part of the DevOps COE in CPP Delivery office, where our mission is to help our product engineering teams deliver faster with improved quality and reliability. We work multi-functional with our global product teams and market teams in defining and executing on our automation test strategy, improving our build and deploy...


  • Chicago, United States Cleo Full time

    Site Reliability Engineer At Cleo, we make doing business easy! Cleo is an established software company with a start-up feel. We have awesome products, which go hand in hand with our awesome culture! We are devoted to our people and pride ourselves on creating a fun, laid-back, but fast-paced work environment. Not only do we work hard, we play hard. We have...


  • Chicago, United States R2 Global Full time

    Our client, a financial services giant, is looking for a Principal SRE professional to join the team and lead observability efforts throughout a major cloud project and beyond. This role will work 3x's a week in the Downtown Chicago area onsite. Key Responsibilities: Lead and mentor a team of site reliability engineers, fostering a culture of collaboration,...


  • Chicago, United States Info Way Solutions Full time

    Site Reliability Engineer in Wealth Management Chicago (IL) / Tempe (AZ) Onsite Job ROLE: This role will be Responsible for application observability, maintenance, and support, identifying and implementing preventive measures proactively, evaluates and makes recommendation on techniques, practices, or technologies that would enhance business needs. As a SRE...


  • Chicago, United States R2 Global Full time

    Our client, a financial services giant, is looking for a Principal SRE professional to join the team and lead observability efforts throughout a major cloud project and beyond.This role will work 3x's a week in the Downtown Chicago area onsite.Key Responsibilities:Lead and mentor a team of site reliability engineers, fostering a culture of collaboration,...


  • Chicago, United States R2 Global Full time

    Our client, a financial services giant, is looking for a Principal SRE professional to join the team and lead observability efforts throughout a major cloud project and beyond.This role will work 3x's a week in the Downtown Chicago area onsite.Key Responsibilities:Lead and mentor a team of site reliability engineers, fostering a culture of collaboration,...


  • Chicago, United States R2 Global Full time

    Our client, a financial services giant, is looking for a Principal SRE professional to join the team and lead observability efforts throughout a major cloud project and beyond.This role will work 3x's a week in the Downtown Chicago area onsite.Key Responsibilities:Lead and mentor a team of site reliability engineers, fostering a culture of collaboration,...


  • Chicago, United States AmericanEagle.com Full time

    Americaneagle.com is a family-owned web design, development, and digital marketing agency with a passionate belief in the power of technology to positively transform business practices. Our focus is on helping customers grow and achieve success in the digital space. We cover a variety of different industries, including eCommerce, associations & nonprofits,...


  • Chicago, United States Motion Recruitment Partners, LLC Full time

    A financial company is looking for senior level Site Reliability Engineers to join their team in troubleshooting applications and managing their Azure environment. This will be a contract-to-hire position that is hybrid 3 days a week in the Chicago area. Expertise in Terraform, YAML, and Azure infrastructure is mandatory. This company is a global leader in...


  • Chicago, United States Saxon Global Full time

    Site Reliability Engineer (SRE) - (Azure, Systems background) Client: Lexis Nexis Location: REMOTE Rate: $62 C2C Duration: 1 Year Notes: Azure, Systems background experience •BSc Engineering/Computer Science or relevant experience. •Proven background working in a technical, IT related position. •Desirable -Azure Certifications ...


  • Chicago, United States Motion Recruitment Partners LLC Full time

    A financial company is looking for senior level Site Reliability Engineers to join their team in troubleshooting applications and managing their Azure environment. This will be a contract-to-hire position that is hybrid 3 days a week in the Chicago area. Expertise in Terraform, YAML, and Azure infrastructure is mandatory. This company is a global leader in...


  • Chicago, United States Motion Recruitment Full time

    A financial company is looking for senior level Site Reliability Engineers to join their team in troubleshooting applications and managing their Azure environment. This will be a contract-to-hire position that is hybrid 3 days a week in the Chicago area. Expertise in Terraform, YAML, and Azure infrastructure is mandatory. This company is a global leader in...


  • Chicago, United States Balyasny Asset Management Full time

    We are looking for a Senior Site Reliability Engineer who can cultivate our SRE philosophy, processes, and technologies from the ground up. As a Senior Site Reliability Engineer within the Platform group, you will lay the groundwork for our SRE infrastructure. Your role will entail driving standards and fostering adoption across our technology teams, whilst...


  • Chicago, United States Oak Street Health Full time

    Description Company: Oak Street Health Title: Engineer II, Site Reliability Engineer Location: Chicago Role Description: As a Site Reliability Engineer, you will be instrumental to the stability and performance of a new kind of platform for healthcare, one built specifically for the clinical team. From design to implementation, you will partner with our...