Engineer I, Site Reliability

4 weeks ago


Chicago, United States Oak Street Health Full time

Role Description

As an Engineer I - Site Reliability Engineer (SRE), you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications. You will work closely with cross-functional teams to implement automation, optimize processes, and enhance observability to maintain high availability and performance of our infrastructure.Key Responsibilities:Collaborate with development, operations, and other cross-functional teams to design, implement, and maintain scalable and reliable systems.Utilize Grafana and other observability tools to monitor system performance, troubleshoot issues, and implement proactive measures to ensure optimal performance.Manage and administer Azure infrastructure, including resource provisioning, configuration, and optimization.Develop and execute performance and load testing strategies to identify and address bottlenecks and optimize system performance.Create and maintain automation scripts using PowerShell, JavaScript, and other scripting languages to streamline operational tasks and improve efficiency.Implement systems integration solutions to ensure seamless communication and interoperability between different systems and services.Participate in on-call rotations and respond to incidents in a timely manner to minimize downtime and maintain service availability.Document processes, procedures, and configurations to ensure knowledge sharing and maintain system reliability.Qualifications:Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience2+ years of experience working with Grafana, Azure administration, or similar observability tools and cloud platforms.Experience with performance and load testing methodologies and tools.Proficiency in scripting languages such as PowerShell and JavaScript.Strong automation skills and experience with configuration management tools.Excellent communication and collaboration skills, with the ability to work effectively in a cross-functional team environment.Proven track record of troubleshooting complex issues and implementing effective solutions.Ability to adapt to a fast-paced environment and prioritize multiple tasks effectively.Preferred Qualifications:Certification in Azure administration or related fields.Experience with containerization technologies such as Docker and Kubernetes.Familiarity with CI/CD pipelines and DevOps principles.Knowledge of networking concepts and protocols.Previous experience in a Site Reliability Engineering or similar role.

Why Oak Street Health?

Oak Street Health is on a mission to 'Rebuild healthcare as it should be'', providing personalized primary care for older adults on Medicare, with the goal of keeping patients healthy and living life to the fullest. Our innovative care model is centered right in our patient's communities, and focused on the quality of care over volume of services. We're an organization on the move With over 150 locations and an ambitious growth trajectory, Oak Street Health is attracting and cultivating team members who embody 'Oaky' values and passion for our mission.

Oak Street Health Benefits: 

Mission-focused career impacting change and measurably improving health outcomes for medicare patients

Paid vacation, sick time, and investment/retirement 401K match options

Health insurance, vision, and dental benefits

Opportunities for leadership development and continuing education stipends

New centers and flexible work environments

Opportunities for high levels of responsibility and rapid advancement



  • Chicago, United States Allied Reliability Full time

    Overview: The Maintenance Reliability Engineer is responsible for implementing machinery and process improvements using management of change best practices while promoting values of a safe, environmentally compliant workplace, and philosophy of continuous improvement with the workforce. Responsibilities: Process Improvements and Operational Upgrading Works...


  • Chicago, United States Allied Reliability Full time

    Overview The Maintenance Reliability Engineer is responsible for implementing machinery and process improvements using management of change best practices while promoting values of a safe, environmentally compliant workplace, and philosophy of continuous improvement with the workforce. Responsibilities Process Improvements and Operational Upgrading Works...


  • Chicago, United States OpenGov Full time

    OpenGov is home to an exceptional team - passionate about our mission to power more effective and accountable government. By bringing the OpenGov Cloud to our nation's state and local government, we’re transforming communities so they can thrive!  Imagine yourself being able to help small business owners open their doors faster, ensuring our tax...


  • Chicago, United States Rackera Inc Full time

    Find the below role Role : Site reliability engineerLocation :Chicago, IllinoisLong term project Job Description:6+ plus years of application development experience using modern technologies and architecture, including experience collaborating with technology teams.2 plus years of Site Reliability Engineering experience.Good Understanding of at least one...


  • Chicago, Illinois, United States Motion Recruitment Full time

    A financial company is looking for senior level Site Reliability Engineers to join their team in troubleshooting applications and managing their Azure environment. This will be a contract-to-hire position that is hybrid 3 days a week in the Chicago area. Expertise in Terraform, YAML, and Azure infrastructure is mandatory. This company is a global leader in...


  • Chicago, Illinois, United States Motion Recruitment Full time

    A financial company is looking for senior level Site Reliability Engineers to join their team in troubleshooting applications and managing their Azure environment. This will be a contract-to-hire position that is hybrid 3 days a week in the Chicago area. Expertise in Terraform, YAML, and Azure infrastructure is mandatory. This company is a global leader in...


  • Chicago, United States JobRialto Full time

    Top 3 requirements: Ecommerce experience (think Nordstrom, Target, where you purchase a product) Java Spring boot Kubernetes Plusses: Azure Kubernetes preferred Description: Client is looking for a forward-thinking, energetic Site Reliability Engineering Manager to join our team. Client serves the ecommerce needs of leading and growing grocery retailers...


  • Chicago, United States JobRialto Full time

    Top 3 requirements: Ecommerce experience (think Nordstrom, Target, where you purchase a product) Java Spring boot Kubernetes Plusses: Azure Kubernetes preferred Description: Client is looking for a forward-thinking, energetic Site Reliability Engineering Manager to join our team. Client serves the ecommerce needs of leading and growing grocery retailers with...


  • Chicago, United States Diverse Lynx Full time

    Job Title: Site Reliability Engineer Location: Chicago IL (Remote) Employment: Contract Job Summary: Experienced Senior Cloud Engineer with 6-10 years of experience in AWS and AWS Cloud Formation. Must have domain skill experience in Payer. Required Skills Technical Skills: Dynatrace, Python, AWS, AWS Cloud Watch, Glue and Lambda Roles &...


  • Chicago, United States Diverse Lynx Full time

    Job Title: Site Reliability Engineer Location: Chicago - IL (Remote) Employment: Contract JobSummary: Experienced Senior Cloud Engineer with 6-10 years of experience in AWS and AWS Cloud Formation. Must have domain skill experience in Payer. Required Skills Technical Skills: Dynatrace, Python, AWS, AWS Cloud Watch, Glue and Lambda Roles & Responsibilities...


  • Chicago, United States Diverse Lynx Full time

    Job Title: Site Reliability Engineer Location: Chicago - IL (Remote) Employment: Contract JobSummary: Experienced Senior Cloud Engineer with 6-10 years of experience in AWS and AWS Cloud Formation. Must have domain skill experience in Payer. Required Skills Technical Skills: Dynatrace, Python, AWS, AWS Cloud Watch, Glue and Lambda Roles & Responsibilities...


  • Chicago, United States Fetch Full time

    What We're Building And Why We're Building It. There's a reason Fetch is ranked top 10 in Shopping in the App Store. Every day, millions of people earn Fetch Points buying brands they love. From the grocery aisle to the drive-through, Fetch makes saving money fun. We're more than just a build-first tech unicorn. We're a revolutionary shopping platform where...


  • Chicago, United States Fetch Full time

    What We're Building And Why We're Building It. There's a reason Fetch is ranked top 10 in Shopping in the App Store. Every day, millions of people earn Fetch Points buying brands they love. From the grocery aisle to the drive-through, Fetch makes saving money fun. We're more than just a build-first tech unicorn. We're a revolutionary shopping platform where...


  • Chicago, United States Oak Street Health Full time

    Description Company: Oak Street Health Title: Lead Engineer, Site Reliability Location: Chicago or Remote Role Description: As a Lead Engineer - Site Reliability Engineer (SRE), you will play a critical role in leading the design, implementation, and maintenance of highly available and scalable systems. You will leverage your extensive experience to drive...


  • Chicago, United States Adyen Full time

    This is Adyen Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the culture...


  • Chicago, United States Cleo Full time

    Site Reliability Engineer At Cleo, we make doing business easy! Cleo is an established software company with a start-up feel. We have awesome products, which go hand in hand with our awesome culture! We are devoted to our people and pride ourselves on creating a fun, laid-back, but fast-paced work environment. Not only do we work hard, we play hard. We have...


  • Chicago, United States Oak Street Health Full time

    Description Company: Oak Street Health Title: Lead Engineer, Site Reliability Location: Chicago or Remote Role Description: As a Lead Engineer - Site Reliability Engineer (SRE), you will play a critical role in leading the design, implementation, and maintenance of highly available and scalable systems. You will leverage your extensive experience to drive...


  • Chicago, United States McDonald's Corporation Full time

    Job Description This opportunity is part of the DevOps COE in CPP Delivery office, where our mission is to help our product engineering teams deliver faster with improved quality and reliability. We work multi-functional with our global product teams and market teams in defining and executing on our automation test strategy, improving our build and deploy...


  • Chicago, United States Adyen Full time

    This is Adyen Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition.  For our teams, we create an environment with opportunities for our people to succeed, backed by the culture...


  • Chicago, United States Adyen Full time

    This is Adyen Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition.  For our teams, we create an environment with opportunities for our people to succeed, backed by the culture...