Engineer II, Site Reliability

3 weeks ago


Chicago, United States Oak Street Health Full time

Company: Oak Street Health Title: Engineer II, Site Reliability Engineer Location: Chicago Role Description: As a Site Reliability Engineer, you will be instrumental to the stability and performance of a new kind of platform for healthcare, one built specifically for the clinical team. From design to implementation, you will partner with our stellar software engineering teams in a fast-paced, agile environment to transform ideas into a reality. Utilizing modern methodologies and open source tools, you will be empowered to set the engineering excellence standards as we seek to deliver applications that will directly and immediately impact the experience of our teams and our patients. Core Responsibilities: Review systems to identify and implement the necessary telemetry, monitoring and alerting for proactive and reactive management. Partner with Product and AD to define/review Service Level Objectives and Service Level Agreements. Participate in design reviews to ensure solutions can meet SLO's / SLA's. Design and automate performance and resiliency test cases in partnership with application development and infra teams. Identify and eliminate manual repeatable tasks with automation or application enhancements partnering with development. Other duties, as assigned. What are we looking for? Bachelors or Relevant industry experience Minimum of 3 years of development experience in consumer facing products leveraging cloud native technologies Experience automating pipelines using continuous delivery tools. Experience with system monitoring, alerting and observability platform tools and best practices. Experience with capacity planning and management. Experience with resilient systems, resiliency testing and design best practices. Experience with nonfunctional requirements along with SLO's/ SLA's. Preferred: Our Tech Stack – Istio, Grafana Labs, .NET Core, Confluent Kafka, Mongo, gRPC, AKS, Docker, Azure Preferred: Experience managing Kubernetes clusters in a production environment Preferred: Experience monitoring applications at scale using Microservices US Work Authorization Someone who embodies being 'Oaky' What does being 'Oaky' look like? Radiating positive energy Assuming good intentions Creating an unmatched patient experience Driving clinical excellence Taking ownership and delivering results Relentlessly determined Why Oak Street Health? Oak Street Health is on a mission to 'Rebuild healthcare as it should be', providing personalized primary care for older adults on Medicare, with the goal of keeping patients healthy and living life to the fullest. Our innovative care model is centered right in our patient's communities, and focused on the quality of care over volume of services. We're an organization on the move With over 150 locations and an ambitious growth trajectory, Oak Street Health is attracting and cultivating team members who embody 'Oaky' values and passion for our mission. Oak Street Health Benefits: Mission-focused career impacting change and measurably improving health outcomes for medicare patients Paid vacation, sick time, and investment/retirement 401K match options Health insurance, vision, and dental benefits Opportunities for leadership development and continuing education stipends New centers and flexible work environments Opportunities for high levels of responsibility and rapid advancement Oak Street Health is an equal opportunity employer. We embrace diversity and encourage all interested readers to apply. Learn more at www.oakstreethealth.com/diversity-equity-and-inclusion-at-oak-street-health #J-18808-Ljbffr



  • Chicago, United States Resource Logistics Full time

    Role: Site Reliability Engineer Location: Chicago, IL Hire Type: Full-time Responsibilities: Expertise with Monitoring, Alerting, Reliability Engineering & Observability Experience with Splunk, SignalFx or similar Tools Ability to create Log ingestions, Identify Metrics and KPIs App, Platform, Infra Logging & Alerting Best practices Creating Dashboards,...


  • Chicago, Illinois, United States Calabitek Full time

    Job DescriptionPosition: Site Reliability EngineerLocation: RemoteExperience: 10+ yearsThis position is responsible for ensuring application observability, maintenance, and support. The role involves identifying and implementing proactive preventive measures, evaluating, and recommending techniques, practices, or technologies that align with business...


  • Chicago, United States Definity First Full time

    We are seeking a skilled and motivated Site Reliability Engineer (SRE) to join our dynamic team. As an SRE at Definity First, you will play a crucial role in ensuring the reliability, scalability, and performance of our systems. You will collaborate with cross-functional teams to design, build, and maintain our infrastructure, and you'll have the opportunity...


  • Chicago, United States Stardom Employment Consultants Full time

    Job Description: As a Site Reliability Engineer you will be responsible for maintaining and improving the reliability availability and performance of our systems. You will collaborate closely with development operations and security teams to build and automate scalable infrastructure monitor system health and address issues before they impact users. The...


  • Chicago, Illinois, United States Calabitek Full time

    Job OverviewPosition: Site Reliability EngineerLocation: Chicago, IL (Local Candidates Preferred)Experience: 10+ YearsThis position is crucial for ensuring application observability, ongoing maintenance, and robust support. The role involves identifying and implementing proactive preventive measures, as well as evaluating and recommending techniques,...


  • Chicago, Illinois, United States National Black MBA Association Full time

    About the RoleThis is a strategic and transformation-focused role within the National Black MBA Association's Global Technology organization. As a Manager of Site Reliability Engineering, you will play a key part in ensuring the reliable and efficient operation of our security services.Key Responsibilities:Design and drive monitoring, alerting, and ticket...


  • Chicago, Illinois, United States National Black MBA Association Full time

    About the RoleThis is a strategic and transformation-focused role within the National Black MBA Association's Global Technology organization. As a Manager of Site Reliability Engineering, you will play a key part in ensuring the reliable and efficient operation of our security services.**Key Responsibilities:**Design and drive monitoring, alerting, and...


  • Chicago, United States Oneview Healthcare Full time

    Job DescriptionJob DescriptionSalary: Position Overview: Site Reliability Engineers support and smooth functioning of the Oneview system for our hospital customers, using their advanced technical and coding skills. People in this role will be former systems administrators or operation engineers with strong coding skills. Career development in this role...


  • Chicago, Illinois, United States Oak Street Health Full time

    Transformative Role at Oak Street HealthWe are seeking a skilled Site Reliability Engineer to collaborate with our software engineering teams in implementing monitoring and alerting solutions, designing performance tests, and automating tasks to enhance efficiency.Key ResponsibilitiesDesign and implement telemetry, monitoring, and alerting systems to ensure...


  • Chicago, Illinois, United States Circle Full time

    About CircleCircle is a pioneering financial technology company at the forefront of the emerging internet of money, where value can flow freely, globally, and instantly, revolutionizing the way we think about payments, commerce, and markets. Our cutting-edge infrastructure, including the blockchain-based USDC, empowers businesses, institutions, and...


  • Chicago, United States Cleo Full time

    Site Reliability Engineer At Cleo, we make doing business easy! Cleo is an established software company with a start-up feel. We have awesome products, which go hand in hand with our awesome culture! We are devoted to our people and pride ourselves on creating a fun, laid-back, but fast-paced work environment. Not only do we work hard, we play hard. We have...


  • Chicago, United States Saxon Global Full time

    Northern Trust Site Reliability Engineer (Azure) Location : Downtown Chicago - Onsite 2 days/week - 181 W Madison St Duration : 12+ month contract w/extension/conversion Overview The Goals Driven Wealth Management platform is a showcase product for Northern Trusts Wealth Management business and we must demonstrate our ability to deliver and...


  • Chicago, United States AmericanEagle.com Full time

    Americaneagle.com is a family-owned web design, development, and digital marketing agency with a passionate belief in the power of technology to positively transform business practices. Our focus is on helping customers grow and achieve success in the digital space. We cover a variety of different industries, including eCommerce, associations & nonprofits,...


  • Chicago, United States PDSSOFT Full time

    8 Months Contract Only Locals within an hour's drive distance Chicago, IL, US, 60602 Must have 10+ yrs of IT experience Work Model: Hybrid Anchor Days: Monday, Wednesday, Friday Hours: 8:30am - 5pm CST Job Post Title Site Reliability/DevOps Engineer Job Post Summary Seeking a Site Reliability/DevOps Engineer to gather and analyze metrics to assist in...


  • Chicago, United States PDSSOFT Full time

    8 Months Contract Only Locals within an hour's drive distance Chicago, IL, US, 60602 Must have 10+ yrs of IT experience Work Model: Hybrid Anchor Days: Monday, Wednesday, Friday Hours: 8:30am - 5pm CST Job Post Title Site Reliability/DevOps Engineer Job Post Summary Seeking a Site Reliability/DevOps Engineer to gather and analyze metrics to assist...


  • Chicago, United States PDSSOFT Full time

    8 Months Contract Only Locals within an hour's drive distance Chicago, IL, US, 60602 Must have 10+ yrs of IT experience Work Model: Hybrid Anchor Days: Monday, Wednesday, Friday Hours: 8:30am - 5pm CST Job Post Title Site Reliability/DevOps Engineer Job Post Summary Seeking a Site Reliability/DevOps Engineer to gather and analyze metrics to assist...


  • Chicago, Illinois, United States The Hartford Full time

    Senior Site Reliability EngineerAt The Hartford, we are committed to making a significant impact as an insurance provider that transcends traditional coverages and policies. Being part of our team means you have the opportunity to achieve your professional aspirations while assisting others in reaching theirs. Join us as we work towards shaping the...


  • Chicago, United States Outdefine Full time

    Site Reliability Engineer Uber Freight Software 500+ Employees Location: Chicago, Illinois, EUA About the Job Overview: Outdefine is a web3 talent community that connects top talent with leading-edge companies and enterprises globally. Companies choose to hire Outdefine Trusted Members because their skills and readiness have been proven. When you accept a...


  • Chicago, United States Cboe Full time

    Job Description Building trusted markets — powered by our people. At Cboe Global Markets, we inspire our people to solve complex challenges together because what we do matters. We provide the financial infrastructure that powers the global economy. As a leading provider of market infrastructure and tradable products, Cboe delivers cutting-edge trading,...


  • Chicago, Illinois, United States McDonald's Corporation Full time

    Job SummaryThis opportunity is part of the DevOps Center of Excellence in the Corporate Productivity Delivery office, where our mission is to help our product engineering teams deliver faster with improved quality and reliability.We work multi-functionally with our global product teams and market teams in defining and executing on our automation test...