Current jobs related to Site Reliability Engineer - Chicago - Brain Bolt Consulting


  • Chicago, Illinois, United States Diverse Lynx Full time

    Job Summary: We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a key member of our engineering team, you will be responsible for ensuring the reliability and scalability of our cloud-based applications. Key Responsibilities:Design and implement monitoring, metrics, and logging systems to ensure application...


  • Chicago, Illinois, United States Oak Street Health Full time

    Role OverviewWe are seeking a skilled Site Reliability Engineer to join our team at Oak Street Health. As a Site Reliability Engineer, you will play a critical role in ensuring the stability and performance of our platform, which is built specifically for the clinical team. You will partner with our software engineering teams to transform ideas into reality,...


  • chicago, United States Matlen Silver Full time

    Compensation: $70 - $75/HourHybrid: 2 Days Onsite Chicago IllinoisDomain: Retail/Supply ChainJob Title: Site Reliability EngineerPosition SummaryAs a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel...


  • chicago, United States Matlen Silver Full time

    Compensation: $70 - $75/HourHybrid: 2 Days Onsite Chicago IllinoisDomain: Retail/Supply ChainJob Title: Site Reliability EngineerPosition SummaryAs a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel...


  • Chicago, United States Matlen Silver Full time

    Compensation: $70 - $75/HourHybrid: 2 Days Onsite Chicago IllinoisDomain: Retail/Supply ChainJob Title: Site Reliability EngineerPosition SummaryAs a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions for our client to support omni-channel...


  • Chicago, United States Algo Capital Group Full time

    Linux Site Reliability Engineer – Linux Systems Engineering TeamOur client, an industry leading proprietary trading firm and liquidity provider, is looking for a Linux Site Reliability Engineer to join their expanding Linux Systems Engineering Team in Chicago. The firm prides itself on its collaborative environment and usage of mostly in-home tools and...


  • chicago, United States Algo Capital Group Full time

    Linux Site Reliability Engineer – Linux Systems Engineering TeamOur client, an industry leading proprietary trading firm and liquidity provider, is looking for a Linux Site Reliability Engineer to join their expanding Linux Systems Engineering Team in Chicago. The firm prides itself on its collaborative environment and usage of mostly in-home tools and...


  • Chicago, Illinois, United States Enova Full time

    About the Role: As a Site Reliability Engineer at Enova, you will play a crucial part in maintaining the reliability of our consumer business from a technology and operational standpoint. You will drive the rapid improvement and efficiency of our platform by implementing automated tools, evaluating processes, troubleshooting, and resolving complex problems....


  • Chicago, Illinois, United States Matlen Silver Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at Matlen Silver. As a key member of our infrastructure and operations team, you will be responsible for ensuring the availability, performance, and reliability of our Fulfillment Technology solutions.Key Responsibilities:Partner with application engineering, observability,...


  • Chicago, Illinois, United States CloudBC Labs Full time

    Job Title: Site Reliability EngineerJob Summary:CloudBC Labs is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure. You will work closely with our development team to identify and resolve issues, and...


  • Chicago, United States Selby Jennings Full time

    A leading Proprietary Trading firm is seeking a Site Reliability Engineer to join their team. You'll design and support the systems used by electronic trading desks leveraging tools like Linux, Kubernetes, and Python. What you'll do: Support software development teams to implement different parts of the application life cycle, i.e. application deployment,...


  • Chicago, United States Selby Jennings Full time

    A leading Proprietary Trading firm is seeking a Site Reliability Engineer to join their team. You'll design and support the systems used by electronic trading desks leveraging tools like Linux, Kubernetes, and Python. What you'll do: Support software development teams to implement different parts of the application life cycle, i.e. application deployment,...


  • Chicago, United States Selby Jennings Full time

    A leading Proprietary Trading firm is seeking a Site Reliability Engineer to join their team.You'll design and support the systems used by electronic trading desks leveraging tools like Linux, Kubernetes, and Python.What you'll do:Support software development teams to implement different parts of the application life cycle, i.e. application deployment,...


  • Chicago, Illinois, United States Enova Full time

    About the Role:As a Site Reliability Engineer at Enova International, you will play a critical role in maintaining the reliability of our consumer business from a technology and operational standpoint. Your expertise will drive the rapid improvement and efficiency of our platform by implementing automated tools, evaluating processes, and troubleshooting...


  • Chicago, Illinois, United States Bank of America Full time

    Job Description:At Bank of America, we are committed to delivering exceptional customer experiences through the power of every connection. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and observability of our services.Key responsibilities include:Partnering with engineering and technology teams to improve...


  • Chicago, Illinois, United States Northern Trust Full time

    About Northern Trust:Northern Trust is a globally recognized financial institution with a rich history dating back to 1889. We provide innovative financial services and guidance to the world's most successful individuals, families, and institutions.We are committed to delivering exceptional service, expertise, and integrity in all our endeavors. Our team of...


  • Chicago, Illinois, United States iManage Full time

    About the RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at iManage. As a key member of our global SRE team, you will contribute to the development and maintenance of our cloud-based platform. Your expertise in cloud infrastructure, Kubernetes, and containerization will be instrumental in ensuring the scalability,...


  • Chicago, Illinois, United States TalTeam Full time

    Job Summary TalTeam is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will work closely with technology support teams and application teams to build monitoring and automation solutions to improve application and infrastructure availability.Key Responsibilities Represent the Enterprise Monitoring team...


  • Chicago, United States Oak Street Health Full time

    Role DescriptionAs a Site Reliability Engineer, you will be instrumental to the stability and performance of a new kind of platform for healthcare, one built specifically for the clinical team. From design to implementation, you will partner with our stellar software engineering teams in a fast-paced, agile environment to transform ideas into a reality....


  • Chicago, Illinois, United States Adyen Full time

    We are looking for a highly technical Senior Site Reliability Engineer to join our Internal Services team at Adyen. As a Site Reliability Engineer, you will be responsible for the stability and reliability of our internal services.The ideal candidate will have 7+ years of relevant work experience and a solid understanding of the Linux operating system and...

Site Reliability Engineer

1 month ago


Chicago, United States Brain Bolt Consulting Full time

Responsibilities:

  • Analyse, design, program, test, and deploy new user stories and features with high quality (security, reliability, operations) to production
  • Achieves team commitments (and influence others to do the same) by using informal leadership & highly developed communication skills
  • Has an oversight on design decisions and guides team to achieve key results for products assigned to them
  • Remediates issues using engineering principles and creates proactive design solutions for potential failures
  • Work with a team of site reliability engineers that is responsible for building the continuous reliability mindset, shepherding problem management, and driving key site reliability engineering practices into the organization.
  • Design and drive monitoring, alerting, ticket reporting strategies to measure SLA, SLO, MTTI, MTTR. Etc. and align with management expectations to reduce/minimize prod downtime.
  • Guide site reliability automation to help eliminate manual toil and create a self-healing capability
  • Participate in selection of appropriate automation tools, defining technology, quality, experience and implementation standards and practices within own technical domain.
  • Fosters a culture of excellence and continuous learning within the chapter. Establishes and tracks to appropriate OKRs to ensure outcomes are met.
  • Creates solutions addressing high impact technology and business priorities
  • Competent in multiple contexts, such as programming languages, security, automation, testing, infrastructure, and performance and is the go-to person for many people (inside and outside of their team)
  • Proactively identifies and mitigates issues based on intuition and experience in multiple domains

Must Have Skills:

  • Experienced with AWS Cloud
  • Experienced in building and managing OCP clusters, deploy applications into OCP
  • Experience with SRE design to address reliability and resiliency with availability of 5-9s
  • Experience in managing caching solutions like Hazelcast, GemFire or Terracota
  • Experience in setting up and managing Kafka
  • High level of familiarity with the Linux command line and scripting
  • Extremely comfortable with production environments, firewalls, and networking
  • Strong experience in deploying, observing, altering, logging, and monitoring systems (Splunk, Datadog, AppDynamics, Instana) with a mindset towards predictive analysis.
  • Working knowledge of the automation tools such as Ansible, Terraform, or Chef
  • Experience in performing RCA, Disaster Recovery activities, Chaos Engineering

Good to have Skills:

  • Highly preferred experience working in the payments industry
  • Deep knowledge and understanding of emerging trends in the SRE field.
  • Experience developing in Java (or other similar languages)
  • Studied architectural patterns at scale, including thoughtfully designed APIs, repeatable delivery pipelines, and efficient computer engineering principles.
  • Working knowledge of messaging services like RabbitMQ, SQS, Kafka
  • Strong Experience with Continuous Integration and Continuous Delivery models including Blue/Green and/or Canary release models

Tools & Technologies:

  • Open-shift Container Platform
  • (Splunk, Datadog, AppDynamics, Instana)
  • HazelCast.
  • Ansible, Terraform, or Chef
  • RabbitMQ, SQS, Kafka
  • Linux VMs , Shell Scripting
  • AWS CLoud
  • Postgress Database