Senior Principal Site Reliability Engineer

5 days ago


Chicago, Illinois, United States Northern Trust Full time
About Northern Trust

Northern Trust is a globally recognized financial institution with a rich history dating back to 1889. As a Fortune 500 company, we pride ourselves on providing innovative financial services and guidance to the world's most successful individuals, families, and institutions.

Job Summary

We are seeking an experienced Senior Principal Site Reliability Engineer to join our team. This role will play a pivotal part in ensuring the reliability and performance of our systems and services. As a Site Reliability DevOps Engineer, you will be responsible for defining and deploying key observability services with a deep focus on architecture, production operations, capacity planning, performance management, deployment, and release engineering.

Key Responsibilities
  • Lead the design and architecture of providing reliability, scalability, and performance of critical complex systems.
  • Develop and maintain automation scripts and tools to streamline operations and reduce manual tasks.
  • Collaborate with root cause analysis and implement measures to prevent recurrence of issues.
  • Design and implement comprehensive monitoring and observability solutions to proactively detect and address issues prior to them impacting our business.
  • Identify opportunities for improving system reliability through process enhancements and technical solutions.
  • Create and maintain detailed documentation of systems, processes, and procedures.
  • Communicate effectively with stakeholders across different teams and levels within the organization.
  • Manage and prioritize multiple projects and initiatives related to reliability and performance improvements.
Requirements
  • Bachelor's degree or equivalent experience.
  • 10+ years in systems engineering with a focus on reliability, systems operations, and software engineering.
  • 5+ years as a Team lead or a hands-on Technical Manager role that can engage and deliver projects to completion.
  • Strong proficiency in programming languages such as Python, Go, Ruby, Java, etc.
  • Experience with both on-prem and cloud solutions.
  • Experience with containerization.
  • Demonstrated ability to design and implement systems that ensure observability with associated dashboards.
  • Deep understanding of distributed systems, networking, and modern software architectures.
  • Excellent problem-solving skills and ability to handle complex technical challenges.
  • Strong dedication to customer needs, with excellent communication and the ability to build lasting relationships, alongside the capability to articulate complex reliability strategies in a clear and impactful manner.
  • Prior experience delivering Infrastructure as Code via a CI/CD pipeline.
  • Proven experience in leading a mentoring technical teams.
  • Skilled in implementing automation for corrective action based on deployed observability solutions.
  • Practical experience operating in an Agile development environment.
What We Offer

As a Northern Trust partner, you will be part of a flexible and collaborative work culture in an organization where financial strength and stability is an asset that emboldens us to explore new ideas. We offer a range of benefits, including a competitive salary, comprehensive health insurance, and a generous retirement plan. We also provide opportunities for professional growth and development, as well as a commitment to diversity and inclusion.

We are an equal opportunities employer and welcome applications from all qualified candidates. If you need a reasonable accommodation for any part of the employment process, please email our HR Service Center at hrservicecenter@northerntrust.com.



  • Chicago, Illinois, United States Northern Trust Full time

    About Northern TrustNorthern Trust is a leading global financial institution with a rich history dating back to 1889. As a Fortune 500 company, we have established ourselves as a trusted partner for individuals, families, and institutions seeking innovative financial solutions.Our commitment to service, expertise, and integrity has enabled us to build...


  • Chicago, Illinois, United States TalTeam Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at TalTeam Inc. in O'Fallon, MO. As a key member of our technology team, you will be responsible for building and maintaining monitoring and automation solutions to improve application and infrastructure availability.Key...


  • Chicago, Illinois, United States TalTeam Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at TalTeam Inc. in O'Fallon, MO. As a key member of our technology team, you will be responsible for building and maintaining monitoring and automation solutions to improve application and infrastructure availability.Key...


  • Chicago, Illinois, United States Saxon Global Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Saxon Global. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.About the RoleThis is a remote opportunity that requires a strong background in Azure and systems engineering. You...


  • Chicago, Illinois, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerJob Summary:Diverse Lynx LLC is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based applications. You will work closely with our development and operations teams to identify...


  • Chicago, Illinois, United States Oak Street Health Full time

    About Oak Street HealthOak Street Health is a leading healthcare technology company that is transforming the way healthcare is delivered to seniors. Our mission is to inspire and empower healthcare providers to deliver high-quality, patient-centered care.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site...


  • Chicago, Illinois, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerAt Diverse Lynx LLC, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our applications and infrastructure.Key Responsibilities:Lead production stability efforts by preventing...


  • Chicago, Illinois, United States Diverse Lynx Full time

    Job DescriptionJob Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the stability and reliability of our cloud-based infrastructure. You will work closely with our development team to identify and...


  • Chicago, Illinois, United States CIRCLE Full time

    About CircleCircle is a financial technology company at the forefront of the emerging internet of money, where value can flow freely and securely across borders. Our mission is to create an inclusive financial future, with transparency at our core.Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of...


  • Chicago, Illinois, United States Circle Full time

    About CircleCircle is a financial technology company at the forefront of the emerging internet of money, where value can flow freely and securely across borders.Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team and help us build and maintain our cloud infrastructure. As a key member of our engineering team, you will...


  • Chicago, Illinois, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerAt Diverse Lynx LLC, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based applications.Key Responsibilities:Lead production stability efforts by preventing...


  • Chicago, Illinois, United States Info Way Solutions Full time

    Job Title: Site Reliability EngineerInfo Way Solutions is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based applications.Key Responsibilities:Lead production stability efforts by preventing production issues...


  • Chicago, Illinois, United States Cleo Full time

    About CleoCleo is a software company that makes doing business easy. We pride ourselves on creating a fun, laid-back, but fast-paced work environment.Our CultureWe have a passion for all things nerdy and hire like-minded people who aren't afraid to think outside of the box. Our team is devoted to our people and values collaboration, innovation, and...


  • Chicago, Illinois, United States Northern Trust Full time

    About Northern Trust:Northern Trust is a globally recognized, award-winning financial institution with a rich history dating back to 1889. As a Fortune 500 company, we provide innovative financial services and guidance to the world's most successful individuals, families, and institutions.Job Summary:We are seeking an experienced Site Reliability Engineer to...


  • Chicago, Illinois, United States LightEdge Solutions Full time

    Job Title: Site Reliability EngineerLightEdge Solutions is a leading provider of IT solutions, driving business growth and innovation through cutting-edge technology. We are seeking a highly skilled Site Reliability Engineer to join our team.Job SummaryThe Site Reliability Engineer will be responsible for ensuring the reliable operation of our systems and...


  • Chicago, Illinois, United States Cleo Full time

    About CleoCleo is a software company that makes doing business easy. We pride ourselves on creating a fun, laid-back, but fast-paced work environment.Our CultureWe're a team of like-minded individuals who aren't afraid to think outside the box. We have a passion for all things nerdy and value creativity, innovation, and collaboration.The RoleWe're looking...


  • Chicago, Illinois, United States Cleo Full time

    About CleoCleo is a software company that makes doing business easy. We pride ourselves on creating a fun, laid-back, but fast-paced work environment.Our CultureWe have a passion for all things nerdy and hire like-minded people who aren't afraid to think outside of the box. Our team is devoted to our people and values collaboration, innovation, and...


  • Chicago, Illinois, United States Diverse Lynx Full time

    Job DescriptionThis role will be responsible for ensuring the reliability and performance of our cloud-based applications. The ideal candidate will have a strong understanding of modern cloud technologies and experience collaborating with technology teams.As a Site Reliability Engineer, you will work closely with our Application Support and Development teams...


  • Chicago, Illinois, United States Oak Street Health Full time

    Role OverviewWe are seeking a highly skilled Site Reliability Engineer to join our team at Oak Street Health. As a Site Reliability Engineer, you will play a critical role in ensuring the stability and performance of our platform, which is built specifically for the clinical team in the healthcare industry.Key ResponsibilitiesCollaborate with our software...


  • Chicago, Illinois, United States Enova Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer to join our team at Enova International. As a Site Reliability Engineer, you will play a critical role in maintaining the reliability of our consumer business from a technology and operational standpoint.You will collaborate with IT, Software Engineering, and product teams to resolve...