Senior Site Reliability Engineer

2 days ago


Chicago, Illinois, United States TalTeam Full time
Job Title: Senior Site Reliability Engineer

We are seeking a highly skilled Senior Site Reliability Engineer to join our team at TalTeam Inc. in O'Fallon, MO. As a key member of our technology team, you will be responsible for building and maintaining monitoring and automation solutions to improve application and infrastructure availability.

Key Responsibilities:
  • Represent the Enterprise Monitoring team in project meetings and provide advice, status, training, and technical support.
  • Work with customers to understand monitoring and automation requirements and implement solutions using available toolsets and scripting languages.
  • Administer, support, and maintain enterprise Monitoring tools in a multi-tier environment.
  • Build automation solutions using available toolsets and scripting languages.
  • Maintain design and support documents for all built solutions and processes.
  • Troubleshoot networking, Unix/Linux systems, and applications to identify and correct malfunctions and other operational problems using associated Linux and UNIX command line and management tools.
  • On-call administration and tool support.
  • Learn new technologies quickly and resolve any problems involved in integrating new technologies.
  • Maintain a broad knowledge of state-of-the-art technology, equipment, and/or systems.
  • Self-driven and flexible, willing to learn in adjacent areas with the initiative to learn more.
  • Thorough, adhering to critical processes even under stress.
  • Support business disaster recovery procedures for assigned areas of responsibility.
  • Accurately document duties and procedures to aid the department in cross-training and absentee coverage.
  • Work with technical engineering teams to manage and improve processes.
  • Ability to solve problems quickly and completely.
  • Ability to identify tasks that should be automated and then develop and implement automation.
Requirements:
  • Advanced user-level expertise in UNIX and/or Red Hat Linux.
  • Networking experience from basic to advanced, along with security knowledge.
  • Proficient with scripting or programming languages such as SQL, Perl, and shell scripting.
  • Experience with enterprise systems management/monitoring tools such as IBM Tivoli products, Microsoft System Center Operations Manager, Zabbix, Nagios, etc. is highly desirable.
  • Experienced in developing web applications on a Linux/Apache/MySQL/PHP stack is a strong plus.
  • Strong analytical, troubleshooting, and problem-solving skills.
  • Ability to manage multiple projects simultaneously under pressure without direct supervision.
  • Ability to manage multiple activities and work with a strong sense of urgency.
  • Ability and motivation to learn new technologies quickly and with minimal support and guidance.
  • Evening, weekend, and shift on-call required to meet deadlines and correct system failures or for patch upgrades.
  • Strong people skills and the ability to understand business needs and translate them into technical solutions.
  • Excellent verbal and written skills, organization, project prioritization, and time management skills.
Required Technical Skills:
  • Knowledge of any 2 Monitoring tools like Splunk, Dynatrace.
  • Experience with a variety of enterprise storage and networking systems, and especially with Cisco equipment.
  • Familiarity with IBM Tivoli Network Manager, including configuration of device discovery, SNMP traps, and probes.
Desired/Nice to Have Technical Skills:
  • Ability to drive from customer monitoring requirement to a technical solution with minimal supervision.
  • Experienced in NcKL (Netcool Knowledge Library).
  • Experience writing probe rules for IBM Netcool Omnibus.

TalTeam Inc. is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or protected veteran status and will not be discriminated against on the basis of disability.



  • Chicago, Illinois, United States Northern Trust Full time

    About Northern TrustNorthern Trust is a globally recognized financial institution with a rich history dating back to 1889. As a Fortune 500 company, we pride ourselves on providing innovative financial services and guidance to the world's most successful individuals, families, and institutions.Job SummaryWe are seeking an experienced Senior Principal Site...


  • Chicago, Illinois, United States TalTeam Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at TalTeam Inc. in O'Fallon, MO. As a key member of our technology team, you will be responsible for building and maintaining monitoring and automation solutions to improve application and infrastructure availability.Key...


  • Chicago, Illinois, United States Northern Trust Full time

    About Northern TrustNorthern Trust is a leading global financial institution with a rich history dating back to 1889. As a Fortune 500 company, we have established ourselves as a trusted partner for individuals, families, and institutions seeking innovative financial solutions.Our commitment to service, expertise, and integrity has enabled us to build...


  • Chicago, Illinois, United States Saxon Global Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Saxon Global. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.About the RoleThis is a remote opportunity that requires a strong background in Azure and systems engineering. You...


  • Chicago, Illinois, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerJob Summary:Diverse Lynx LLC is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based applications. You will work closely with our development and operations teams to identify...


  • Chicago, Illinois, United States Oak Street Health Full time

    About Oak Street HealthOak Street Health is a leading healthcare technology company that is transforming the way healthcare is delivered to seniors. Our mission is to inspire and empower healthcare providers to deliver high-quality, patient-centered care.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site...


  • Chicago, Illinois, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerAt Diverse Lynx LLC, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our applications and infrastructure.Key Responsibilities:Lead production stability efforts by preventing...


  • Chicago, Illinois, United States Diverse Lynx Full time

    Job DescriptionJob Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the stability and reliability of our cloud-based infrastructure. You will work closely with our development team to identify and...


  • Chicago, Illinois, United States CIRCLE Full time

    About CircleCircle is a financial technology company at the forefront of the emerging internet of money, where value can flow freely and securely across borders. Our mission is to create an inclusive financial future, with transparency at our core.Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of...


  • Chicago, Illinois, United States Circle Full time

    About CircleCircle is a financial technology company at the forefront of the emerging internet of money, where value can flow freely and securely across borders.Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team and help us build and maintain our cloud infrastructure. As a key member of our engineering team, you will...


  • Chicago, Illinois, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerAt Diverse Lynx LLC, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based applications.Key Responsibilities:Lead production stability efforts by preventing...


  • Chicago, Illinois, United States Info Way Solutions Full time

    Job Title: Site Reliability EngineerInfo Way Solutions is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based applications.Key Responsibilities:Lead production stability efforts by preventing production issues...


  • Chicago, Illinois, United States Cleo Full time

    About CleoCleo is a software company that makes doing business easy. We pride ourselves on creating a fun, laid-back, but fast-paced work environment.Our CultureWe have a passion for all things nerdy and hire like-minded people who aren't afraid to think outside of the box. Our team is devoted to our people and values collaboration, innovation, and...


  • Chicago, Illinois, United States Northern Trust Full time

    About Northern Trust:Northern Trust is a globally recognized, award-winning financial institution with a rich history dating back to 1889. As a Fortune 500 company, we provide innovative financial services and guidance to the world's most successful individuals, families, and institutions.Job Summary:We are seeking an experienced Site Reliability Engineer to...


  • Chicago, Illinois, United States LightEdge Solutions Full time

    Job Title: Site Reliability EngineerLightEdge Solutions is a leading provider of IT solutions, driving business growth and innovation through cutting-edge technology. We are seeking a highly skilled Site Reliability Engineer to join our team.Job SummaryThe Site Reliability Engineer will be responsible for ensuring the reliable operation of our systems and...


  • Chicago, Illinois, United States Cleo Full time

    About CleoCleo is a software company that makes doing business easy. We pride ourselves on creating a fun, laid-back, but fast-paced work environment.Our CultureWe're a team of like-minded individuals who aren't afraid to think outside the box. We have a passion for all things nerdy and value creativity, innovation, and collaboration.The RoleWe're looking...


  • Chicago, Illinois, United States Cleo Full time

    About CleoCleo is a software company that makes doing business easy. We pride ourselves on creating a fun, laid-back, but fast-paced work environment.Our CultureWe have a passion for all things nerdy and hire like-minded people who aren't afraid to think outside of the box. Our team is devoted to our people and values collaboration, innovation, and...


  • Chicago, Illinois, United States Diverse Lynx Full time

    Job DescriptionThis role will be responsible for ensuring the reliability and performance of our cloud-based applications. The ideal candidate will have a strong understanding of modern cloud technologies and experience collaborating with technology teams.As a Site Reliability Engineer, you will work closely with our Application Support and Development teams...


  • Chicago, Illinois, United States Oak Street Health Full time

    Role OverviewWe are seeking a highly skilled Site Reliability Engineer to join our team at Oak Street Health. As a Site Reliability Engineer, you will play a critical role in ensuring the stability and performance of our platform, which is built specifically for the clinical team in the healthcare industry.Key ResponsibilitiesCollaborate with our software...


  • Chicago, Illinois, United States Enova Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer to join our team at Enova International. As a Site Reliability Engineer, you will play a critical role in maintaining the reliability of our consumer business from a technology and operational standpoint.You will collaborate with IT, Software Engineering, and product teams to resolve...