Site Reliability Engineer

2 weeks ago


Chicago, Illinois, United States Calabitek Full time
Job Overview

Position: Site Reliability Engineer

Location: Chicago, IL (Local Candidates Preferred)

Experience: 10+ Years

This position is crucial for ensuring application observability, ongoing maintenance, and robust support. The role involves identifying and implementing proactive preventive measures, as well as evaluating and recommending techniques, practices, or technologies that align with business objectives.

As a Site Reliability Engineer, you will work closely with Application Support and Development teams to deliver business solutions using agile methodologies while managing production-related challenges.

The ideal candidate will demonstrate exceptional leadership and communication abilities, along with a comprehensive understanding of contemporary cloud technologies, particularly within the financial sector.


KEY RESPONSIBILITIES:
  • Lead initiatives aimed at enhancing production stability by preventing issues and improving overall system reliability.
  • Define and enforce Service Level Objectives (SLOs) and Service Level Agreements (SLAs), as well as manage Error Budgets to ensure system availability.
  • Monitor key performance indicators, including response times, error rates, and uptime, to align operational performance with strategic business goals.
  • Identify opportunities for continuous improvement, such as automating tasks and reducing manual efforts to decrease production incidents.
  • Participate in the design and deployment of monitoring, metrics, and logging systems, and develop application dashboards.
  • Ensure minimal downtime through effective monitoring, alerting, self-healing automation, and ongoing enhancements.
  • Provide reactive support and communicate issue resolution status to project teams and management.
  • Develop expertise in assigned application domains and interpret alerts from tools like SiteScope, Dynatrace, and ELK for root cause analysis.
  • Be adaptable to learning new technologies and willing to engage in hands-on development.
  • Deliver regular, high-quality updates to stakeholders regarding progress on user stories and IT service management issues.
  • Attend regular meetings with Project and Development teams to prioritize and address production issues.

REQUIRED SKILLS AND EXPERIENCE:
  • 5-6 years of application development experience utilizing modern technologies and architectures, with a background in collaboration with technology teams.
  • 2+ years of experience in Site Reliability Engineering.
  • Strong understanding of at least one public cloud platform, preferably Microsoft Azure or Pivotal Cloud Foundry.
  • Proficient in REST APIs and their practical applications.
  • Experience with continuous integration and collaboration tools such as Azure DevOps, JIRA, Bitbucket, GitHub, and Confluence.
  • Solid knowledge and hands-on experience with CLI Bash, Linux, and Azure CLI.

Familiarity with technologies such as Java, J2EE, Pivotal Cloud Foundry, Cloud Computing (IaaS, PaaS, and SaaS), RESTful interfaces, GIT, Gradle, Maven, NPM, Spring (Spring Batch and Spring Boot), CSS3, and HTML4 is advantageous.



  • Chicago, Illinois, United States Calabitek Full time

    Job DescriptionPosition: Site Reliability EngineerLocation: RemoteExperience: 10+ yearsThis position is responsible for ensuring application observability, maintenance, and support. The role involves identifying and implementing proactive preventive measures, evaluating, and recommending techniques, practices, or technologies that align with business...


  • Chicago, Illinois, United States National Black MBA Association Full time

    About the RoleThis is a strategic and transformation-focused role within the National Black MBA Association's Global Technology organization. As a Manager of Site Reliability Engineering, you will play a key part in ensuring the reliable and efficient operation of our security services.Key Responsibilities:Design and drive monitoring, alerting, and ticket...


  • Chicago, Illinois, United States National Black MBA Association Full time

    About the RoleThis is a strategic and transformation-focused role within the National Black MBA Association's Global Technology organization. As a Manager of Site Reliability Engineering, you will play a key part in ensuring the reliable and efficient operation of our security services.**Key Responsibilities:**Design and drive monitoring, alerting, and...


  • Chicago, Illinois, United States Oak Street Health Full time

    Transformative Role at Oak Street HealthWe are seeking a skilled Site Reliability Engineer to collaborate with our software engineering teams in implementing monitoring and alerting solutions, designing performance tests, and automating tasks to enhance efficiency.Key ResponsibilitiesDesign and implement telemetry, monitoring, and alerting systems to ensure...


  • Chicago, Illinois, United States Circle Full time

    About CircleCircle is a pioneering financial technology company at the forefront of the emerging internet of money, where value can flow freely, globally, and instantly, revolutionizing the way we think about payments, commerce, and markets. Our cutting-edge infrastructure, including the blockchain-based USDC, empowers businesses, institutions, and...


  • Chicago, Illinois, United States The Hartford Full time

    Senior Site Reliability EngineerAt The Hartford, we are committed to making a significant impact as an insurance provider that transcends traditional coverages and policies. Being part of our team means you have the opportunity to achieve your professional aspirations while assisting others in reaching theirs. Join us as we work towards shaping the...


  • Chicago, Illinois, United States Gusto Full time

    About GustoGusto is a modern, online people platform that helps small businesses take care of their teams. On top of full-service payroll, Gusto offers health insurance, 401(k)s, expert HR, and team management tools. Today, Gusto offices in Denver, San Francisco, and New York serve more than 300,000 businesses nationwide. Our mission is to create a world...


  • Chicago, Illinois, United States Donato Technologies, Inc Full time

    Job OverviewPosition Title: DevOps EngineerCompany: Donato Technologies, IncWork Model: HybridOnsite Days: Tuesday - ThursdayContract Duration: 6 MonthsPosition SummaryWe are in search of a skilled DevOps Engineer to partner with our Application Development teams in delivering innovative business solutions through agile methodologies while effectively...


  • Chicago, Illinois, United States DASH2 Full time

    OverviewDASH2 is seeking experienced technical professionals who are eager to excel in delivering top-tier SaaS solutions. We provide a stimulating environment that encourages growth, adaptability, and the consistent application of your skills. Our clients depend on us during critical moments, and our engineering team is committed to fulfilling that...


  • Chicago, Illinois, United States Stardom Employment Consultants Full time

    Job Description:As a Site Reliability Engineer at Stardom Employment Consultants, you will be responsible for maintaining and improving the reliability, availability, and performance of our systems. You will collaborate closely with development, operations, and security teams to build and automate scalable infrastructure, monitor system health, and address...


  • Chicago, Illinois, United States TEKsystems Full time

    Position Overview:This Site Reliability Engineering (SRE) team is responsible for facilitating in-depth advisory sessions, establishing SRE program leadership internally, and recruiting and nurturing talent for client projects.The Practice Architect will be strategic, generating innovative SRE methodologies in areas such as observability, production...


  • Chicago, Illinois, United States DASH2 Full time

    OverviewDASH2 is seeking skilled technical professionals at various levels who are eager to challenge themselves in delivering top-tier SaaS solutions. We provide a stimulating environment that encourages growth, adaptability, and the consistent application of your expertise. Our clients depend on us during critical moments, and our engineering team is...


  • Chicago, Illinois, United States Itron, Inc. Full time

    Itron is revolutionizing how utilities and cities manage energy and water. We are committed to creating a more sustainable, resourceful world. Join us.Job Family SummaryPlans, designs, develops and tests software systems or applications for software enhancements and new products including cloud-based or internet-related tools. Evaluates reliability of...


  • Chicago, Illinois, United States Jobot Full time

    Remote Azure Site Reliability Engineer OpportunityThis position is hosted by Jobot Consulting.About Us:We are a dynamic tech consulting firm seeking a Senior Cloud Site Reliability Engineer with a strong background in Azure Cloud. In this role, you will play a key part in implementing Site Reliability Engineering (SRE) practices across our enterprise...


  • Chicago, Illinois, United States Jobot Full time

    Remote Azure Site Reliability Engineer Opportunity with a Leading Tech Consulting FirmAbout Us:We are a dynamic consulting organization seeking a seasoned Cloud Site Reliability Engineer with a strong foundation in Azure Cloud technologies. This fully remote position is pivotal in implementing Site Reliability Engineering (SRE) methodologies across our...

  • Reliability Engineer

    4 weeks ago


    Chicago, Illinois, United States GATX Full time

    OverviewFounded in 1898 and headquartered in Chicago, IL, GATX Corporation (NYSE: GATX) is an industry leader with 125+ years of success-success that is powered by our people. We are proud of our high-performance culture, hard-working and enthusiastic management team, and beautiful office space in the Willis Tower.At GATX, we hire the best and offer our...


  • Chicago, Illinois, United States The Hartford Full time

    About The HartfordThe Hartford is a leading insurance company that goes beyond traditional coverages and policies. We're committed to making a difference and proud to be an organization that values innovation and excellence.Job SummaryWe're seeking a highly skilled Staff Reliability Engineer to join our Reliability Engineering Team. As a key member of our...


  • Chicago, Illinois, United States CCC Intelligent Solutions, Inc. Full time

    About the RoleCareer Opportunities at CCC Intelligent Solutions, Inc.We are seeking a highly skilled Senior Cloud Reliability Engineer to join our team at CCC Intelligent Solutions, Inc. As a key member of our engineering team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based applications and services...

  • Reliability Engineer

    4 hours ago


    Chicago, Illinois, United States Mondelez GLobal LLC Full time

    Job SummaryWe are seeking a highly skilled Reliability Engineer to join our team at Mondelez Global LLC. As a key member of our manufacturing support team, you will be responsible for ensuring the reliability, availability, and performance of equipment and machinery within our facilities.Key ResponsibilitiesEquipment Maintenance and Reliability: Develop and...


  • Chicago, Illinois, United States GATX Full time

    Position OverviewGATX Corporation, a leader in the industry since 1898, is seeking a HYBRID Reliability Engineer to enhance our engineering team. With a commitment to excellence and a culture that fosters collaboration, we provide our employees with a dynamic environment to thrive. Our corporate office, located in a prestigious area, reflects our dedication...