Senior Site Reliability Engineer

2 weeks ago


Chicago, Illinois, United States The Hartford Full time
Senior Site Reliability Engineer
  • At The Hartford, we are committed to making a significant impact as an insurance provider that transcends traditional coverages and policies.
Being part of our team means you have the opportunity to achieve your professional aspirations while assisting others in reaching theirs. Join us as we work towards shaping the future.

The Group Benefits Technology division at The Hartford is in search of a dynamic Senior Site Reliability Engineer to become a vital member of our Reliability Engineering Team.

The ideal candidate will possess a robust background in Site Reliability Engineering (SRE) and IT operations, along with expertise in various programming languages.

This position necessitates a profound technical comprehension of intricate IT ecosystems, cloud computing, and emerging technologies.


Key Responsibilities:
  • Champion the implementation of top-tier software engineering standards and design methodologies for instrumenting code/application technology stacks to facilitate the generation of pertinent metrics regarding overall technology health, availability, performance, quality, and resilience.
  • Act as a principal liaison between architecture and software engineering teams to shape the technical strategy for the organization, considering its cross-functional impacts, integration across departments, and architectural rationalization.

  • Serve as the primary technical authority for the applications supported, necessitating extensive knowledge in technologies, applications, integrations, interfaces, and business domains.

Operational Responsibilities:

Ensure excellence in operations.

Independently lead the triaging and restoration of all high-impact incidents to minimize the mean time to service restoration and mitigate business impact.

Exhibit end-to-end ownership.

Collaborate with infrastructure teams to design and implement intelligent incident routing, enhanced monitoring/alerting capabilities, and automated service restoration processes.

Proactively take measures to avert high-impact incidents.

Maintain the continuity of The Hartford and third-party assets that support business functions.

Accountable for keeping IT application and infrastructure metadata repositories up to date.

Govern the overall Data & Analytics platform ecosystem with a focus on processes and solutions addressing data masking (PII management) and data lifecycle management needs.


Solutions Responsibilities:

Develop effective tools, alerts, and response mechanisms to identify and mitigate reliability risks, leveraging automation to support problem prevention, detection, mitigation, and resolution.

Enhance the delivery process by engineering appropriate solutions to increase delivery speed while adhering to technology standards for sustained reliability.

Progressively implement preventative controls and drive increased automation and self-healing capabilities.

Continuously improve cost efficiency baselines.

Promote and implement innovative solutions.


Data Engineering Responsibilities:

Lead the migration of applications to open-source platforms, PaaS, and utilize containers and other cloud technology standards for cloud enablement and platform agility.

Drive simplification across the technology stack, ensuring that all technical designs can be effectively operated without adding unnecessary complexity.

Facilitate inner- and open-sourcing practices to accelerate the development of self-service enterprise capabilities (platform, infrastructure, security, etc.).

Demonstrate strong experience in establishing scalable Software Development Life Cycle (SDLC) environments using commercial off-the-shelf (COTS), PaaS, and SaaS products catering to data pipeline requirements.

Possess the ability to build solutions that promote the migration of applications to open-source platforms, PaaS, and utilize containers and other cloud technology standards for cloud enablement and platform agility.


Qualifications:

5+ years of relevant technical experience.

Bachelor's degree or equivalent work experience in Computer Science, Information Technology Management, or a related field.

Ability to engage with diverse technical and non-technical groups within a matrix organization.

Solid understanding of AWS, DevSecOps practices, and SAFe Agile methodologies.

Familiarity with programming languages such as Python, Lambda, Go, Java, etc.


Familiarity with enterprise software solutions:

(MS Teams, ServiceNow, Rally, etc.)

Expertise with cloud platforms like AWS and microservices architecture.

Hands-on experience with observability tools such as Dynatrace, SPLUNK, CloudWatch, CloudTrail, etc.

Experience with continuous integration and DevOps methodologies, tools including GitHub, Jenkins, Nexus.

Exceptional communication skills (written, oral, presentation, and facilitation).

Understanding of robotics and artificial intelligence to enhance services.

Experience in strategy development to achieve business objectives.

Hands-on application development and production support experience is a plus.

Ability to develop, manage, and communicate frameworks (e.g., Cloud Security Alliance).

Solid understanding of technologies that support services offered for cloud applications.

Excellent analytical and problem-solving skills.

Must be authorized to work in the US without company sponsorship.

Compensation: The listed annualized base pay range is primarily based on analysis of similar positions in the external market. Actual base pay could vary and may be above or below the listed range based on factors including but not limited to performance, proficiency, and demonstration of competencies required for the role. The base pay is just one component of The Hartford's total compensation package for employees. Other rewards may include short-term or annual bonuses, long-term incentives, and on-the-spot recognition.

Equal Opportunity Employer: The Hartford is an equal opportunity employer and welcomes applicants from diverse backgrounds.



  • Chicago, Illinois, United States Circle Full time

    About CircleCircle is a pioneering financial technology company at the forefront of the emerging internet of money, where value can flow freely, globally, and instantly, revolutionizing the way we think about payments, commerce, and markets. Our cutting-edge infrastructure, including the blockchain-based USDC, empowers businesses, institutions, and...


  • Chicago, Illinois, United States Calabitek Full time

    Job DescriptionPosition: Site Reliability EngineerLocation: RemoteExperience: 10+ yearsThis position is responsible for ensuring application observability, maintenance, and support. The role involves identifying and implementing proactive preventive measures, evaluating, and recommending techniques, practices, or technologies that align with business...


  • Chicago, Illinois, United States Calabitek Full time

    Job OverviewPosition: Site Reliability EngineerLocation: Chicago, IL (Local Candidates Preferred)Experience: 10+ YearsThis position is crucial for ensuring application observability, ongoing maintenance, and robust support. The role involves identifying and implementing proactive preventive measures, as well as evaluating and recommending techniques,...


  • Chicago, Illinois, United States National Black MBA Association Full time

    About the RoleThis is a strategic and transformation-focused role within the National Black MBA Association's Global Technology organization. As a Manager of Site Reliability Engineering, you will play a key part in ensuring the reliable and efficient operation of our security services.Key Responsibilities:Design and drive monitoring, alerting, and ticket...


  • Chicago, Illinois, United States National Black MBA Association Full time

    About the RoleThis is a strategic and transformation-focused role within the National Black MBA Association's Global Technology organization. As a Manager of Site Reliability Engineering, you will play a key part in ensuring the reliable and efficient operation of our security services.**Key Responsibilities:**Design and drive monitoring, alerting, and...


  • Chicago, Illinois, United States Jobot Full time

    Remote Azure Site Reliability Engineer OpportunityThis position is hosted by Jobot Consulting.About Us:We are a dynamic tech consulting firm seeking a Senior Cloud Site Reliability Engineer with a strong background in Azure Cloud. In this role, you will play a key part in implementing Site Reliability Engineering (SRE) practices across our enterprise...


  • Chicago, Illinois, United States Oak Street Health Full time

    Transformative Role at Oak Street HealthWe are seeking a skilled Site Reliability Engineer to collaborate with our software engineering teams in implementing monitoring and alerting solutions, designing performance tests, and automating tasks to enhance efficiency.Key ResponsibilitiesDesign and implement telemetry, monitoring, and alerting systems to ensure...


  • Chicago, Illinois, United States TEKsystems Full time

    Position Overview:This Site Reliability Engineering (SRE) team is responsible for facilitating in-depth advisory sessions, establishing SRE program leadership internally, and recruiting and nurturing talent for client projects.The Practice Architect will be strategic, generating innovative SRE methodologies in areas such as observability, production...


  • Chicago, Illinois, United States CCC Intelligent Solutions, Inc. Full time

    About the RoleCareer Opportunities at CCC Intelligent Solutions, Inc.We are seeking a highly skilled Senior Cloud Reliability Engineer to join our team at CCC Intelligent Solutions, Inc. As a key member of our engineering team, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based applications and services...


  • Chicago, Illinois, United States Gusto Full time

    About GustoGusto is a modern, online people platform that helps small businesses take care of their teams. On top of full-service payroll, Gusto offers health insurance, 401(k)s, expert HR, and team management tools. Today, Gusto offices in Denver, San Francisco, and New York serve more than 300,000 businesses nationwide. Our mission is to create a world...


  • Chicago, Illinois, United States Donato Technologies, Inc Full time

    Job OverviewPosition Title: DevOps EngineerCompany: Donato Technologies, IncWork Model: HybridOnsite Days: Tuesday - ThursdayContract Duration: 6 MonthsPosition SummaryWe are in search of a skilled DevOps Engineer to partner with our Application Development teams in delivering innovative business solutions through agile methodologies while effectively...


  • Chicago, Illinois, United States DASH2 Full time

    OverviewDASH2 is seeking experienced technical professionals who are eager to excel in delivering top-tier SaaS solutions. We provide a stimulating environment that encourages growth, adaptability, and the consistent application of your skills. Our clients depend on us during critical moments, and our engineering team is committed to fulfilling that...


  • Chicago, Illinois, United States Ardmore Roderick Full time

    Ardmore Roderick is in search of a Senior Site Engineering Supervisor to become a vital part of our team, reporting directly to the Project Resident Engineer. In this role, the Senior Site Engineering Supervisor will be tasked with overseeing the construction inspection and documentation for a significant transit station and substation construction...


  • Chicago, Illinois, United States Jobot Full time

    Remote Azure Site Reliability Engineer Opportunity with a Leading Tech Consulting FirmAbout Us:We are a dynamic consulting organization seeking a seasoned Cloud Site Reliability Engineer with a strong foundation in Azure Cloud technologies. This fully remote position is pivotal in implementing Site Reliability Engineering (SRE) methodologies across our...


  • Chicago, Illinois, United States Stardom Employment Consultants Full time

    Job Description:As a Site Reliability Engineer at Stardom Employment Consultants, you will be responsible for maintaining and improving the reliability, availability, and performance of our systems. You will collaborate closely with development, operations, and security teams to build and automate scalable infrastructure, monitor system health, and address...


  • Chicago, Illinois, United States GATX Full time

    Position OverviewGATX Corporation, a leader in the industry since 1898, is seeking a HYBRID Reliability Engineer to enhance our engineering team. With a commitment to excellence and a culture that fosters collaboration, we provide our employees with a dynamic environment to thrive. Our corporate office, located in a prestigious area, reflects our dedication...


  • Chicago, Illinois, United States DASH2 Full time

    About the RoleWe are seeking a highly skilled Senior Cloud Reliability Engineer to join our team at DASH2. As a key member of our engineering team, you will play a critical role in ensuring the reliability and performance of our cloud-based SaaS products.Key ResponsibilitiesNetwork Infrastructure Management: Design and implement secure, redundant, and...


  • Chicago, Illinois, United States GATX Full time

    Position OverviewFounded in 1898, GATX Corporation is a prominent player in the industry, boasting over 125 years of success driven by our dedicated workforce. We take pride in our high-performance culture and our enthusiastic management team, fostering a collaborative environment that allows employees to make a significant impact from day one. Our...


  • Chicago, Illinois, United States DASH2 Full time

    OverviewDASH2 is seeking skilled technical professionals at various levels who are eager to challenge themselves in delivering top-tier SaaS solutions. We provide a stimulating environment that encourages growth, adaptability, and the consistent application of your expertise. Our clients depend on us during critical moments, and our engineering team is...


  • Chicago, Illinois, United States Itron, Inc. Full time

    Itron is revolutionizing how utilities and cities manage energy and water. We are committed to creating a more sustainable, resourceful world. Join us.Job Family SummaryPlans, designs, develops and tests software systems or applications for software enhancements and new products including cloud-based or internet-related tools. Evaluates reliability of...