Site Reliability Engineer

2 weeks ago


Chicago, IL, United States HCL Global Systems Full time

Edward Jones
Site Reliability Engineer
100% remote
Initial contract is 6 months, but will be a multi year engagement.

Position Overview:
As a Senior Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems. You will be responsible for incident management, root cause analysis, and implementing postmortem processes to prevent future incidents. Your expertise in VMware, Linux, Networking, Kubernetes, Java, IBM Mainframe/DB2, Oracle, MongoDB, Messaging systems(Kafka) and SRE tools like Dynatrace, and Splunk will be instrumental in maintaining and optimizing our infrastructure.
Responsibilities:
• Lead incident management efforts, including rapid response, coordination, and communication during incidents.
• Conduct thorough root cause analysis (RCA) for incidents and drive the implementation of corrective actions.
• Develop and refine postmortem processes to ensure continuous learning and improvement from incidents.
• Design, implement, and maintain highly available and scalable systems using Kubernetes.
• Develop and optimize Java applications for performance and reliability.
• Utilize Dynatrace and Splunk for monitoring, alerting, and troubleshooting system issues.
• Collaborate with development teams to integrate reliability and observability best practices into the software development lifecycle.
• Automate repetitive tasks and processes to reduce toil and improve efficiency.
Qualifications:
• Bachelor's degree in Computer Science, Engineering, or a related field.
• 2-5 years of experience as an SRE worked on on-premise systems like VMware, Kubernetes and Database systems.
• Strong experience with incident management, root cause analysis, and postmortem processes.
• Expertise in Kubernetes, Java, Dynatrace, Grafana and Splunk.
• Proficiency in scripting languages (e.g., Python, Bash) and automation tools (e.g., Ansible, Terraform).
• Ability to work effectively in a fast-paced, dynamic environment.



  • Chicago, IL, United States Request Technology Full time

    ***Hybrid, 3 days onsite, 2 days remote******We are unable to sponsor as this is a permanent full-time role***A prestigious company is looking for a Site Reliability Engineer. This role is focused on observation, logging, and capacity planning. This engineer will need experience/exposure to Linux systems, Kubernetes/Docker, Terraform, Jenkins, Ansible,...


  • Chicago, IL, United States Caddi Full time

    For security reasons, the candidate must be a US Citizen, or a Permanent Resident (Green Card) Overview As a Site Reliability Engineer at CADDi, you will build and secure infrastructure supporting our AI platform with special attention to safeguarding US customer data and supporting the Aerospace and Defense Industrial Base. You'll have strong ownership of...


  • Chicago, IL, United States CADDi Full time

    For security reasons, the candidate must be a US Citizen, or a Permanent Resident (Green Card) Overview As a Site Reliability Engineer at CADDi, you will build and secure infrastructure supporting our AI platform with special attention to safeguarding US customer data and supporting the Aerospace and Defense Industrial Base. You'll have strong ownership of...


  • Chicago, IL, United States CADDi Full time

    For security reasons, the candidate must be a US Citizen, or a Permanent Resident (Green Card) Overview As a Site Reliability Engineer at CADDi, you will build and secure infrastructure supporting our AI platform with special attention to safeguarding US customer data and supporting the Aerospace and Defense Industrial Base. You'll have strong ownership of...


  • Chicago, IL, United States CADDi Full time

    For security reasons, the candidate must be a US Citizen, or a Permanent Resident (Green Card) Overview As a Site Reliability Engineer at CADDi, you will build and secure infrastructure supporting our AI platform with special attention to safeguarding US customer data and supporting the Aerospace and Defense Industrial Base. You'll have strong ownership of...


  • Chicago, IL, United States Caddi Full time

    For security reasons, the candidate must be a US Citizen, or a Permanent Resident (Green Card) Overview As a Site Reliability Engineer at CADDi, you will build and secure infrastructure supporting our AI platform with special attention to safeguarding US customer data and supporting the Aerospace and Defense Industrial Base. You'll have strong ownership of...


  • Chicago, IL, United States CADDi Full time

    For security reasons, the candidate must be a US Citizen, or a Permanent Resident (Green Card) Overview As a Site Reliability Engineer at CADDi, you will build and secure infrastructure supporting our AI platform with special attention to safeguarding US customer data and supporting the Aerospace and Defense Industrial Base. You'll have strong ownership of...


  • Chicago, IL, United States CADDi Full time

    For security reasons, the candidate must be a US Citizen, or a Permanent Resident (Green Card) Overview As a Site Reliability Engineer at CADDi, you will build and secure infrastructure supporting our AI platform with special attention to safeguarding US customer data and supporting the Aerospace and Defense Industrial Base. You'll have strong ownership of...


  • Chicago, IL, United States CADDi Full time

    For security reasons, the candidate must be a US Citizen, or a Permanent Resident (Green Card) Overview As a Site Reliability Engineer at CADDi, you will build and secure infrastructure supporting our AI platform with special attention to safeguarding US customer data and supporting the Aerospace and Defense Industrial Base. You'll have strong ownership of...


  • Chicago, IL, United States Info Way Solutions Full time

    Site Reliability Engineer in Wealth Management Chicago (IL) / Tempe (AZ) Onsite Job ROLE: This role will be Responsible for application observability, maintenance, and support, identifying and implementing preventive measures proactively, evaluates and makes recommendation on techniques, practices, or technologies that would enhance business needs. As a SRE...