Site Reliability Engineer
1 week ago
***Hybrid, 3 days onsite, 2 days remote******We are unable to sponsor as this is a permanent full-time role***A prestigious company is looking for a Site Reliability Engineer. This role is focused on observation, logging, and capacity planning. This engineer will need experience/exposure to Linux systems, Kubernetes/Docker, Terraform, Jenkins, Ansible, Harness, and Kafka. Responsibilities:Collaborate with development, operations and infrastructure teams to ensure availability of services, and to work through implementation issuesDevelop automation for incident response and to prevent problem recurrenceCreate and enhance runbooks to respond to service outages or degradationsAssess the production readiness of servicesDefine and track operational metrics for production performance, reliability, scalability and availabilityArchitect, develop and maintain shared services and tools to improve reliability and reduce toil across the organizationQualifications:Bachelor’s or Master’s Degrees in Computer Science, Information Systems or other related field, or equivalent work experienceMinimum of 4+ years of experience in Site Reliability Engineering / DevOpsExperience with maintaining and troubleshooting large-scale distributed systemsExperience managing infrastructure in public cloud environments like AWS (preferred), Azure or GCPExperience with AIOps and predictive analysis for anomaly detection, forecasting system capacity using monitoring and alerting tools like Splunk, AppDynamics, Datadog, StackDriver, Sysdig, Prometheus or GrafanaProgramming/scripting experience in languages like Java, Bash, Python or GoExperience with distributed messaging systems like Kafka, RabbitMQ, or ActiveMQExperience with container orchestration systems like Kubernetes, Mesos, Docker Swarm or RancherExperience with using Continuous Integration and Continuous Delivery (CI/CD) tools like Jenkins, Travis, Harness, Appveyor, CodeBuild or CodePipelineFamiliarity with leveraging large language models (LLMs) to automate and optimize SRE workflows. This may include using AI-powered tools to perform tasks such as, writing scripts, summarizing incident reports, or even creating and maintaining AI workloads.
-
Site Reliability Engineer
1 week ago
Chicago, United States Qorali Full timeSite Reliability Engineer – Cloud & AutomationLocation: Chicago Visa Sponsorship: Not available A technology-driven organization is seeking an experienced Site Reliability Engineer to support and enhance the reliability of its next-generation platform. The role focuses on automation, cloud infrastructure, and system performance. Key Responsibilities:Ensure...
-
Site Reliability Engineer
3 weeks ago
Chicago, United States Genesis10 Full timeSite Reliability EngineerGenesis10 is currently seeking a Site Reliability Engineer with our client in the financial industry located in Chandler, AZ, Chicago, IL, Kennesaw, GA, and Richmond, VA. This is a 12+ month contract position. Responsibilities include reliability and support of Container Platform on-prem and external clouds (Azure/AWS/Google)....
-
Site Reliability Engineer
3 weeks ago
Chicago, United States Request Technology, LLC Full timeSite Reliability Engineer Hybrid (3 days onsite, 2 days remote) full‑time. No visa sponsorship. Base pay: $150,000 – $155,000 per year, subject to skills and experience. A prestigious company seeks a Site Reliability Engineer focused on observation, logging, and capacity planning. The role requires experience with Linux, Kubernetes/Docker, Terraform,...
-
Site Reliability Engineer
1 week ago
Chicago, IL, United States HCL Global Systems Full timeEdward Jones Site Reliability Engineer 100% remote Initial contract is 6 months, but will be a multi year engagement. Position Overview: As a Senior Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems. You will be responsible for incident management, root cause analysis, and...
-
Site Reliability Engineer
4 days ago
Chicago, IL, United States HCL Global Systems Full timeEdward Jones Site Reliability Engineer 100% remote Initial contract is 6 months, but will be a multi year engagement. Position Overview: As a Senior Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems. You will be responsible for incident management, root cause analysis, and...
-
Site Reliability Engineer
2 days ago
Chicago, IL, United States HCL Global Systems Full timeEdward Jones Site Reliability Engineer 100% remote Initial contract is 6 months, but will be a multi year engagement. Position Overview: As a Senior Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems. You will be responsible for incident management, root cause analysis, and...
-
Site Reliability Engineer
3 hours ago
Chicago, IL, United States HCL Global Systems Full timeEdward Jones Site Reliability Engineer 100% remote Initial contract is 6 months, but will be a multi year engagement. Position Overview: As a Senior Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems. You will be responsible for incident management, root cause analysis, and...
-
Site Reliability Engineer
5 days ago
Chicago, United States Request Technology, LLC Full time***Hybrid, 3 days onsite, 2 days remote******We are unable to sponsor as this is a permanent full-time role***A prestigious company is looking for a Site Reliability Engineer. This role is focused on observation, logging, and capacity planning. This engineer will need experience/exposure to Linux systems, Kubernetes/Docker, Terraform, Jenkins, Ansible,...
-
Site Reliability Engineer
2 days ago
Chicago, IL, United States Request Technology Full time***Hybrid, 3 days onsite, 2 days remote******We are unable to sponsor as this is a permanent full-time role***A prestigious company is looking for a Site Reliability Engineer. This role is focused on observation, logging, and capacity planning. This engineer will need experience/exposure to Linux systems, Kubernetes/Docker, Terraform, Jenkins, Ansible,...
-
Site Reliability Engineer
3 weeks ago
Chicago, United States Qorali Full timeThis range is provided by Qorali. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $140,000.00/yr - $150,000.00/yr Location: Chicago A technology-driven organization is seeking an experienced Site Reliability Engineer to support and enhance the reliability of its next-generation platform....