Sr. Site Reliability Engineer SRE

7 days ago


Chicago, United States Request Technology, LLC Full time

***We are unable to sponsor for this permanent full-time role*****Position is bonus eligible***Prestigious Financial Company is currently seeking a Sr. Site Reliability Engineer. Candidate will provide support for the availability and performance of next generation platform and will enhance system reliability and developer productivity through automation. Candidate will provide guidance to development, platform teams, in the areas of cloud technologies, application profiling and monitoring, logging, metrics collection and analysis.Responsibilities:Collaborate with development, operations and infrastructure teams to ensure availability of services, and to work through implementation issuesDevelop automation for incident response and to prevent problem recurrenceCreate and enhance runbooks to respond to service outages or degradationsAssess the production readiness of servicesDefine and track operational metrics for production performance, reliability, scalability and availabilityArchitect, develop and maintain shared services and tools to improve reliability and reduce toil across the organizationContribute to the team’s continuous improvement through research, retrospectives, discussion groups and code reviewsInfluences timelines and expectations amongst the teamProvide knowledge by guiding and mentoring junior members, and preparing stories for the sprint backlogQualifications:[Required] Experience with maintaining and troubleshooting large-scale distributed systems[Required] Experience with Agile / Scrum methodology[Required] Able to succeed in fast-paced environment with frequent changes[Required] Comfortable communicating with both technical and non-technical audiences[Required] Strong documentation skills[Required] Analytical problem-solving approach[Required] Self-starter – takes the initiative to research, learn and deliver. Anticipates the play[Required] Team player – humble, collaborative, and focused on making sure the entire team succeedsTechnical Skills:[Required] Experience managing infrastructure in public cloud environments like AWS (preferred), Azure or Google Cloud Platform[Required] Experience with AIOps and predictive analysis for anomaly detection, forecasting system capacity using monitoring and alerting tools like Splunk, AppDynamics, Datadog, StackDriver, Sysdig, Prometheus or Grafana[Required] Programming/scripting experience in languages like Java, Bash, Python or Go[Required] Experience with distributed messaging systems like Kafka, RabbitMQ, or ActiveMQ[Required] Experience with container orchestration systems like Kubernetes, Mesos, Docker Swarm or Rancher[Required] Experience with using Continuous Integration and Continuous Delivery (CI/CD) tools like Jenkins, Travis, Harness, Appveyor, CodeBuild or CodePipeline[Required] Familiarity with leveraging large language models (LLMs) to automate and optimize SRE workflows. This may include using AI-powered tools to perform tasks such as, writing scripts, summarizing incident reports, or even creating and maintaining AI workloads.[Required] Familiarity with leveraging large language models (LLMs) to automate and optimize SRE workflows. This may include using AI-powered tools to perform tasks such as, writing scripts, summarizing incident reports, data analysis or even creating and maintaining AI workloads.[Required] Basic exposure to Chaos Engineering tools like, Gremlin, Chaos Monkey, Harness Chaos Engineering, or cloud-native fault injection services like AWS FIS.Education and/or Experience:[Required] Bachelor’s or Master’s Degrees in Computer Science, Information Systems or other related field, or equivalent work experience[Required] Minimum of 4+ years of experience in Site Reliability Engineering / DevOps



  • Chicago, IL, United States Request Technology, LLC Full time

    ***We are unable to sponsor for this permanent full-time role*****Position is bonus eligible*** Prestigious Financial Company is currently seeking a Sr. Site Reliability Engineer. Candidate will provide support for the availability and performance of next generation platform and will enhance system reliability and developer productivity through automation....


  • Chicago, IL, United States Request Technology, LLC Full time

    ***We are unable to sponsor for this permanent full-time role*****Position is bonus eligible*** Prestigious Financial Company is currently seeking a Sr. Site Reliability Engineer. Candidate will provide support for the availability and performance of next generation platform and will enhance system reliability and developer productivity through automation....


  • Chicago, United States Moonlite AI Full time

    Moonlite delivers high-performance AI infrastructure for organizations running intensive computational research, large-scale model training, and demanding data processing workloads. We provide infrastructure deployed in our facilities or co-located in yours, delivering flexible on-demand or reserved compute that feels like an extension of your existing data...


  • Chicago, United States Chowbus Full time

    Site Reliability Engineer (SRE)_ Mandarin SpeakingJoin to apply for the Site Reliability Engineer (SRE)_ Mandarin Speaking role at ChowbusSite Reliability Engineer (SRE)_ Mandarin SpeakingJoin to apply for the Site Reliability Engineer (SRE)_ Mandarin Speaking role at ChowbusChowbus is a SaaS (Software as a Service) company that began as an online platform...


  • Chicago, United States ExecutivePlacements.com Full time

    We are looking for a Senior Site Reliability Engineer (SRE) with deep experience in AWS infrastructure, automation, observability, and production support. As an SRE, you will ensure our cloud‑native systems are resilient, scalable, and efficient, driving reliability through code, not just processes. Requirements Design, implement, and maintain scalable,...


  • Chicago, United States ExecutivePlacements.com Full time

    We are looking for a Senior Site Reliability Engineer (SRE) with deep experience in AWS infrastructure, automation, observability, and production support. As an SRE, you will ensure our cloud‑native systems are resilient, scalable, and efficient, driving reliability through code, not just processes. Requirements Design, implement, and maintain scalable,...


  • Chicago, United States Sputnik Solutions Inc Full time

    We are looking for a Senior Site Reliability Engineer (SRE) with deep experience in AWS infrastructure, automation, observability, and production support. As an SRE, you will ensure our cloud-native systems are resilient, scalable, and efficient, driving reliability through code, not just processes.RequirementsKey Responsibilities: Design, implement, and...


  • Chicago, United States Northern Trust Corp Full time

    Sr Implementation Lead, SRE (CoP) page is loaded## Sr Implementation Lead, SRE (CoP)remote type: Hybridlocations: Chicago, ILtime type: Full timeposted on: Posted Todayjob requisition id: R140191**About Northern Trust:**Northern Trust, a Fortune 500 company, is a globally recognized, award-winning financial institution that has been in continuous...


  • Chicago, United States Energy Jobline ZR Full time

    Energy Jobline is the largest and fastest growing global Energy Job Board and Energy Hub. We have an audience reach of over 7 million energy professionals, 400,000+ monthly advertised global energy and engineering jobs, and work with the leading energy companies worldwide.We focus on the Oil & Gas, Renewables, Engineering, Power, and Nuclear markets as well...


  • Chicago, United States Early Warning® Full time

    Join to apply for the Sr Site Reliability Engineer role at Early WarningContinue with Google Continue with GoogleJoin to apply for the Sr Site Reliability Engineer role at Early WarningAt Early Warning, we’ve powered and protected the U.S. financial system for over thirty years with cutting-edge solutions like Zelle, Paze℠, and so much more. As a trusted...