Sr. DevOps/Site Reliability Engineer

2 days ago


Chicago, United States Sputnik Solutions Inc Full time

Job DescriptionWe are looking for a Senior Site Reliability Engineer (SRE) with deep experience in AWS infrastructure, automation, observability, and production support. As an SRE, you will ensure our cloud-native systems are resilient, scalable, and efficient, driving reliability through code, not just processes.RequirementsKey Responsibilities: Design, implement, and maintain scalable, secure, and highly available infrastructure on AWSDevelop and improve CI/CD pipelines, Infrastructure as Code (IaC) using Terraform, HarnessOwn and implement monitoring, alerting, logging, and distributed tracing with tools like Dynatrace/ DatadogTroubleshoot production incidents, conduct blameless postmortems, and improve incident response processesOptimize systems for cost, performance, and reliabilityDrive chaos engineering and resilience testingCollaborate with development teams to embed SRE practices like SLAs, SLOs, and error budgetsMentor junior SREs and promote DevOps/SRE culture across the organizationBasic Qualifications: Strong experience in SRE, DevOps, or Cloud EngineeringExpertise in AWS core services (EC2, ECS/EKS, Lambda, S3, VPC, RDS, IAM, CloudFront, etc.)Hands-on experience with Terraform, Ansible, or other IaC toolsStrong scripting/coding skills (Python, Go, Shell, etc.)Experience with Kubernetes, containerization, and orchestrationDeep knowledge of Linux systems and networkingPreferred Qualifications:Experience with Service Meshes (e.g., Istio, App Mesh)Familiarity with AWS Well-Architected FrameworkExperience building self-healing systems and automated remediationBackground in security, compliance, or multi-account/multi-region AWS architecturesCertifications (Optional/Preferred):AWS Certified DevOps Engineer – ProfessionalAWS Certified Solutions Architect – ProfessionalRequirementsKey Responsibilities: Design, implement, and maintain scalable, secure, and highly available infrastructure on AWS Develop and improve CI/CD pipelines, Infrastructure as Code (IaC) using Terraform, Harness Own and implement monitoring, alerting, logging, and distributed tracing with tools like Dynatrace/ Datadog Troubleshoot production incidents, conduct blameless postmortems, and improve incident response processes Optimize systems for cost, performance, and reliability Drive chaos engineering and resilience testing Collaborate with development teams to embed SRE practices like SLAs, SLOs, and error budgets Mentor junior SREs and promote DevOps/SRE culture across the organization Basic Qualifications: Strong experience in SRE, DevOps, or Cloud Engineering Expertise in AWS core services (EC2, ECS/EKS, Lambda, S3, VPC, RDS, IAM, CloudFront, etc.) Hands-on experience with Terraform, Ansible, or other IaC tools Strong scripting/coding skills (Python, Go, Shell, etc.) Experience with Kubernetes, containerization, and orchestration Deep knowledge of Linux systems and networking Preferred Qualifications: Experience with Service Meshes (e.g., Istio, App Mesh) Familiarity with AWS Well-Architected Framework Experience building self-healing systems and automated remediation Background in security, compliance, or multi-account/multi-region AWS architectures Certifications (Optional/Preferred): AWS Certified DevOps Engineer – Professional AWS Certified Solutions Architect – Professional



  • Chicago, United States ExecutivePlacements.com Full time

    We are looking for a Senior Site Reliability Engineer (SRE) with deep experience in AWS infrastructure, automation, observability, and production support. As an SRE, you will ensure our cloud‑native systems are resilient, scalable, and efficient, driving reliability through code, not just processes. Requirements Design, implement, and maintain scalable,...


  • Chicago, United States ExecutivePlacements.com Full time

    We are looking for a Senior Site Reliability Engineer (SRE) with deep experience in AWS infrastructure, automation, observability, and production support. As an SRE, you will ensure our cloud‑native systems are resilient, scalable, and efficient, driving reliability through code, not just processes. Requirements Design, implement, and maintain scalable,...


  • Chicago, United States Request Technology, LLC Full time

    ***We are unable to sponsor for this permanent full-time role*****Position is bonus eligible***Prestigious Financial Company is currently seeking a Sr. Site Reliability Engineer. Candidate will provide support for the availability and performance of next generation platform and will enhance system reliability and developer productivity through automation....


  • Chicago, United States Request Technology, LLC Full time

    ***We are unable to sponsor for this permanent full-time role*****Position is bonus eligible***Prestigious Financial Company is currently seeking a Sr. Site Reliability Engineer. Candidate will provide support for the availability and performance of next generation platform and will enhance system reliability and developer productivity through automation....


  • Chicago, IL, United States Request Technology, LLC Full time

    ***We are unable to sponsor for this permanent full-time role*****Position is bonus eligible*** Prestigious Financial Company is currently seeking a Sr. Site Reliability Engineer. Candidate will provide support for the availability and performance of next generation platform and will enhance system reliability and developer productivity through automation....


  • Chicago, United States Early Warning® Full time

    Join to apply for the Sr Site Reliability Engineer role at Early WarningContinue with Google Continue with GoogleJoin to apply for the Sr Site Reliability Engineer role at Early WarningAt Early Warning, we’ve powered and protected the U.S. financial system for over thirty years with cutting-edge solutions like Zelle, Paze℠, and so much more. As a trusted...


  • Chicago, United States Qorali Full time

    Site Reliability Engineer – Cloud & AutomationLocation: Chicago Visa Sponsorship: Not available A technology-driven organization is seeking an experienced Site Reliability Engineer to support and enhance the reliability of its next-generation platform. The role focuses on automation, cloud infrastructure, and system performance. Key Responsibilities:Ensure...


  • Chicago, IL, United States Request Technology, LLC Full time

    ***We are unable to sponsor for this permanent full-time role*****Position is bonus eligible*** Prestigious Financial Company is currently seeking a Sr. Site Reliability Engineer. Candidate will provide support for the availability and performance of next generation platform and will enhance system reliability and developer productivity through automation....


  • Chicago, Illinois, United States Moonlite AI Full time

    Moonlite delivers high-performance AI infrastructure for organizations running intensive computational research, large-scale model training, and demanding data processing workloads.We provide infrastructure deployed in our facilities or co-located in yours, delivering flexible on-demand or reserved compute that feels like an extension of your existing data...


  • Chicago, United States Request Technology, LLC Full time

    Site Reliability Engineer Hybrid (3 days onsite, 2 days remote) full‑time. No visa sponsorship. Base pay: $150,000 – $155,000 per year, subject to skills and experience. A prestigious company seeks a Site Reliability Engineer focused on observation, logging, and capacity planning. The role requires experience with Linux, Kubernetes/Docker, Terraform,...