Site Reliability Engineering

1 week ago


Atlanta, United States STAFFWORXS Full time

Site Reliability Engineering (SRE) Architect Get AI-powered advice on this job and more exclusive features. This range is provided by STAFFWORXS. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $70.00/hr - $75.00/hr Direct message the job poster from STAFFWORXS Delivery Manager @ STAFFWORXS | US IT Recruitment Job Title: Site Reliability Engineering (SRE) Architect Location: Atlanta, Georgia Work Model: Hybrid (In-person presence required) Overview We are seeking a highly experienced Site Reliability Engineering (SRE) Architect to lead the strategic design, development, and maturity of our reliability engineering practices. This role goes beyond operational support, focusing on defining the architectural blueprint, standards, and frameworks that guide development and SRE operations teams in building resilient, scalable, and high-performing systems. The SRE Architect will influence technology decisions, enhance system observability, and foster a culture of reliability across the organization. Key Responsibilities Reliability Strategy & Architecture Architect scalable, highly available, secure, and cost-effective solutions on AWS. Define and promote SRE standards, best practices, and architectural blueprints across engineering teams. Evaluate and enhance current observability systems, identifying gaps and driving next-level maturity to improve system insights. Lead the definition and implementation of SLIs, SLOs, and error budgets for critical services. Design solutions to eliminate operational toil through automation and improved system architecture. Assess existing SRE tools, CI/CD pipelines, IaC modules, and automated remediation frameworks, proposing improvements. Evaluate and recommend new tools, technologies, and practices to strengthen reliability, productivity, and operational excellence. Technical Leadership & Consultation Serve as a senior advisor on reliability, scalability, and performance across development and platform teams. Offer architectural guidance for new services to ensure reliability principles are integrated from the start. Mentor SREs and engineers, promoting strong engineering discipline and adherence to SRE principles. Lead architecture reviews and production readiness assessments for critical systems. Resilience Engineering Lead blameless postmortems for major incidents and drive systemic architectural improvements. Advocate and architect resilience patterns including circuit breakers, rate limiting, graceful degradation, and chaos engineering. Required Qualifications Proven experience in architectural roles focused on reliability, scalability, and performance. Deep hands-on expertise with SRE principles (SLIs/SLOs, error budgets, automation, incident management). Strong AWS experience across infrastructure, networking, and security. Expertise with containerization and orchestration (Kubernetes, Docker, serverless). Experience building observability solutions (Dynatrace, Prometheus, Grafana, ELK/EFK, Jaeger, OpenTelemetry). Strong programming/scripting abilities (Python, Go, Bash). Excellent analytical and strategic problem-solving skills. Strong communication, collaboration, and leadership abilities. Preferred Qualifications Experience implementing and maturing chaos engineering practices and platforms. Seniority level Mid-Senior level Employment type Contract Job function Other Referrals increase your chances of interviewing at STAFFWORXS by 2x #J-18808-Ljbffr



  • Atlanta, United States Origami Risk Full time

    Join to apply for the Site Reliability Engineer role at Origami Risk1 day ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Origami RiskThe Site Reliability Engineer is a key force behind improving Origami’s time to resolution and advancing overall site reliability and scalability. This person participates in...


  • Atlanta, United States Tata Consultancy Services Full time

    Site Reliability Engineer (SRE) - Full Time Location: Atlanta Metropolitan Area Salary Range: $100,000 - $125,000 per year Job Description We are seeking an experienced Site Reliability Engineer to build and support a reliable application suite, implement Service‑Reliability Engineering practices, and ensure the availability, reliability, and performance...


  • Atlanta, United States McKesson’s Corporate Full time

    Site Reliability Engineer page is loaded## Site Reliability Engineerremote type: Hybridtime type: Full timeposted on: Posted 4 Days Agojob requisition id: JR0140698McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more...

  • Site Reliability Engineer

    31 minutes ago


    Atlanta, United States AutoRABIT Full time

    About the role:AutoRABIT is looking for a Site Reliability/DevSecOps Engineer to help develop, scale and operate our cloud services. In this role you will be an experienced business professional able to implement and execute best practice operations and improvements across teams by providing visibility and recommendations for improved reliability and...


  • Atlanta, United States Rx Savings Solutions Full time

    Site Reliability Engineer McKesson is an impact‑driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well‑being of you and those we serve – we care. Rx Savings...


  • Atlanta, Georgia, United States Wormhole Foundation Full time

    The Wormhole Foundation**Our mission is to empower passionate people in the research and development of blockchain interoperability technologies. We support teams building secure, open-source, and decentralized products within the Wormhole ecosystem.The Role: Site Reliability Engineer**Wormhole Foundation is seeking an experienced Site Reliability Engineer...


  • Atlanta, United States AutoRABIT Holding Inc. Full time

    About the role:AutoRABIT is looking for a Site Reliability/DevSecOps Engineer to help develop, scale and operate our cloud servicesIn this role you will be an experienced business professional able to implement and execute best practice operations and improvements across teams by providing visibility and recommendations for improved reliability and...

  • Site Reliability Engineer

    14 minutes ago


    Atlanta, United States Highbrow LLC Full time

    Job Title :- Site Reliability Engineer (SRE) Employment Type :- W2 Duration :- Long Term Visa Type :- All Visa applicable which are ready for W2 Location :- Atlanta, GA (Onsite) Job Description We are seeking a highly skilled Site Reliability Engineer (SRE)with expertise in Adobe Experience Manager (AEM) 6.x+architecture, including Sling, OSGi, JCR, and...


  • Atlanta, United States Origami Risk LLC Full time

    OverviewThe Site Reliability Engineer is a key force behind improving Origami’s time to resolution and advancing overall site reliability and scalability. This person participates in efforts to identify root causes during post-incident investigations, while also identifying preventative measures to minimize future disruptions. They also assist with...


  • Atlanta, Georgia, United States Florence Healthcare Full time

    What We Do: Florence software advances cures by helping the world's most important research sites do their best work. Our solutions are now used by over 30,000 research teams in 70 countries around the world—we're the most widely deployed site workflow tool in the industry. By the end of the decade, we'll double the pace at which new medicines get to...