Senior Cloud Reliability Engineer

4 weeks ago


Atlanta, Georgia, United States UKG Full time
About the Role:
As a Senior Site Reliability Engineer at UKG, you will be responsible for developing software solutions to enhance, harden, and support our service delivery processes. This includes building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering, and auto remediation.

  • Engage in and improve the lifecycle of services from conception to EOL, including system design consulting and capacity planning.
  • Define and implement standards and best practices related to system architecture, service delivery, metrics, and the automation of operational tasks.
  • Support services, product, and engineering teams by providing common tooling and frameworks to deliver increased availability and improved incident response.
  • Improve system performance, application delivery, and efficiency through automation, process refinement, postmortem reviews, and in-depth configuration analysis.
  • Collaborate closely with engineering professionals within the organization to deliver reliable services.
  • Identify and eliminate operational toil by treating operational challenges as a software engineering problem.
  • Actively participate in incident response, including on-call responsibilities.

About You:

Basic Qualifications:

  • 3-5+ years of hands-on experience working in Engineering or Cloud.
  • 3-5+ years of experience with public cloud platforms (e.g., GCP, AWS, Azure).
  • Engineering degree, or a related technical discipline, or equivalent work experience.
  • Experience coding in higher-level languages (e.g., Python, JavaScript, C++, or Java).
  • Demonstrated understanding of best practices in metric generation and collection, log aggregation pipelines, time-series databases, and distributed tracing.
  • Demonstrable fundamentals in 2 of the following: Computer Science, Cloud Architecture, Security, or Network Design fundamentals.
  • Working experience with industry standards like Terraform, Ansible.
  • Experience working with automation.

Preferred Qualifications:

  • Experience with distributed system design and architecture.
  • Experience with containerization technologies.
  • Experience in configuration and maintenance of applications and/or systems infrastructure for large-scale customer-facing companies.


  • Atlanta, Georgia, United States Duck Creek Technologies Full time

    Job Title: Senior Cloud Reliability EngineerAbout the Role:We are seeking a highly skilled Senior Cloud Reliability Engineer to join our team at Duck Creek Technologies. As a key member of our engineering organization, you will be responsible for designing and implementing scalable, secure, and highly available cloud solutions. Your expertise in cloud...


  • Atlanta, Georgia, United States ACL Digital Full time

    Job DescriptionWe are seeking a highly skilled Senior Cloud Reliability Engineer to join our team at ACL Digital. As a key member of our Site Reliability Engineering team, you will be responsible for designing and implementing scalable, secure, and highly available cloud infrastructure solutions.Key Responsibilities:Design and implement cloud infrastructure...


  • Atlanta, Georgia, United States IRIS Consulting Corporation Full time

    Job DescriptionWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at IRIS Consulting Corporation. As a key member of our Retail, Site Reliability Engineering team, you will be responsible for establishing and maintaining the reliability of our cloud-based infrastructure and applications.Key Responsibilities:Design and implement...


  • Atlanta, Georgia, United States Pyramid Consulting, Inc. Full time

    Job SummaryWe are seeking a highly skilled Senior Cloud Reliability Engineer to join our team at Pyramid Consulting, Inc. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining scalable and reliable cloud infrastructure. Your expertise in cloud computing, automation, and DevOps will enable us to...


  • Atlanta, Georgia, United States Diversity Resource Staffing Inc Full time

    This is an exciting opportunity for a Senior Cloud Reliability Engineer in the Consumer SRE Team at Diversity Resource Staffing Inc, to provide secure, resilient, scalable and maintainable services for mortgage borrowers and lenders. The company operates numerous financial and commodity marketplaces and exchanges, including the New York Stock Exchange...


  • Atlanta, Georgia, United States Jonas Software UK Full time

    About the Role:We are seeking a highly skilled Senior Site Reliability Engineer to join our team at Jonas Software UK. As a key member of our technical operations team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...

  • Senior Cloud Engineer

    3 weeks ago


    Atlanta, Georgia, United States Next Level Business Services, Inc. Full time

    Job DescriptionWe are seeking a highly skilled Senior Cloud Engineer to join our team at Next Level Business Services, Inc. As a key member of our Site Reliability Engineering team, you will be responsible for designing, implementing, and maintaining our cloud infrastructure to ensure high availability, scalability, and security.Key Responsibilities:Design...

  • Senior Cloud Engineer

    3 weeks ago


    Atlanta, Georgia, United States New Relic Full time

    Job Title: Senior Cloud EngineerJob Summary:We are seeking a highly skilled Senior Cloud Engineer to join our team at New Relic. As a Senior Cloud Engineer, you will be responsible for designing, implementing, and managing secure, scalable, and reliable cloud infrastructure.Key Responsibilities:Design and implement cloud infrastructure on GCP, Azure, and...


  • Atlanta, Georgia, United States STORD Full time

    About the RoleStord is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our SRE team, you will be responsible for designing and implementing scalable, efficient, and secure infrastructure and platform solutions.You will collaborate with cross-functional teams to deliver high-quality products and services to our...


  • Atlanta, Georgia, United States TalentBridge Full time

    Job Title: Senior Cloud Software EngineerAt TalentBridge, we're seeking a talented Senior Cloud Software Engineer to join our Cloud Services team. As a key member of our team, you will be responsible for building and expanding the services powering our API ecosystem, solving problems for a large community of fellow developers.Key Responsibilities:Design and...


  • Atlanta, Georgia, United States Microsoft Corporation Full time

    We are seeking a highly skilled Senior Site Reliability Engineer to join our Windows Servicing and Delivery team at Microsoft Corporation.The ideal candidate will have a strong background in software engineering, network engineering, or systems administration, with a proven track record of delivering high-quality solutions that meet customer needs.As a...

  • Senior Cloud Engineer

    3 weeks ago


    Atlanta, Georgia, United States RIT Solutions, Inc. Full time

    Job Title: Senior Cloud EngineerJob Summary:RIT Solutions, Inc. is seeking a skilled Senior Cloud Engineer to join our engineering team. As a Senior Cloud Engineer, you will be responsible for designing, building, and maintaining cloud-based infrastructure using Azure and AWS services. You will work closely with our team to ensure the smooth operation of our...


  • Atlanta, Georgia, United States Now100 Full time

    Job Title: Site Reliability EngineerNow100 is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our product team, you will be responsible for building and growing the skillsets of junior engineers while maintaining high site uptime and availability.Key Responsibilities:Design and implement scalable and reliable...


  • Atlanta, Georgia, United States Diverse Lynx Full time

    Job Title: Site Reliability Engineer - Cloud ExpertJob Summary: We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. The ideal candidate will have strong knowledge in AWS cloud platform and expertise in developing and maintaining monitoring tools, alerts, and dashboards to provide visibility into system health and...


  • Atlanta, Georgia, United States Now100 Full time

    About the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Now100. As a Site Reliability Engineer, you will be responsible for building and supporting the platform/application infrastructure of one of the largest retailers in the world.Key Responsibilities:Maintain high site uptime/availability while embracing rapid change...


  • Atlanta, Georgia, United States Everbridge Full time

    About the Role:We are seeking a highly skilled Senior Data Reliability Engineer to join our team at Everbridge. As a key member of our Database Reliability Engineering team, you will be responsible for ensuring the overall service quality and availability of our data solutions.Key Responsibilities:Own operational availability, security, performance,...


  • Atlanta, Georgia, United States UKG Full time

    About the RoleAs a Site Reliability Engineer at UKG, you will play a critical role in ensuring the reliability and performance of our cloud-based services. You will be responsible for developing software solutions to enhance, harden, and support our service delivery processes.Key ResponsibilitiesDesign and implement scalable and reliable cloud...

  • Senior Cloud Engineer

    4 weeks ago


    Atlanta, Georgia, United States Delta Air Lines, Inc. Full time

    Join Delta Air Lines, Inc. as a Senior Software Development Engineer - AWS MigrationWe are seeking a highly skilled Senior Software Development Engineer to lead our AWS migration from on-prem to the cloud. As a key member of our IT department, you will design, implement, and optimize modernized specialized business applications, deploying to development,...

  • Senior Cloud Engineer

    3 weeks ago


    Atlanta, Georgia, United States Microsoft Corporation Full time

    Job Description:Microsoft Corporation is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will be responsible for ensuring the high availability and performance of our cloud services.Key Responsibilities:Develop, test, and implement changes to optimize code and improve platforms.Leverage...


  • Atlanta, Georgia, United States SIDEARM Sports Full time

    Job SummaryAt SIDEARM Sports, we're seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our SRE team, you'll play a critical role in ensuring the reliability, availability, and performance of our live services, which impact millions of customers across the entertainment space.Key ResponsibilitiesCollaborate with...