Senior Cloud Reliability Engineer

13 hours ago


Atlanta, Georgia, United States Diversity Resource Staffing Inc Full time

This is an exciting opportunity for a Senior Cloud Reliability Engineer in the Consumer SRE Team at Diversity Resource Staffing Inc, to provide secure, resilient, scalable and maintainable services for mortgage borrowers and lenders. The company operates numerous financial and commodity marketplaces and exchanges, including the New York Stock Exchange (NYSE).

Automation is a big part of what we do - we use infrastructure-as-code within our hybrid cloud to bring stability and scalability to Windows, Linux, Docker and Serverless applications in AWS, On-Prem and Azure environments. We reduce toil through scripting and automation of repetitive tasks. You will collaborate with Developers to deliver robust services, build actionable alerts to detect / avoid incidents and to detect performance bottlenecks, as well as automation to remediate issues.

Key Responsibilities

  • Employ deep troubleshooting skills to improve the availability, performance, and security of Ellie Mae Services.
  • Ensure services are designed with 24/7 availability and operational readiness and rigor
  • Implement proactive monitoring, alerting, trend analysis and self-healing systems
  • Define and measure KPIs and SLOs
  • Build automated deployments, automated tests, and operational tools
  • Participate in on-call rotation for Production support
  • Collaborate with Product and Support teams to plan and deploy product releases
  • Partner with other SREs and lead by example

Required Skills and Experience

  • 10+ years of Application/Systems engineering in 24x7 Production Services environments
  • BS in Computer Science, Computer Engineering, Math, or equivalent professional experience
  • Excellent troubleshooter, utilizing a systematic problem-solving approach
  • Demonstrate the ability to lead Incident Response and root cause analysis (RCA)
  • Fluency with one or more current generation scripting language used by SRE/DevOps professionals (Powershell, Python, Perl, PHP, Ruby) + Java/.NET development
  • Experience running a SaaS application in a public cloud, on-prem or hybrid cloud environment

Additional Credit

  • Proficiency in Windows and on-prem environments
  • Experience with Continuous Integration and Continuous Delivery concepts.
  • Automation in RunDeck or Jenkins
  • Infrastructure-as-code or Configuration Management, utilizing tools like Terraform, CloudFormation or Chef/SaltStack/Puppet/DSC
  • Containers/Docker/Micro-Services


  • Atlanta, Georgia, United States Motion Recruitment Full time

    Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team in Atlanta, GA. As a key member of our infrastructure team, you will be responsible for ensuring the reliability and scalability of our cloud-based platform.About the RoleThis is a full-time position that requires a minimum of 8 years of...


  • Atlanta, Georgia, United States Microsoft Corporation Full time

    Job Title: Senior Site Reliability EngineerMicrosoft is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our Office 365 Enterprise Cloud team, you will be responsible for designing, implementing, and maintaining highly scalable and reliable cloud-based systems.Key Responsibilities:Design and implement scalable...


  • Atlanta, Georgia, United States Motion Recruitment Full time

    Job Title: Senior Site Reliability Engineer IIAt Motion Recruitment, we are seeking a highly skilled Senior Site Reliability Engineer II to join our team. As a key member of our SRE/Platform team, you will be responsible for ensuring the reliability and scalability of our SaaS-based AI/ML product.About the Role:Work closely with the SRE/Platform team to...


  • Atlanta, Georgia, United States Motion Recruitment Full time

    Job Title: Senior Site Reliability EngineerA leading organization in the financial services industry is seeking a highly skilled Senior Site Reliability Engineer to join their team. This is a full-time position based in the Atlanta area, requiring Monday, Tuesday, and Wednesday commutes to the office.The company is a pioneer in creating an AI/ML platform to...


  • Atlanta, Georgia, United States Microsoft Corporation Full time

    Job DescriptionMicrosoft Corporation is seeking a highly skilled Senior Cloud Reliability Engineer to join our Cloud+Artificial Intelligence (C+AI) Silver SQL Team. This team is responsible for deploying and operating the Azure SQL family of services within Azure Government clouds.In this role, you will have the opportunity to work with engineers who enable...


  • Atlanta, Georgia, United States Motion Recruitment Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team in Atlanta, Georgia. As a key member of our infrastructure team, you will be responsible for ensuring the reliability and scalability of our cloud-based platform.Key ResponsibilitiesDesign and implement scalable and reliable cloud infrastructure using AWS and...


  • Atlanta, Georgia, United States UKG Full time

    About the Role:As a Senior Site Reliability Engineer at UKG, you will be responsible for developing software solutions to enhance, harden, and support our service delivery processes. This includes building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance analysis, monitoring, alerting, chaos engineering, and auto...


  • Atlanta, Georgia, United States Motion Recruitment Full time

    {"title": "Senior Site Reliability Engineer II", "description": "Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer II to join our team in Atlanta, Georgia. As a key member of our SRE/Platform team, you will work directly on our SaaS-based AI/ML product, focusing on analytics for communications data. With a strong background in AWS,...


  • Atlanta, Georgia, United States Diversity Resource Staffing Inc Full time

    Senior Site Reliability EngineerThis is an exciting opportunity to join our Consumer SRE Team at IMT division, where you will play a key role in providing secure, resilient, scalable, and maintainable services for mortgage borrowers and lenders.Our team operates in a hybrid cloud environment, utilizing infrastructure-as-code to bring stability and...


  • Atlanta, Georgia, United States Bank of America Full time

    Job Title: Cloud Senior Site Reliability EngineerJob Summary:Bank of America is seeking a highly skilled Cloud Senior Site Reliability Engineer to join our team. As a key member of our cloud infrastructure team, you will be responsible for designing, building, and maintaining our next-gen AWS platform.Key Responsibilities:Design and implement scalable and...

  • Senior Cloud Engineer

    4 weeks ago


    Atlanta, Georgia, United States Next Level Business Services, Inc. Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Next Level Business Services, Inc. This role will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems and applications.Key Responsibilities:Design and implement scalable and highly...


  • Atlanta, Georgia, United States Motion Recruitment Full time

    Job DescriptionA leading financial services organization is seeking a seasoned Senior Site Reliability Engineer to join their team in Atlanta, GA.About the RoleThis company is a pioneer in AI/ML platform development, specializing in detecting fraudulent communications. They require an SRE with 8-10 years of experience in AWS and EKS to handle mature...


  • Atlanta, Georgia, United States STORD Full time

    About the RoleStord is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our cloud infrastructure team, you will be responsible for designing and implementing scalable, secure, and efficient cloud infrastructure solutions.Key ResponsibilitiesCollaborate with cross-functional teams to design and implement CI/CD...

  • Senior Cloud Engineer

    3 weeks ago


    Atlanta, Georgia, United States Next Level Business Services, Inc. Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Next Level Business Services, Inc. This role will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems and applications.Key Responsibilities:Design and implement scalable and highly...


  • Atlanta, Georgia, United States Microsoft Corporation Full time

    Job Title: Senior Site Reliability EngineerMicrosoft is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our Cloud Engineering team, you will be responsible for designing, building, and operating large-scale cloud-based systems that meet the needs of our customers.About the Role:Design and implement scalable,...


  • Atlanta, Georgia, United States Bank of America Full time

    Job Title: Cloud Senior Site Reliability EngineerWe are seeking a highly skilled Cloud Senior Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for designing, building, and maintaining our next-gen AWS platform.Key Responsibilities:Collaborate with a diverse set of engineers, architects, and teams to...


  • Atlanta, Georgia, United States Motion Recruitment Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team in Atlanta, Georgia. As a key member of our Cloud Operations team, you will be responsible for designing, implementing, and maintaining our global distributed cloud environments.Key Responsibilities:Collaborate with SRE and Cloud teams...


  • Atlanta, Georgia, United States Motion Recruitment Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team in Atlanta, Georgia. As a key member of our Cloud Operations team, you will be responsible for designing, implementing, and maintaining our global distributed cloud environments.Key Responsibilities:Work with members of the SRE and...


  • Atlanta, Georgia, United States STORD Full time

    About StordStord is a leading commerce enablement provider of fulfillment services and technology that powers seamless checkout and delivery experiences for high-volume mid-market and enterprise brands across all channels.Job DescriptionWe are seeking a mission-driven Senior Site Reliability Engineer to be a driving force behind an exceptionally resilient,...


  • Atlanta, Georgia, United States Fox Point Recruitment LLc Full time

    Job Title: Senior Cloud Software EngineerWe are seeking a highly skilled Senior Cloud Software Engineer to join our expanding platform as a service team. Our platform as a service is responsible for providing the foundation for cloud-based products and utilizing a variety of features and services that are found on Google Cloud Platform.Key...