Current jobs related to Senior Site Reliability Engineer - Atlanta, Georgia - Diversity Resource Staffing Inc


  • Atlanta, Georgia, United States Jonas Software UK Full time

    About the Role:We are seeking a highly skilled Senior Site Reliability Engineer to join our team at Jonas Software UK. As a key member of our technical operations team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Atlanta, Georgia, United States Microsoft Corporation Full time

    We are seeking a highly skilled Senior Site Reliability Engineer to join our Windows Servicing and Delivery team at Microsoft Corporation.The ideal candidate will have a strong background in software engineering, network engineering, or systems administration, with a proven track record of delivering high-quality solutions that meet customer needs.As a...


  • Atlanta, Georgia, United States STORD Full time

    About the RoleStord is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our SRE team, you will be responsible for designing and implementing scalable, efficient, and secure infrastructure and platform solutions.You will collaborate with cross-functional teams to deliver high-quality products and services to our...


  • Atlanta, Georgia, United States Learfield Full time

    About LearfieldLearfield is a leading media and technology services company in intercollegiate athletics, unlocking the value of college sports for brands and fans through an omnichannel platform with innovative content and commerce solutions for fan engagement.Job Title: Senior Site Reliability EngineerWe are seeking an experienced Senior Site Reliability...


  • Atlanta, Georgia, United States SIDEARM Sports Full time

    Job SummaryAt SIDEARM Sports, we're seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our SRE team, you'll play a critical role in ensuring the reliability, availability, and performance of our live services, which impact millions of customers across the entertainment space.Key ResponsibilitiesCollaborate with...


  • Atlanta, Georgia, United States Microsoft Corporation Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Microsoft Corporation. As a key member of our Windows Servicing and Delivery team, you will be responsible for ensuring the reliability and performance of our product offerings, including Windows client, Windows Update, and Windows Autopatch.Key Responsibilities...


  • Atlanta, Georgia, United States Greenlight Full time

    About GreenlightGreenlight is a leading family fintech company dedicated to empowering parents to raise financially savvy kids. Our mission is to create a better, brighter future for the next generation.Job SummaryWe are seeking a Senior Site Reliability Engineer to join our team. As a key member of our Engineering organization, you will play a critical role...


  • Atlanta, Georgia, United States SIDEARM Sports Full time

    About UsSIDEARM Sports is a leading provider of technology solutions for collegiate athletic programs. We're a passionate team of technologists, creatives, and strategists dedicated to delivering exceptional products and services to our partners and their fans.Job SummaryWe're seeking an experienced Senior Site Reliability Engineer to join our team. As a key...


  • Atlanta, Georgia, United States Cox Enterprises Full time

    About the RoleCox Automotive is seeking a highly skilled Senior Site Reliability Engineer to join our Manheim Logistics SRE team. As a key member of our team, you will be responsible for designing and maintaining AWS infrastructure and deployment pipelines for our 15+ development teams.Key ResponsibilitiesDesign and implement scalable and reliable...


  • Atlanta, Georgia, United States SIDEARM Sports Full time

    About UsSIDEARM Sports is a leading provider of technology solutions for collegiate athletic programs. We're a passionate team of technologists, creatives, and strategists dedicated to delivering exceptional products and services.Job DescriptionWe're seeking an experienced Senior Site Reliability Engineer to join our team. As a key member of our SRE team,...


  • Atlanta, Georgia, United States Cox Communications Full time

    About the RoleThis is an exciting opportunity to join our team as a Senior Site Reliability Engineer. As a key member of our Manheim Logistics SRE team, you will play a crucial role in designing and maintaining AWS infrastructure and deployment pipelines for our 15+ development teams.We are looking for a highly skilled and experienced engineer who can work...


  • Atlanta, Georgia, United States Pyramid Consulting Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Pyramid Consulting, Inc. This is a contract opportunity with long-term potential and is located in Atlanta, GA.Key ResponsibilitiesDesign and implement SLOs / SLIs / error budgets and manage reliability for infrastructure and applicationsProven experience with...


  • Atlanta, Georgia, United States Microsoft Corporation Full time

    About the RoleMicrosoft Corporation is seeking a highly skilled Senior Site Reliability Engineering Manager to lead the delivery of critical features in Office 365 government cloud offerings. As a key member of the Office 365 team, you will be responsible for combining your passion for quality, reliability, and creativity to drive evolution in the continuous...


  • Atlanta, Georgia, United States Greenlight Full time

    Job DescriptionGreenlight is a leading fintech company on a mission to help parents raise financially smart kids. We serve over 6 million parents and kids with our award-winning banking app for families. Our platform allows parents to automate allowance, manage chores, set flexible spend controls, and invest for their family's future. Kids and teens learn to...


  • Atlanta, Georgia, United States Pyramid Consulting Full time

    Pyramid Consulting is seeking a talented Senior Site Reliability Engineer to join our team. This is a contract opportunity with long-term potential and is located in a major US city. The successful candidate will have a strong background in setting SLOs / SLIs / error budgets and managing reliability for infrastructure and applications.Key...


  • Atlanta, Georgia, United States Lorven Technologies Full time

    Job Title: Sr Site Reliability EngineerJob Summary:Lorven Technologies is seeking a highly skilled Sr Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for driving cross-team initiatives that improve Delta engineering practices, increasing accountability, and delivering increased uptime and...


  • Atlanta, Georgia, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based systems and applications.Key Responsibilities:Design, implement, and maintain monitoring tools,...


  • Atlanta, Georgia, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design, implement, and maintain monitoring tools, alerts,...


  • Atlanta, Georgia, United States Lorven Technologies Full time

    Job Title: Site Reliability EngineerLorven Technologies is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and reliable cloud...


  • Atlanta, Georgia, United States Next Level Business Services, Inc. Full time

    Job Title: Site Reliability EngineerNext Level Business Services, Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems and applications.Key Responsibilities:Design, implement, and maintain...

Senior Site Reliability Engineer

2 months ago


Atlanta, Georgia, United States Diversity Resource Staffing Inc Full time
Senior Site Reliability Engineer

This is an exciting opportunity for a skilled Senior Site Reliability Engineer to join our Consumer SRE Team at IMT division, providing secure, resilient, scalable, and maintainable services for mortgage borrowers and lenders. Our client, a division of a leading financial services company, operates numerous financial and commodity marketplaces and exchanges, including the New York Stock Exchange (NYSE).

We leverage automation to bring stability and scalability to our hybrid cloud environment, utilizing infrastructure-as-code to reduce toil and improve efficiency. As a Senior Site Reliability Engineer, you will collaborate with Developers to deliver robust services, build actionable alerts to detect and prevent incidents, and automate issue remediation.

Responsibilities
  • Employ deep troubleshooting skills to improve the availability, performance, and security of Ellie Mae Services.
  • Ensure services are designed with 24/7 availability and operational readiness.
  • Implement proactive monitoring, alerting, trend analysis, and self-healing systems.
  • Define and measure KPIs and SLOs.
  • Build automated deployments, automated tests, and operational tools.
  • Participate in on-call rotation for Production support.
  • Collaborate with Product and Support teams to plan and deploy product releases.
  • Partner with other SREs and lead by example.
Requirements
  • 10+ years of Application/Systems engineering in 24x7 Production Services environments.
  • BS in Computer Science, Computer Engineering, Math, or equivalent professional experience.
  • Excellent troubleshooter, utilizing a systematic problem-solving approach.
  • Demonstrate the ability to lead Incident Response and root cause analysis (RCA).
  • Fluency with one or more current generation scripting language used by SRE/DevOps professionals (Powershell, Python, Perl, PHP, Ruby) + Java/.NET development.
  • Experience running a SaaS application in a public cloud, on-prem, or hybrid cloud environment.

Additional credit for:

  • Proficiency in Windows and on-prem environments.
  • Experience with Continuous Integration and Continuous Delivery concepts.
  • Automation in RunDeck or Jenkins.
  • Infrastructure-as-code or Configuration Management, utilizing tools like Terraform, CloudFormation, or Chef/SaltStack/Puppet/DSC.
  • Containers/Docker/Micro-Services.