Senior Site Reliability Engineer

4 weeks ago


Atlanta, United States Flexton Inc. Full time

Title: Sr. Site Reliability Engineer

Location: Atlanta, GA (Hybrid)

Duration: 12+ Months


Job Description:

  • Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure; Software Engineering background/experience---Python, JavaScript, Bash, etc.; In-depth knowledge of infrastructure as code (IaC) tools, like Terraform, GHA, CloudFormation, Ansible; Strong Automation and Scripting Skills, Solid Understanding of CI/CD Pipelines (Jenkins).
  • Lead and mentor a team of SREs, fostering a culture of collaboration, continuous learning, and operational excellence.
  • Drive the adoption of SRE best practices and ensure adherence to reliability and performance standards.
  • Design and implement highly available, scalable, and fault-tolerant systems using AWS.
  • Collaborate with software engineering teams and other SREs to influence design and architecture decisions to improve system reliability and performance.
  • Develop and maintain automation scripts and tools to streamline operations, deployments, and monitoring processes.
  • Utilize Infrastructure as Code (IaC) tools such as Terraform, GitHub Actions, and CloudFormation to manage infrastructure. Implement and maintain robust monitoring, alerting, and logging systems using tools like Splunk, Grafana, or New Relic.
  • Lead incident response efforts, conduct root cause analysis, and implement measures to prevent recurrence.
  • Oversee the design and maintenance of CI/CD pipelines using tools like Jenkins, GitLab CI, or CircleCI.
  • Ensure seamless and efficient code deployment processes, reducing time to market and increasing system reliability. Optimization:
  • Conduct performance tuning and capacity planning to ensure systems can handle growing workloads. Troubleshooting experience. Identify and resolve performance bottlenecks in infrastructure and applications.



  • Atlanta, United States Nextlink Full time

    Job Description Senior Site Reliability Engineer Desirable Skills: Experience with additional programming languages and technologies beyond Python and Ruby. Familiarity with cloud platforms such as AWS, Azure, or GCP. Proficiency in additional logging and monitoring tools. Experience with other Infrastructure as Code (IaC) tools and practices. Knowledge of...


  • Atlanta, United States Nextlink Full time

    Job Description Senior Site Reliability Engineer Desirable Skills: Experience with additional programming languages and technologies beyond Python and Ruby. Familiarity with cloud platforms such as AWS, Azure, or GCP. Proficiency in additional logging and monitoring tools. Experience with other Infrastructure as Code (IaC) tools and practices. Knowledge of...


  • Atlanta, United States Boomi Inc Full time

    About Boomi and What Makes Us SpecialAre you ready to work at a fast-growing company where you can make a difference? Boomi aims to make the world a better place by connecting everyone to everything, anywhere. Our award-winning, intelligent integration and automation platform helps organizations power the future of business. At Boomi, you’ll work with...


  • Atlanta, United States Boomi Inc Full time

    About Boomi and What Makes Us SpecialAre you ready to work at a fast-growing company where you can make a difference? Boomi aims to make the world a better place by connecting everyone to everything, anywhere. Our award-winning, intelligent integration and automation platform helps organizations power the future of business. At Boomi, you’ll work with...


  • Atlanta, Georgia, United States Delta Airlines Full time

    Senior Site Reliability Engineer United States, Georgia, AtlantaInformation Technology21-May-2024Ref #: 24931How you'll help us Keep Climbing (overview & key responsibilities)Delta IT is on a journey to becoming the best IT organization in the airline industry, a journey of transformation. We are changing the way we do business from top to bottom as we...


  • Atlanta, Georgia, United States Delta Airlines Full time

    Senior Site Reliability Engineer United States, Georgia, AtlantaInformation Technology21-May-2024Ref #: 24931How you'll help us Keep Climbing (overview & key responsibilities)Delta IT is on a journey to becoming the best IT organization in the airline industry, a journey of transformation. We are changing the way we do business from top to bottom as we...


  • Atlanta, United States NVIDIA Full time

    About NVIDIA: NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that's motivated by outstanding technology and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains...


  • Atlanta, United States Calendly LLC Full time

    What’s so great about working on Calendly’s Engineering team? We make things possible for our customers through impactful innovation. Why do we need you? Well, we are looking for a Senior Site Reliability Engineer who will bring creative problem solving and a keen eye for detail. You will report to a Senior Manager of Engineering and be responsible for...


  • Atlanta, Georgia, United States Delta Airlines Full time

    United States, Georgia, Atlanta Information Technology 21-May-2024 Ref #: 24931How you'll help us Keep Climbing (overview & key responsibilities)Delta IT is on a journey to becoming the best IT organization in the airline industry, a journey of transformation. We are changing the way we do business from top to bottom as we strive to create meaningful and...


  • Atlanta, Georgia, United States Delta Airlines Full time

    United States, Georgia, Atlanta Information Technology 21-May-2024 Ref #: 24931How you'll help us Keep Climbing (overview & key responsibilities)Delta IT is on a journey to becoming the best IT organization in the airline industry, a journey of transformation. We are changing the way we do business from top to bottom as we strive to create meaningful and...


  • ATLANTA, Georgia, United States Delta Airlines Full time

    Senior Site Reliability Engineer United States, Georgia, Atlanta Information Technology 21-May-2024 Ref #: 24931 How you'll help us Keep Climbing (overview & key responsibilities) Delta IT is on a journey to becoming the best IT organization in the airline industry, a journey of transformation. We are changing the way we do business from top to bottom...


  • Atlanta, United States VySystems Full time

    Skills Required:s∙5 or more years of experience as an application developer or SRE. ∙2 or more years of experience with ops automation using a scripting language such as Python or Ansible. ∙Experience with an APM tool such as Dynatrace, New Relic, AppDynamics, or Datadog is preferred. ∙Site Reliability Engineering: Knowledge of the theories and...


  • Atlanta, United States VySystems Full time

    Skills Required:s∙5 or more years of experience as an application developer or SRE. ∙2 or more years of experience with ops automation using a scripting language such as Python or Ansible. ∙Experience with an APM tool such as Dynatrace, New Relic, AppDynamics, or Datadog is preferred. ∙Site Reliability Engineering: Knowledge of the theories and...


  • Atlanta, United States VySystems Full time

    Skills Required:s∙5 or more years of experience as an application developer or SRE. ∙2 or more years of experience with ops automation using a scripting language such as Python or Ansible. ∙Experience with an APM tool such as Dynatrace, New Relic, AppDynamics, or Datadog is preferred. ∙Site Reliability Engineering: Knowledge of the theories and...


  • Atlanta, United States VySystems Full time

    Skills Required:s∙5 or more years of experience as an application developer or SRE. ∙2 or more years of experience with ops automation using a scripting language such as Python or Ansible. ∙Experience with an APM tool such as Dynatrace, New Relic, AppDynamics, or Datadog is preferred. ∙Site Reliability Engineering: Knowledge of the theories and...


  • Atlanta, United States Delta Airlines Full time

    United States, Georgia, Atlanta Information Technology 21-May-2024 Ref #: 24931How you'll help us Keep Climbing (overview & key responsibilities)Delta IT is on a journey to becoming the best IT organization in the airline industry, a journey of transformation. We are changing the way we do business from top to bottom as we strive to create meaningful and...


  • Atlanta, United States Hermeus Full time

    Hermeus is an aerospace and defense technology company founded to radically accelerate air travel by delivering hypersonic aircraft. The company aims to develop hypersonic aircraft quickly and cost-effectively by integrating hardware-rich, iterative development with modern computing and autonomy. This approach has been validated through design, build, and...


  • Atlanta, United States Flexton Full time

    Title: Sr. Site Reliability Engineer Location: Atlanta, GA (Hybrid) Duration: 12+ Months Job Description: Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure; Software Engineering background/experience---Python, JavaScript, Bash, etc.; In-depth knowledge of infrastructure as code (IaC)...


  • Atlanta, United States Hermeus Full time

    Hermeus is an aerospace and defense technology company founded to radically accelerate air travel by delivering hypersonic aircraft. The company aims to develop hypersonic aircraft quickly and cost-effectively by integrating hardware-rich, iterative development with modern computing and autonomy. This approach has been validated through design, build, and...


  • Atlanta, United States Hermeus Full time

    Hermeus is an aerospace and defense technology company founded to radically accelerate air travel by delivering hypersonic aircraft. The company aims to develop hypersonic aircraft quickly and cost-effectively by integrating hardware-rich, iterative development with modern computing and autonomy. This approach has been validated through design, build, and...