Current jobs related to Sr. Site Reliability Engineer - Atlanta - Flexton Inc.


  • atlanta, United States Advansys Full time

    Job Title: Site Reliability Engineer Location: Alpharetta, GA (Locals Candidates only) Duration: Long term We seek a highly skilled Site Reliability Engineer and dynamic – Consultant In this role you will Maintain and improve the reliability, performance, and availability of software systems. Act as a bridge between traditional IT operations and...


  • Atlanta, United States Advansys Full time

    Job Title: Site Reliability Engineer Location: Alpharetta, GA (Locals Candidates only) Duration: Long term We seek a highly skilled Site Reliability Engineer and dynamic – Consultant In this role you will Maintain and improve the reliability, performance, and availability of software systems. Act as a bridge between traditional IT operations and...


  • Atlanta, United States Advansys Full time

    Job Title: Site Reliability Engineer Want to make an application Make sure your CV is up to date, then read the following job specs carefully before applying. Location: Alpharetta, GA (Locals Candidates only) Duration: Long term We seek a highly skilled Site Reliability Engineer and dynamic – Consultant In this role you will Maintain and improve the...


  • Atlanta, United States ACL Digital Full time

    Title:: Site Reliability EngineerLocation:: Atlanta, GA (Hybrid role, 3x days onsite/week)Type of Hire:: Contract (c2c/w2)Duration:: 12 months with possible extension Site Reliability Engineer (SRE) with AWS Cloud and Application Monitoring Experience** We are seeking a skilled Site Reliability Engineer (SRE) with expertise in AWS cloud infrastructure and...


  • Atlanta, United States ACL Digital Full time

    Title:: Site Reliability EngineerLocation:: Atlanta, GA (Hybrid role, 3x days onsite/week)Type of Hire:: Contract (c2c/w2)Duration:: 12 months with possible extension Site Reliability Engineer (SRE) with AWS Cloud and Application Monitoring Experience** We are seeking a skilled Site Reliability Engineer (SRE) with expertise in AWS cloud infrastructure and...

  • Sr. Software Engineer

    3 weeks ago


    Atlanta, United States Comcast Corporation Full time

    FreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...


  • Atlanta, United States Insight Global Full time

    Must Haves:5+ years of C# .NET Development ExperienceExperience building automated deploymentsIIS application pool experience Plusses:Splunk Scrum Experience Cloud knowledge and experience Day-to-Day Responsibilities:A Fortune 500 client of Insight Global is seeking a Site Reliability Engineer (SRE) to join their team on a hybrid basis. As the sole SRE, you...


  • Atlanta, United States Insight Global Full time

    Must Haves:5+ years of C# .NET Development ExperienceExperience building automated deploymentsIIS application pool experience Plusses:Splunk Scrum Experience Cloud knowledge and experience Day-to-Day Responsibilities:A Fortune 500 client of Insight Global is seeking a Site Reliability Engineer (SRE) to join their team on a hybrid basis. As the sole SRE, you...


  • Atlanta, United States Insight Global Full time

    Position Title: Site Reliability EngineerLocation: Atlanta, GA; Portland, ME; or Chattanooga, TN (3 days/week onsite)Compensation: $130-150k Duration: Full-Time, Direct Hire Job Overview:A Fortune 500 client of Insight Global is seeking a dedicated Site Reliability Engineer (SRE) to join their team. As the sole SRE, you will play a crucial role in...


  • Atlanta, United States Tata Consultancy Services Full time

    Job DescriptionAutomating work including infrastructure needs, testing, failover solutions, failure mitigation, and much moreDebugging complex problems across an entire stack and creating solid solutionsDeveloping and building CI/CD processes to improve cadenceUsing Chaos Engineering to test what you build under real-world conditionsTriage product or system...


  • Atlanta, United States Tata Consultancy Services Full time

    Job DescriptionAutomating work including infrastructure needs, testing, failover solutions, failure mitigation, and much moreDebugging complex problems across an entire stack and creating solid solutionsDeveloping and building CI/CD processes to improve cadenceUsing Chaos Engineering to test what you build under real-world conditionsTriage product or system...


  • Atlanta, United States Hermeus Full time

    Hermeus is an aerospace and defense technology company founded to radically accelerate air travel by delivering hypersonic aircraft. The company aims to develop hypersonic aircraft quickly and cost-effectively by integrating hardware-rich, iterative development with modern computing and autonomy. This approach has been validated through design, build, and...


  • Atlanta, United States Datum Technologies Group Full time

    Job Details:Site Reliability EngineerLong term contractAtlanta, GAQualifications:Must have Skills:Deep understanding of AWS services (Lambda, S3, SQS, IAM, Route 53 etc.) and proficiency in infrastructure as code (e.g., Terraform, CloudFormation).Hands-on experience with monitoring tools such as CloudWatch, Sumo Logic, Dynatrace, Grafana, or similar for...


  • Atlanta, United States Datum Technologies Group Full time

    Job Details:Site Reliability EngineerLong term contractAtlanta, GAQualifications:Must have Skills:Deep understanding of AWS services (Lambda, S3, SQS, IAM, Route 53 etc.) and proficiency in infrastructure as code (e.g., Terraform, CloudFormation).Hands-on experience with monitoring tools such as CloudWatch, Sumo Logic, Dynatrace, Grafana, or similar for...


  • Atlanta, United States Hermeus Full time

    Hermeus is an aerospace and defense technology company founded to radically accelerate air travel by delivering hypersonic aircraft. The company aims to develop hypersonic aircraft quickly and cost-effectively by integrating hardware-rich, iterative development with modern computing and autonomy. This approach has been validated through design, build, and...


  • Atlanta, Georgia, United States Advansys Full time

    About the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Advansys. As a key member of our infrastructure team, you will be responsible for maintaining and improving the reliability, performance, and availability of our software systems.Key Responsibilities:Maintain and improve the reliability, performance, and availability...


  • Atlanta, United States STORD Full time

    Stord is the leading commerce enablement provider of fulfillment services and technology that powers seamless checkout and delivery experiences for high-volume mid-market and enterprise brands across all channels. Stord manages over $5 billion of commerce annually through its fulfillment, warehousing, transportation, and operator-built software suite...


  • Atlanta, United States Cox Communications Full time

    This role is for an opening for a Senior Site Reliability Engineer (SRE) on the Manheim Logistics SRE team. The SRE team is tasked with designing and maintaining AWS infrastructure and deployment pipelines for Manheim Logistics 15 development teams. Reliability Engineer, Liability, Reliability, Engineer, Reliability, Monitoring, Technology


  • Atlanta, United States BeVera Solutions LLC Full time

    Job DescriptionJob DescriptionDescription:Company DescriptionBeVera Solutions, LLC is a fast-growing Data Science Consulting provider focused on delivering high-value solutions to its Federal Government customers. BeVera places a high premium on Integrity and Respect for all employees. Our CEO values every employee and fosters that attitude throughout the...


  • Atlanta, United States Motion Recruitment Full time

    A prominent insurance firm located in Atlanta is seeking skilled professionals to join their engineering team. They are currently in search of a DevOps/Senior Site Reliability Engineer for a full-time position, offering a hybrid work model at their Atlanta office. This company is at the cutting edge of innovation in content and presentation software designed...

Sr. Site Reliability Engineer

2 months ago


Atlanta, United States Flexton Inc. Full time

Position Title: Sr. SRE Engineer

Location: Atlanta, GA

Pay rate- $72-$75

Min.5 years of experience

Unable to offer sponsorship. USC or GC only


Position Description:


Top 5 Must Haves:

  • Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure.
  • Software Engineering background/experience---Python, Javascript, Bash, etc.
  • In-depth knowledge of infrastructure as code (IaC) tools, like Terraform, GHA, CloudFormation, Ansible.
  • Strong Automation and Scripting Skills, Solid Understanding of CI/CD Pipelines (Jenkins)


Job Description - Key Responsibilities:

  • Lead and mentor a team of SREs, fostering a culture of collaboration, continuous learning, and operational excellence.
  • Drive the adoption of SRE best practices and ensure adherence to reliability and performance standards.
  • Design and implement highly available, scalable, and fault-tolerant systems using AWS. Collaborate with software engineering teams and other SREs to influence design and architecture decisions to improve system reliability and performance.
  • Develop and maintain automation scripts and tools to streamline operations, deployments, and monitoring processes.
  • Utilize Infrastructure as Code (IaC) tools such as Terraform, GitHub Actions, and CloudFormation to manage infrastructure. Implement and maintain robust monitoring, alerting, and logging systems using tools like Splunk, Grafana, or New Relic.
  • Lead incident response efforts, conduct root cause analysis, and implement measures to prevent recurrence.
  • Oversee the design and maintenance of CI/CD pipelines using tools like Jenkins, GitLab CI, or CircleCI.
  • Ensure seamless and efficient code deployment processes, reducing time to market and increasing system reliability optimization:


Conduct performance tuning and capacity planning to ensure systems can handle growing workloads.

Troubleshooting experience.

Identify and resolve performance bottlenecks in infrastructure and applications.