Site Reliability Engineer

3 weeks ago


Atlanta, United States iScale Solutions Full time

Job Description This is a remote position.

Key Responsibilities:

Design, implement, and maintain highly available and scalable infrastructure on AWS cloud platform.

Develop and manage Infrastructure as Code (IaC) using Terraform for provisioning and managing cloud resources.

Implement containerization strategies using Docker for packaging and deploying applications.

Deploy and manage container orchestration platforms such as Kubernetes for automating deployment, scaling, and management of containerized applications.

Monitor system performance, troubleshoot issues, and implement solutions to ensure optimal performance and reliability.

Automate repetitive tasks and workflows to improve efficiency and reduce manual intervention.

Collaborate with development teams to ensure applications are designed with scalability, reliability, and operability in mind.

Participate in on-call rotation and respond to incidents in a timely manner, ensuring minimal downtime and disruption to services.

Requirements:

Bachelor’s degree in Computer Science, Engineering, or related field.

Proven experience as a Site Reliability Engineer or similar role.

Strong understanding of AWS services and best practices for cloud infrastructure.

Proficiency in Infrastructure as Code (IaC) using Terraform.

Hands-on experience with containerization using Docker.

Experience with container orchestration platforms such as Kubernetes.

Solid understanding of networking, security, and system administration concepts.

Excellent problem-solving skills and attention to detail.

Strong communication and collaboration skills.

Ability to work independently and as part of a team in a fast-paced environment.

#J-18808-Ljbffr



  • Atlanta, United States Tech Providers Inc. Full time

    Site Reliability EngineerAtlanta GA (Hybrid) 06+ Months Contract to HireSkills: Top 5 Must Haves: Extensive/Strong AWS experience, experience in designing, deploying managing scalable/reliable cloud-based infrastructure; Software Engineering background/experience---Python, JavaScript, Bash, etc.; In-depth knowledge of infrastructure as code (IaC) tools, like...


  • Atlanta, United States Hermeus Full time

    Hermeus is an aerospace and defense technology company founded to radically accelerate air travel by delivering hypersonic aircraft. The company aims to develop hypersonic aircraft quickly and cost-effectively by integrating hardware-rich, iterative development with modern computing and autonomy. This approach has been validated through design, build, and...


  • Atlanta, United States CNA Search Full time

    Functions and Responsibilities Manage production environments by monitoring availability and taking a holistic view of system health Automate reliability, quality, and repeatability of cloud environments Proactively ensure the highest levels of systems and infrastructure availability Responsible for maintaining tools/systems/platforms for cloud service...


  • Atlanta, United States ClifyX, INC Full time

    Senior Specialist - Software Engineering Requisition ID : 1289381 Posting Start Date : Apr 24, 2024 Posting End Date : Apr 25, 2024 Recruiter : Ramachandra Reddy Job Code: 1289381 Job Title: Site Reliability Engineer Work Location : Location: Atlanta, GA Job Description: "Provisioning cloud infrastructure (AWS, GCP) using infrastructure as...


  • Atlanta, United States Hermeus Full time

    Hermeus is seeking a Senior Software or Site Reliability Engineer to join the Information Team and take charge of building infrastructure for our business-critical development and test facilities to help accelerate our iteration speed on hardware development and testing. You will work closely with avionics and aerospace engineering teams to design and...


  • Atlanta, United States Flexton Full time

    Title: Sr. Site Reliability Engineer Location: Atlanta, GA (Hybrid) Duration: 12+ Months Job Description: Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure; Software Engineering background/experience---Python, JavaScript, Bash, etc.; In-depth knowledge of infrastructure as code (IaC)...


  • Atlanta, United States Motion Recruitment Full time

    A proven performer in HealthTech and Digital Recording is looking to continue to expand their DevOps team with an ambitious and skilled Site Reliability Engineer.  The ideal candidate will, be working on large-scale projects, primarily involving automation expansion and cloud migration. You will be developing in an environment that is primarily AWS, and...


  • Atlanta, United States Flexton Inc. Full time

    Title: Sr. Site Reliability Engineer Location: Atlanta, GA (Hybrid)Duration: 12+ Months Job Description:Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure; Software Engineering background/experience---Python, JavaScript, Bash, etc.; In-depth knowledge of infrastructure as code (IaC)...


  • Atlanta, United States iTech Solutions Full time

    Lead and mentor a team of SREs, fostering a culture of collaboration, continuous learning, and operational excellence.Drive the adoption of SRE best practices and ensure adherence to reliability and performance standards.Design and implement highly available, scalable, and fault-tolerant systems using AWS. Collaborate with software engineering teams and...


  • Atlanta, United States InfoPeople Corp Full time

    Potential to Extend? PossiblyPotential to Convert FTE?Only interested in candidates who are interest in converting Target Years of Exp: 5 yearsTop 5 Must Haves: Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure; Software Engineering background/experience---Python, Javascript, Bash,...


  • Atlanta, United States Flexton Inc. Full time

    Title: Sr. Site Reliability Engineer Location: Atlanta, GA (Hybrid)Duration: 12+ Months Job Description:Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure; Software Engineering background/experience---Python, JavaScript, Bash, etc.; In-depth knowledge of infrastructure as code (IaC)...


  • Atlanta, United States Apex Systems Full time

    We are seeking a proactive and dynamic Site Reliability Engineer to support our multiple engineering teams by providing robust infrastructure solutions. The ideal candidate will be responsible for patching, building pipelines, and must possess a strong understanding of Terraform. As an AWS and a GitHub Actions (GHA) shop, experience with both AWS and GHA is...


  • Atlanta, United States Tech Providers, Inc. Full time

    Sr. Site Reliability EngineerAtlanta GA (Hybrid)06+ Months Contract with possibility of extension  Note: we are onsite by mandate 4-10 days per month. Our teams typically work Wednesdays each week. The day may change based on the needs of the teams. Skills: Top 5 Must Haves: Extensive/Strong AWS experience---experience in designing, deploying managing...


  • Atlanta, United States Tech Providers Full time

    Sr. Site Reliability Engineer Atlanta GA (Hybrid) 06+ Months Contract with possibility of extension   Note: we are onsite by mandate 4-10 days per month. Our teams typically work Wednesdays each week. The day may change based on the needs of the teams.   Skills: Top 5 Must Haves: Extensive/Strong AWS experience---experience in designing, deploying...


  • Atlanta, United States Apex Systems Full time

    We are seeking a proactive and dynamic Site Reliability Engineer to support our multiple engineering teams by providing robust infrastructure solutions. The ideal candidate will be responsible for patching, building pipelines, and must possess a strong understanding of Terraform. As an AWS and a GitHub Actions (GHA) shop, experience with both AWS and GHA is...


  • Atlanta, United States Apex Systems Full time

    We are seeking a proactive and dynamic Site Reliability Engineer to support our multiple engineering teams by providing robust infrastructure solutions. The ideal candidate will be responsible for patching, building pipelines, and must possess a strong understanding of Terraform. As an AWS and a GitHub Actions (GHA) shop, experience with both AWS and GHA is...


  • Atlanta, United States Apex Systems Full time

    We are seeking a proactive and dynamic Site Reliability Engineer to support our multiple engineering teams by providing robust infrastructure solutions. The ideal candidate will be responsible for patching, building pipelines, and must possess a strong understanding of Terraform. As an AWS and a GitHub Actions (GHA) shop, experience with both AWS and GHA is...


  • Atlanta, United States 3i People, Inc. Full time

    We have a position for a Sr. Site Reliability Engineer with one of our clients in Atlanta, GA for an initial contract duration of 6 months. s and all those authorized to work in the US are encouraged to apply.Interview Type is Video. Local Candidates Preferred.Key Responsibilities:Lead and mentor a team of SREs, fostering a culture of collaboration,...


  • Atlanta, United States Resource Informatics Group Full time

    Job Description Job Description Role: Site Reliability Engineer Location: Atlanta, GA Duration: 12 months Rate: $market All Inclusive Job Description: This Software Engineer will be part of the Site Reliability Engineering (SRE) team. The SRE team is an innovative team devoted to providing automated solutions and services for Cox Automotive to measure,...


  • Atlanta, United States Navtech Inc Full time

    Hi Folks, I have an open opportunity that you may be a good fit for. If this sounds like something you would be interested in, please get in touch with me as soon as possible at with your most recent resume, your ideal time and number for communication, and the expected pay rate for C2C/1099/W2. Job Description: Job Title : Sr. Site Reliability...