Sr. Site Reliability Engineer II

3 weeks ago


Atlanta GA, United States Smarsh Full time

Atlanta / PortlandDivisions – Enterprise Engineering /Full-Time /HybridWe are seeking a Sr. Site Reliability Engineer II to join our growing Enterprise Engineering Team. What will you do?Attend and actively participate in team ceremonies (stand-ups, retros, and planning meetings). Coordinate efforts with globally dispersed teams. Respond to incidents coordinated by SRE and Incident Response teams. Document decisions regarding technology choices, best practices and process. Manage code bases using Smarsh engineering practices. Creatively solve problems in the SaaS Operations space, collaborating with SRE, Platform, Delivery, and Engineering team members. Develop tools and libraries for broader use by SaaS Operations and Engineering teams. Actively change code within current production systems to resolve incidents and/or enhance operational performance, following Engineering process for code change. Contribute to architectural conversations and plans. Actively identify areas for improvement. Other duties as assignedWhat will you bring?Broad range of programming/scripting experience (i.e. Java, Python  etc.). Strong background in managing code with Git.Strong analytical and problem-solving skills.Versed in infrastructure as code (IaC) practices using Terraform.Kubernetes and/or EKS expertiseProven experience with AWS cloud Platforms, PaaS, SaaS experience.Working knowledge and hands-on experience of VPC, AWS Transit VPC, Transit Gateway, Direct Connect methodologies.Experience managing continuous integration and deployment systems (jenkins, Concourse, Argo). Experience working with software development tools (Jira) and documentation collaboration platforms (Confluence, wiki, knowledge base, etc.).Basic SQL querying knowledge (MySQL/PostgreSQL/MSSQL).Experience with builds and packaging in a Linux/Java environment strongly preferred (ISO, rpm, etc.). Experience with containerization (Docker, Kubernetes, etc.). Strong networking knowledge (routing, firewall rules, nat policies, load balancing).Minimum 5+ years of industry experience.Bachelor’s degree in Computer Science Other combination of education and work experience$150,000 - $170,000 a yearThe above salary range represents Smarsh's good faith and reasonable estimate of the range of possible base compensation at the time of posting. Any applicable bonus programs will be discussed during the recruiting process. The salary for this role will be set based on a variety of factors, including but not limited to, internal equity, experience, education, location, specialty and training. Local cost of living assessments are done for each new hire at the time of offer.



  • Atlanta, United States iTech Solutions Full time

    Lead and mentor a team of SREs, fostering a culture of collaboration, continuous learning, and operational excellence.Drive the adoption of SRE best practices and ensure adherence to reliability and performance standards.Design and implement highly available, scalable, and fault-tolerant systems using AWS. Collaborate with software engineering teams and...


  • Atlanta, United States InfoPeople Corp Full time

    Potential to Extend? PossiblyPotential to Convert FTE?Only interested in candidates who are interest in converting Target Years of Exp: 5 yearsTop 5 Must Haves: Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure; Software Engineering background/experience---Python, Javascript, Bash,...


  • Atlanta, United States Tech Providers Full time

    Sr. Site Reliability Engineer Atlanta GA (Hybrid) 06+ Months Contract with possibility of extension   Note: we are onsite by mandate 4-10 days per month. Our teams typically work Wednesdays each week. The day may change based on the needs of the teams.   Skills: Top 5 Must Haves: Extensive/Strong AWS experience---experience in designing, deploying...


  • Atlanta, United States Tech Providers, Inc. Full time

    Sr. Site Reliability EngineerAtlanta GA (Hybrid)06+ Months Contract with possibility of extension  Note: we are onsite by mandate 4-10 days per month. Our teams typically work Wednesdays each week. The day may change based on the needs of the teams. Skills: Top 5 Must Haves: Extensive/Strong AWS experience---experience in designing, deploying managing...


  • Atlanta, United States WIVERSE Full time

    SRE- Sr. Site Reliability Engineer Contract: 6+ Months with possible covert or hire Pay rate: W2 Hourly USC/GC Only Local candidates can be considered only Location: Atlanta, GA - onsite by mandate 4-10 days per month. Client teams typically work Wednesdays each week. The day may change based on the needs of the teams the SRE supports. Target Years of Exp: 5...


  • Atlanta, United States 3i People, Inc. Full time

    We have a position for a Sr. Site Reliability Engineer with one of our clients in Atlanta, GA for an initial contract duration of 6 months. s and all those authorized to work in the US are encouraged to apply.Interview Type is Video. Local Candidates Preferred.Key Responsibilities:Lead and mentor a team of SREs, fostering a culture of collaboration,...


  • Atlanta, United States Navtech Inc Full time

    Hi Folks, I have an open opportunity that you may be a good fit for. If this sounds like something you would be interested in, please get in touch with me as soon as possible at with your most recent resume, your ideal time and number for communication, and the expected pay rate for C2C/1099/W2. Job Description: Job Title : Sr. Site Reliability...


  • Atlanta, United States Xoriant Full time

    Looking for Sr. Site Reliability Engineer - Atlanta, GA Top 5 Must Haves: Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure; Software Engineering background/experience---Python, Javascript, Bash, etc.; In-depth knowledge of infrastructure as code (IaC) tools, like Terraform, GHA,...


  • Atlanta, United States Xoriant Full time

    Looking for Sr. Site Reliability Engineer - Atlanta, GATop 5 Must Haves: • Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure; • Software Engineering background/experience---Python, Javascript, Bash, etc.; • In-depth knowledge of infrastructure as code (IaC) tools, like Terraform,...


  • Atlanta, United States Xoriant Full time

    Looking for Sr. Site Reliability Engineer - Atlanta, GATop 5 Must Haves: • Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure; • Software Engineering background/experience---Python, Javascript, Bash, etc.; • In-depth knowledge of infrastructure as code (IaC) tools, like Terraform,...


  • Atlanta, United States Xoriant Full time

    Looking for Sr. Site Reliability Engineer - Atlanta, GATop 5 Must Haves: • Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure; • Software Engineering background/experience---Python, Javascript, Bash, etc.; • In-depth knowledge of infrastructure as code (IaC) tools, like Terraform,...


  • Atlanta, United States Xoriant Full time

    Looking for Sr. Site Reliability Engineer - Atlanta, GATop 5 Must Haves: • Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure; • Software Engineering background/experience---Python, Javascript, Bash, etc.; • In-depth knowledge of infrastructure as code (IaC) tools, like Terraform,...


  • Atlanta, United States Xoriant Corporation Full time

    Looking for Sr. Site Reliability Engineer - Atlanta, GATarget Years of Exp: 5 years Top 5 Must Haves: Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure;Software Engineering background/experience---Python, Javascript, Bash, etc.;In-depth knowledge of infrastructure as code (IaC) tools,...


  • Atlanta, United States Flexton Full time

    Title: Sr. Site Reliability Engineer Location: Atlanta, GA (Hybrid) Duration: 12+ Months Job Description: Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure; Software Engineering background/experience---Python, JavaScript, Bash, etc.; In-depth knowledge of infrastructure as code (IaC)...


  • Atlanta, GA, United States T-Mobile USA, Inc. Full time

    T-Mobile USA, Inc. seeks Sr. Engineers, System Reliability in Atlanta, GA Research, design, and develop computer and network software or specialized utility programs. Analyze user needs and develop software solutions, applying principles and techniques of computer science, engineering, and mathematical analysis. Telecommuting is permitted, but applicants...


  • Atlanta, United States Flexton Inc. Full time

    Title: Sr. Site Reliability Engineer Location: Atlanta, GA (Hybrid)Duration: 12+ Months Job Description:Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure; Software Engineering background/experience---Python, JavaScript, Bash, etc.; In-depth knowledge of infrastructure as code (IaC)...


  • Atlanta, United States Flexton Inc. Full time

    Title: Sr. Site Reliability Engineer Location: Atlanta, GA (Hybrid)Duration: 12+ Months Job Description:Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure; Software Engineering background/experience---Python, JavaScript, Bash, etc.; In-depth knowledge of infrastructure as code (IaC)...


  • Atlanta, United States Home Depot Management Company, LLC Full time

    Position Purpose: The Software Engineer II is responsible for independently developing and assisting in the design of a product that our customers and associates love. As a Software Engineer II, you will be part of a dynamic team with engineers of all experience levels who help each other build and grow technical and leadership skills while creating,...


  • Atlanta, United States Planet DDS, Inc Full time

    About Us: Planet DDS is a dynamic and rapidly growing dental software company, serving over 13,000 practices across the United States with over 118,000 users. The company delivers a complete platform of cloud-based SaaS solutions for dental practices, including Denticon Practice Management, Apteryx Imaging, Cloud 9 Ortho Practice Management, and Legwork...


  • Atlanta, United States MethodHub Full time

    Job Title: Site Reliablity Engineer (Performance Monitoring)Location: RemoteDuration: Long Term (W2 Only)Client: DirectJob Description:Experience of 6-8 Professional experience as a Site Reliability Engineer (SRE)Software development “hands on” engineer with excellent understanding of SDLC Application delivery.Ability to translate functional and...