Current jobs related to Site Reliability Engineer - Atlanta - Incident IQ


  • Atlanta, United States Engineer Up Full time

    About Us: Engineer Up is on a mission to disrupt how good, hard-working people advance their careers in tech. We partner with Fortune 500 companies to deliver customized IT consulting services spanning from software development to digital transformation. Position: Senior Principal Software EngineerEmployer: Engineer UpLocation: RemoteRole: ConsultantPosition...


  • Atlanta, United States Engineer Up Full time

    About Us: Engineer Up is on a mission to disrupt how good, hard-working people advance their careers in tech. We partner with Fortune 500 companies to deliver customized IT consulting services spanning from software development to digital transformation. Position: Senior Principal Software EngineerEmployer: Engineer UpLocation: RemoteRole: ConsultantPosition...


  • Atlanta, Georgia, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based systems and applications.Key Responsibilities:Design, implement, and maintain monitoring tools,...


  • Atlanta, Georgia, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design, implement, and maintain monitoring tools, alerts,...


  • Atlanta, Georgia, United States Lorven Technologies Full time

    Job Title: Site Reliability EngineerLorven Technologies is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and reliable cloud...


  • Atlanta, Georgia, United States Next Level Business Services, Inc. Full time

    Job Title: Site Reliability EngineerNext Level Business Services, Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems and applications.Key Responsibilities:Design, implement, and maintain...


  • Atlanta, Georgia, United States ACL Digital Full time

    Job Title: Site Reliability EngineerWe are seeking a skilled Site Reliability Engineer to join our team at ACL Digital. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems and applications.Key Responsibilities:Implement and improve monitoring, alerting, and logging...


  • Atlanta, Georgia, United States GeorgiaTEK Systems Inc. Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at GeorgiaTEK Systems Inc. in Minneapolis, MN or Atlanta, GA. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design,...


  • Atlanta, Georgia, United States GeorgiaTEK Systems Inc. Full time

    Job Title: Site Reliability EngineerGeorgiaTEK Systems Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Atlanta, Georgia, United States Navtech Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Navtech Inc. in Atlanta or St. Louis. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our production systems.Key Responsibilities:Provide L4 technical support for production...


  • Atlanta, Georgia, United States Geotab Full time

    Job Title: Site Reliability EngineerGeotab is a global leader in IoT and connected transportation, and we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based infrastructure. You will work closely with our development team to...


  • Atlanta, Georgia, United States bloomreach Full time

    Job Title: Site Reliability Engineer-IIWe are seeking a highly skilled Site Reliability Engineer-II to join our team at Bloomreach. As a Site Reliability Engineer-II, you will be responsible for improving and managing infrastructure to drive efficiency and scalability.Responsibilities:Improve and manage infrastructure to drive efficiency and...


  • Atlanta, Georgia, United States Tata Consultancy Services Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Tata Consultancy Services. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our mission-critical services.Key ResponsibilitiesAutomate Infrastructure and Testing: Automate infrastructure needs,...


  • Atlanta, Georgia, United States Ditto Job Board Full time

    Job Title: Site Reliability EngineerAt Ditto, we're on a mission to unleash the full power of edge devices by removing all the plumbing required to build amazing applications. As a Site Reliability Engineer, you'll play a critical role in helping us achieve this goal.About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our Federal...


  • Atlanta, Georgia, United States GeorgiaTEK Systems Inc. Full time

    Job Title: Site Reliability EngineerGeorgiaTEK Systems Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Atlanta, Georgia, United States Tekwissen Full time

    Job Title: Site Reliability EngineerAt TekWissen Group, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the stability, availability, and performance of our cloud-based systems.Key Responsibilities:Provide consulting services to improve system stability,...


  • Atlanta, Georgia, United States T-Mobile Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at T-Mobile. As a Site Reliability Engineer, you will play a critical role in ensuring the availability, scalability, and performance of our IT services.Key ResponsibilitiesDesign and implement scalable and reliable systems and processes to support our IT...


  • Atlanta, Georgia, United States JobRialto Full time

    Job Title: Site Reliability EngineerJobRialto is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement performance indicators in SOA to monitor system...


  • Atlanta, Georgia, United States Geotab Full time

    Job Title: Site Reliability EngineerGeotab is a global leader in IoT and connected transportation, and we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the high availability and performance of our core MyGeotab applications.Key Responsibilities:Provide escalated...


  • Atlanta, Georgia, United States JobRialto Full time

    Job SummaryThe Site Reliability Engineer is responsible for ensuring the availability, scalability, and performance of critical services and systems. This role requires expertise in OpenShift and CloudFormation, along with a deep understanding of site reliability principles, container technologies, monitoring tools, and automation.Key ResponsibilitiesEnsure...

Site Reliability Engineer

3 months ago


Atlanta, United States Incident IQ Full time
Job DescriptionJob Description

Site Reliability Engineer (SRE)

Company Overview

Atlanta-based, Incident IQ is a SaaS service management platform built exclusively for K-12 schools that is transforming K-12 workflows including IT asset management, help desk ticketing, facilities maintenance solutions, Human Resources service delivery, and more. Our mission is to revolutionize how school districts manage operational support activities to better serve students and drive instructional efficiencies. Incident IQ is a dynamic, fast-growing company focusing on providing innovative cloud-based software. The Incident IQ platform has been rapidly adopted by K-12 school districts. Today, millions of students and teachers in districts across the U.S. rely on the IncidentIQ platform to manage and deliver mission-critical services.

Since the company's founding, Incident IQ has built a culture focused on customer success and product leadership; we are passionate about helping school districts achieve operational efficiency. Incident IQ's environment is inclusive and transparent, and our team members are respected and valued contributors who consistently exhibit openness, integrity, collaboration, enthusiasm, and effort.

Position Overview:

We are seeking a Mid-Level Site Reliability Engineer (SRE) to join our growing cloud engineering team. The ideal candidate will have a strong background in software engineering, system administration, and operations, with a focus on building reliable, scalable, and efficient systems. As an SRE, you will work across our various software development teams to automate operational tasks, participate in emergency escalation and troubleshooting, and build new infrastructure.

SRE Responsibilities:

  • Design, develop, and maintain scalable and reliable infrastructure.
  • Implement monitoring, logging, and alerting solutions to ensure the health and performance of our systems in support of our service level objectives (SLOs).
  • Contribute to cross-department infrastructure projects.
  • Work with development teams on projects related to application reliability.
  • Participate in periodic on-call duties to prevent, solve, and automate the response to problems in mission-critical services.
  • Continuously improve the reliability and performance of our systems through proactive monitoring and capacity planning.
  • Document processes, systems, and configurations to ensure knowledge sharing and continuous improvement.
  • Implement security compliance regulations and other business rules.

Key Skills/Experience:

  • 3-5 years of experience in a Site Reliability Engineer, DevOps, or related role.
  • Strong experience with cloud platforms such as Azure, AWS, or Google Cloud.
  • Experience with configuration automation in ephemeral environments.
  • Familiarity with containerization and orchestration tools such as Docker and Kubernetes.
  • Strong understanding of networking, security, and system administration.
  • Excellent problem-solving skills and attention to detail.
  • Strong communication and collaboration skills.

Preferred Qualifications:

  • Experience with Infrastructure as Code (IaC) tools such as Terraform, Ansible, Helm and ArgoCD.
  • Knowledge of CI/CD pipelines and tools such as GitHub Actions and Azure DevOps YAML Pipelines.
  • Experience with monitoring and logging tools such as Prometheus, Grafana, Kibana, and Azure Insights.
  • Familiarity with deploying and monitoring database technologies such as Microsoft SQL Server, MongoDb, and Redis.
  • Familiarity with Agile and DevOps methodologies.
  • Familiarity with building and deploying C# code.
  • Azure and AKS is a plus.

What makes Incident IQ different:

  • We facilitate whole-person growth where employees can develop personally as well as professionally.
  • We offer an energetic and collaborative environment; everyone's opinion matters
  • We produce software that empowers K-12 schools to run efficiently, allowing for a better classroom experience for students to THRIVE
  • We provide excellent work/life balance. Two amazing offices - a Downtown Atlanta office location and one at Halcyon in Alpharetta, or fully remote.

Incident IQ offers a competitive salary based on experience with a benefits package for full time employees that includes medical, dental, vision, life insurance, 401k, and paid-time off (PTO).

Incident IQ is an Equal Opportunity Employer