Current jobs related to Site Reliability Engineer - Atlanta - Tata Consultancy Services


  • Atlanta, Georgia, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based systems and applications.Key Responsibilities:Design, implement, and maintain monitoring tools,...


  • Atlanta, Georgia, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design, implement, and maintain monitoring tools, alerts,...


  • Atlanta, Georgia, United States Lorven Technologies Full time

    Job Title: Site Reliability EngineerLorven Technologies is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and reliable cloud...


  • Atlanta, Georgia, United States Next Level Business Services, Inc. Full time

    Job Title: Site Reliability EngineerNext Level Business Services, Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems and applications.Key Responsibilities:Design, implement, and maintain...


  • Atlanta, Georgia, United States GeorgiaTEK Systems Inc. Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at GeorgiaTEK Systems Inc. in Minneapolis, MN or Atlanta, GA. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design,...


  • Atlanta, Georgia, United States GeorgiaTEK Systems Inc. Full time

    Job Title: Site Reliability EngineerGeorgiaTEK Systems Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Atlanta, Georgia, United States Navtech Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Navtech Inc. in Atlanta or St. Louis. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our production systems.Key Responsibilities:Provide L4 technical support for production...


  • Atlanta, Georgia, United States Geotab Full time

    Job Title: Site Reliability EngineerGeotab is a global leader in IoT and connected transportation, and we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based infrastructure. You will work closely with our development team to...


  • Atlanta, Georgia, United States bloomreach Full time

    Job Title: Site Reliability Engineer-IIWe are seeking a highly skilled Site Reliability Engineer-II to join our team at Bloomreach. As a Site Reliability Engineer-II, you will be responsible for improving and managing infrastructure to drive efficiency and scalability.Responsibilities:Improve and manage infrastructure to drive efficiency and...


  • Atlanta, Georgia, United States Tata Consultancy Services Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Tata Consultancy Services. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our mission-critical services.Key ResponsibilitiesAutomate Infrastructure and Testing: Automate infrastructure needs,...


  • Atlanta, Georgia, United States Ditto Job Board Full time

    Job Title: Site Reliability EngineerAt Ditto, we're on a mission to unleash the full power of edge devices by removing all the plumbing required to build amazing applications. As a Site Reliability Engineer, you'll play a critical role in helping us achieve this goal.About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our Federal...


  • Atlanta, Georgia, United States GeorgiaTEK Systems Inc. Full time

    Job Title: Site Reliability EngineerGeorgiaTEK Systems Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Atlanta, Georgia, United States T-Mobile Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at T-Mobile. As a Site Reliability Engineer, you will play a critical role in ensuring the availability, scalability, and performance of our IT services.Key ResponsibilitiesDesign and implement scalable and reliable systems and processes to support our IT...


  • Atlanta, Georgia, United States JobRialto Full time

    Job Title: Site Reliability EngineerJobRialto is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement performance indicators in SOA to monitor system...


  • Atlanta, Georgia, United States Geotab Full time

    Job Title: Site Reliability EngineerGeotab is a global leader in IoT and connected transportation, and we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the high availability and performance of our core MyGeotab applications.Key Responsibilities:Provide escalated...


  • Atlanta, Georgia, United States JobRialto Full time

    Job SummaryThe Site Reliability Engineer is responsible for ensuring the availability, scalability, and performance of critical services and systems. This role requires expertise in OpenShift and CloudFormation, along with a deep understanding of site reliability principles, container technologies, monitoring tools, and automation.Key ResponsibilitiesEnsure...


  • Atlanta, Georgia, United States Navtech Full time

    Job Title: Site Reliability EngineerJob Description:We are seeking a highly skilled Site Reliability Engineer to join our team at Navtech. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our production systems.Key Responsibilities:Provide L4 technical support for production 24x7Design and...


  • Atlanta, Georgia, United States JobRialto Full time

    Job SummaryThe Site Reliability Engineer is a critical role that ensures the availability, scalability, and performance of our cloud-based services and systems. This position requires expertise in OpenShift and CloudFormation, as well as a deep understanding of site reliability principles, container technologies, monitoring tools, and automation.Key...


  • Atlanta, Georgia, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerLocation: RemoteDuration: ContractContact: SUNDARRAJAN MURALIJob Requirements: We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. The ideal candidate will have strong knowledge in AWS cloud platform and expertise in developing and maintaining monitoring tools, alerts, and...


  • Atlanta, Georgia, United States Navtech Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Navtech Inc. in Atlanta or St. Louis. As a Site Reliability Engineer, you will be responsible for ensuring the availability, performance, and security of our production systems.Key Responsibilities:Provide L4 technical support for production...

Site Reliability Engineer

2 months ago


Atlanta, United States Tata Consultancy Services Full time

Job Description


Job Type: Fulltime

Location: Atlanta GA (Onsite)

Experience: 6+years


  • Automating work including infrastructure needs, testing, failover solutions, failure mitigation, and much more
  • Debugging complex problems across an entire stack and creating solid solutions
  • Developing and building CI/CD processes to improve cadence
  • Using Chaos Engineering to test what you build under real-world conditions
  • Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.
  • Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
  • Experience with an APM tool such as Dynatrace, New Relic, AppDynamics, or Datadog.
  • Performance Measurement and Tuning: Knowledge of system performance, testing and programming; ability to monitor, measure, and optimize system performance and network communication.
  • Site Reliability Engineering: Knowledge of the theories and methodologies of reliability engineering; ability to design, develop and support various tools, services and applications to maintain a reliable site environment.
  • Support capacity planning, availability, scalability, security and latency considerations for new infrastructure and service provisioning as appropriate
  • Responsible for improvements to end-to-end availability and performance of mission critical services and build automation to prevent problem recurrence.
  • Strong experience setting SLOs / SLIs / error budgets and managing of reliability for infrastructure and applications
  • Partner with other SREs to bring best practices or learnings from across the organization to them
  • Scale and optimize existing infrastructure and services sustainably through mechanisms, including automation, and evolve them by improving reliability and efficiency
  • Manage end-to-end availability and performance of mission-critical services and build automation to prevent problem recurrence
  • Maintain infrastructure and services by measuring, and monitoring system metrics to proactively identify operational efficiencies, potential outages and security threats in Development, UAT, Staging and Production environments
  • Practice sustainable incident response and blameless postmortems
  • Develop and maintain solution and operational documentation and designs for all infrastructure and services within the scope of SRE

Other Skills

  • AWS SysOps Administrator OR AWS DevOps Engineer certification
  • Experience with Akamai or related WAF application preferred.
  • Experience with OpenShift, Kubernetes.
  • Experience with setting up synthetic monitors and tracking SLAs.
  • Experience with airline applications and infrastructure technology is a plus.
  • Experience developing applications and/or automation runn ing in Red Hat OpenShift is a plus.