Current jobs related to Site Reliability Engineer - Atlanta - Tata Consultancy Services
-
Site Reliability Engineer
2 weeks ago
Atlanta, Georgia, United States Diverse Lynx Full timeJob Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based systems and applications.Key Responsibilities:Design, implement, and maintain monitoring tools,...
-
Site Reliability Engineer
3 weeks ago
Atlanta, Georgia, United States Diverse Lynx Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design, implement, and maintain monitoring tools, alerts,...
-
Site Reliability Engineer
3 weeks ago
Atlanta, Georgia, United States Lorven Technologies Full timeJob Title: Site Reliability EngineerLorven Technologies is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and reliable cloud...
-
Site Reliability Engineer
3 weeks ago
Atlanta, Georgia, United States Next Level Business Services, Inc. Full timeJob Title: Site Reliability EngineerNext Level Business Services, Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems and applications.Key Responsibilities:Design, implement, and maintain...
-
Site Reliability Engineer
2 weeks ago
Atlanta, Georgia, United States GeorgiaTEK Systems Inc. Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at GeorgiaTEK Systems Inc. in Minneapolis, MN or Atlanta, GA. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design,...
-
Site Reliability Engineer
4 weeks ago
Atlanta, Georgia, United States GeorgiaTEK Systems Inc. Full timeJob Title: Site Reliability EngineerGeorgiaTEK Systems Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...
-
Site Reliability Engineer
3 weeks ago
Atlanta, Georgia, United States Navtech Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Navtech Inc. in Atlanta or St. Louis. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our production systems.Key Responsibilities:Provide L4 technical support for production...
-
Site Reliability Engineer
2 weeks ago
Atlanta, Georgia, United States Geotab Full timeJob Title: Site Reliability EngineerGeotab is a global leader in IoT and connected transportation, and we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based infrastructure. You will work closely with our development team to...
-
Site Reliability Engineer
3 weeks ago
Atlanta, Georgia, United States bloomreach Full timeJob Title: Site Reliability Engineer-IIWe are seeking a highly skilled Site Reliability Engineer-II to join our team at Bloomreach. As a Site Reliability Engineer-II, you will be responsible for improving and managing infrastructure to drive efficiency and scalability.Responsibilities:Improve and manage infrastructure to drive efficiency and...
-
Site Reliability Engineer
3 weeks ago
Atlanta, Georgia, United States Tata Consultancy Services Full timeJob DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Tata Consultancy Services. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our mission-critical services.Key ResponsibilitiesAutomate Infrastructure and Testing: Automate infrastructure needs,...
-
Site Reliability Engineer
3 weeks ago
Atlanta, Georgia, United States Ditto Job Board Full timeJob Title: Site Reliability EngineerAt Ditto, we're on a mission to unleash the full power of edge devices by removing all the plumbing required to build amazing applications. As a Site Reliability Engineer, you'll play a critical role in helping us achieve this goal.About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our Federal...
-
Site Reliability Engineer
3 weeks ago
Atlanta, Georgia, United States GeorgiaTEK Systems Inc. Full timeJob Title: Site Reliability EngineerGeorgiaTEK Systems Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...
-
Site Reliability Engineer
1 month ago
Atlanta, Georgia, United States T-Mobile Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at T-Mobile. As a Site Reliability Engineer, you will play a critical role in ensuring the availability, scalability, and performance of our IT services.Key ResponsibilitiesDesign and implement scalable and reliable systems and processes to support our IT...
-
Site Reliability Engineer
3 weeks ago
Atlanta, Georgia, United States JobRialto Full timeJob Title: Site Reliability EngineerJobRialto is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement performance indicators in SOA to monitor system...
-
Site Reliability Engineer
3 weeks ago
Atlanta, Georgia, United States Geotab Full timeJob Title: Site Reliability EngineerGeotab is a global leader in IoT and connected transportation, and we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the high availability and performance of our core MyGeotab applications.Key Responsibilities:Provide escalated...
-
Site Reliability Engineer
1 week ago
Atlanta, Georgia, United States JobRialto Full timeJob SummaryThe Site Reliability Engineer is responsible for ensuring the availability, scalability, and performance of critical services and systems. This role requires expertise in OpenShift and CloudFormation, along with a deep understanding of site reliability principles, container technologies, monitoring tools, and automation.Key ResponsibilitiesEnsure...
-
Site Reliability Engineer
1 week ago
Atlanta, Georgia, United States Navtech Full timeJob Title: Site Reliability EngineerJob Description:We are seeking a highly skilled Site Reliability Engineer to join our team at Navtech. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our production systems.Key Responsibilities:Provide L4 technical support for production 24x7Design and...
-
Site Reliability Engineer
3 weeks ago
Atlanta, Georgia, United States JobRialto Full timeJob SummaryThe Site Reliability Engineer is a critical role that ensures the availability, scalability, and performance of our cloud-based services and systems. This position requires expertise in OpenShift and CloudFormation, as well as a deep understanding of site reliability principles, container technologies, monitoring tools, and automation.Key...
-
Site Reliability Engineer
3 weeks ago
Atlanta, Georgia, United States Diverse Lynx Full timeJob Title: Site Reliability EngineerLocation: RemoteDuration: ContractContact: SUNDARRAJAN MURALIJob Requirements: We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. The ideal candidate will have strong knowledge in AWS cloud platform and expertise in developing and maintaining monitoring tools, alerts, and...
-
Site Reliability Engineer
3 weeks ago
Atlanta, Georgia, United States Navtech Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Navtech Inc. in Atlanta or St. Louis. As a Site Reliability Engineer, you will be responsible for ensuring the availability, performance, and security of our production systems.Key Responsibilities:Provide L4 technical support for production...
Site Reliability Engineer
2 months ago
Job Description
Job Type: Fulltime
Location: Atlanta GA (Onsite)
Experience: 6+years
- Automating work including infrastructure needs, testing, failover solutions, failure mitigation, and much more
- Debugging complex problems across an entire stack and creating solid solutions
- Developing and building CI/CD processes to improve cadence
- Using Chaos Engineering to test what you build under real-world conditions
- Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.
- Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
- Experience with an APM tool such as Dynatrace, New Relic, AppDynamics, or Datadog.
- Performance Measurement and Tuning: Knowledge of system performance, testing and programming; ability to monitor, measure, and optimize system performance and network communication.
- Site Reliability Engineering: Knowledge of the theories and methodologies of reliability engineering; ability to design, develop and support various tools, services and applications to maintain a reliable site environment.
- Support capacity planning, availability, scalability, security and latency considerations for new infrastructure and service provisioning as appropriate
- Responsible for improvements to end-to-end availability and performance of mission critical services and build automation to prevent problem recurrence.
- Strong experience setting SLOs / SLIs / error budgets and managing of reliability for infrastructure and applications
- Partner with other SREs to bring best practices or learnings from across the organization to them
- Scale and optimize existing infrastructure and services sustainably through mechanisms, including automation, and evolve them by improving reliability and efficiency
- Manage end-to-end availability and performance of mission-critical services and build automation to prevent problem recurrence
- Maintain infrastructure and services by measuring, and monitoring system metrics to proactively identify operational efficiencies, potential outages and security threats in Development, UAT, Staging and Production environments
- Practice sustainable incident response and blameless postmortems
- Develop and maintain solution and operational documentation and designs for all infrastructure and services within the scope of SRE
Other Skills
- AWS SysOps Administrator OR AWS DevOps Engineer certification
- Experience with Akamai or related WAF application preferred.
- Experience with OpenShift, Kubernetes.
- Experience with setting up synthetic monitors and tracking SLAs.
- Experience with airline applications and infrastructure technology is a plus.
- Experience developing applications and/or automation runn ing in Red Hat OpenShift is a plus.