Site Reliability Engineer

2 weeks ago


Charlotte, North Carolina, United States Regions Bank Full time
Job Description:

At Regions Bank, we are seeking a skilled Site Reliability Engineer to join our team. This role is responsible for ensuring the dependability of our firm's most critical system applications.

This position will be called upon to solve major issues, understand and remediate the points of system failure, as well as work with internal teams to optimize system applications' reliability, performance, and capacity.

Key Responsibilities:
  • Ensures a holistic view is taken of our system applications' overall health
  • Solves problems relating to mission critical services and creates solutions to prevent problem recurrence
  • Collaborates with team members to improve the company's engineering tools, systems and procedures
  • Designs and deploys applications and enhancements to improve reliability, scalability, latency, and efficiency
  • Tests installed software for malfunction detection
  • Identifies and resolves business systems issues
  • Measures effectiveness and efficiency of existing systems
  • Develops and implements strategies to improve systems
  • Monitors and tests system performance
  • Assists management with auditing team processes and procedures on a routine basis
  • Provides high-level training, guidance, and advice to junior engineers
  • Manages complex projects
  • Serves as a subject-matter expert
Requirements:
  • High School Diploma or GED and eleven (11) years of related experience
  • Or Bachelor's degree in Computer Science, Computer Engineering or a related field and seven (7) years of related experience
  • Skills and Competencies
  • Ability to collaborate with programmers, developers, and other technology professionals to achieve a common objective
  • Ability to conduct system analysis to detect issues with performance
  • Ability to develop and implement technology solutions to resolve technical challenges
  • Ability to manage multiple projects simultaneously
  • Knowledge of software testing techniques, code optimization and software debugging
  • Strong communication, analytical and problem-solving skills
  • Strong executional capabilities
  • Thorough understanding of software structures, hardware, computing systems and how to integrate them
Desired Skills:
  • Cloud technologies: Prior experience supporting hybrid environments with one or more Cloud providers (AWS, Azure)
  • Observability: Prior experience implementing one or more Commercial Observability/APM solutions (Dynatrace, New Relic, Datadog, AppDynamics, Honeycomb)
  • Monitoring and Logging: Solid familiarity with Splunk, Elastic, OpenSearch, Prometheus, Grafana
  • Implementing Site Reliability Engineering (SRE) principles SLO/SLI
  • Experience troubleshooting and resolving issues with critical business apps.
  • Solid knowledge of servers, infrastructure, load balancers, storage etc.
  • Operating Systems competency: solid understanding of Unix/Linux and windows
  • Technologies: Kubernetes, Containers, serverless
  • Languages/Programming: One or more of the following: Bash or ksh, Powershell, Java, Python, JavaScript, or any other common computer language competency
  • Configuration management and IaC: Prior experience writing and utilizing Ansible and Terraform
  • Cloud cert(s): AWS Solutions Architect Associate or AWS DevOps Associate
Benefits:
  • Paid Vacation/Sick Time
  • 401K with Company Match
  • Medical, Dental and Vision Benefits
  • Disability Benefits
  • Health Savings Account
  • Flexible Spending Account
  • Life Insurance
  • Parental Leave
  • Employee Assistance Program
  • Associate Volunteer Program


  • Charlotte, North Carolina, United States Oraapps Inc Full time

    Job Title: Site Reliability EngineerAt Oraapps Inc, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Charlotte, North Carolina, United States Digital Technology Solutions Llc Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Digital Technology Solutions LLC. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our production environment.Key Responsibilities:Monitor and maintain the production...


  • Charlotte, North Carolina, United States V2soft Full time

    About V2SoftV2Soft is a global company with a strong presence in multiple regions, including North America, Europe, and Asia. Our headquarters is located in Bloomfield Hills, Michigan, and we have a diverse team of professionals working together to deliver high-quality technology solutions to our clients.The RoleWe are seeking a skilled Site Reliability...


  • Charlotte, North Carolina, United States Capgemini Full time

    Job Title: Site Reliability EngineerCapgemini is seeking a seasoned Site Reliability Engineer to join our Trade Distribution System (TDS) software development team. As a Site Reliability Engineer, you will be responsible for advancing and enhancing reliability practices, with a strong focus on testing, monitoring, and maintaining system performance.Key...


  • Charlotte, North Carolina, United States Insight Global Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will be responsible for ensuring the stability and performance of our production environment.Key Responsibilities:Monitor availability and take a holistic view of system healthSupport...


  • Charlotte, North Carolina, United States Digital Technology Solutions Llc Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a skilled Site Reliability Engineer to join our team at Digital Technology Solutions LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the stability, scalability, and performance of our production environment.Key Responsibilities:• Run the production environment...


  • Charlotte, North Carolina, United States Capgemini Full time

    Job Title: Site Reliability EngineerCapgemini is seeking a seasoned Site Reliability Engineer to join our Trade Distribution System (TDS) software development team. As a Site Reliability Engineer, you will play a critical role in advancing and enhancing reliability practices, with a strong focus on testing, monitoring, and maintaining system performance.Key...


  • Charlotte, North Carolina, United States Matlen Silver Full time

    Job Title: Site Reliability EngineerMatlen Silver is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, performance, and reliability of our Fulfillment Technology solutions.Key Responsibilities:Partner with application engineering, observability, and other...


  • Charlotte, North Carolina, United States Brightspeed Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining scalable and reliable cloud-based systems.Key Responsibilities:Design and implement monitoring systems...


  • Charlotte, North Carolina, United States Digital Technology Solutions Full time

    Job Title: Site Reliability EngineerAbout the Role:At Digital Technology Solutions, we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our production environment.Key Responsibilities:1. Monitoring and Incident...


  • Charlotte, North Carolina, United States Capgemini Full time

    Job Title: Site Reliability EngineerCapgemini is seeking a seasoned Site Reliability Engineer to join our Trade Distribution System (TDS) software development team. As a Site Reliability Engineer, you will play a critical role in advancing and enhancing reliability practices, with a strong focus on testing, monitoring, and maintaining system performance.Key...


  • Charlotte, North Carolina, United States Capgemini Full time

    Job Title: Site Reliability EngineerCapgemini is seeking a seasoned Site Reliability Engineer to join our Trade Distribution System (TDS) software development team. As a Site Reliability Engineer, you will play a critical role in advancing and enhancing reliability practices, with a strong focus on testing, monitoring, and maintaining system performance.Key...


  • Charlotte, North Carolina, United States Brightspeed Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering team, you will play a critical role in ensuring the reliability and scalability of our cloud-native applications.Key Responsibilities:Design and implement monitoring systems to...


  • Charlotte, North Carolina, United States SERC Reliability Corporation Full time

    Job Title: Senior Reliability EngineerAt SERC Reliability Corporation, we are seeking a highly skilled Senior Reliability Engineer to join our team. As a key member of our Reliability Assessment, Performance Analysis, and Technical Services group, you will play a critical role in ensuring the reliability and security of the electric grid.Job Summary:The...


  • Charlotte, North Carolina, United States Matlen Silver Full time

    Job Title: Site Reliability Engineer (SRE)Duration: 6+ monthsLocation: Charlotte, NCRequired Pay Scale: $67-$70/hour W2** No C2CJob Description/Requirements:True SRE with 6+ years of experienceMust have AWS/Cloud expertiseTriage, incident response, root cause analysis, application improvement, reliabilityLamda, ECS, APIs, Dynatrace/Datadog knowledge, gitlab,...


  • Charlotte, North Carolina, United States Matlen Silver Full time

    Job Title: Site Reliability Engineer (SRE)Duration: 6+ monthsLocation: Charlotte, NCRequired Pay Scale: $67-$70/hour W2** No C2CJob Description/Requirements:True SRE with 6+ years of experienceAWS/Cloud expertiseTriage, incident response, root cause analysis, application improvement, reliabilityLamda, ECS, APIs, Dynatrace/Datadog knowledge, gitlab,...


  • Charlotte, North Carolina, United States Tandym Group Full time

    Job Title: Site Reliability EngineerTandym Group is seeking a skilled Site Reliability Engineer to support a financial client based in Charlotte. The ideal candidate will have a strong background in DevOps tools and technologies, as well as experience with infrastructure as code tools such as Terraform.Responsibilities:Monitor the production environment to...


  • Charlotte, North Carolina, United States Capgemini Full time

    Job Title: Site Reliability EngineerLocation: Atlanta, USJob Type: PermanentJob Description:We are seeking a seasoned Site Reliability Engineer to join our team at Capgemini. As a Site Reliability Engineer, you will be responsible for advancing and enhancing reliability practices, with a strong focus on testing, monitoring, and maintaining system...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering organization, you will be responsible for designing and implementing monitoring systems to track the performance and availability of our business-critical systems and...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering team, you will be responsible for designing and implementing monitoring systems to track the performance and availability of our business-critical systems and infrastructure.Key...