Current jobs related to Principal Site Reliability Engineer - Charlotte, North Carolina - Brightspeed


  • Charlotte, North Carolina, United States Brightspeed Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining scalable and reliable cloud-based systems.Key Responsibilities:Design and implement monitoring systems...


  • Charlotte, North Carolina, United States Brightspeed Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering team, you will play a critical role in ensuring the reliability and scalability of our cloud-native applications.Key Responsibilities:Design and implement monitoring systems to...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering organization, you will be responsible for designing and implementing monitoring systems to track the performance and availability of our business-critical systems and...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job Title: Principal Site Reliability EngineerAt Brightspeed, we are reimagining how people live, work, play and connect by providing fast, reliable internet connections and an awesome customer experience in twenty states throughout the Midwest and South.We are seeking a highly skilled Principal Site Reliability Engineer to join our growing team. As a key...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering team, you will be responsible for designing and implementing monitoring systems to track the performance and availability of our business-critical systems and infrastructure.Key...


  • Charlotte, North Carolina, United States Brightspeed Full time

    Job SummaryWe are seeking a highly skilled Principal Site Reliability Engineer to lead our team in ensuring the reliability and scalability of our business-critical systems and infrastructure.Key ResponsibilitiesDesign and implement monitoring systems to track performance and availability of critical systems and infrastructure.Develop and maintain scripts...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job Title: Principal Site Reliability EngineerAt Brightspeed, we are reimagining how people live, work, play, and connect by providing fast, reliable internet connections and an exceptional customer experience in twenty states throughout the Midwest and South.We are seeking a highly skilled Principal Site Reliability Engineer to join our team. As a key...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job Title: Principal Site Reliability EngineerAt Brightspeed, we are reimagining how people live, work, play, and connect by providing fast, reliable internet connections and an exceptional customer experience in twenty states throughout the Midwest and South.We are backed by funds managed by Apollo Global Management, and our vision is to accelerate the...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job DescriptionWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering team, you will play a critical role in ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Implement and maintain monitoring systems to track the performance and...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    We are seeking a Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our team, you will be responsible for implementing and maintaining monitoring systems to track the performance and availability of business-critical systems and infrastructure.Key responsibilities include:Implementing and maintaining monitoring systems to...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job DescriptionAt Brightspeed, we are reimagining how people live, work, play and connect by providing fast, reliable internet connections and an awesome customer experience in twenty states throughout the Midwest and South.We are currently looking for a Principal Site Reliability Engineer to join our growing team. In this role, you will be responsible for...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job DescriptionWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering team, you will play a critical role in ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Implement and maintain monitoring systems to track the performance and...


  • Charlotte, North Carolina, United States City National Bank Full time

    Job SummaryCity National Bank is seeking a highly skilled Site Reliability Principal Engineer to join our team. As a Site Reliability Principal Engineer, you will be responsible for designing, building, and managing large-scale, fault-tolerant systems. Your role will be to ensure the reliability, scalability, and maximum uptime of CNB systems in the Data...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job DescriptionWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering team, you will be responsible for designing and implementing monitoring systems to track the performance and availability of our business-critical systems and infrastructure.Key Responsibilities:Implement and...


  • Charlotte, North Carolina, United States City National Bank Full time

    Job Title: Site Reliability Principal EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Principal Engineer to join our team. As a key member of our engineering organization, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our cloud-based systems.Key Responsibilities:Design and...


  • Charlotte, North Carolina, United States Oraapps Inc Full time

    Job Title: Site Reliability EngineerAt Oraapps Inc, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Charlotte, North Carolina, United States Digital Technology Solutions Llc Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Digital Technology Solutions LLC. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our production environment.Key Responsibilities:Monitor and maintain the production...


  • Charlotte, North Carolina, United States V2soft Full time

    About V2SoftV2Soft is a global company with a strong presence in multiple regions, including North America, Europe, and Asia. Our headquarters is located in Bloomfield Hills, Michigan, and we have a diverse team of professionals working together to deliver high-quality technology solutions to our clients.The RoleWe are seeking a skilled Site Reliability...


  • Charlotte, North Carolina, United States Capgemini Full time

    Job Title: Site Reliability EngineerCapgemini is seeking a seasoned Site Reliability Engineer to join our Trade Distribution System (TDS) software development team. As a Site Reliability Engineer, you will be responsible for advancing and enhancing reliability practices, with a strong focus on testing, monitoring, and maintaining system performance.Key...


  • Charlotte, North Carolina, United States Digital Technology Solutions Llc Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a skilled Site Reliability Engineer to join our team at Digital Technology Solutions LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the stability, scalability, and performance of our production environment.Key Responsibilities:• Run the production environment...

Principal Site Reliability Engineer

2 months ago


Charlotte, North Carolina, United States Brightspeed Full time
Job Description

We are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. In this role, you will be responsible for implementing and maintaining monitoring systems to track the performance and availability of business-critical systems and infrastructure.

Key Responsibilities:

  • Implement and maintain monitoring systems to track the performance and availability of business-critical systems and infrastructure.
  • Respond to system outages and performance issues, performing root cause analysis to prevent recurrence.
  • Develop scripts and tools to automate repetitive tasks, such as deployment, scaling, and monitoring.
  • Work closely with development teams, operations, and other stakeholders to ensure that new services and features are reliable and scalable.
  • Work on reducing latency and improving the speed of data transmission across the network.
  • Define and measure Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to ensure services meet required performance and availability targets.
  • Conduct postmortems after incidents to identify what went wrong and what can be improved.
  • Work with Lead Application owners and internal Change Management to review code changes and support deployments.
  • Lead the team of site reliability engineers onshore/offshore, mentor them for support activities required for system reliability.
  • Must have ability to communicate and abstract the messaging to multiple target audiences including Sr business & IT leadership, technology, and business teams.

Requirements:

  • Master's degree in computer science, telecommunications, or similar areas, with a minimum of 10 years software engineering experience, including a minimum of 5 years as a site reliability engineer.
  • Proven track record of managing mission critical customer facing applications for reliability.
  • 5+ years of experience supporting operations and maintenance for cloud-native applications in production that are fault-tolerant, self-healing, scalable and high available.
  • Excellent troubleshooting and problem-solving skills, with a keen attention to detail to identify and resolve complex production issues.
  • Deep understanding of cloud computing platforms (GCP) and containerization technologies (Docker, Kubernetes).
  • Solid experience with core Kubernetes concepts such as Pods, Workloads, Services, Ingress/Egress, Deployments, ConfigMaps, HPA, Liveliness Probe, and Secrets.
  • Strong knowledge of infrastructure as code tools (Terraform, Ansible, ArgoCD) and CI/CD pipelines.
  • Strong experience working with integration of code quality tool (SonarQube or Checkmarx) with CI/CD pipeline.
  • Strong experience with monitoring, logging, and observability tools like Splunk, GCP log, Dynatrace etc.
  • Ability to work independently and as part of a collaborative team, effectively communicating technical concepts to both technical and non-technical stakeholders.
  • Must have proven written and verbal communication skills, including presentations using tools like PowerPoint.
  • Must have ability to communicate and abstract the messaging to multiple target audiences including Sr business & IT leadership, technology and business teams.

Bonus Points:

  • Certifications such as Google Professional Cloud DevOps Engineer or AWS Certified DevOps Engineer.