Principal Site Reliability Engineer

2 months ago


Charlotte, North Carolina, United States Brightspeed Full time
Job Summary

We are seeking a highly skilled Principal Site Reliability Engineer to lead our team in ensuring the reliability and scalability of our business-critical systems and infrastructure.

Key Responsibilities
  • Design and implement monitoring systems to track performance and availability of critical systems and infrastructure.
  • Develop and maintain scripts and tools to automate repetitive tasks, such as deployment, scaling, and monitoring.
  • Collaborate with development teams, operations, and stakeholders to ensure new services and features are reliable and scalable.
  • Work on reducing latency and improving data transmission speed across the network.
  • Define and measure Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to ensure services meet performance and availability targets.
  • Conduct postmortems after incidents to identify areas for improvement.
  • Lead a team of site reliability engineers, mentoring them on support activities required for system reliability.
Requirements
  • Master's degree in computer science, telecommunications, or similar field, with a minimum of 10 years software engineering experience, including 5 years as a site reliability engineer.
  • Proven track record of managing mission-critical customer-facing applications for reliability.
  • 5+ years of experience supporting operations and maintenance for cloud-native applications in production.
  • Excellent troubleshooting and problem-solving skills, with a keen attention to detail.
  • Deep understanding of cloud computing platforms (GCP) and containerization technologies (Docker, Kubernetes).
  • Strong knowledge of infrastructure as code tools (Terraform, Ansible, ArgoCD) and CI/CD pipelines.
  • Strong experience working with integration of code quality tools (SonarQube or Checkmarx) with CI/CD pipeline.
  • Strong experience with monitoring, logging, and observability tools (Splunk, GCP log, Dynatrace).
  • Ability to work independently and as part of a collaborative team, effectively communicating technical concepts to both technical and non-technical stakeholders.
Preferred Qualifications
  • Certifications such as Google Professional Cloud DevOps Engineer or AWS Certified DevOps Engineer.


  • Charlotte, North Carolina, United States Brightspeed Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining scalable and reliable cloud-based systems.Key Responsibilities:Design and implement monitoring systems...


  • Charlotte, North Carolina, United States Brightspeed Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering team, you will play a critical role in ensuring the reliability and scalability of our cloud-native applications.Key Responsibilities:Design and implement monitoring systems to...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering organization, you will be responsible for designing and implementing monitoring systems to track the performance and availability of our business-critical systems and...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job Title: Principal Site Reliability EngineerAt Brightspeed, we are reimagining how people live, work, play and connect by providing fast, reliable internet connections and an awesome customer experience in twenty states throughout the Midwest and South.We are seeking a highly skilled Principal Site Reliability Engineer to join our growing team. As a key...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering team, you will be responsible for designing and implementing monitoring systems to track the performance and availability of our business-critical systems and infrastructure.Key...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job Title: Principal Site Reliability EngineerAt Brightspeed, we are reimagining how people live, work, play, and connect by providing fast, reliable internet connections and an exceptional customer experience in twenty states throughout the Midwest and South.We are seeking a highly skilled Principal Site Reliability Engineer to join our team. As a key...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job Title: Principal Site Reliability EngineerAt Brightspeed, we are reimagining how people live, work, play, and connect by providing fast, reliable internet connections and an exceptional customer experience in twenty states throughout the Midwest and South.We are backed by funds managed by Apollo Global Management, and our vision is to accelerate the...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job DescriptionWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering team, you will play a critical role in ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Implement and maintain monitoring systems to track the performance and...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    We are seeking a Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our team, you will be responsible for implementing and maintaining monitoring systems to track the performance and availability of business-critical systems and infrastructure.Key responsibilities include:Implementing and maintaining monitoring systems to...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job DescriptionAt Brightspeed, we are reimagining how people live, work, play and connect by providing fast, reliable internet connections and an awesome customer experience in twenty states throughout the Midwest and South.We are currently looking for a Principal Site Reliability Engineer to join our growing team. In this role, you will be responsible for...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job DescriptionWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering team, you will play a critical role in ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Implement and maintain monitoring systems to track the performance and...


  • Charlotte, North Carolina, United States City National Bank Full time

    Job SummaryCity National Bank is seeking a highly skilled Site Reliability Principal Engineer to join our team. As a Site Reliability Principal Engineer, you will be responsible for designing, building, and managing large-scale, fault-tolerant systems. Your role will be to ensure the reliability, scalability, and maximum uptime of CNB systems in the Data...


  • Charlotte, North Carolina, United States BrightSpeed Full time

    Job DescriptionWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Brightspeed. As a key member of our engineering team, you will be responsible for designing and implementing monitoring systems to track the performance and availability of our business-critical systems and infrastructure.Key Responsibilities:Implement and...


  • Charlotte, North Carolina, United States City National Bank Full time

    Job Title: Site Reliability Principal EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Principal Engineer to join our team. As a key member of our engineering organization, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our cloud-based systems.Key Responsibilities:Design and...


  • Charlotte, North Carolina, United States Oraapps Inc Full time

    Job Title: Site Reliability EngineerAt Oraapps Inc, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Charlotte, North Carolina, United States V2soft Full time

    About V2SoftV2Soft is a global company with a strong presence in multiple regions, including North America, Europe, and Asia. Our headquarters is located in Bloomfield Hills, Michigan, and we have a diverse team of professionals working together to deliver high-quality technology solutions to our clients.The RoleWe are seeking a skilled Site Reliability...


  • Charlotte, North Carolina, United States Digital Technology Solutions Llc Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Digital Technology Solutions LLC. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our production environment.Key Responsibilities:Monitor and maintain the production...


  • Charlotte, North Carolina, United States Capgemini Full time

    Job Title: Site Reliability EngineerCapgemini is seeking a seasoned Site Reliability Engineer to join our Trade Distribution System (TDS) software development team. As a Site Reliability Engineer, you will be responsible for advancing and enhancing reliability practices, with a strong focus on testing, monitoring, and maintaining system performance.Key...


  • Charlotte, North Carolina, United States Digital Technology Solutions Llc Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a skilled Site Reliability Engineer to join our team at Digital Technology Solutions LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the stability, scalability, and performance of our production environment.Key Responsibilities:• Run the production environment...


  • Charlotte, North Carolina, United States Capgemini Full time

    Job Title: Site Reliability EngineerCapgemini is seeking a seasoned Site Reliability Engineer to join our Trade Distribution System (TDS) software development team. As a Site Reliability Engineer, you will play a critical role in advancing and enhancing reliability practices, with a strong focus on testing, monitoring, and maintaining system performance.Key...