![Brightspeed](https://media.trabajo.org/img/noimg.jpg)
Principal Site Reliability Engineer
1 month ago
Job Description
We are currently looking for a Principal Site Reliability Engineer to join our growing team. In this role, you will implement and maintain monitoring systems to track the performance and availability of business-critical systems and infrastructure using metrics to identify trends and potential issues. You will also work closely with development teams, operations, and other stakeholders to ensure that new services and features are reliable and scalable.
As a Principal Site Reliability Engineer, your duties and responsibilities will include:
Implement and maintain monitoring systems to track the performance and availability of Business-critical systems and infrastructure. Use metrics to identify trends and potential issues. Respond to system outages and performance issues, performing root cause analysis to prevent recurrence Develop scripts and tools to automate repetitive tasks, such as deployment, scaling, and monitoring Work closely with development teams, operations, and other stakeholders to ensure that new services and features are reliable and scalable Work on reducing latency and improving the speed of data transmission across the network Define and measure Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to ensure services meet required performance and availability targets+ Conduct postmortems after incidents to identify what went wrong and what can be improved Work with Lead Application owners and internal Change Management to review code changes and support deployments Lead the team of site reliability engineers onshore/offshore, mentor them for support activities required for system reliability Must have ability to communicate and abstract the messaging to multiple target audiences including Sr business & IT leadership, technology, and business teams.Qualifications:
Qualifications
WHAT IT TAKES TO CATCH OUR EYE:
Master’s degree in computer science, telecommunications, or similar areas, with a minimum of 10 years software engineering experience, including a minimum of 5 years as a site reliability engineer Proven track record of managing mission critical customer facing applications for reliability 5+ years of experience supporting operations and maintenance for cloud-native applications in production that are fault-tolerant, self-healing, scalable and high available Excellent troubleshooting and problem-solving skills, with a keen attention to detail to identify and resolve complex production issues Deep understanding of cloud computing platforms (GCP) and containerization technologies (e.g., Docker, Kubernetes) Solid experience with core Kubernetes concepts such as Pods, Workloads, Services, Ingress/Egress, Deployments, ConfigMaps, HPA, Liveliness Probe, and Secrets Strong knowledge of infrastructure as code tools (e.g., Terraform, Ansible, ArgoCD) and CI/CD pipelines Strong experience working with integration of code quality tool (SonarQube or Checkmarx) with CI/CD pipeline Strong experience with monitoring, logging, and observability tools like, Splunk, GCP log, Dynatrace etc. Ability to work independently and as part of a collaborative team, effectively communicating technical concepts to both technical and non-technical stakeholders Must have proven written and verbal communication skills, including presentations using tools like PowerPoint Must have ability to communicate and abstract the messaging to multiple target audiences including Sr business & IT leadership, technology and business teamsBONUS POINTS FOR:
Certifications such as Google Professional Cloud DevOps Engineer or AWS Certified DevOps Engineer#LI-SS1
Additional Information
WHY JOIN US?
** We aspire to contemporary ways of working.**
Recognized as a Top Workplace by the Charlotte Observer, Brightspeed HQ is located on the 7th floor of the new Vantage South End - East Tower in Charlotte, NC. We prioritize hiring talent in the Charlotte area, whenever possible, to make it a truly vibrant destination for our hybrid workforce. At Brightspeed, we have roles that are designated as remote, hybrid, office or field-based, depending on the position, business needs and individual circumstances. We also invest in technology that enables our entire team to stay connected. Why? Because Brightspeed recognizes the value of finding the best talent for the job, wherever they may be.
** We offer competitive compensation and comprehensive benefits.**
Our benefits and paid time off programs reflect our underlying belief in promoting overall wellness through physical, emotional and financial health. Brightspeed offers a comprehensive benefit program, including competitive medical, dental, vision, and life insurance; an employee assistance program; a 401K plan with company match and a host of voluntary benefits.
Diversity, equity and inclusion are at the center of our grounding belief in Being Real.
When we bring our authentic selves to work, everyone is better as a result. A diverse team helps us be fierce advocates for more accessible, inclusive and high-quality internet, because we believe doing so promotes equity in the communities we serve.
Brightspeed is an Equal Opportunity Employer/Veterans/Disabled
For all applicants, please take a moment to review our Privacy Notices:
Brightspeed’s Privacy Notice for California Residents Brightspeed’s Privacy NoticeWe have other current jobs related to this field that you can find below
-
Principal Site Reliability Engineer
2 weeks ago
Charlotte, United States BrightSpeed Full timeJob Description We are currently looking for a Principal Site Reliability Engineer to join our growing team. In this role, you will implement and maintain monitoring systems to track the performance and availability of business-critical systems and infrastructure using metrics to identify trends and potential issues. You will also work closely with...
-
Principal Site Reliability Engineer
1 month ago
Charlotte, United States Brightspeed Full timeJob Description We are currently looking for a Principal Site Reliability Engineer to join our growing team. In this role, you will implement and maintain monitoring systems to track the performance and availability of business-critical systems and infrastructure using metrics to identify trends and potential issues. You will also work closely with...
-
Principal Site Reliability Engineer
1 month ago
Charlotte, North Carolina, United States Brightspeed Full timeJob DescriptionWe are currently looking for a Principal Site Reliability Engineer to join our growing team. In this role, you will implement and maintain monitoring systems to track the performance and availability of business-critical systems and infrastructure using metrics to identify trends and potential issues. You will also work closely with...
-
Principal Site Reliability Engineer
1 month ago
Charlotte, United States Brightspeed Full timeJob DescriptionJob DescriptionCompany DescriptionAt Brightspeed, we are reimagining how people live, work, play and connect by providing fast, reliable internet connections and an awesome customer experience in twenty states throughout the Midwest and South.Backed by funds managed by Apollo Global Management, our vision is to accelerate the upgrade of...
-
Site Reliability Engineer
2 weeks ago
Charlotte, United States Regions Bank Full timeThank you for your interest in a career at Regions. At Regions, we believe associates deserve more than just a job. We believe in offering performance-driven individuals a place where they can build a career --- a place to expect more opportunities. If you are focused on results, dedicated to quality, strength and integrity, and possess the drive to succeed,...
-
Site Reliability Engineer
4 weeks ago
Charlotte, United States JobRialto Full timeJob Description: Looking for a forward-thinking, energetic Site Reliability Engineering Manager to join our team. PDL serves the ecommerce needs of leading and growing grocery retailers with millions of shoppers located throughout the East Coast and Midwest. PDL strives to enable our retailers to be number one in all markets they operate in by: Leading IT...
-
Site Reliability Engineer
2 months ago
Charlotte, United States JobRialto Full timeJob Description: Looking for a forward-thinking, energetic Site Reliability Engineering Manager to join our team. PDL serves the ecommerce needs of leading and growing grocery retailers with millions of shoppers located throughout the East Coast and Midwest. PDL strives to enable our retailers to be number one in all markets they operate in by: Leading IT...
-
Senior Site Reliability Engineer
1 week ago
Charlotte, United States Delta Air Lines Full timeUnited States, Georgia, Atlanta Information Technology 04-May-2024 Ref #: 24745 How you'll help us Keep Climbing (overview & key responsibilities) Delta IT is on a journey of transformation. We are changing the way we do business from top to bottom. As thought-leaders within Delta, we strive to create significant and innovative solutions and are looking...
-
Platform/Site Reliability Engineer
2 months ago
Charlotte, United States Syntricate Technologies Full timePlatform/Site Reliability Engineer 6 Months Contract to Hire Charlotte, NCJOB DESCRIPTION We're looking for a Senior Platform Engineer to come help us automate everything, enable our developer teammates, and create and support world-class platforms. As a Senior Platform Engineer, you will be an integral member of the Platform Engineering team, helping the...
-
Site Reliability Engineer
2 months ago
Charlotte, United States Saxon Global Full timeSite Reliability Engineer JOB SUMMARY This position is responsible for design, development and implementation of cloud based technologies. Provide technical expertise on complex projects and advanced troubleshooting of existing Cloud technology for use by department. Such as guidance and support in the development of progress at all system layers, including...
-
Senior Reliability Engineer, RAPA
2 months ago
Charlotte, United States SERC Reliability Corporation Full timeSERC OVERVIEW: The electric grid is vital to our everyday lives. It is fundamental for the health, safety, and well-being of our communities, and provides the platform for our economy and our societal and technological advances. SERC's mission is to reduce risks to the reliability and security of the electric grid (also known as the bulk power system), not...
-
Senior Reliability Engineer, RAPA
2 months ago
Charlotte, United States SERC Reliability Corporation Full timeJob DescriptionJob DescriptionSERC OVERVIEW:The electric grid is vital to our everyday lives. It is fundamental for the health, safety, and well-being of our communities, and provides the platform for our economy and our societal and technological advances. SERC's mission is to reduce risks to the reliability and security of the electric grid (also known...
-
Site Reliability Engineer
3 weeks ago
Charlotte, United States Recurring Decimal Full timeLocation- Hybrid | Charlotte, NC or Phoenix, AZKey Skills:Experience with one or more Cloud Platforms (Azure, GCP)Experience with Container technologies: Kubernetes, Docker, PKS, Azure Kubernetes Service (AKS)5+ years of experience in Site Reliability engineeringExperience setting up monitoring in applications and database.Experience in ServiceNow, Jira,...
-
Site Reliability Engineer
3 weeks ago
Charlotte, United States Recurring Decimal Full timeLocation- Hybrid | Charlotte, NC or Phoenix, AZKey Skills:Experience with one or more Cloud Platforms (Azure, GCP)Experience with Container technologies: Kubernetes, Docker, PKS, Azure Kubernetes Service (AKS)5+ years of experience in Site Reliability engineeringExperience setting up monitoring in applications and database.Experience in ServiceNow, Jira,...
-
Site Reliability Engineer
3 weeks ago
Charlotte, United States Cedent Consulting Full timeSite Reliability Engineer (Charlotte, NC) Role: Site Reliability Engineer Location: Charlotte, NC Client: Healthcare client Position Responsibilities: Code strategies and languages by leveraging knowledge while working with customers on configuration management initiatives. Coordinate and assist teams in building competencies with infrastructure using object...
-
Senior Software Site Reliability Engineer
4 weeks ago
Charlotte, United States Credit Karma Full timeIntuit Credit Karma is a mission-driven company, focused on championing financial progress for our more than 130 million members globally. While we're best known for pioneering free credit scores, our members turn to us for everything related to their financial goals, including identity monitoring, applying for credit cards, shopping for insurance and loans...
-
Senior Software Site Reliability Engineer
4 weeks ago
Charlotte, United States Credit Karma Full timeIntuit Credit Karma is a mission-driven company, focused on championing financial progress for our more than 130 million members globally. While we're best known for pioneering free credit scores, our members turn to us for everything related to their financial goals, including identity monitoring, applying for credit cards, shopping for insurance and loans...
-
Site Reliability Engineer
4 weeks ago
Charlotte, United States Cedent Consulting Full timeSite Reliability Engineer (Charlotte, NC) Role: Site Reliability Engineer Location: Charlotte, NC Client: Healthcare client Position Responsibilities: Code strategies and languages by leveraging knowledge while working with customers on configuration management initiatives. Coordinate and assist teams in building competencies with infrastructure using object...
-
Reliability Engineer
4 weeks ago
Charlotte, United States JLL Full timeJLL is seeking a Reliability Engineer to join our team! This exciting opportunity is responsible for providing reliability engineering support for operations and maintenance of buildings, infrastructure, and equipment assets. In coordination and full collaboration with the Engineering Services Reliability & Asset Management COE, the Reliability Engineer is...
-
Digital One
7 days ago
Charlotte, United States Jobs for Humanity Full timeJob Description A variety of soft skills and experience may be required for the following role Please ensure you check the overview below carefully.Position Type :Full time Type Of Hire :Experienced (relevant combo of work and education) Education Desired :Bachelor of Computer Science Travel Percentage :5 - 10%Job DescriptionAs the world works and lives...