SRE Team Lead

3 days ago


Charlotte, North Carolina, United States Maintec Technologies Full time
Job Description - SRE Team Lead

At Maintec Technologies, we're seeking a highly skilled SRE Team Lead to join our team. As a Site Reliability Engineer, you'll play a critical role in building and maintaining our engineering discipline, combining software and systems to develop engineering solutions to operations problems.

Responsibilities:
  • Lead the development and implementation of Service Level Agreements (SLA), Service Level Objectives (SLO), and associated metrics for our Critical Java applications deployed in Cloud.
  • Provide Cloud operations management, Cloud services deployments, and ensure Cloud Services availability.
  • Develop and maintain strong knowledge of automation and scripting language (Python) to automate tasks and improve efficiency.
  • Collaborate with the SRE team to maintain the integrity of cloud services deployments and ensure seamless integration with other teams.
  • Responsible for managing incidents, problems, change management, release management, analytics on previous incidents, and usage patterns.
  • Manage new development, new enhancement, and operationalize changes to ensure smooth deployment and minimal downtime.
  • Develop and maintain the On Call staffing plan, roster, allocation of team members, internal and external communication, and reporting.
  • Plan and implement patching and upgrades to ensure our systems remain secure and up-to-date.
  • Analyze system health metrics to identify areas for improvement and optimize system performance.
  • Enforce best practices for security, reliability, resiliency, self-healing, HA, automation, and quality of service.
  • Establish and follow SRE Principles to ensure our team operates efficiently and effectively.
  • Coordinate and manage operational schedules and priorities to ensure seamless execution.
  • Monitor and report on infrastructure performance metrics to ensure our systems are running optimally.
Requirements:
  • 12+ years of overall experience with 5+ years in SRE Technical Manager role handling IaaS, PaaS, and Microservices on PCF/Azure.
  • 4+ years of experience as SRE Engineer in DevOps, DataOps, SecOps, or InfraOps.
  • 2+ years of experience as Level 1, 1.5, or 2 support/operations with 24x7 support across onsite/offshore/nearshore model.
  • Experience managing a large global cloud organization working in multiple locations and time zones.
  • Brings the best of the industry and the organization along in the journey.
  • Good knowledge of Information Technology Infrastructure Library processes.
  • Experience managing SLI, SLO, Toil management, Error budget, and metrics.
  • Experience in cloud reliability standards, observability, security, performance, disaster recovery, and reporting requirements.
  • Experience with identifying Manual, repetitive, automatable tasks and automating them.
  • Experience with IT and Cloud security standards and compliance.
  • Hands-on experience working on Java, PCF, or Azure Platforms.
  • Hands-on experience in working Azure AD.
  • Hands-on experience in automation and scripting using Python.
  • Strong expertise in Cloud concepts like Infrastructure as Code, Cloud Computing, Cloud Networking, Cloud Storage & Backup, Containerization, SSO, sFTP, and SRE.
  • Experience in understanding and implementing SecOps needs.
  • Experience in release, deployment of patches across the spectrum of scope.
Process Skills:
  • Having sound knowledge of ITIL practices like Change Management, Incident Management, Problem management, release management, etc.
  • Exceptional communication skills.
  • Self-starter, ambitious, willing to take on difficult problems.
  • Collaborative, team player attitude.
  • Practical exposure & knowledge in existing/emerging cloud Database technologies.
  • Has worked in Metrix role with an ability to work independently with multiple managers with dotted line hierarchies.
  • Keeping abreast of industry trends, technology innovation, and changing customer requirements to help with the continual service improvement process.
  • Participate in on-call rotations and be responsible for infrastructure and platform level escalations.
  • Work with the DevOps team on planning and implementation of infrastructure capacity planning, upgrades, and monitoring.
  • Participate in Daily (Standup) Production Reviews.
  • Contribute to the design and improvement of deployment architecture of new and existing applications based on the principles of reliability, high availability, efficiency, and observability.
  • Research, learn, adapt, customize, and create tools to improve the observability, resilience, and usability of applications in scope.
  • Create and maintain SRE-related documentation (solution repository, Root Cause Analysis Reports, etc).
Certification:
  • Certification in PCF, Java mandatory.

  • Principal SRE

    7 days ago


    Charlotte, North Carolina, United States Apex Systems Full time

    Principal SRE Job DescriptionWe are seeking a highly skilled Principal SRE to join our dynamic SRE team at Apex Systems. As a subject matter expert and SRE professional, you will be responsible for analyzing complex data and distributed systems, anticipating problems, and finding ways to mitigate risks to the environment.Main Responsibilities:Optimize...

  • Principal SRE

    2 days ago


    Charlotte, North Carolina, United States Apex Systems Full time

    Job Title: Principal SREApex Systems is seeking a highly skilled Principal SRE to join our dynamic SRE team. As a subject matter expert and SRE professional, you will be responsible for analyzing complex data and distributed systems, anticipating problems, and finding ways to mitigate risks to the environment.Key Responsibilities:Optimize day-to-day...

  • Principal SRE

    3 days ago


    Charlotte, North Carolina, United States Apex Systems Full time

    Job Title: Principal SREWe are seeking a highly skilled Principal SRE to join our dynamic team at Apex Systems. As a Principal SRE, you will be responsible for leading the design, build, and implementation of orchestration and tooling solutions to optimize workflows and tasks. You will also establish operational best practices for structuring, automating,...

  • Principal SRE

    1 day ago


    Charlotte, North Carolina, United States Apex Systems Full time

    Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our dynamic SRE team. As a subject matter expert, you will be responsible for analyzing complex data and distributed systems, anticipating problems, and finding ways to mitigate risks to the environment.Key Responsibilities:Lead the design, build,...

  • SRE Architect

    8 hours ago


    Charlotte, North Carolina, United States CapB InfoteK Full time

    Job Title: SRE Architect - Enterprise Platform ExpertWe are seeking an experienced SRE Architect to lead our multiyear project, driving the transformation of clients through technology and innovation.Key Responsibilities:Develop and implement SRE strategies to improve system reliability and scalabilityLead client DevOps capability assessments and provide...


  • Charlotte, North Carolina, United States Motion Recruitment Full time

    Job Title: SRE/Site Reliability EngineerJoin a leading financial services company as a SRE/Site Reliability Engineer and be part of a team that drives innovation and excellence in the industry.About the Role:We are seeking a highly skilled SRE/Site Reliability Engineer to join our team in Chandler, AZ, Charlotte, NC, Iselin, NJ, Irving, TX, and/or New York,...


  • Charlotte, North Carolina, United States Jobot Full time

    Job Title: Azure SREJobot is seeking a highly skilled Azure SRE Engineer to join our fully remote team. As a key member of our team, you will be responsible for driving the implementation of Site Reliability Engineering (SRE) practices across our enterprise-level systems.Key Responsibilities:Architect and manage Azure Cloud infrastructure to ensure secure...


  • Charlotte, North Carolina, United States Jobot Full time

    Remote Azure SRE WantedThis is a unique opportunity to join a growing tech consulting firm as a Senior Cloud SRE Engineer. In this role, you will be instrumental in driving the implementation of Site Reliability Engineering (SRE) practices across our enterprise-level systems.Key Responsibilities:Architect and manage Azure Cloud infrastructure, ensuring...


  • Charlotte, North Carolina, United States CapB InfoteK Full time

    Job Title: SRE DevOps Telemetry EngineerWe are seeking a highly skilled SRE DevOps Telemetry Engineer to join our team at CapB InfoteK. As a key member of our team, you will be responsible for designing and implementing critical software components on our telemetry platform.Key Responsibilities:Design and code critical software components on the telemetry...


  • Charlotte, North Carolina, United States CapB InfoteK Full time

    Job Title: SRE DevOps Telemetry EngineerWe are seeking a highly skilled SRE DevOps Telemetry Engineer to join our team at CapB InfoteK. As a key member of our team, you will be responsible for designing and implementing critical software components on our telemetry platform.Key Responsibilities:Design and code critical software components on the telemetry...

  • SRE Engineer

    3 days ago


    Charlotte, North Carolina, United States 1 Point System Full time

    Job Title: SRE - W2Job Summary:We are seeking a highly skilled SRE Engineer to join our Platform team. As an SRE Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our payment processing systems.Key Responsibilities:Support a suite of applications across varying technologies enabling wires and payment...


  • Charlotte, North Carolina, United States True Team Medical Full time

    Job Title: Community Support Team LeadWe are seeking a highly skilled and compassionate Community Support Team Lead to join our team at True Team Medical. As a key member of our community-based therapy and case management team, you will provide essential support and services to adults with serious and persistent mental health issues.Key...


  • Charlotte, North Carolina, United States Wells Fargo Full time

    About this RoleWells Fargo is seeking a highly skilled Lead Software Engineer to join our Home Lending group as part of Consumer Technology. This role offers a unique opportunity to lead complex technology initiatives and develop standards and best practices for engineering complex and large-scale technology solutions.Key ResponsibilitiesLead complex...


  • Charlotte, North Carolina, United States Wells Fargo Full time

    About this role:Wells Fargo is seeking a highly skilled Lead Software Engineer to join our Home Lending group as part of Consumer Technology. This role requires a strong background in software engineering, with a focus on cloud and DevOps technologies.Key Responsibilities:Lead complex technology initiatives, including those with broad impact across the...


  • Charlotte, North Carolina, United States Wells Fargo Full time

    About this role:Wells Fargo is seeking a highly skilled Lead Software Engineer to join our Home Lending group as part of Consumer Technology. This role requires a strong background in software engineering, with a focus on cloud and DevOps technologies.Key Responsibilities:Lead complex technology initiatives, including those with broad impact across the...


  • Charlotte, North Carolina, United States Wells Fargo Full time

    About this role:Wells Fargo is seeking a highly skilled Lead Software Engineer to join our Home Lending group as part of Consumer Technology. This role requires a strong background in software engineering, site reliability engineering, and cloud computing.Key Responsibilities:Lead complex technology initiatives, including those with broad impact across the...


  • Charlotte, North Carolina, United States Jobot Full time

    Remote Azure SRE WantedWe are seeking a highly skilled Senior Cloud SRE Engineer to join our fully remote team at Jobot. As a key member of our team, you will be responsible for driving the implementation of Site Reliability Engineering (SRE) practices across our enterprise-level systems.Key Responsibilities:Architect and manage Azure Cloud infrastructure to...


  • Charlotte, North Carolina, United States GEICO Full time

    Senior Engineer Position at GEICOGEICO is seeking a skilled Senior Engineer to contribute to the transformation of our insurance business.Key Responsibilities:Provide technical leadership to engineering teamsOwn complete solutions lifecycleInfluence technical vision with product managers and other teamsLead design sessions and code reviewsMentor junior team...


  • Charlotte, North Carolina, United States Wells Fargo Full time

    About this RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Wells Fargo. As a key member of our Application Support and SRE team, you will play a critical role in introducing and advancing SRE discipline across multiple applications and vertical lines of business.Key ResponsibilitiesInstantiate Site Reliability...


  • Charlotte, North Carolina, United States Matlen Silver Full time

    Job Title: Site Reliability Engineer (SRE)Duration: 6+ monthsLocation: Charlotte, NCRequired Pay Scale: $67-$70/hour W2** No C2CJob Description/Requirements:True SRE with 6+ years of experienceMust have AWS/Cloud expertiseTriage, incident response, root cause analysis, application improvement, reliabilityLamda, ECS, APIs, Dynatrace/Datadog knowledge, gitlab,...