Lead Manager of Site Reliability Engineering

1 week ago


Reston, Virginia, United States Microsoft Full time

Are you driven by a commitment to excellence in large-scale service delivery? We are seeking a Lead Manager of Site Reliability Engineering who possesses a blend of software development expertise, online service experience, and a dedication to quality. This role involves envisioning, designing, and executing cloud service offerings tailored for government clients.

About Our Services: Our cloud solutions are pivotal to our strategy, integrating trusted communication and collaboration tools with desktop and mobile applications. The Enterprise Cloud team collaborates with major enterprise and government clients to develop features that align with their unique requirements, facilitating seamless cloud adoption. Our clients maintain the highest standards for quality, security, reliability, availability, and performance.

Role Overview: The Site Reliability Engineering (SRE) team is responsible for providing leadership, guidance, and accountability in application architecture, system design, and comprehensive implementation. As a Lead SRE Manager, you will cultivate a team focused on identifying and delivering software enhancements, leveraging your expertise in software development, complexity analysis, and scalable system architecture. Strong collaboration skills are essential for working alongside other engineering teams to ensure that services and systems are stable and performant, meeting the rigorous expectations of our government clientele.

Key Attributes of the Ideal Candidate:

  • Enthusiastic about distributed systems and scalable service architectures.
  • Finds fulfillment in mentoring others and fostering a positive, collaborative team environment.
  • Thrives on tackling new technological challenges and is motivated to find solutions.
  • Passionate about enhancing software quality and continuously refining development, integration, and deployment processes.
  • A proactive, highly motivated self-starter who excels in a dynamic, technical landscape.
  • Skilled collaborator with a proven track record of building technical partnerships across teams.
Qualifications

Essential Qualifications:

  • 6+ years of technical experience in software engineering, network engineering, or systems administration, or equivalent educational background.
  • 7+ years of experience in Software, Site Reliability, Systems, or Service Engineering.
  • Proficiency in multiple programming languages (C#, C++, Python, Java, etc.).
  • Demonstrated ability to drive improvements and deliver solutions in collaboration with stakeholders at all organizational levels.

Additional Requirements:

  • Security Clearance: Candidates must meet security screening requirements, including an active TS clearance, with the potential for upgrades.
  • Background Check: Successful completion of a Microsoft Cloud background check is required.
  • Citizenship Verification: This position necessitates verification of U.S. citizenship due to legal restrictions.

Preferred Qualifications:

  • 7+ years of technical experience in relevant fields, with advanced degrees considered.
  • Experience with large-scale cloud or distributed systems.
  • People management experience of 3+ years.
  • Proven success in enhancing the reliability and performance of cloud services.
  • Technical knowledge of Office 365 and Exchange architecture.
  • Prior experience requiring government screening and clearance.

Responsibilities:

  • Provide strategic technical leadership to a team of dedicated engineers.
  • Recruit, onboard, and develop a team of Software Engineers focused on Site Reliability.
  • Oversee the operation and enhancement of critical public-sector service environments.
  • Coordinate planning and execution with internal engineering teams and business partners.
  • Take ownership of deployment, availability, reliability, performance, and customer escalation targets.
  • Proactively identify and mitigate issues through effective design, testing, and software implementation.
  • Maintain high standards of employee and team satisfaction within the organization.


  • Reston, Virginia, United States Microsoft Full time

    Are you driven by a commitment to excellence in large-scale service delivery? We are seeking a Lead Manager of Site Reliability Engineering who possesses a unique blend of software development expertise, experience in online services, and a dedication to quality. This role is pivotal in conceptualizing, designing, and executing government cloud service...


  • Reston, Virginia, United States Microsoft Full time

    About the Role: Microsoft is seeking a Principal Site Reliability Engineer to join our dynamic Office 365 team, which is dedicated to delivering exceptional communication and collaboration solutions. In this pivotal role, you will leverage your expertise in ensuring the reliability and quality of our services, particularly within the government cloud sector....


  • Reston, Virginia, United States Microsoft Full time

    Microsoft is seeking a Senior Site Reliability Engineer to join our Cloud and Artificial Intelligence Silver Team. This team plays a crucial role in deploying and managing a Secure Work Area, which includes the infrastructure necessary for collaboration within a highly secure environment. In this position, you will collaborate with engineers who facilitate a...


  • Reston, Virginia, United States Microsoft Full time

    About the RoleWe are seeking a highly skilled and experienced Senior Site Reliability Engineering Manager to join our team at Microsoft. As a key member of our Cloud Services organization, you will be responsible for providing technical leadership and direction to a team of engineers focused on ensuring the reliability, availability, and performance of our...


  • Reston, Virginia, United States Microsoft Full time

    About the RoleWe are seeking a highly skilled and experienced Senior Site Reliability Engineering Manager to join our team at Microsoft. As a key member of our engineering organization, you will be responsible for providing technical leadership to a team of highly passionate and skilled engineers.Key Responsibilities:Recruit, on-board, and grow a team of...


  • Reston, Virginia, United States Microsoft Full time

    Unlock the Power of Cloud Services with MicrosoftAs a leader in cloud innovation, Microsoft is revolutionizing the business world with cutting-edge solutions. We're seeking skilled Site Reliability Engineers to design and implement top-notch solutions for our customers.Contribute to Shaping the Future of Cloud Computing3+ years of experience in software...


  • Reston, Virginia, United States Microsoft Full time

    About the Role: Join the Office 365 team as a Principal Site Reliability Engineer, where you will play a pivotal role in enhancing the delivery of essential features within our government cloud offerings. Your expertise in quality, reliability, and innovation will be crucial in advancing the continuous delivery of services that enhance the Teams Phone...


  • Reston, Virginia, United States Microsoft Full time

    About the RoleWe are seeking highly skilled Site Reliability Engineers I/II to join our team at Microsoft. As a Site Reliability Engineer, you will play a critical role in designing and implementing scenarios for our customers, ensuring the reliability and scalability of our cloud services.Key ResponsibilitiesDesign and implement solutions to ensure the...


  • Reston, Virginia, United States Red Gate Group Full time

    Company DescriptionAt RED GATE we do everything we can to serve our clients:Using the right technical skills, unique methodologies, best practices, and integrated technology, we help clients implement bold solutions. New approaches to emerging and evolving threats. Non-traditional ways to overcome entrenched obstacles. Advantage through opportunity. If you...


  • Reston, Virginia, United States Microsoft Corporation Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our cloud-first team at Microsoft Corporation. As a key member of our team, you will play a critical role in designing and implementing scalable and reliable cloud services for our customers.About the RoleDesign and implement cloud infrastructure solutions that meet the reliability...

  • Maintenance Manager

    4 days ago


    Reston, Virginia, United States RL Enterprise & Associates: Recruiting & Staffing Full time

    About the Role:The Maintenance Manager will be responsible for developing and leading safe and effective maintenance and reliability activities. This role will also serve as a liaison for third-party factory maintenance services.Key Responsibilities:Ensure compliance with all Environmental, Safety & Health (ESH) requirements.Manage all external maintenance...


  • Reston, Virginia, United States Amalgamated Sugar Company Full time

    Job SummaryThe Amalgamated Sugar Company is seeking a highly skilled Reliability Engineer to join our team. As a key member of our maintenance and reliability department, you will be responsible for ensuring the mechanical integrity and safety of our production equipment.About the RoleThis is a challenging and rewarding role that requires a strong background...


  • Reston, Virginia, United States Microsoft Full time

    Microsoft is seeking a Senior Site Reliability Engineer to join our Cloud and Artificial Intelligence Silver Team. This team is tasked with the deployment and management of a Secure Work Area, which includes the infrastructure necessary for collaboration within a highly secure environment. In this position, you will collaborate with engineers who facilitate...


  • Reston, Virginia, United States SAIC Full time

    Position OverviewSAIC is in search of a Reliability Engineer to become a vital part of our dynamic Engineering Innovation Factory Team, which consists of solution architects and digital engineers. This team is responsible for defining and constructing the infrastructure that drives the Digital Engineering Transformation across various sectors. Our...


  • Reston, Virginia, United States SAIC Full time

    Position OverviewSAIC is in search of a Reliability Engineer to become a vital part of our dynamic Engineering Innovation Factory Team. This team comprises solution architects and digital engineers dedicated to shaping and constructing the infrastructure that drives the Digital Engineering Transformation across various sectors. Our initiatives in creating...


  • Reston, Virginia, United States SAIC Full time

    Position OverviewSAIC is in search of a Reliability Engineer to become a vital member of our dynamic Engineering Innovation Factory Team. This team of solution architects and digital engineers is dedicated to defining and constructing the infrastructure that drives the Digital Engineering Transformation across various sectors. Our initiatives in creating...


  • Reston, Virginia, United States SAIC Full time

    Position OverviewSAIC is on the lookout for a Reliability Engineer to become a vital part of our dynamic Engineering Innovation Factory Team. This team of solution architects and digital engineers is dedicated to shaping and constructing the infrastructure that drives the Digital Engineering Transformation across various sectors. Our initiatives in creating...


  • Reston, Virginia, United States SAIC Full time

    Position OverviewSAIC is in search of a Reliability Engineer to be a vital part of our dynamic Engineering Innovation Factory Team. This team of solution architects and digital engineers is dedicated to defining and constructing the infrastructure that drives the Digital Engineering Transformation across various sectors. Our initiatives encompass a diverse...


  • Reston, Virginia, United States FEDERAL HOME LOAN BANKS OFFICE OF FINANCE Full time

    Position OverviewROLE: Lead Product Engineering ManagerDEPARTMENT: Information Technology FLSA: ExemptREPORTS TO: Director of Product EngineeringPosition SummaryThe Lead Product Engineering Manager is tasked with overseeing the product engineering division within the Office of Finance. This role encompasses a broad spectrum of business technology, focusing...


  • Reston, Virginia, United States Comcast Full time

    Job SummaryWe are seeking a highly skilled Senior Software Engineer - Site Reliability Engineering to join our team at Comcast. As a key member of our engineering team, you will be responsible for ensuring the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning for our FreeWheel...