Principal Reliability Engineer

2 weeks ago


Reston, Virginia, United States Microsoft Full time

Microsoft is seeking a Senior Site Reliability Engineer to join our Cloud and Artificial Intelligence Silver Team. This team is tasked with the deployment and management of a Secure Work Area, which includes the infrastructure necessary for collaboration within a highly secure environment.

In this position, you will collaborate with engineers who facilitate a wide array of Azure services for internal clients in sectors that require stringent security and regulatory compliance. The systems and software you develop will need to adhere to the security policies and assurance standards set forth by both public and private sector clients.

At Microsoft, our mission is to empower every individual and organization on the planet to achieve more. We foster a culture of growth, innovation, and collaboration, striving to achieve our collective objectives. Each day, we build upon our core values of respect, integrity, and accountability to cultivate an inclusive workplace where everyone can succeed.

Essential Qualifications

  • 6+ years of technical experience in software engineering, network engineering, or systems administration.
    • OR a Bachelor's Degree in Computer Science, Information Technology, or a related field with 3+ years of technical experience in software engineering, network engineering, or systems administration.
    • OR a Master's Degree in Computer Science, Information Technology, or a related field with 2+ years of technical experience in software engineering, network engineering, or systems administration.

Additional Requirements

Security Clearance: Candidates must meet Microsoft and government security screening requirements for this role. This includes, but is not limited to, the following specialized security screenings:

  • The successful candidate must possess an active U.S. Government Top Secret Clearance with access to Sensitive Compartmented Information (SCI) based on a Single Scope Background Investigation (SSBI) with Polygraph. Maintaining or obtaining the appropriate U.S. Government clearance is essential for continued employment.
  • Clearance Verification: This role requires successful verification of the stated security clearance to meet federal government customer requirements.
  • Microsoft Cloud Background Check: This position will necessitate passing the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Citizenship Verification: This role requires verification of U.S. citizenship due to legal restrictions associated with the position.

Preferred Qualifications

  • 7+ years of technical experience in software engineering, network engineering, or systems administration
    • OR a Bachelor's Degree in Computer Science, Information Technology, or a related field with 4+ years of technical experience in software engineering, network engineering, or systems administration.
    • OR a Master's Degree in Computer Science, Information Technology, or a related field with 3+ years of technical experience in software engineering, network engineering, or systems administration.
    • OR a Doctorate Degree in Computer Science, Information Technology, or a related field.
  • 3+ years of experience with PowerShell, C#, or C++.
  • Experience managing large-scale distributed services with on-call responsibilities.
  • Ability to build consensus and influence across teams towards common objectives.
  • Ownership of the complete project lifecycle, demonstrating strong project management and communication skills.

As a Site Reliability Engineer IC4, the typical base pay range for this role across the U.S. is USD $117,200 - $229,200 per year. There are variations applicable to specific work locations.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, gender identity or expression, genetic information, marital status, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations, and ordinances.

Benefits and perks may vary depending on the nature of your employment with Microsoft and the country where you work.

We are looking for individuals who thrive on solving complex problems, developing innovative solutions, and collaborating within focused teams to enhance production reliability.

  • Demonstrates proficiency in distributed systems design, understanding interactions between cloud technology layers and components, and identifying optimal configurations for cloud technology solutions.
  • Develops insights into the code, features, and operations of specific products at scale to contribute to improvements in product availability, reliability, efficiency, observability, and performance.
  • Stays informed on industry trends, advancements in distributed systems and cloud technologies, and new tools or processes for enhancing product performance.
  • Contributes to development and design by leveraging technical expertise in large-scale distributed systems.
  • Engages with product engineering teams through code/design reviews and incident responses to propose enhancements across components and features.

Embody our culture and values.



  • Reston, Virginia, United States Microsoft Full time

    About the Role: Join the Office 365 team as a Principal Site Reliability Engineer, where you will play a pivotal role in enhancing the delivery of essential features within our government cloud offerings. Your expertise in quality, reliability, and innovation will be crucial in advancing the continuous delivery of services that enhance the Teams Phone...


  • Reston, Virginia, United States Microsoft Full time

    About the Role: Microsoft is seeking a Principal Site Reliability Engineer to join our dynamic Office 365 team, which is dedicated to delivering exceptional communication and collaboration solutions. In this pivotal role, you will leverage your expertise in ensuring the reliability and quality of our services, particularly within the government cloud sector....


  • Reston, Virginia, United States Microsoft Full time

    About the Role:Microsoft is seeking a Principal Site Reliability Engineer to join our Office 365 team, which is dedicated to delivering advanced communication and collaboration solutions. This role is pivotal in enhancing the reliability and performance of our services within the government cloud sector.Key Responsibilities:Drive the evolution of our...


  • Reston, Virginia, United States SAIC Full time

    Position OverviewSAIC is in search of a Reliability Engineer to become a vital member of our dynamic Engineering Innovation Factory Team. This team of solution architects and digital engineers is dedicated to defining and constructing the infrastructure that drives the Digital Engineering Transformation across various sectors. Our initiatives in creating...


  • Reston, Virginia, United States SAIC Full time

    Position OverviewSAIC is in search of a Reliability Engineer to become a vital part of our dynamic Engineering Innovation Factory Team. This team comprises solution architects and digital engineers dedicated to shaping and constructing the infrastructure that drives the Digital Engineering Transformation across various sectors. Our initiatives in creating...


  • Reston, Virginia, United States SAIC Full time

    Position OverviewSAIC is in search of a Reliability Engineer to become a vital part of our dynamic Engineering Innovation Factory Team, which consists of solution architects and digital engineers. This team is responsible for defining and constructing the infrastructure that drives the Digital Engineering Transformation across various sectors. Our...


  • Reston, Virginia, United States SAIC Full time

    Position OverviewSAIC is in search of a Reliability Engineer to be a vital part of our dynamic Engineering Innovation Factory Team. This team of solution architects and digital engineers is dedicated to defining and constructing the infrastructure that drives the Digital Engineering Transformation across various sectors. Our initiatives encompass a diverse...


  • Reston, Virginia, United States SAIC Full time

    Position OverviewSAIC is on the lookout for a Reliability Engineer to become a vital part of our dynamic Engineering Innovation Factory Team. This team of solution architects and digital engineers is dedicated to shaping and constructing the infrastructure that drives the Digital Engineering Transformation across various sectors. Our initiatives in creating...


  • Reston, Virginia, United States Amalgamated Sugar Company Full time

    Job SummaryThe Amalgamated Sugar Company is seeking a highly skilled Reliability Engineer to join our team. As a key member of our maintenance and reliability department, you will be responsible for ensuring the mechanical integrity and safety of our production equipment.About the RoleThis is a challenging and rewarding role that requires a strong background...


  • Reston, Virginia, United States Microsoft Full time

    Unlock the Power of Cloud Services with MicrosoftAs a leader in cloud innovation, Microsoft is revolutionizing the business world with cutting-edge solutions. We're seeking skilled Site Reliability Engineers to design and implement top-notch solutions for our customers.Contribute to Shaping the Future of Cloud Computing3+ years of experience in software...


  • Reston, Virginia, United States ANDURIL INDUSTRIES Full time

    About Anduril IndustriesAnduril Industries is a leading defense technology company that specializes in transforming U.S. and allied military capabilities with cutting-edge technology.We bring the expertise, technology, and business model of the 21st century's most innovative companies to the defense industry, changing how military systems are designed,...


  • Reston, Virginia, United States Microsoft Full time

    About the RoleWe are seeking highly skilled Site Reliability Engineers I/II to join our team at Microsoft. As a Site Reliability Engineer, you will play a critical role in designing and implementing scenarios for our customers, ensuring the reliability and scalability of our cloud services.Key ResponsibilitiesDesign and implement solutions to ensure the...


  • Reston, Virginia, United States Comcast Full time

    Job SummaryWe are seeking a highly skilled Senior Software Engineer - Site Reliability Engineering to join our team at Comcast. As a key member of our engineering team, you will be responsible for ensuring the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning for our FreeWheel...


  • Reston, Virginia, United States Microsoft Full time

    About the RoleWe are seeking a highly skilled and experienced Senior Site Reliability Engineering Manager to join our team at Microsoft. As a key member of our Cloud Services organization, you will be responsible for providing technical leadership and direction to a team of engineers focused on ensuring the reliability, availability, and performance of our...


  • Reston, Virginia, United States Microsoft Full time

    Are you driven by a commitment to excellence in large-scale service delivery? We are seeking a Lead Manager of Site Reliability Engineering who possesses a unique blend of software development expertise, experience in online services, and a dedication to quality. This role is pivotal in conceptualizing, designing, and executing government cloud service...


  • Reston, Virginia, United States Microsoft Full time

    Are you driven by a commitment to excellence in large-scale service delivery? We are seeking a Lead Manager of Site Reliability Engineering who possesses a blend of software development expertise, online service experience, and a dedication to quality. This role involves envisioning, designing, and executing cloud service offerings tailored for government...


  • Reston, Virginia, United States Microsoft Full time

    Microsoft is seeking a Senior Site Reliability Engineer to join our Cloud and Artificial Intelligence Silver Team. This team plays a crucial role in deploying and managing a Secure Work Area, which includes the infrastructure necessary for collaboration within a highly secure environment. In this position, you will collaborate with engineers who facilitate a...


  • Reston, Virginia, United States Microsoft Full time

    About the RoleWe are seeking a highly skilled and experienced Senior Site Reliability Engineering Manager to join our team at Microsoft. As a key member of our engineering organization, you will be responsible for providing technical leadership to a team of highly passionate and skilled engineers.Key Responsibilities:Recruit, on-board, and grow a team of...


  • Reston, Virginia, United States Northrop Grumman Full time

    Job DescriptionCompany OverviewNorthrop Grumman is a leading global security company that provides innovative solutions to the defense and aerospace industries. We are committed to delivering exceptional value to our customers and employees.Job SummaryWe are seeking a highly skilled Principal Software Engineer to join our team in Melbourne, Florida. As a key...


  • Reston, Virginia, United States Red Gate Group Full time

    Company DescriptionAt RED GATE we do everything we can to serve our clients:Using the right technical skills, unique methodologies, best practices, and integrated technology, we help clients implement bold solutions. New approaches to emerging and evolving threats. Non-traditional ways to overcome entrenched obstacles. Advantage through opportunity. If you...