Senior Site Reliability Engineering Manager

2 months ago


Redmond, United States Microsoft Corporation Full time

OverviewAre you passionate about hardware and enabling new technology? Do you enjoy complex problem solving and investigation? Azure has one of the largest storage services on the planet, holding Exabytes of data and files not just for our 3rd party customers, but also many of Microsoft's own services. This role will focus on managing an ever growing and changing fleet at scale to maximize efficiency while providing a stable environment for our customers. As a Senior Site Reliability Engineering Manager in Azure Storage team you will be working with a team of engineers focused on optimizing fleet availability and health. Leading a team of engineers to design, develop and improve automation and uptime. You will take lead of planning, investigating complex issues and designing solutions to solve problems at scale. This opportunity will allow you to deepen your knowledge and experience with massive distributed systems. Opportunities to have significant impact on reducing cost to the business. Exposure and visibility at VP and CVP levels. This position is located in Redmond and has a flexible work environment that supports working from home. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesDevelop, test, and implement changes to optimize code and improve scalability. You leverage end-to-end technical expertise and telemetry analysis to identify patterns and opportunities to implement configuration and automation improvments. You review the effect of changes to documents and share development insights within your team. You drive Sprint planning, SCRUM stand ups, code/design reviews, and host regular cross team / org meetings. Investigate hardware and system issues that are impacting available capacity and impacting customers. Understand the long term goals of the organization and understand the steps your team will have to take to achieve those. You respond to incidents during regular on-call rotations and share details related to incidents and their resolution through post-mortem reports and regular review meetings. As a member of the team you willl be expected to help drive bridges for recovery durring major outages. Embody our culture and values.



  • Redmond, Washington, United States Microsoft Full time

    Job Description">We're looking for a Site Reliability Engineer II who can envision, design, and deliver Office 365 government cloud service offerings. The successful candidate will have a passion for high-scale services, working with some of Microsoft's most critical customers, and delivering software improvements using expertise in software development,...


  • Redmond, Washington, United States Top Secret Clearance Jobs Full time

    About the RoleThe Principal Site Reliability Engineering Manager at Top Secret Clearance Jobs will lead the delivery of critical features in Office 365 government cloud offerings. This position requires a passion for quality, reliability, and creativity to drive evolution in the continuous delivery of IC3 services that power Teams.Key ResponsibilitiesProvide...


  • Redmond, United States Microsoft Corporation Full time

    Security represents the most critical priorities for our customers in a world awash in digital threats, regulatory scrutiny, and estate complexity. Microsoft Security aspires to make the world a safer place for all. We want to reshape security and empower every user, customer, and developer with a security cloud that protects them with end to end, simplified...


  • Redmond, United States Microsoft Corporation Full time

    Security represents the most critical priorities for our customers in a world awash in digital threats, regulatory scrutiny, and estate complexity. Microsoft Security aspires to make the world a safer place for all. We want to reshape security and empower every user, customer, and developer with a security cloud that protects them with end to end, simplified...


  • Redmond, United States Microsoft Full time

    Security represents the most critical priorities for our customers in a world awash in digital threats, regulatory scrutiny, and estate complexity. Microsoft Security aspires to make the world a safer place for all. We want to reshape security and empower every user, customer, and developer with a security cloud that protects them with end to end, simplified...


  • Redmond, United States Microsoft Corporation Full time

    Microsoft has an exciting opportunity for a  Senior Site Reliability Engineer  in the Cloud+Artificial Intelligence (C+AI) Silver SQL Team. This team is responsible for deploying and operating the Azure SQL family of services within Azure Government clouds. In this role, you will have the opportunity to work with engineers who enable a broad set of...


  • Redmond, United States Top Secret Clearance Jobs Full time

    About the job Principal Site Reliability Engineering Manager - CTJ - Top Secret Top Secret Clearance Jobs is dedicated to helping those with the most exclusive security clearance find their next career opportunity and get interviews within 48 hours. Microsoft Teams delivers smart communication and seamless collaboration, and the Intelligent Conversation and...


  • Redmond, Washington, United States Microsoft Full time

    Job Requirements:To be successful in this role, you will need strong problem-solving skills, excellent communication skills, both verbal and written, and the ability to work collaboratively as part of a high-performing team. You should have experience with one or more general-purpose programming languages, including but not limited to: Java, C/C++, C#,...


  • Redmond, United States Amazon Full time

    Description Project Kuiper is an initiative to increase global broadband access through a constellation of 3,236 satellites in low Earth orbit (LEO). Its mission is to bring fast, affordable broadband to unserved and underserved communities around the world. Project Kuiper will help close the digital divide by delivering fast, affordable broadband to a wide...


  • Redmond, United States Top Secret Clearance Jobs Full time

    About the job Site Reliability Engineer II - CTJ - Poly Top Secret Clearance Jobs is dedicated to helping those with the most exclusive security clearance find their next career opportunity and get interviews within 48 hours. Join the Commerce and Ecosystems (C+E) team - the next generation of platform and experiences enabling Microsoft and Azure, the...


  • Redmond, United States Microsoft Corporation Full time

    Do you have a passion for high scale services and working with some of Microsoft’s most critical customers? We’re looking for a Site Reliability Engineer with the right mix of software development, on-line services experience and passion for quality to envision, design, and deliver Office 365 government cloud service offerings. Office 365 is at the...


  • Redmond, United States Amazon Full time

    Description Project Kuiper is an initiative to increase global broadband access through a constellation of 3,236 satellites in low Earth orbit (LEO). Its mission is to bring fast, affordable broadband to unserved and underserved communities around the world. Project Kuiper will help close the digital divide by delivering fast, affordable broadband to a wide...


  • Redmond, United States Microsoft Full time

    Microsoft has an exciting opportunity for a Senior Site Reliability Engineer in the Cloud+ArtificialIntelligence (C+AI) Silver SQL Team. This team is responsible fordeploying and operatingthe Azure SQL family of services within Azure Government clouds. In this role, you will have the opportunity to work with engineers who enable a broad set of Azure...


  • Redmond, United States Microsoft Full time

    Do you have a passion for high scale services and working with some of Microsoft’s most critical customers? We’re looking for a Site Reliability Engineer II with the right mix of software development, on-line services experience and passion for quality to envision, design, and deliver Office 365 government cloud service offerings.  Office 365 is at the...


  • Redmond, United States Microsoft Corporation Full time

    Security represents the most critical priorities for our customers in a world awash in digital threats, regulatory scrutiny, and estate complexity. Microsoft Security aspires to make the world a safer place for all. We want to reshape security and empower every user, customer, and developer with a security cloud that protects them with end to end, simplified...


  • Redmond, Washington, United States PRIME RAYS INC Full time

    About the JobPRIME RAYS INC is seeking a Senior Data Engineering Manager to lead our data engineering team. This is a senior-level role that requires a strong technical background and experience in managing a team of engineers.The successful candidate will be responsible for designing and implementing robust data architectures, developing scalable data...


  • Redmond, United States Microsoft Corporation Full time

    Security represents the most critical priorities for our customers in a world awash in digital threats, regulatory scrutiny, and estate complexity. Microsoft Security aspires to make the world a safer place for all. We want to reshape security and empower every user, customer, and developer with a security cloud that protects them with end to end, simplified...


  • Redmond, WA, United States Microsoft Full time

    Cloud Operations + Innovation (CO+I) is the team behind one of the world’s largest cloud infrastructures, responsible for powering all Microsoft online Products and Services as well as powering Microsoft’s “Cloud First” mission. Our Global Project Controls team supports the delivery of CO+I’s mission and vision as the trusted advisors that deliver...


  • Redmond, WA, United States Microsoft Full time

    Do you have a passion for high scale services and working with some of Microsoft’s most critical customers? We’re looking for a Site Reliability Engineer II with the right mix of software development, on-line services experience and passion for quality to envision, design, and deliver Office 365 government cloud service offerings.  Office 365 is at the...


  • Redmond, Washington, United States Top Secret Clearance Jobs Full time

    About the JobTop Secret Clearance Jobs is dedicated to helping those with the most exclusive security clearance find their next career opportunity and get interviews within 48 hours. The company requires a Principal Site Reliability Engineering Manager to lead the delivery of critical features in Office 365 government cloud offerings.Job...