Site Reliability Engineer II

15 hours ago


Redmond, Washington, United States Microsoft Corporation Full time
Job Title: Site Reliability Engineer II

Job Summary:
Microsoft's Cloud Operations & Innovation (CO&I) group is seeking a skilled Site Reliability Engineer II to support the Commissioning (Cx) Automation and Global Cx teams in deploying, monitoring, and troubleshooting a distributed test platform. The platform is globally deployed and consists of client and cloud-based applications, custom hardware, wired / wireless networks, and sensor networks that automate the measurement and validation of hardware and electrical components and interconnected systems within large datacenters.

Responsibilities:
Configure, monitor, and support the test platform used by the Global Commissioning Team
Establish and maintain the Cx Automation lab as the environment for training and testing new applications
Perform technical evaluation of new devices and test instruments
Lead projects in the lab to add or update test automation or device simulation capabilities
Establish and oversee the Incident Management processes for the team
Develop an understanding of features and operation of all software products and test equipment
Participate in on-call rotations and alert product teams to major customer impacting issues
Analyze telemetry data to identify opportunities to improve the reliability and performance of the platform
Leverage and contribute to troubleshooting tools for commons problems
Evaluate and test new applications and test equipment prior to global deployments
Develop reporting for quality of service, and usage of the application / test instruments
Troubleshoot and repairing test devices or network equipment that is returned from field
Develop code or scripts that reduce the setup and overall testing time

Qualifications:
7+ years relevant technical engineering experience
OR Bachelor's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 3+ years technical engineering experience
OR Master's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 2+ years technical engineering experience

Additional Requirements:
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to, the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

About Microsoft:
Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees, we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day, we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Benefits:
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://www.microsoft.com/en-us/careers/benefits

  • Redmond, Washington, United States Microsoft Corporation Full time

    Job DescriptionMicrosoft is seeking a highly skilled Site Reliability Engineer II to join our team. As a Site Reliability Engineer II, you will be responsible for designing, developing, and delivering software engineering solutions to serve and protect O365 government clouds.ResponsibilitiesDesign and develop software engineering solutions to serve and...


  • Redmond, Washington, United States Microsoft Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II to join our team at Microsoft. As a Site Reliability Engineer II, you will be responsible for designing, developing, and delivering software engineering to serve and protect O365 government clouds. You will own deployment, availability, reliability, performance, and customer...


  • Redmond, Washington, United States Microsoft Corporation Full time

    Job SummaryMicrosoft is seeking a highly skilled Site Reliability Engineer II to join our Silver Infrastructure and Sovereign Operations team. As a Site Reliability Engineer II, you will play a pivotal role in defining operations for new, existing, and emerging environments.Key ResponsibilitiesDefine and develop standardized, repeatable, scalable solutions...


  • Redmond, Washington, United States Microsoft Corporation Full time

    Job DescriptionMicrosoft is seeking a highly skilled Site Reliability Engineer II to join our Silver Infrastructure and Sovereign Operations team. This critical role involves defining operations for new, existing, and emerging environments, ensuring the reliability and efficiency of our cloud infrastructure.Key ResponsibilitiesDefine and develop...


  • Redmond, Washington, United States Microsoft Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer II to join our team at Microsoft. As a key member of our Commerce and Ecosystems (C+E) team, you will be responsible for managing and automating a large-scale Commerce platform, providing world-class analytics to customers, and ensuring seamless interactions with Azure and Office.About the...


  • Redmond, Washington, United States Microsoft Corporation Full time

    Transforming the Future of Cloud ServicesAt Microsoft Corporation, we're committed to being cloud-first, and we're looking for talented Site Reliability Engineers to help shape the future of our cloud services. As a Site Reliability Engineer, you'll play a critical role in designing and implementing scenarios for our customers, ensuring the reliability,...


  • Redmond, Washington, United States Microsoft Corporation Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Microsoft Corporation. As a Site Reliability Engineer, you will play a critical role in designing and implementing scenarios for our customers, ensuring the reliability and scalability of our cloud services.ResponsibilitiesCollaborate with cross-functional teams to...


  • Redmond, Washington, United States Microsoft Corporation Full time

    Transforming the Future of Cloud ServicesAt Microsoft, we're committed to being cloud-first, and we're looking for talented Site Reliability Engineers to help us shape the future of cloud services. As a key member of our team, you'll play a crucial role in designing and implementing scenarios for our customers.What You'll DoCollaborate with our team to...


  • Redmond, Washington, United States Microsoft Corporation Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer II/SR to join our team at Microsoft Corporation. As a key member of our cloud-first organization, you will play a critical role in designing and implementing scenarios for our customers.ResponsibilitiesWith guidance, create and implement code for a product, service, or feature, reusing...


  • Redmond, Washington, United States SpaceX Full time

    Job Title: Site Reliability EngineerSpaceX is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Responsibilities:Develop automation to deploy and manage compute resources both on-premises and in the...


  • Redmond, Washington, United States Microsoft Corporation Full time

    Transforming the Future of Cloud ServicesAt Microsoft Corporation, we're committed to being cloud-first, and we're looking for talented Site Reliability Engineers to help us shape the future of cloud services. As a Site Reliability Engineer, you'll play a critical role in designing and implementing scenarios for our customers, ensuring the reliability and...


  • Redmond, Washington, United States Microsoft Corporation Full time

    Job SummaryMicrosoft Corporation is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing and developing solutions to complex application problems, system administration issues, or network concerns.Key ResponsibilitiesMonitor and test systems and integration functions to...


  • Redmond, Washington, United States Microsoft Corporation Full time

    Job Title: Site Reliability EngineerMicrosoft Corporation is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the high availability, scalability, and performance of our cloud services.Key Responsibilities:Design, develop, and deliver software engineering solutions...


  • Redmond, Washington, United States Microsoft Corporation Full time

    Job Title: Site Reliability EngineeringMicrosoft Corporation is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing and developing solutions to complex application problems, system administration issues, or network concerns.Key Responsibilities:Design and develop solutions...


  • Redmond, Washington, United States SpaceX Full time

    Job Title: Site Reliability Engineer (Starshield)Join SpaceX, a pioneering company in space exploration, as a Site Reliability Engineer (Starshield) in Redmond, WA. This role involves working on top-secret clearance projects, leveraging Starlink technology and launch capability to support national security efforts.About the Role:Develop automation to deploy...


  • Redmond, Washington, United States SpaceX Full time

    Job Title: Site Reliability EngineerJoin SpaceX, a pioneering company in space exploration and development, as a Site Reliability Engineer for our Starshield program. As a key member of our team, you will play a crucial role in ensuring the reliability and efficiency of our satellite systems.Responsibilities:Develop automation to deploy and manage compute...


  • Redmond, Washington, United States SpaceX Full time

    Job Title: Site Reliability EngineerSpaceX is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and infrastructure.Responsibilities:Develop automation to deploy and manage compute resources both on-premises and in the...


  • Redmond, Washington, United States Microsoft Full time

    Job DescriptionMicrosoft is seeking a highly skilled Principal Site Reliability Engineer to join our team. As a Principal Site Reliability Engineer, you will be responsible for designing, developing, and delivering software engineering solutions to serve and protect O365 government clouds.Key Responsibilities:Own deployment, availability, reliability,...


  • Redmond, Washington, United States Microsoft Corporation Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Microsoft Corporation. As a key member of our Office 365 government cloud service team, you will be responsible for designing, developing, and delivering software engineering solutions to ensure the high availability and scalability of our cloud...


  • Redmond, Washington, United States Microsoft Corporation Full time

    Job Title: Senior Site Reliability EngineerMicrosoft is seeking a highly skilled Senior Site Reliability Engineer to join our Cloud+Artificial Intelligence (C+AI) Silver SQL Team. As a key member of this team, you will be responsible for deploying and operating the Azure SQL family of services within Azure Government clouds.Responsibilities:Design and...