Critical Infrastructure Reliability Specialist, Cloud Performance Optimization

6 days ago


Herndon, Virginia, United States Amazon Full time

Amazon Web Services (AWS) is a world-leading cloud platform that offers a comprehensive suite of products and services. As an Infrastructure Reliability Engineer, you will play a vital role in ensuring the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers.

The ideal candidate will have a strong background in reliability engineering, with experience in failure analysis activities and root cause analysis. Additionally, they should be familiar with accelerated life testing, stress analysis, and finite element analysis. A Ph.D. in Reliability Engineering, Physics, Electrical, Mechanical or Materials Engineering, or a related field, is also preferred.

AWS values diverse experiences and encourages candidates from all backgrounds to apply. If your career has taken a non-traditional path, don't let it stop you from applying. We strive for flexibility as part of our working culture and aim to achieve work-life harmony. Our employee-led affinity groups foster a culture of inclusion, empowering us to be proud of our differences.

The successful candidate will be responsible for proactively driving the reliability risk identification, assessment, and mitigation for datacenter infrastructure & Security equipment. They will also be responsible for root cause analysis of critical equipment failures and drive the continuous improvements to improve datacenter availability & security for AWS customers. The estimated annual salary for this position is $175,000-$225,000, depending on location and experience.

A key aspect of this role is working closely with both internal and outside partners, including suppliers, to drive key aspects of product specification, risk identification plan, and execution. The ideal candidate will be able to influence development teams, procurement, and external partners. Strong communication skills are essential, as well as the ability to manage multiple qualification activities and development schedules.

Becoming a Critical Infrastructure Reliability Specialist, Cloud Performance Optimization requires meeting Amazon's leadership principles requirements for this role. Amazon is committed to a diverse and inclusive workplace and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.



  • Herndon, Virginia, United States Amazon Full time

    About the JobWe're seeking a highly skilled Cloud Infrastructure Reliability Expert to join our team at Amazon. In this role, you will be responsible for driving reliability risk identification, assessment, and mitigation for data center infrastructure equipment.Key Responsibilities:Proactively identify and assess reliability risks in data center...


  • Herndon, Virginia, United States ATPCO Full time

    Job DescriptionWe are seeking a highly skilled Senior Platform Engineer to lead the development and optimization of our AWS cloud infrastructure. As a key member of our platform team, you will leverage your expertise in cloud technologies to drive architectural decisions, automate infrastructure, and implement cutting-edge solutions to improve platform...

  • AWS Cloud Engineer

    6 days ago


    Herndon, Virginia, United States Insight Global Full time

    About the Role:We are seeking a highly skilled AWS Cloud Engineer to join our team. As a cloud infrastructure specialist, you will be responsible for maintaining and optimizing our cloud-based infrastructure.Key Responsibilities:Maintain and troubleshoot cloud-based infrastructure, including servers, databases, and applicationsDesign, implement, and manage...


  • Herndon, Virginia, United States The Swift Group Full time

    OverviewThe Swift Group, a forward-thinking organization, is seeking an experienced Cloud Infrastructure Automation Engineer to drive the development and implementation of modern infrastructure solutions. This role plays a critical part in advancing our cloud capabilities and ensuring the security, efficiency, and scalability of our applications and...


  • Herndon, Virginia, United States Fortinet Full time

    About the RoleWe are seeking an experienced Senior Site Reliability Engineer to spearhead the development and expansion of our FortiSASE OpenStack infrastructure. This role demands deep expertise in both Networking and SRE practices, with a strong focus on automation and infrastructure as code (Ansible/Terraform). If you're a seasoned professional who...


  • Herndon, Virginia, United States ShorePoint Full time

    Cloud Reliability SpecialistShorePoint is a renowned cybersecurity services firm with a strong focus on protecting high-profile, high-threat clients. As a Cloud Reliability Specialist, you will be part of our dynamic team, contributing to the growth and development of our company.Key Responsibilities:


  • Herndon, Virginia, United States Navitas Full time

    Job Title: Cloud Infrastructure Specialist with Azure ExpertiseAt Navitas, we are seeking a skilled Cloud Infrastructure Specialist with expertise in Azure to join our team. This role will be responsible for designing, deploying, and managing cloud-based systems and applications.Responsibilities:Design and implement scalable cloud infrastructure solutions...


  • Herndon, Virginia, United States Amazon Full time

    Overview">Amazon Web Services (AWS) is the world's most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that's why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.">About the Role">We are seeking a...


  • Herndon, Virginia, United States KDA Consulting Inc Full time

    Job SummaryWe are seeking an experienced Cloud/Infrastructure Engineer to join our team at KDA Consulting Inc in Herndon, VA.This is a challenging role that requires strong technical expertise and exceptional problem-solving skills. As a Cloud/Infrastructure Engineer, you will be responsible for designing, implementing, and managing secure and scalable cloud...


  • Herndon, Virginia, United States Amazon Full time

    About the RoleWe are seeking a highly skilled and experienced Senior Datacenter Reliability Specialist to join our team at Amazon Web Services (AWS). This role will be responsible for proactively identifying, assessing, and mitigating reliability risks in datacenter infrastructure equipment.ResponsibilitiesDrive the reliability risk identification,...


  • Herndon, Virginia, United States NANA Regional Corp Full time

    About NANA Regional CorpNANA Regional Corp is an Alaska Native Corporation with a rich history of delivering IT solutions to the US government. As a trusted partner, we provide cutting-edge technology services to implement and evolve IT infrastructures.We are seeking a highly skilled Cloud Infrastructure Deployment Specialist to join our team. The ideal...


  • Herndon, Virginia, United States Peraton Full time

    Cloud Engineering LeadPeraton is seeking a lead cloud engineer to join our team of qualified, diverse individuals. This position will be working 100% remote and will focus on cloud operations and engineering.Responsibilities:Manage the prioritization of ServiceNow requests, changes, and Jira tasks for provisioning, modifying, troubleshooting, and...


  • Herndon, Virginia, United States Smart Synergies Full time

    Cloud Platforms: AWS, Azure, Google Cloud Platform (GCP)Key Skills:• Infrastructure as Code (IaC): Terraform, CloudFormation• Containerization: Docker, Kubernetes• Continuous Integration/Continuous Deployment (CI/CD): Jenkins, GitLab CI/CD, CircleCI• Configuration Management: Ansible, Puppet, ChefAdvanced Skills:• Monitoring and Logging:...


  • Herndon, Virginia, United States The Swift Group Full time

    Company Overview:The Swift Group is a leading provider of innovative solutions, dedicated to advancing infrastructure capabilities and ensuring the security and efficiency of applications and systems.Salary:The estimated salary for this role is $141,500.40 per year, based on industry standards and the location in Herndon, VA.Job Description:This Senior...


  • Herndon, Virginia, United States Amazon Full time

    About the Role: We're seeking a talented Senior Cloud Infrastructure Engineer to join our team at AWS Global Infrastructure Services. As a key member of our Network Capacity Services team, you'll be responsible for designing, building, and operating the network infrastructure that underpins our cloud services. Your primary focus will be on developing...


  • Herndon, Virginia, United States Knowmadics, Inc Full time

    Job OverviewWe are seeking an experienced Cloud Systems Integration Specialist to join our team at Knowmadics, Inc. This is a unique opportunity for a skilled professional to work with cutting-edge machine learning and geospatial data systems.Job DescriptionThis individual contributor role will involve hands-on work to integrate reliable, secure...


  • Herndon, Virginia, United States Red Rock Government Services Full time

    Job Title: Cloud Integration Specialist for AI and Elasticsearch Company Overview:Red Rock Government Services is a leading software engineering company recognized for its exceptional support to the intelligence community. With a proven track record of delivering innovative and mission-critical solutions, Red Rock specializes in developing secure, scalable,...


  • Herndon, Virginia, United States Amazon Development Center U.S., Inc. Full time

    About the RoleWe are seeking an experienced Cloud Infrastructure Leadership Manager to join our team at Amazon Development Center U.S., Inc. This role will be responsible for leading a high-impact systems development and operations team, accountable for the operational performance, customer experience, maintenance, security, and functional parity of Builder...


  • Herndon, Virginia, United States General Dynamics Information Technology Full time

    Job DescriptionWe are seeking a highly skilled Cloud Engineer to join our team at General Dynamics Information Technology (GDIT). As a Cloud Engineer, you will play a critical role in securing our clients' missions and ensuring the success of our cloud-based initiatives.About the RoleYou will work closely with our clients to recommend effective methods for...


  • Herndon, Virginia, United States CV Library Full time

    About L2T, LLCL2T, LLC is a rapidly growing high-tech company based in Northern Virginia. We invest in our employees' growth and provide opportunities for leadership, training, conferences, and mentorship.Job DescriptionWe are seeking a skilled Cloud Infrastructure Solutions Engineer to join our team. This role will focus on delivering stable and secure...