Site Reliability Engineer

1 week ago


Washington, Washington, D.C., United States Booz Allen Hamilton Full time
Job Title: Site Reliability Administrator, Senior

Job Summary:

We are seeking a highly skilled Site Reliability Administrator, Senior to join our team. As a key member of our operations team, you will be responsible for ensuring the reliability and scalability of our back-end infrastructure. You will work closely with our development team to identify and resolve issues, and collaborate with other teams to implement new technologies and processes.

Key Responsibilities:

  • Manage and maintain our Red Hat Enterprise Linux (RHEL) systems in a multi-tenant enterprise network environment
  • Implement security and compliance frameworks, such as DoD STIGs and NIST 800-53, and remediation strategies in RHEL environments
  • Develop and maintain automation tools, such as Ansible and Red Hat Satellite, for streamlining operational tasks and maintaining compliance of systems
  • Design and implement CI/CD pipelines using DevSecOps principles in Kubernetes environments
  • Support weekly maintenance and respond to outages on a rotational basis, ensuring minimal disruption in multi-tenant operations in accordance with Operations SLAs
  • Hold a TS/SCI clearance
  • Hold a HS diploma or GED
  • Hold Linux+, Linux Foundations, Red Hat Certified Systems Administrator (RHCSA), or Red Hat Certified Engineer (RHCE) Certification
  • Ability to obtain DoD 8570 or 8140 for IAT2 Level Certification, such as Security+ CE within 60 days of start date

Preferred Qualifications:

  • Experience with Kubernetes orchestrations, scaling, and troubleshooting in multi-tenant environments
  • Experience managing and configuring Kubernetes environments, including tools, such as Rancher, Helm, and Harbor
  • Experience with Agile development methodologies and working on Agile teams using collaboration tools, including Jira, Confluence, or Discourse
  • Experience managing and deploying applications in an air-gapped environment
  • Experience working with CM tools, including GitLab, Git-centric, and CI/CD workflows
  • Knowledge of container networking and security best practices within Kubernetes
  • Bachelor's degree in an IT related field
  • Kubernetes Certifications such as Certified Kubernetes Administrator (CKA) or Certified Kubernetes Application Developer (CKAD) Certification

Clearance:

Applicants selected will be subject to a security investigation and may need to meet eligibility requirements for access to classified information; TS/SCI clearance is required.

Compensation:

At Booz Allen, we celebrate your contributions, provide you with opportunities and choices, and support your total well-being. Our offerings include health, life, disability, financial, and retirement benefits, as well as paid leave, professional development, tuition assistance, work-life programs, and dependent care. Our recognition awards program acknowledges employees for exceptional performance and superior demonstration of our values. Full-time and part-time employees working at least 20 hours a week on a regular basis are eligible to participate in Booz Allen's benefit programs. Individuals that do not meet the threshold are only eligible for select offerings, not inclusive of health benefits. We encourage you to learn more about our total benefits by visiting the Resource page on our Careers site and reviewing Our Employee Benefits page.

Salary at Booz Allen is determined by various factors, including but not limited to location, the individual's particular combination of education, knowledge, skills, competencies, and experience, as well as contract-specific affordability and organizational requirements. The projected compensation range for this position is $84,600.00 to $193, annualized USD. The estimate displayed represents the typical salary range for this position and is just one component of Booz Allen's total compensation package for employees. This posting will close within 90 days from the Posting Date.

Identity Statement:

As part of the application process, you are expected to be on camera during interviews and assessments. We reserve the right to take your picture to verify your identity and prevent fraud.

Work Model:

Our people-first culture prioritizes the benefits of flexibility and collaboration, whether that happens in person or remotely.

  • If this position is listed as remote or hybrid, you'll periodically work from a Booz Allen or client site facility.
  • If this position is listed as onsite, you'll work with colleagues and clients in person, as needed for the specific role.

EEO Commitment:

We're an equal employment opportunity/affirmative action employer that empowers our people to fearlessly drive change - no matter their race, color, ethnicity, religion, sex (including pregnancy, childbirth, lactation, or related medical conditions), national origin, ancestry, age, marital status, sexual orientation, gender identity and expression, disability, veteran status, military or uniformed service member status, genetic information, or any other status protected by applicable federal, state, local, or international law.



  • Washington, Washington, D.C., United States MetroStar Corporation Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at MetroStar Corporation. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and scalability of our systems.Key Responsibilities:Monitor and analyze platform and containerized applications to...


  • Washington, Washington, D.C., United States MetroStar Systems Full time

    Job Title: Site Reliability EngineerAt MetroStar Systems, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Monitor and analyze system performance to identify areas...


  • Washington, Washington, D.C., United States Veterans Enterprise Technology Solutions Full time

    Job Title: Site Reliability EngineerOverview:Veterans Enterprise Technology Solutions is seeking a highly skilled Site Reliability Engineer to join our team. This role will be responsible for ensuring the reliability and performance of our cloud-based infrastructure. The ideal candidate will have a strong understanding of SRE principles and experience with...


  • Washington, Washington, D.C., United States Varada Consulting, LLC Full time

    Job Title: Site Reliability EngineerVarada Consulting, LLC is seeking a highly skilled and experienced Site Reliability Engineer to join our team. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications through automation, monitoring, and infrastructure improvements.Key...


  • Washington, Washington, D.C., United States MetroStar Systems Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at MetroStar Systems. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and scalability of our systems.Key Responsibilities:Monitor and analyze platform and containerized applications to identify...


  • Washington, Washington, D.C., United States Alldus Full time

    Site Reliability EngineerAlldus is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems.Key Responsibilities:Perform root cause analysis to identify and resolve system or application issues in a timely and...


  • Washington, Washington, D.C., United States Tik Tok Full time

    About the RoleTikTok is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our software systems.ResponsibilitiesWork with infrastructure, product, and platform engineering teams to operate and deploy software platforms, capacity planning,...


  • Washington, Washington, D.C., United States CloudFit Software Full time

    Job Title: Site Reliability EngineerCloudFit Software is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the quality, performance, and reliability of our CloudFit Managed Applications and Services systems.Key Responsibilities:Collaborate with cross-functional teams...


  • Washington, Washington, D.C., United States Cinder LLC Full time

    About Cinder LLCCinder LLC provides a cutting-edge investigation platform to protect the internet.Our software helps Trust and Safety teams at the world's most influential companies innovate and adapt quickly to emerging threats.Job Title: Site Reliability EngineerWe're seeking an experienced Site Reliability Engineer to lead the development and deployment...


  • Washington, Washington, D.C., United States Microsoft Full time

    Job Title: Site Reliability Engineer IIMicrosoft is seeking a highly skilled Site Reliability Engineer II to join our team. As a Site Reliability Engineer II, you will be responsible for designing, developing, and delivering software engineering solutions to serve and protect O365 government clouds.Key Responsibilities:Design, develop, and deploy software...


  • Washington, Washington, D.C., United States Palantir Technologies Full time

    {"title": "Site Reliability Engineer", "description": "Job SummaryPalantir Technologies is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications.Key ResponsibilitiesCollaborate with cross-functional teams...


  • Washington, Washington, D.C., United States Palantir Technologies Full time

    About the RoleWe're looking for a skilled Site Reliability Engineer to join our team at Palantir Technologies. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key ResponsibilitiesMaintain the availability of cloud and physical Linux servers that power...


  • Washington, Washington, D.C., United States MetroStar Systems Full time

    Transforming Government Services with Reliability and PerformanceAs a Site Reliability Engineer at MetroStar Systems, you will play a pivotal role in driving improvements in observability, performance, and reliability across high-level government platforms. Your expertise will be instrumental in making a lasting impact.Key Responsibilities:Monitor and...


  • Washington, Washington, D.C., United States MetroStar Corporation Full time

    MetroStar Corporation is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our organization, you will play a critical role in driving improvements in observability, performance, and reliability across our systems.**Key Responsibilities:*** Monitor and analyze platform and containerized applications to identify...


  • Washington, Washington, D.C., United States MetroStar Systems Full time

    Transforming Government Services with Reliability and PerformanceAs a Site Reliability Engineer at MetroStar Systems, you will play a pivotal role in driving improvements in observability, performance, and reliability across high-level government platforms. Your expertise will be instrumental in making a lasting impact.Key Responsibilities:Monitor and...


  • Washington, Washington, D.C., United States DataRobot Full time

    Job Title: Director of Site Reliability Engineering Job Summary: DataRobot is seeking a highly skilled and experienced Director of Site Reliability Engineering to lead our SRE team. As a key member of our engineering organization, you will be responsible for ensuring the reliability, scalability, and performance of our platform. Key Responsibilities: *...


  • Washington, Washington, D.C., United States Oracle Full time

    Job DescriptionOracle Health Applications & Infrastructure (OHAI) is seeking a highly skilled Site Reliability Engineer to join its OHAI Platform & Production Engineering organization.This is a unique opportunity to work on a net new line of business, constructed with an entrepreneurial spirit that promotes an energetic and creative environment.As a Site...


  • Washington, Washington, D.C., United States Kansas Action for Children Full time

    Transforming System ReliabilityWe're seeking a seasoned Principal Site Reliability Engineer to spearhead the improvement of system reliability and resilience at T-Mobile USA, Inc. in Overland Park, Kansas, United States.About the RoleAs a key member of our team, you'll apply your expertise to minimize manual effort and prevent operational incidents. Your...


  • Washington, Washington, D.C., United States Palantir Technologies Full time

    About the RoleWe're seeking a skilled Site Reliability Engineer to join our team at Palantir Technologies. As a Site Reliability Engineer, you will play a critical role in building, operating, and maintaining high-performance, scalable, and reliable services for our production infrastructure.Key ResponsibilitiesMaintain the availability of cloud and physical...


  • Washington, Washington, D.C., United States Palantir Technologies Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer to join our team at Palantir Technologies. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications.Key ResponsibilitiesCollaborate with cross-functional teams to design, implement, and maintain...