Site Reliability Engineer

4 weeks ago


Washington, United States Mount Indie Full time
Job DescriptionJob Description

As aSite Reliability Engineer (SRE), youll continuously drive improvements in observability, performance, and reliability,with the goal to make an impact across the federal government. This role requires a current TS/SCI that has been obtained within the last 51 months and the ability to pass additional background investigations. As a member of this team, you will work onsite at JBAB (Joint Base Anacostia-Bolling) 3 days per week and remotely 2 days.


What youll do:

  • Monitor platform and containerized applications.
  • Identify performance and availability risks and issues.
  • Work on the core platform to create and optimize all functions needed to establish a strong platform infrastructure.
  • Collaborate with the team and the customer daily


What youll need to succeed:

  • Minimum of 8 years of software development experience with a minimum of 2 years with Kubernetes and strong understanding of SRE principles for highly scalable and reliable systems.
  • Experience implementing proactive alert / monitoring workflows and dashboards based on Kubernetes metrics, logs, and traces using Prometheus, Grafana, Loki, Splunk, or similar technologies.
  • Working knowledge of industry best practices with regards to information security.
  • Knowledge of clustering, high-availability, replication, and disaster recovery techniques.
  • Possess a bachelor's degree and an active TS//SCI clearance (T5 or T5R required).
  • Experience working in a DevSecOps environment and with Source Code repositories and CI/CD pipeline solutions such as GitLab, Azure DevOps, GitHub etc.
  • Experience with Infrastructure as Code (IaC), containerization, K8, and CI/CD Automation.
  • Experience with container orchestration tools (Rancher/RKE2, OpenShift, etc.)
  • Ability to work well on a team as well as individually.
  • Ability to work in downtown Washington, DC on client site at least 3 days per week.


Nice to haves:

  • Passion for learning new development concepts, methodologies, and technologies
  • Experience hardening and securing containers
  • Previous experience with commercial cloud (e.g. AWS, Azure)
  • Can establish and maintain a high level of client trust and confidence with your software development skills
  • Can think out of the box to help with troubleshooting issues and providing innovative solutions that fit customers needs





  • Washington, United States ALTA IT Services Full time

    Site Reliability EngineerWashington, DC – 100% ONSITEActive TS/SCI clearance is required to start As a Site Reliability Engineer (SRE), you’ll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the federal government. What you’ll do:• Monitor platform and containerized...


  • Washington, United States Harbor Compliance Full time

    Site Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance Location: Full-time Remote (Excluding CA, CO, MT, NY) About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to...


  • Washington, United States Harbor Compliance Full time

    Site Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance Location: Full-time Remote (Excluding CA, CO, MT, NY) About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to...


  • Washington, United States Mount Indie Full time

    Mount Indie is on the search for a Lead Site Reliability Engineering (SRE) to work remotely, focusing on delivering mission critical services that empower end users. The role will involve designing and implementing end to end CI/CD pipelines using AI/ML tooling. Responsibilities: • Design and implement end-to-end CI/CD pipelines. • Employ extensive...


  • Washington, United States Harbor Compliance Full time

    Job DescriptionJob DescriptionSite Reliability Engineer - Full-time RemoteAdvance Your Career with Cutting-Edge Infrastructure at Harbor ComplianceLocation: Full-time Remote (Excluding CA, CO, MT, NY)About Harbor Compliance:Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology...


  • Washington, United States Allscripts Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today’s healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Washington, United States Mount Indie Full time

    Job DescriptionJob DescriptionMount Indie is on the search for a Lead Site Reliability Engineering (SRE) to work remotely, focusing on delivering mission critical services that empower end users. The role will involve designing and implementing end to end CI/CD pipelines using AI/ML tooling.Responsibilities:Design and implement end-to-end CI/CD...


  • Fort Washington, United States JR Technologies Full time

    At JR Technologies, our vision is to create the new customer-centric distribution landscape of tomorrow. Working with us offers many opportunities to experienced professionals who are interested in joining a strong team, learning and mentoring in a dynamic environment, honing professional and technical abilities, and who thrive on new challenges. We provide...


  • Washington, United States OMW Consulting Full time

    Site Reliability Engineer Salary $140k-$200k + Equity Secret Clearance or higher is required My client, a VC-backed organization in the defense tech space, is looking to hire multiple SREs as they build out their DevOps team across the USA. My client has created a modern product which is streamlining processes and saving time in critical areas for the DOD....


  • Washington, United States ALTA IT Services Full time

    Site Reliability Engineering (SRE) Lead100% RemoteUS Citizenship required per government contract Must be able to obtain a DHS Public Trust clearance As a Site Reliability Engineering (SRE) Lead, you'll deliver mission-critical services that empower end users. As the ideal candidate, you'll use your extensive experience designing and implementing end-to-end...


  • Washington, United States Sparibis Full time

    Location: 100% remote Years' Experience: 10+ Year's of experience Education: Bachelor's degree Work Authorization: United States Citizenship is required as part of the eligibility criteria to be able to obtain a security clearance. Clearance: Applicants must be able to obtain and maintain a Public Trust security clearance. Key Skills: Must experience...


  • Washington, United States Mechanicode.io Full time

    We are looking for a Lead Azure Site Reliability Engineer (SRE) to enable efficient monitoring and observability of the CDC Azure infrastructure and and applications. The SRE will lead operations of the cloud environment with observability, IAC, and cloud-native best practices. The engineer will be part of a larger effort to modernize the CDC DevOps...


  • Washington, United States KMS Solutions Full time

    Reliability Engineer KMS Solutions, LLC is a technical management/solutions company that specializes in engineering, analysis, and cyber security. Founded in 2005, KMS is a certified small business with over a decade and a half of experience supporting the Department of Defense as well as many other departments and programs critical to our Nations security...


  • Washington, United States Palantir Technologies Full time

    Site Reliability Engineer - Security Infrastructure Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more. The Role Our products support...


  • Washington, United States Marriott Full time

    Job Number 24059351 Job Category Information Technology Location Marriott International HQ, 7750 Wisconsin Avenue, Bethesda, Maryland, United States Schedule Full-Time Located Remotely? Y Relocation? N Position Type Management JOB SUMMARY Lead role in the Monitoring and Performance Management function at Marriott. Performs detailed performance analysis of...


  • Washington, United States MetroStar Full time

     As a Site Reliability Engineering (SRE) Lead, you'll deliver mission-critical services that empower end users. As the ideal candidate, you'll use your extensive experience designing and implementing end-to-end continuous delivery pipelines and experience in AI/ML. You will also use your experience working closely with developers and other engineers to...


  • Washington, United States Jacobs Full time

    Your Impact: Challenging Today. Reinventing Tomorrow. We're invested in you and your success. Everything we do is more than just a project. It's our challenge as human beings, too. That's why we bring a thoughtful and collaborative approach to every one of our partnerships. At Jacobs, we challenge the status quo and redefine how to solve the world's...


  • Washington, United States Knewin Full time

    A World-Changing Company Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more. The Role Our products support some of the most important...


  • Washington, Washington, D.C., United States SAIC Career Site Full time

    Description SAIC is seeking a Microsoft Intune Engineer. The position will support a large federal government agency and their mobile environment which includes iOS, Android, and Windows operating systems. The position will have a hybrid telework arrangement with on-site presence one day a week, at a secure government facility. Work schedule will be Monday...


  • Washington, DC, United States Exelon Full time

    DescriptionWe're powering a cleaner, brighter future.Exelon is leading the energy transformation, and we're calling all problem solvers, innovators, community builders and change makers. Work with us to deliver solutions that make our diverse cities and communities stronger, healthier and more resilient.We're powered by purpose-driven people like you who...