Site Reliability Engineer
4 weeks ago
As aSite Reliability Engineer (SRE), youll continuously drive improvements in observability, performance, and reliability,with the goal to make an impact across the federal government. This role requires a current TS/SCI that has been obtained within the last 51 months and the ability to pass additional background investigations. As a member of this team, you will work onsite at JBAB (Joint Base Anacostia-Bolling) 3 days per week and remotely 2 days.
What youll do:
- Monitor platform and containerized applications.
- Identify performance and availability risks and issues.
- Work on the core platform to create and optimize all functions needed to establish a strong platform infrastructure.
- Collaborate with the team and the customer daily
What youll need to succeed:
- Minimum of 8 years of software development experience with a minimum of 2 years with Kubernetes and strong understanding of SRE principles for highly scalable and reliable systems.
- Experience implementing proactive alert / monitoring workflows and dashboards based on Kubernetes metrics, logs, and traces using Prometheus, Grafana, Loki, Splunk, or similar technologies.
- Working knowledge of industry best practices with regards to information security.
- Knowledge of clustering, high-availability, replication, and disaster recovery techniques.
- Possess a bachelor's degree and an active TS//SCI clearance (T5 or T5R required).
- Experience working in a DevSecOps environment and with Source Code repositories and CI/CD pipeline solutions such as GitLab, Azure DevOps, GitHub etc.
- Experience with Infrastructure as Code (IaC), containerization, K8, and CI/CD Automation.
- Experience with container orchestration tools (Rancher/RKE2, OpenShift, etc.)
- Ability to work well on a team as well as individually.
- Ability to work in downtown Washington, DC on client site at least 3 days per week.
Nice to haves:
- Passion for learning new development concepts, methodologies, and technologies
- Experience hardening and securing containers
- Previous experience with commercial cloud (e.g. AWS, Azure)
- Can establish and maintain a high level of client trust and confidence with your software development skills
- Can think out of the box to help with troubleshooting issues and providing innovative solutions that fit customers needs
-
Washington, United States ALTA IT Services Full timeSite Reliability EngineerWashington, DC – 100% ONSITEActive TS/SCI clearance is required to start As a Site Reliability Engineer (SRE), you’ll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the federal government. What you’ll do:• Monitor platform and containerized...
-
REMOTE - Site Reliability Engineer
7 days ago
Washington, United States Harbor Compliance Full timeSite Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance Location: Full-time Remote (Excluding CA, CO, MT, NY) About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to...
-
REMOTE - Site Reliability Engineer
1 week ago
Washington, United States Harbor Compliance Full timeSite Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance Location: Full-time Remote (Excluding CA, CO, MT, NY) About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to...
-
Lead Site Reliability Engineer
1 week ago
Washington, United States Mount Indie Full timeMount Indie is on the search for a Lead Site Reliability Engineering (SRE) to work remotely, focusing on delivering mission critical services that empower end users. The role will involve designing and implementing end to end CI/CD pipelines using AI/ML tooling. Responsibilities: • Design and implement end-to-end CI/CD pipelines. • Employ extensive...
-
REMOTE - Site Reliability Engineer
1 week ago
Washington, United States Harbor Compliance Full timeJob DescriptionJob DescriptionSite Reliability Engineer - Full-time RemoteAdvance Your Career with Cutting-Edge Infrastructure at Harbor ComplianceLocation: Full-time Remote (Excluding CA, CO, MT, NY)About Harbor Compliance:Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology...
-
Expert Site Reliability Engineer
17 hours ago
Washington, United States Allscripts Full timeWelcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today’s healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...
-
Lead Site Reliability Engineer
2 weeks ago
Washington, United States Mount Indie Full timeJob DescriptionJob DescriptionMount Indie is on the search for a Lead Site Reliability Engineering (SRE) to work remotely, focusing on delivering mission critical services that empower end users. The role will involve designing and implementing end to end CI/CD pipelines using AI/ML tooling.Responsibilities:Design and implement end-to-end CI/CD...
-
Site Reliability Engineer
3 days ago
Fort Washington, United States JR Technologies Full timeAt JR Technologies, our vision is to create the new customer-centric distribution landscape of tomorrow. Working with us offers many opportunities to experienced professionals who are interested in joining a strong team, learning and mentoring in a dynamic environment, honing professional and technical abilities, and who thrive on new challenges. We provide...
-
Washington, United States OMW Consulting Full timeSite Reliability Engineer Salary $140k-$200k + Equity Secret Clearance or higher is required My client, a VC-backed organization in the defense tech space, is looking to hire multiple SREs as they build out their DevOps team across the USA. My client has created a modern product which is streamlining processes and saving time in critical areas for the DOD....
-
Site Reliability Engineering
3 weeks ago
Washington, United States ALTA IT Services Full timeSite Reliability Engineering (SRE) Lead100% RemoteUS Citizenship required per government contract Must be able to obtain a DHS Public Trust clearance As a Site Reliability Engineering (SRE) Lead, you'll deliver mission-critical services that empower end users. As the ideal candidate, you'll use your extensive experience designing and implementing end-to-end...
-
Senior Site Reliability Engineer
3 weeks ago
Washington, United States Sparibis Full timeLocation: 100% remote Years' Experience: 10+ Year's of experience Education: Bachelor's degree Work Authorization: United States Citizenship is required as part of the eligibility criteria to be able to obtain a security clearance. Clearance: Applicants must be able to obtain and maintain a Public Trust security clearance. Key Skills: Must experience...
-
Lead Azure Site Reliability Engineer
1 month ago
Washington, United States Mechanicode.io Full timeWe are looking for a Lead Azure Site Reliability Engineer (SRE) to enable efficient monitoring and observability of the CDC Azure infrastructure and and applications. The SRE will lead operations of the cloud environment with observability, IAC, and cloud-native best practices. The engineer will be part of a larger effort to modernize the CDC DevOps...
-
Reliability Engineer
1 day ago
Washington, United States KMS Solutions Full timeReliability Engineer KMS Solutions, LLC is a technical management/solutions company that specializes in engineering, analysis, and cyber security. Founded in 2005, KMS is a certified small business with over a decade and a half of experience supporting the Department of Defense as well as many other departments and programs critical to our Nations security...
-
Site Reliability Engineer
2 weeks ago
Washington, United States Palantir Technologies Full timeSite Reliability Engineer - Security Infrastructure Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more. The Role Our products support...
-
Sr. Site Reliability Engineer
4 days ago
Washington, United States Marriott Full timeJob Number 24059351 Job Category Information Technology Location Marriott International HQ, 7750 Wisconsin Avenue, Bethesda, Maryland, United States Schedule Full-Time Located Remotely? Y Relocation? N Position Type Management JOB SUMMARY Lead role in the Monitoring and Performance Management function at Marriott. Performs detailed performance analysis of...
-
Site Reliability Engineering
21 hours ago
Washington, United States MetroStar Full timeAs a Site Reliability Engineering (SRE) Lead, you'll deliver mission-critical services that empower end users. As the ideal candidate, you'll use your extensive experience designing and implementing end-to-end continuous delivery pipelines and experience in AI/ML. You will also use your experience working closely with developers and other engineers to...
-
Maintenance and Reliability Engineer
24 hours ago
Washington, United States Jacobs Full timeYour Impact: Challenging Today. Reinventing Tomorrow. We're invested in you and your success. Everything we do is more than just a project. It's our challenge as human beings, too. That's why we bring a thoughtful and collaborative approach to every one of our partnerships. At Jacobs, we challenge the status quo and redefine how to solve the world's...
-
Site Reliability Engineer
4 days ago
Washington, United States Knewin Full timeA World-Changing Company Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more. The Role Our products support some of the most important...
-
Microsoft Intune Engineer
2 weeks ago
Washington, Washington, D.C., United States SAIC Career Site Full timeDescription SAIC is seeking a Microsoft Intune Engineer. The position will support a large federal government agency and their mobile environment which includes iOS, Android, and Windows operating systems. The position will have a hybrid telework arrangement with on-site presence one day a week, at a secure government facility. Work schedule will be Monday...
-
Engineer - Pepco Reliability
2 weeks ago
Washington, DC, United States Exelon Full timeDescriptionWe're powering a cleaner, brighter future.Exelon is leading the energy transformation, and we're calling all problem solvers, innovators, community builders and change makers. Work with us to deliver solutions that make our diverse cities and communities stronger, healthier and more resilient.We're powered by purpose-driven people like you who...