Site Reliability Engineering

4 weeks ago


Washington DC United States ALTA IT Services Full time
Site Reliability Engineering (SRE) Lead
100% Remote
US Citizenship required per government contract Must be able to obtain a DHS Public Trust clearance As a Site Reliability Engineering (SRE) Lead, you'll deliver mission-critical services that empower end users
As the ideal candidate, you'll use your extensive experience designing and implementing end-to-end continuous delivery pipelines and experience in AI/ML
You will also use your experience working closely with developers and other engineers to identify and resolve issues that may impact website or service availability, with the goal of making an impact across the federal government What you’ll do:
• Design and implement end-to-end continuous delivery pipelines.
• Leverage extensive AWS cloud experience in a production environment (e.g., network, security, deployment, automation, serverless technologies).
• Utilize a deep understanding of SRE principles for highly scalable and reliable systems.
• Leverage extensive experience with Configuration Management and Infrastructure as Code.
• Serve as a thought leader for agile development teams.
• Establish clarity of direction and a shared vision of success that is championed by team members, stakeholders, and product owners.
• Build relationships, and work in collaboration with team members, stakeholders, product owners, and technical team leads.
• Help enhance processes, communication, and delivery through new norms that improve how work is done — from discovery to delivery
What you’ll need to succeed:
• A minimum of ten (10) years of software engineering and DevOps experience.
• Experience in designing and implementing end-to-end continuous delivery pipelines.
• A deep AWS cloud experience in a production environment (e.g., network, security, deployment, automation, serverless technologies).
• Experience and understanding in SRE principles for highly scalable and reliable systems.
• A strong experience with Configuration Management and Infrastructure as a Code.

  • Washington DC, United States StaffWorthy Inc. Full time

    We are a leading technology services provider with a rich history of assembling exceptional teams dedicated to delivering outstanding solutions. For over two decades, we have been committed to excellence, with a mission centered around our passion for our people and the value they deliver to our customers. Responsibilities As a Site Reliability Engineer...


  • Washington DC, United States Mount Indie Full time

    Job Description Job Description As a Site Reliability Engineer (SRE) , youll continuously drive improvements in observability, performance, and reliability,with the goal to make an impact across the federal government. This role requires a current TS/SCI that has been obtained within the last 51 months and the ability to pass additional background...


  • Washington DC, United States Cinder Full time

    Full Time] Site Reliability Engineer at Cinder (United States) | BEAMSTART Jobs Site Reliability Engineer Full Time Remote Work Stock Options Cinder provides a cutting-edge investigation platform to protect the internet. Companies rely on our software to investigate and disrupt threats like hate groups, terrorist organizations, and state-sponsored...


  • Washington DC, United States Harbor Compliance Full time

    Site Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to grow, we seek a Site Reliability Engineer who is...


  • Washington DC, United States StaffWorthy Inc. Full time

    We are a leading technology services provider with a rich history of assembling exceptional teams dedicated to delivering outstanding solutions. As a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal...


  • Washington, United States Talent Discovery Pros Full time

    Job DescriptionJob Description TITLE : Site Reliability Engineer LOCATION : Washington DC CLEARANCE REQUIRED : TS/SCI WORK AUTHORIZATION : US Citizen As a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal...


  • Washington, United States StaffWorthy Inc. Full time

    We are a leading technology services provider with a rich history of assembling exceptional teams dedicated to delivering outstanding solutions. For over two decades, we have been committed to excellence, with a mission centered around our passion for our people and the value they deliver to our customers. Responsibilities As a Site Reliability Engineer...


  • Washington DC, United States Palantir Technologies Full time

    Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more. We’re looking for Site Reliability Engineers who can help us build, operate,...


  • Washington DC, United States Palantir Technologies Full time

    A World-Changing Company Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more. The Role We’re looking for Site Reliability...


  • Washington DC, United States Mechanicode.io Full time

    We are looking for a Lead Azure Site Reliability Engineer (SRE) to enable efficient monitoring and observability of the CDC Azure infrastructure and and applications. The SRE will lead operations of the cloud environment with observability, IAC, and cloud-native best practices. The engineer will be part of a larger effort to modernize the CDC DevOps...


  • Washington DC, United States Mission Box Solutions Full time

    As a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal government. Our client firmly believes that exceptional technology services are built upon exceptional individuals. Monitor platform and containerized...

  • Site Reliability

    4 days ago


    Washington DC, United States Canonical - Jobs Full time

    Job Description Job Description This role is an opportunity for a hands-on, but literally hands-off, technologist with a passion for Linux to build a career with Canonical and drive the success with those leveraging Ubuntu and open source products. If you have experience of IT operations automation, Infrastructure as Code and a passion for technology, then...


  • Washington, United States Mount Indie Full time

    Job Description Job Description As a Site Reliability Engineer (SRE) , youll continuously drive improvements in observability, performance, and reliability,with the goal to make an impact across the federal government. This role requires a current TS/SCI that has been obtained within the last 51 months and the ability to pass additional background...


  • Washington, United States ALTA IT Services Full time

    Site Reliability EngineerWashington, DC – 100% ONSITEActive TS/SCI clearance is required to start As a Site Reliability Engineer (SRE), you’ll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the federal government. What you’ll do:• Monitor platform and containerized...


  • Washington, United States Alta It Services Full time

    Site Reliability EngineerWashington, DC - 100% ONSITEActive TS/SCI clearance is required to start As a Site Reliability Engineer (SRE), you'll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the federal government. What you'll do: Monitor platform and containerized applications. Identify...


  • Washington, United States Mount Indie Full time

    Job DescriptionJob DescriptionAs aSite Reliability Engineer (SRE), youll continuously drive improvements in observability, performance, and reliability,with the goal to make an impact across the federal government. This role requires a current TS/SCI that has been obtained within the last 51 months and the ability to pass additional background...


  • Washington, United States Harbor Compliance Full time

    Site Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to grow, we seek a Site Reliability Engineer who is...


  • Washington, United States Harbor Compliance Full time

    Site Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance Location: Full-time Remote (Excluding CA, CO, MT, NY) About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to...


  • Washington, United States Harbor Compliance Full time

    Site Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance Location: Full-time Remote (Excluding CA, CO, MT, NY) About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to...


  • Washington, United States Mount Indie Full time

    Mount Indie is on the search for a Lead Site Reliability Engineering (SRE) to work remotely, focusing on delivering mission critical services that empower end users. The role will involve designing and implementing end to end CI/CD pipelines using AI/ML tooling. Responsibilities: • Design and implement end-to-end CI/CD pipelines. • Employ extensive...