Lead Azure Site Reliability Engineer
3 weeks ago
We are looking for a Lead Azure Site Reliability Engineer (SRE) to enable efficient monitoring and observability of the CDC Azure infrastructure and and applications.
The SRE will lead operations of the cloud environment with observability, IAC, and cloud-native best practices.
The engineer will be part of a larger effort to modernize the CDC DevOps enterprise framework by joining the team of 20 which is comprised of data scientists, software engineers, product owners, and DevOps engineers.
Mechanicode
is a remote-first company, and this role will be 100% remote.
W2-Salary : 140-160k
Required
Must be a U.S citizen or green-card holder
8+ years of professional experience
Proven leadership track record
Ability to
pass a background check and obtain a public trust security clearance
Essential Skills, Experience, and Competencies :
Proficient with Observability in the cloud, building monitoring & alerting frameworks (grafana, datadog, newrelic etc.)
Has built alert escalation plans, disaster recovery infrastructure, and setup on-call rotations
Proficient with implementing cloud infrastructure on Azure.
Proficient with Terraform
Experience with Linux, and Bash scripting.
Experience with Kubernetes (AKS)
Substantial experience with programming languages like Python
Experience with containerization technologies (e.g.Docker, containerD)
Ability to develop the architecture for continuous integration and deployment as well as continuous monitoring
Experience supporting scalable and elastic applications on distributed architectures.
Strong ability and understanding of securing systems on the application, network, and infrastructure layers.
Experience managing network/compute/database infrastructure with infrastructure-as-code.
Expert in basic git actions like cloning, creating branches, navigating between branches, staging code for commit, committing code, resetting, and merging.
Ability to mentor & support junior members
Proven ability to work under pressure and in fast-paced environments.
Ability to operate and manage work, strategically reason, build relationships and influence others.
Nice to Have
Azure Certifications
Interview Steps
Preliminary Screen
CoderByte Assessment
Technical review
Client Review
Why Mechanicode?
Mechanicodes vision is to bring peace of mind with technology.
We do so by building self-healing cloud infrastructure, resilient enough to withstand failures and sufficiently predictable to resolve issues without human intervention.
We do that by having automation as the cornerstone of our cloud solutions, significantly improving workforce attrition, and introducing agile rapid development conventions that improve the developer's experience.
About Mechanicode
Mechanicode a Cloud Digital services firm providing comprehensive DevSecOps, Cloud Native Engineering, IT Modernization & Automation services.
Founded by a former USDS engineer, Mechanicode has 13 years of experience developing innovative automation solutions improving the feedback loop in the developer experience, and using AWS/Azure Certified best practices for clients.
Mechanicode has experience in both the public and private sectors, providing modernization services that engage Agile best practices, scalable cloud architectures, and continuous integration & deployment standards.
#J-18808-Ljbffr
-
Lead Azure Site Reliability Engineer
4 weeks ago
Washington, United States Mechanicode.io Full timeWe are looking for a Lead Azure Site Reliability Engineer (SRE) to enable efficient monitoring and observability of the CDC Azure infrastructure and and applications. The SRE will lead operations of the cloud environment with observability, IAC, and cloud-native best practices. The engineer will be part of a larger effort to modernize the CDC DevOps...
-
Washington, United States ALTA IT Services Full timeSite Reliability EngineerWashington, DC – 100% ONSITEActive TS/SCI clearance is required to start As a Site Reliability Engineer (SRE), you’ll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the federal government. What you’ll do:• Monitor platform and containerized...
-
Site Reliability Engineer
3 weeks ago
Washington, United States Mount Indie Full timeJob DescriptionJob DescriptionAs aSite Reliability Engineer (SRE), youll continuously drive improvements in observability, performance, and reliability,with the goal to make an impact across the federal government. This role requires a current TS/SCI that has been obtained within the last 51 months and the ability to pass additional background...
-
Lead Site Reliability Engineer
1 week ago
Washington, United States Mount Indie Full timeMount Indie is on the search for a Lead Site Reliability Engineering (SRE) to work remotely, focusing on delivering mission critical services that empower end users. The role will involve designing and implementing end to end CI/CD pipelines using AI/ML tooling. Responsibilities: • Design and implement end-to-end CI/CD pipelines. • Employ extensive...
-
Lead Site Reliability Engineer
3 days ago
Washington, United States Mount Indie Full timeMount Indie is on the search for a Lead Site Reliability Engineering (SRE) to work remotely, focusing on delivering mission critical services that empower end users. The role will involve designing and implementing end to end CI/CD pipelines using AI/ML tooling. Responsibilities: • Design and implement end-to-end CI/CD pipelines. • Employ extensive...
-
Azure DevOps Server Administrator
3 weeks ago
Washington, Washington, D.C., United States SAIC Career Site Full timeDescription SAIC is seeking a motivated, experienced individual to act as an integral part of a client's program. As a member of the engineering team, the Azure DevOps Server Administrator is a critical contributor to the team's mission. We specialize in leveraging Microsoft Azure DevOps to streamline our development processes and enhance collaboration...
-
REMOTE - Site Reliability Engineer
2 days ago
Washington, United States Harbor Compliance Full timeSite Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance Location: Full-time Remote (Excluding CA, CO, MT, NY) About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to...
-
REMOTE - Site Reliability Engineer
15 hours ago
Washington, United States Harbor Compliance Full timeSite Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance Location: Full-time Remote (Excluding CA, CO, MT, NY) About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to...
-
REMOTE - Site Reliability Engineer
2 days ago
Washington, United States Harbor Compliance Full timeJob DescriptionJob DescriptionSite Reliability Engineer - Full-time RemoteAdvance Your Career with Cutting-Edge Infrastructure at Harbor ComplianceLocation: Full-time Remote (Excluding CA, CO, MT, NY)About Harbor Compliance:Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology...
-
Lead Site Reliability Engineer
7 days ago
Washington, United States Mount Indie Full timeJob DescriptionJob DescriptionMount Indie is on the search for a Lead Site Reliability Engineering (SRE) to work remotely, focusing on delivering mission critical services that empower end users. The role will involve designing and implementing end to end CI/CD pipelines using AI/ML tooling.Responsibilities:Design and implement end-to-end CI/CD...
-
Site Reliability Engineering
2 weeks ago
Washington, United States ALTA IT Services Full timeSite Reliability Engineering (SRE) Lead100% RemoteUS Citizenship required per government contract Must be able to obtain a DHS Public Trust clearance As a Site Reliability Engineering (SRE) Lead, you'll deliver mission-critical services that empower end users. As the ideal candidate, you'll use your extensive experience designing and implementing end-to-end...
-
Site Reliability Engineer
7 days ago
Washington, United States MetroStar Systems Full timeAs a Site Reliability Engineer (SRE), youll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the federal government. We know that you cant have great technology services without Reliability Engineer, Liability, Reliability, Engineer, Reliability, Manufacturing, Technology
-
Azure Cloud Engineer
7 days ago
Washington, United States SAIC Full timeDescription SAIChas an opening for a Cloud Engineer with experience designing and testing Azure Hyperconverged Infrastructure (HCI) to include integration with the Enterprise Azure Cloud Services (EACS). The Vanguard 2.2.1 program provides transparent, interconnected systems and security support for the Department of State (DOS) Bureau of Information...
-
Senior Site Reliability Engineer
2 weeks ago
Washington, United States Sparibis Full timeLocation: 100% remote Years' Experience: 10+ Year's of experience Education: Bachelor's degree Work Authorization: United States Citizenship is required as part of the eligibility criteria to be able to obtain a security clearance. Clearance: Applicants must be able to obtain and maintain a Public Trust security clearance. Key Skills: Must experience...
-
Azure DevOps Server Administrator
2 weeks ago
Washington, United States SAIC Full timeDescription SAIC is seeking a motivated, experienced individual to act as an integral part of a client's program. As a member of the engineering team, the Azure DevOps Server Administrator is a critical contributor to the team's mission. We specialize in leveraging Microsoft Azure DevOps to streamline our development processes and enhance collaboration...
-
Azure DevOps Engineer
3 days ago
Washington, United States Mindlance Full timePosition Summary:Title: DevOps and IT Security Engineer Premium IIIDuration: Long Term Location: Washington, DCHybrid Onsite : 4 days onsite per week from Day1The Senior Azure DevOps Engineer will work closely with development teams to automate and streamline our operations and processes, build and maintain tools for deployment, monitoring, and operations,...
-
Azure DevOps Engineer
2 days ago
Washington, United States Mindlance Full timePosition Summary:Title: DevOps and IT Security Engineer Premium IIIDuration: Long Term Location: Washington, DCHybrid Onsite : 4 days onsite per week from Day1The Senior Azure DevOps Engineer will work closely with development teams to automate and streamline our operations and processes, build and maintain tools for deployment, monitoring, and operations,...
-
Washington, United States SAIC Full timeDescription SAIC has an opening for a Cloud Engineer with experience designing and testing Azure Hyperconverged Infrastructure (HCI) to include integration with the Enterprise Azure Cloud Services (EACS). The Vanguard 2.2.1 program provides transparent, interconnected systems and security support for the Department of State (DOS) Bureau of Information...
-
Site Reliability Engineer
6 days ago
Washington, United States Palantir Technologies Full timeSite Reliability Engineer - Security Infrastructure Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more. The Role Our products support...
-
Azure Cloud Engineer
2 days ago
Washington, United States Science Applications International Corporation Full timeSAIC has an opening for a Cloud Engineer with experience designing and testing Azure Hyperconverged Infrastructure (HCI) to include integration with the Enterprise Azure Cloud Services (EACS). The Vanguard 2.2.1 program provides transparent, intercon Cloud Engineer, Azure, Cloud, Engineer, Technical Support, Operations