Site Reliability Engineer

3 weeks ago


Washington, United States Varada Consulting Full time

Site Reliability Engineer Job Location: Washington, DC; Hybrid This position is eligible for a $5,000 Sign-on Bonus and Relocation Assistance, if applicable. Overview: Varada Consulting, LLC is seeking a full-time highly skilled and experienced Site Reliability Engineer (SRE) to join our team. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications through automation, monitoring, and infrastructure improvements. You will work closely with development, operations, and security teams to implement best practices for building and maintaining highly available and secure systems. Job Duties: Implement and maintain Infrastructure as Code (IaC) solutions to automate provisioning, configuration, and management of infrastructure components. Utilize containerization technologies such as Docker and Kubernetes (K8) to deploy and manage microservices-based applications. Employ container orchestration tools like Rancher, OpenShift, etc., to automate deployment, scaling, and management of containerized applications. Collaborate with development and security teams to integrate security practices into the DevOps pipeline and ensure compliance with security standards and policies. Manage Source Code repositories and CI/CD pipelines using tools such as Team Foundation Server/Azure DevOps, Bitbucket, and GitHub to automate build, test, and deployment processes. Apply Site Reliability Engineering (SRE) principles to design, build, and operate highly scalable and reliable systems that meet the needs of our customers. Monitor system performance, availability, and reliability using monitoring and alerting tools, and proactively identify and address issues before they impact users. Participate in on-call rotations, respond to incidents, troubleshoot issues, and implement permanent fixes to prevent recurrence. Continuously improve system reliability, scalability, and performance through capacity planning, performance tuning, and infrastructure optimizations.

Required Qualifications: Bachelors degree in Computer Science, Engineering, or a related field. Minimum of 8 years of experience as a Site Reliability Engineer or similar role. Strong experience with Infrastructure as Code (IaC), containerization, K8, and CI/CD Automation. Proficiency in container orchestration tools such as Rancher, OpenShift, etc. Experience working in a DevSecOps environment and integrating security practices into the development and operations processes. Hands-on experience with Source Code repositories and CI/CD pipeline solutions like Team Foundation Server/Azure DevOps, Bitbucket, and GitHub. Excellent problem-solving skills and ability to troubleshoot complex issues in distributed systems. Strong communication and collaboration skills, with the ability to work effectively in a cross-functional team environment.

Desired: Experience with Prometheus, Grafana or other monitoring tools. Certification in relevant technologies (e.g., Kubernetes, AWS, Azure) is a plus. Experience in scripting and programming languages such as PowerShell, Python, Bash, or Go for automation and tooling.

Clearance Requirements: Active Top Secret clearance/SCI with the ability to obtain and maintain Presidential Support Duty (PSD) approval (Yankee White) prior to employment

Join an Award Winning Team Voted as Most Innovative and Fastest Growing Company, Varada Consulting offers highly customized IT capabilities in the federal civilian and DoD market space in support of the mission objectives of the federal government. Varada provides competitive compensation and benefits packages, including 100% employer paid healthcare premium, matching 401k, and unlimited education/training. Varada Consulting, LLC is an Equal Employment Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or veteran status.

#J-18808-Ljbffr



  • Washington, United States StaffWorthy Inc Full time

    We are a dynamic technology services provider committed to delivering exceptional solutions to government clients. For over two decades, we have been assembling top-tier teams dedicated to innovation and excellence. Our mission revolves around the value we bring to our customers and the unwavering passion we have for our people. Position: Site Reliability...


  • Washington, United States Alldus Full time

    Our client is a Series A startup within the Generative AI space and they are hiring an Site Reliability Engineer to join the team. Backed by one of the leading venture capital firms in the industry, this is an exciting opportunity to join a SaaS company that is revolutionizing their industry. Responsibilities: As the Site Reliability Engineer, you will...


  • Washington, United States Cinder LLC Full time

    [Full Time] Site Reliability Engineer at Cinder (United States) | BEAMSTART Jobs Site Reliability Engineer Cinder United StatesDate Posted31 Oct, 2022Work LocationWashington, DC, United StatesSalary Offered$110 — $220 yearlyJob TypeFull TimeExperience Required1+ yearsRemote WorkYesStock OptionsNoVacancies1 availableAbout Cinder Cinder provides a...


  • Washington, United States Vontier Full time

    We are seeking an energetic, self-motivated Site Reliability Engineer to join our team! The ideal candidate will be highly energetic and committed to an excellent product, culture, and will be a strong communicator with solid problem solving skills. The site reliability engineer should be an IT expert who uses automation tools to monitor and observe...


  • Washington, United States System One Full time

    Site Reliability Engineer Work Location: 3 days onsite DC - JBAB, 2 days remote Clearance: Active TS/SCI with ability to clear PSD As a Site Reliability Engineer (SRE), you’ll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the federal government. What You’ll Do Monitor platform and...


  • Washington, United States Mount Indie Full time

    Job Description Job Description As a Site Reliability Engineer (SRE) , youll continuously drive improvements in observability, performance, and reliability,with the goal to make an impact across the federal government. This role requires a current TS/SCI that has been obtained within the last 51 months and the ability to pass additional background...


  • Washington, United States Mount Indie Full time

    Job Description Job Description As a Site Reliability Engineer (SRE) , youll continuously drive improvements in observability, performance, and reliability,with the goal to make an impact across the federal government. This role requires a current TS/SCI that has been obtained within the last 51 months and the ability to pass additional background...


  • Washington, United States Kansas Action for Children Full time

    at T-Mobile USA, Inc. in Overland Park, Kansas, United States Job Description Be unstoppable with us! T-Mobile is synonymous with innovation-and you could be part of the team that disrupted an entire industry! We reinvented customer service, brought real 5G to the nation, and now we're shaping the future of technology in wireless and beyond. Our work is as...


  • Washington, United States Mount Indie Full time

    Job DescriptionJob DescriptionAs aSite Reliability Engineer (SRE), youll continuously drive improvements in observability, performance, and reliability,with the goal to make an impact across the federal government. This role requires a current TS/SCI that has been obtained within the last 51 months and the ability to pass additional background...


  • Washington, United States Harbor Compliance Full time

    Site Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance Location: Full-time Remote (Excluding CA, CO, MT, NY) About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to...


  • Washington, United States Harbor Compliance Full time

    Site Reliability Engineer - Full-time Remote Advance Your Career with Cutting-Edge Infrastructure at Harbor Compliance Location: Full-time Remote (Excluding CA, CO, MT, NY) About Harbor Compliance: Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology solutions. As we continue to...


  • Washington, United States Mount Indie Full time

    Mount Indie is on the search for a Lead Site Reliability Engineering (SRE) to work remotely, focusing on delivering mission critical services that empower end users. The role will involve designing and implementing end to end CI/CD pipelines using AI/ML tooling. Responsibilities: • Design and implement end-to-end CI/CD pipelines. • Employ extensive AWS...


  • Washington, United States Harbor Compliance Full time

    Job DescriptionJob DescriptionSite Reliability Engineer - Full-time RemoteAdvance Your Career with Cutting-Edge Infrastructure at Harbor ComplianceLocation: Full-time Remote (Excluding CA, CO, MT, NY)About Harbor Compliance:Harbor Compliance is committed to simplifying the regulatory challenges of businesses and nonprofits through innovative technology...


  • Washington, United States Veradigm Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today's healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Washington, United States Veradigm Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of todays healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics, and...


  • Washington, United States Evolver Federal Full time

    Job DescriptionJob DescriptionEvolver Federal is seeking a Site Reliability Engineer. This is a senior engineering and technical role that is focused on influencing, shaping, and managing the systems and processes that are relied upon for building and deploying the GovInfo application and constituent parts.ResponsibilitiesWork as a member of the team to...


  • Washington, United States Evolver Federal Full time

    Evolver Federal is seeking a Site Reliability Engineer. This is a senior engineering and technical role that is focused on influencing, shaping, and managing the systems and processes that are relied upon for building and deploying the GovInfo application and constituent parts. Responsibilities Work as a member of the team to support the GovInfo Program,...


  • Washington, United States Allscripts Full time

    Welcome to Veradigm, where our Mission is transforming health, insightfully. Join the Veradigm team and help solve many of today’s healthcare challenges being addressed by biopharma, health plans, healthcare providers, health technology partners, and the patients they serve. At Veradigm, our primary focus is on harnessing the power of research, analytics,...


  • Washington, United States EmergencyMD Full time

    Evolver Federal is seeking a Site Reliability Engineer. This is a senior engineering and technical role that is focused on influencing, shaping, and managing the systems and processes that are relied upon for building and deploying the GovInfo application and constituent parts. Responsibilities Work as a member of the team to support the GovInfo Program, and...


  • Washington, United States Articulate Full time

    Articulate is looking for a Senior Site Reliability Engineer to join our amazing Platform Engineering team. The Senior Site Reliability Engineer I will be responsible for working cross-functionally to deliver and maintain scalable and reliable infrastructure. To be considered for an interview, please make sure your application is full in line with the job...