Current jobs related to Site Reliability Engineer - Washington - Clients First Technologies (CFT)


  • Washington, United States Cinder LLC Full time

    [Full Time] Site Reliability Engineer at Cinder (United States) Site Reliability Engineer Cinder United States Date Posted: 31 Oct, 2022 Work Location: Washington, DC, United States Salary Offered: $110 — $220 yearly Job Type: Full Time Experience Required: 1+ years Remote Work: Yes Stock Options: No Vacancies: 1 available About Cinder Cinder provides a...


  • Washington, United States Varada Consulting Full time

    Site Reliability EngineerJob Location-Washington, DC; Hybrid Overview:Varada Consulting, LLC is seeking a full-time highly skilled and experienced Site Reliability Engineer (SRE) to join our team. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications through automation, monitoring, and...


  • Washington, United States Alldus Full time

    Our client is a Series A startup within the Generative AI space and they are hiring a Site Reliability Engineer to join the team. Backed by one of the leading venture capital firms in the industry, this is an exciting opportunity to join a SaaS company that is revolutionizing their industry. Responsibilities: As the Site Reliability Engineer, you will...


  • Washington, United States StaffWorthy Inc. Full time

    We are a leading technology services provider with a rich history of assembling exceptional teams dedicated to delivering outstanding solutions. For over two decades, we have been committed to excellence, with a mission centered around our passion for our people and the value they deliver to our customers. Responsibilities Monitor platform and containerized...


  • Washington, United States System One Full time

    Site Reliability Engineer Work Location: 3 days onsite DC - JBAB, 2 days remote Clearance: Active TS/SCI with ability to clear PSD As a Site Reliability Engineer (SRE), you’ll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the federal government. What You’ll Do Monitor platform and...


  • Washington, United States StaffWorthy Inc. Full time

    We are a leading technology services provider with a rich history of assembling exceptional teams dedicated to delivering outstanding solutions. For over two decades, we have been committed to excellence, with a mission centered around our passion for our people and the value they deliver to our customers.ResponsibilitiesMonitor platform and containerized...


  • Washington, United States Mount Indie Full time

    Job DescriptionJob DescriptionAs aSite Reliability Engineer (SRE), youll continuously drive improvements in observability, performance, and reliability,with the goal to make an impact across the federal government. This role requires a current TS/SCI that has been obtained within the last 51 months and the ability to pass additional background...


  • Washington, United States Kansas Action for Children, Inc Full time

    at T-Mobile USA, Inc. in Overland Park, Kansas, United States Job DescriptionBe unstoppable with us!T-Mobile is synonymous with innovation-and you could be part of the team that disrupted an entire industry! We reinvented customer service, brought real 5G to the nation, and now we're shaping the future of technology in wireless and beyond. Our work is as...


  • Washington, United States Kansas Action for Children, Inc Full time

    at T-Mobile USA, Inc. in Overland Park, Kansas, United StatesJob DescriptionBe unstoppable with us!T-Mobile is synonymous with innovation-and you could be part of the team that disrupted an entire industry! We reinvented customer service, brought real 5G to the nation, and now we're shaping the future of technology in wireless and beyond. Our work is as...


  • Washington, United States CruitZi, INC Full time

    Job DescriptionJob DescriptionOur Client is currently hiring a full-time Sr. Site Reliability Engineer (SRE), who will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal government.This role is Hybrid, requiring travel to downtown Washington, DC, at...


  • Washington, United States Karsun Solutions Full time

    About the RoleWe are seeking a highly skilled and experienced Site Reliability Engineering Manager to join our team at Karsun Solutions. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our systems and services.Key Responsibilities:Lead a team of engineers in designing, implementing, and maintaining robust...


  • Washington, United States Veterans Enterprise Technology Solutions Full time

    Job Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Veterans Enterprise Technology Solutions. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and scalability of our systems and applications.Key Responsibilities:Monitor and analyze system performance to identify...


  • Washington, United States Veterans Enterprise Technology Solutions Full time

    Job Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Veterans Enterprise Technology Solutions. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and scalability of our infrastructure.Key Responsibilities:Monitor and Maintain Infrastructure: Continuously monitor our...


  • Washington, United States Kansas Action for Children, Inc Full time

    About the RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Kansas Action for Children, Inc. in Overland Park, Kansas, United States.This is an exciting opportunity for a technical professional who is passionate about innovation and wants to be part of a team that is reshaping the future of technology in the wireless...


  • Washington, United States Karsun Solutions Full time

    We are seeking a highly skilled and experienced Site Reliability Manager to join our team. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our systems and services. They will lead a team of engineers in designing, implementing, and maintaining robust infrastructure and automation solutions. The ideal...


  • Washington, United States Red Frog Solutions Full time

    Site Reliability Engineer - SRE - (TS/SCI) Full Time Perm Washington D.C. (Hybrid - 3 days onsite, 2 days remote) $180K - $200K Salary Plus Competitive Benefits As a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the...


  • Washington, United States Tik Tok Full time

    About the RoleTikTok is a leading destination for short-form mobile video, and our mission is to inspire creativity and bring joy. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our platform.Key ResponsibilitiesCollaborate with infrastructure, product, and platform engineering teams to operate and...


  • Washington, United States MetroStar Systems Full time

    ***$25k Sign-On Bonus for this role*** As a Site Reliability Engineer (SRE) , you'll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the highest levels of government. If you think you can see yourself delivering our mission and pursuing our goals with us, then check out the job...


  • Washington, United States Veterans Enterprise Technology Solutions Full time

    Overview: Staffing Pros, a division of VETS Inc., is recruiting for a full-time Site Reliability Engineer. This position will work a rotating hybrid schedule- 3 days onsite at JBAB, 2 days remote. An Active Top Secret SCI clearance is required for this role. If you have additional questions not answered by the information contained within this posting,...


  • Washington, United States Palantir Technologies Full time

    About the RolePalantir Technologies is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications.Key ResponsibilitiesCollaborate with cross-functional teams to design, implement, and maintain...

Site Reliability Engineer

1 month ago


Washington, United States Clients First Technologies (CFT) Full time
Job DescriptionJob Description

Client First Technologies (CFT) is seeking a highly skilled and experienced Senior DevOps Engineer / Site Reliability Engineer (SRE) to join our team. The ideal candidate will have experience in DevOps, with a strong background in leading deployment, integration, and version upgrades of containerized commercial off-the-shelf (COTS) products. As a Senior DevOps Engineer, you will be responsible for leading deployment processes, performing root cause analysis to resolve system outages and issues, and writing comprehensive documentation. This is a full-time, remote position. CFT offers a full benefits package, a collaborative work environment and strong company culture. Veterans and military spouses are encouraged to apply.

Description

 

Responsibilities

  • Lead the deployment, integration, and version upgrades of COTS products
  • Ensure seamless integration with existing systems and infrastructure
  • Coordinate with stakeholders/team members to ensure successful implementation and upgrades
  • Ensure secure containerized software deployments by scanning for vulnerabilities, documenting findings, and developing the plan of action
  • Familiarity with tools like: Red Hat Quay/Clair, Red Hat Advanced Cluster Security for Kubernetes, OpenShift ImageStreams, Aqua Security, and Anchore Syft
  • Analyze current deployment processes and identify areas for improvement
  • Familiarity with tools like Argo CD, GitOps, Kubernetes Dashboard, OpenShift Web Console, and Prometheus w/Grafana
  • Develop and implement automation scripts and tools to streamline deployment activities
  • Familiarity with: Bash/Shell, Python, JavaScript, YAML, Linux OS, Docker, Kubectl, OpenShift CLI, Helm, Jenkins, and GitLab CI
  • Continuously monitor and optimize deployment workflows to enhance efficiency and reliability
  • Familiarity with tools like: Dynatrace, Prometheus, Grafana, New Relic, and Datadog
  • Perform root cause analysis of system outages and performance issues
  • Develop and implement effective solutions to prevent recurrence
  • Collaborate with other teams to ensure timely resolution of issues
  • Create and maintain detailed documentation, including physical and logical diagrams, installation guides, backout and rollback plans, test plans, and test scripts
  • Ensure all documentation is up-to-date and accessible to relevant stakeholders
  • Provide training and support to team members on documentation usage and updates

Qualifications

  • High school diploma or equivalent
  • Minimum of nine years of experience in DevOps/DevSecOps or a related field
  • Proven track record of leading deployment, integration, and version upgrades of COTS products
  • Strong expertise in optimizing deployment processes through automation
  • Extensive experience with CI/CD (Continuous Integration/Continuous Deployment) pipelines
  • Experience working with technologies including but not limited to Docker, Kubernetes, Argo, AWS Red Hat Quay/Clair, Red Hat Advanced Cluster Security for Kubernetes, OpenShift ImageStreams, Aqua Security, Anchore Syft, Bash/Shell, Python, JavaScript, YAML, Linux OS, Docker, Kubectl, OpenShift CLI, Helm, Jenkins, and GitLab CI
  • Excellent problem-solving skills with the ability to perform root cause analysis and resolve issues
  • Proficiency in writing comprehensive technical documentation
  • Strong communication and collaboration skills
  • Experience work with VAPO (VA Platform One) is highly desired

Physical Demands

  • Must be able to sit and stand for long periods of time
  • Occasional travel and overtime may be required

Required Clearances and Screenings

  • Must be able to successfully satisfy a Tier 2/Moderate Risk Government Background Investigation

 

COVID-19 Protocols: As a Federal contractor, CFT is required to comply with COVID-19 protocols applicable to the agency, facility, and location. All COVID-19 requirements are in line with government policies and CDC guidance applicable at the time.

 

CFT is a proud equal opportunity employer. All qualified applicants will be considered for employment without attention to age, race, color, religion, sex, sexual orientation, gender identity, national origin, veteran, or disability status. Discrimination and harassment are not tolerated.

Company DescriptionClient First Technologies (CFT) provides Strategic Consulting, Technology and Managed Services to commercial, non-profit and government organizations. Our expertise lies in mobilizing the right people, skills and technologies to help organizations with their most pressing challenges.

As a Service Disabled Veteran Owned Small Business (SDVOSB), CFT is committed to excellence and creating innovative and flexible solutions for our clients.Company DescriptionClient First Technologies (CFT) provides Strategic Consulting, Technology and Managed Services to commercial, non-profit and government organizations. Our expertise lies in mobilizing the right people, skills and technologies to help organizations with their most pressing challenges.\r
\r
As a Service Disabled Veteran Owned Small Business (SDVOSB), CFT is committed to excellence and creating innovative and flexible solutions for our clients.