Site Reliability Engineer

4 weeks ago


Washington, Washington, D.C., United States Evolent Health Full time

About the Role:

Evolent Health is seeking a highly skilled Site Reliability Engineer to join our Platform Engineering organization. As a member of this team, you will play a critical role in managing our large application suite and cloud infrastructure.

Key Responsibilities:

  • Implement and manage observability solutions using OpenTelemetry to monitor and trace application performance.
  • Implement and manage containerization solutions using platforms such as Docker and Kubernetes, with a strong focus on Azure Kubernetes Service (AKS) and Azure Container Apps (ACA).
  • Monitor the health and performance of containers and resolve any issues that arise.
  • Follow security best practices to ensure containerized workloads are fully secure.
  • Partner with DevOps in advancing Infra as Code and Config as Code discipline.
  • Partner with Platform Architecture team to continuously improve Internal Developer Platform (IDP).
  • Participate in Root Cause Analysis (RCA) to identify corrective action plan (CAP).

Requirements:

  • 3+ years of hands-on Azure and 5+ years of overall cloud-native experience.
  • Strong understanding of OpenTelemetry and experience in implementing observability solutions.
  • Proven experience in implementing and managing container orchestration platforms such as Kubernetes and Docker.
  • Deep understanding of deployment methodologies for Kubernetes, preferably ArgoCD and Helm.
  • Experience with other Azure services such as Azure Functions, Azure Logic Apps, and Azure Service Fabric.
  • Passion and creativity for Automation using tools such as Ansible and Terraform.
  • Experience in working with GitHub Actions or Jenkins.
  • Expertise in at least one of these scripting/configuration languages: PowerShell, YAML, HCL, Python.
  • Expertise in at least one of the APM tools: Prometheus, Dynatrace, DataDog.
  • Experience leveraging agile methodology (i.e., Scrumban) to manage project work.
  • Highly effective communicator with a strong commitment to transparency.
  • PostGresSQL and Fast Healthcare Interoperability Resources (FHIR API) experience is preferred.

Technical Requirements:

We require that all employees have the following technical capability at their home: High speed internet over 10 Mbps and, specifically for all call center employees, the ability to plug in directly to the home internet router.

Evolent Health is an equal opportunity employer and considers all qualified applicants equally without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran status, or disability status.

As part of our total compensation package, Evolent Health is proud to offer comprehensive benefits (including health insurance benefits) to qualifying employees.



  • Washington, Washington, D.C., United States Verint Systems Full time

    About the Role:Verint Systems is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and services.Key Responsibilities:Design and implement scalable and reliable systems and servicesCollaborate with cross-functional...


  • Washington, Washington, D.C., United States Mount Indie Full time

    Job OverviewMt. Indie is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our team, you will play a critical role in driving improvements in observability, performance, and reliability.Key Responsibilities:Monitor and analyze platform and containerized applications to identify performance and availability risks and...


  • Washington, Washington, D.C., United States Karsun Solutions Full time

    Site Reliability ManagerKarsun Solutions is seeking a highly skilled Site Reliability Manager to join our team. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our systems and services.The Site Reliability Manager will lead a team of engineers in designing, implementing, and maintaining robust...


  • Washington, Washington, D.C., United States Harbor Compliance Full time

    About Harbor ComplianceHarbor Compliance is a leading provider of regulatory compliance solutions for businesses and nonprofits. We are committed to simplifying the regulatory challenges of our clients through innovative technology solutions.Job OverviewThe Site Reliability Engineer will play a critical role in ensuring the availability, scalability, and...


  • Washington, Washington, D.C., United States Erias Ventures Full time

    Job SummaryErias Ventures is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for ensuring the stability, scalability, and performance of our cloud-based infrastructure.Key ResponsibilitiesDesign, implement, and maintain scalable and highly available cloud infrastructure...


  • Washington, Washington, D.C., United States Cape Full time

    About CapeCape is a pioneering company in the field of privacy-centric telecommunications. Founded in 2022 by a team of experts from Palantir and Anduril, our mission is to revolutionize the way we think about mobile device security and data privacy.We believe that personal privacy and national security interests are not mutually exclusive, and that strong...


  • Washington, Washington, D.C., United States Clarios Full time

    About the RoleThis position is responsible for reducing assets reliability risks for new projects and existing assets, elaborating and validating Master Routines per technology, and standardizing Master Routines.Key Responsibilities:Reduce assets reliability risks for new projects and existing assetsElaborate and validate Master Routines per...


  • Washington, Washington, D.C., United States ST2 ManTech Advanced Systems Intl Full time

    Secure Our Nation, Ignite Your Future with ST2 ManTech Advanced Systems IntlOverviewST2 ManTech Advanced Systems Intl is a dynamic and growing program seeking a motivated, career-oriented Linux Systems Engineer - Security and Reliability to join our team in Ft. Meade, MD or San Antonio, TX.Job DescriptionThis role involves providing support for...

  • **SRE Engineer

    4 weeks ago


    Washington, Washington, D.C., United States Saxon Global Full time

    Job Title: SRE Engineer - Cloud ExpertLocation: RemoteHire Type: ContractJob Description:We are seeking a highly skilled SRE Engineer - Cloud Expert to join our team at Saxon Global. As a key member of our cloud reliability engineering team, you will be responsible for designing, implementing, and maintaining scalable and highly available cloud-based...


  • Washington, Washington, D.C., United States Dare Commercial Services Full time

    Job OverviewJob OverviewWe are seeking a reliable and detail-oriented individual to conduct regular site inspections for a shopping center in Washington DC. The successful candidate will be responsible for identifying and reporting any issues related to landscaping and CAM, including potholes, graffiti, and debris. This is a part-time opportunity with a...


  • Washington, Washington, D.C., United States Actalent Full time

    Actalent is seeking a skilled Software Engineering Lead to spearhead the development of innovative healthcare solutions. As a dynamic and hands-on leader, you will oversee the development and scaling of core products, providing real-time patient monitoring and comprehensive chronic disease management solutions for healthcare providers and nursing...


  • Washington, Washington, D.C., United States Department of The Navy Full time

    Job SummaryAs an Interdisciplinary Systems Engineer with the Department of the Navy, you will play a critical role in developing and defining overall CPS weapon system performance and reliability requirements into requirements for the missile system's components. This position requires a professional engineer providing expertise in aerospace and weapon...


  • Washington, Washington, D.C., United States Powder River Industries LLC Full time

    Job Title: Windows Systems EngineerPowder River Industries, LLC is seeking a skilled Windows Systems Engineer to join our team. As a prime contractor, we provide technical services across the entire system development life cycle (SDLC). This includes data center management, integrated logistics support, COOP, and disaster recovery.As a subcontractor, we are...


  • Washington, Washington, D.C., United States Unreal Gigs Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at Unreal Gigs, a tech startup specializing in infrastructure and authorization solutions. As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, availability, and performance of our systems.Key ResponsibilitiesDesign, implement, and...


  • Washington, Washington, D.C., United States CloudHQ LLC Full time

    Job SummaryCloudHQ LLC is seeking a highly skilled Senior Site Acquisition Specialist to join our team. This role will be responsible for evaluating site conditions, hazards, and limitations relative to risk appetite, proposing strategies and potential site acquisitions.Key ResponsibilitiesAssessing and drawing conclusions on permitting, zoning, and land use...


  • Washington, Washington, D.C., United States V2X Full time

    Job Summary:The Senior Electrical Engineer will provide expertise to worldwide sites on the electrical engineering aspects of installation, modification, operation, and maintenance of facilities, mission systems, and support systems, and power generation, distribution, and grounding systems.Key Responsibilities:Providing technical expertise to clients on...


  • Washington, Washington, D.C., United States Leidos Full time

    Leidos is seeking a skilled Hardware Engineer to join our team in Columbia, MD. As a Hardware Engineer, you will be responsible for analyzing complex hardware systems, designing and implementing hardware solutions, and providing technical expertise to support the development of innovative solutions.Key Responsibilities:Design and develop hardware systems...


  • Washington, Washington, D.C., United States JCD Staffing Full time

    Job SummaryWe are seeking a highly skilled Cloud Infrastructure Engineer to join our team at JCD Staffing. The ideal candidate will have a strong background in designing, deploying, and maintaining cloud infrastructure and services on Microsoft Azure.Key Responsibilities:Design and implement scalable, reliable, and secure cloud-based infrastructure...

  • DevOps Engineer

    4 weeks ago


    Washington, Washington, D.C., United States Leidos Full time

    Job SummaryLeidos is seeking a highly skilled DevOps Engineer to join our team. As a DevOps Engineer, you will be responsible for supporting the development life cycle of platform architectural design, deployment, and debugging.Key ResponsibilitiesBuilding a release pipeline to enable fast, but safe delivery of critical business software to ProductionDevelop...

  • Mechanical Engineer

    4 weeks ago


    Washington, Washington, D.C., United States Intralox Full time

    Job DescriptionIntralox, a leading provider of innovative conveyance solutions, is seeking a skilled Mechanical Engineer to join its Equipment R&D Group in Hanover, MD.This role involves designing and developing new conveyance solutions using Intralox's Activated Roller Belt (ARB) technology, with a focus on Develop-To-Order (DTO) projects.Key...