Current jobs related to Site Reliability Engineer - Washington - Alldus
-
Site Reliability Engineer
2 weeks ago
Washington, Washington, D.C., United States System One Full timeJob Title: Site Reliability EngineerAt System One, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability, performance, and scalability of our systems.Key Responsibilities:Monitor and analyze platform and containerized applications to identify...
-
Site Reliability Engineer
2 weeks ago
Washington, Washington, D.C., United States Veterans Enterprise Technology Solutions Full timeJob Title: Site Reliability EngineerOverview:We are seeking a highly skilled Site Reliability Engineer to join our team at Veterans Enterprise Technology Solutions. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Responsibilities:• Monitor and analyze...
-
Site Reliability Engineer
4 weeks ago
Washington, Washington, D.C., United States MetroStar Systems Full timeJob Title: Site Reliability EngineerAt MetroStar Systems, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Monitor and analyze system performance to identify areas...
-
Site Reliability Engineer
3 weeks ago
Washington, Washington, D.C., United States Veterans Enterprise Technology Solutions Full timeJob Title: Site Reliability EngineerOverview:Veterans Enterprise Technology Solutions is seeking a highly skilled Site Reliability Engineer to join our team. This role will be responsible for ensuring the reliability and performance of our cloud-based infrastructure. The ideal candidate will have a strong understanding of SRE principles and experience with...
-
Site Reliability Engineer
4 weeks ago
Washington, Washington, D.C., United States Varada Consulting, LLC Full timeJob Title: Site Reliability EngineerVarada Consulting, LLC is seeking a highly skilled and experienced Site Reliability Engineer to join our team. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications through automation, monitoring, and infrastructure improvements.Key...
-
Site Reliability Engineer
1 week ago
Washington, Washington, D.C., United States Ankura Full timeJob Summary:Ankura is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a pivotal role in ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Design, deploy, and manage cloud infrastructure solutions using leading cloud platforms such as Azure, AWS,...
-
Site Reliability Engineer
2 weeks ago
Washington, Washington, D.C., United States Alldus Full timeSite Reliability EngineerAlldus is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Perform root cause analysis to identify and resolve system or application issues in a...
-
Site Reliability Engineer
2 weeks ago
Washington, Washington, D.C., United States Veterans Enterprise Technology Solutions Full timeJob Title: Site Reliability EngineerOverview:Veterans Enterprise Technology Solutions is seeking a highly skilled Site Reliability Engineer to join our team. This role will involve working on a rotating hybrid schedule, with 3 days onsite at JBAB and 2 days remote. An Active Top Secret SCI clearance is required for this position.Responsibilities:Monitor and...
-
Site Reliability Engineer
1 month ago
Washington, United States Varada Consulting Full timeSite Reliability EngineerJob Location-Washington, DC; HybridOverview:Varada Consulting, LLC is seeking a full-time highly skilled and experienced Site Reliability Engineer (SRE) to join our team. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications through automation, monitoring, and...
-
Site Reliability Engineer
1 month ago
washington, United States Varada Consulting Full timeSite Reliability EngineerJob Location-Washington, DC; HybridOverview:Varada Consulting, LLC is seeking a full-time highly skilled and experienced Site Reliability Engineer (SRE) to join our team. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications through automation, monitoring, and...
-
Site Reliability Engineer
2 weeks ago
Washington, Washington, D.C., United States MetroStar Corporation Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at MetroStar Corporation. As a key member of our team, you will be responsible for driving improvements in observability, performance, and reliability of our systems.Key Responsibilities:Monitor and analyze platform and containerized applications to...
-
Site Reliability Engineer
2 weeks ago
Washington, Washington, D.C., United States Cinder LLC Full timeAbout Cinder LLCCinder LLC is a cutting-edge investigation platform that protects the internet. Our software helps Trust and Safety teams at influential companies innovate and adapt quickly to emerging threats.We're seeking an experienced Site Reliability Engineer to lead the development and deployment of our robust infrastructure.Job...
-
Site Reliability Engineer II
4 weeks ago
Washington, Washington, D.C., United States Microsoft Full timeJob Title: Site Reliability Engineer IIMicrosoft is seeking a highly skilled Site Reliability Engineer II to join our team. As a Site Reliability Engineer II, you will be responsible for designing, developing, and delivering software engineering solutions to serve and protect O365 government clouds.Key Responsibilities:Design, develop, and deploy software...
-
Site Reliability Engineer
4 weeks ago
Washington, Washington, D.C., United States Palantir Technologies Full timeAbout the RoleWe're looking for a skilled Site Reliability Engineer to join our team at Palantir Technologies. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key ResponsibilitiesMaintain the availability of cloud and physical Linux servers that power...
-
Site Reliability Engineer
2 weeks ago
Washington, DC , USA, United States Mount Indie Full timeJob Title: Site Reliability EngineerAt Mount Indie, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Monitor and analyze platform and containerized applications...
-
Senior Site Reliability Engineer
3 days ago
Washington, Washington, D.C., United States Verint Systems Full timeAbout the Role:Verint Systems is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our systems and services.Key Responsibilities:Design and implement scalable and reliable systems and servicesCollaborate with cross-functional...
-
Site Reliability Engineer
1 month ago
Washington, Washington, D.C., United States MetroStar Systems Full timeTransforming Government Services with Reliability and PerformanceAs a Site Reliability Engineer at MetroStar Systems, you will play a pivotal role in driving improvements in observability, performance, and reliability across high-level government platforms. Your expertise will be instrumental in making a lasting impact.Key Responsibilities:Monitor and...
-
Site Reliability Engineer
2 weeks ago
Washington, DC , USA, United States MetroStar Corporation Full timeJob Title: Site Reliability EngineerAt MetroStar Corporation, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and scalability of our systems.Key Responsibilities:Monitor and analyze platform and containerized applications to...
-
Site Reliability Engineer
2 weeks ago
Washington, DC , USA, United States Veterans Enterprise Technology Solutions Full timeJob Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Veterans Enterprise Technology Solutions. As a Site Reliability Engineer, you will be responsible for ensuring the optimal performance and availability of our platform and containerized applications.Responsibilities:Monitor and...
-
Site Reliability Engineer
2 weeks ago
Washington, Washington, D.C., United States Palantir Technologies Full timeAbout the RoleWe're seeking a skilled Site Reliability Engineer to join our team at Palantir Technologies. As a Site Reliability Engineer, you will play a critical role in ensuring the availability, scalability, and reliability of our cloud and on-premises infrastructure.Key ResponsibilitiesMaintain the availability of cloud and physical Linux servers that...
Site Reliability Engineer
2 months ago
Our client is a Series A startup within the Generative AI space and they are hiring a Site Reliability Engineer to join the team. Backed by one of the leading venture capital firms in the industry, this is an exciting opportunity to join a SaaS company that is revolutionizing their industry. Responsibilities: As the Site Reliability Engineer, you will perform root cause analysis to identify and resolve system or application issues in a timely and effective manner. You will design and implement a broad range of automated tests to ensure system reliability and performance. Building scalable and cost-effective observability patterns in Datadog or other monitoring providers. Monitor and analyze SLIs to ensure adherence to SLAs and SLOs. Collaborate with development and operations teams to improve system reliability and developer experience. Develop and maintain monitoring and alerting systems to proactively address issues. Implement best practices for incident management and disaster recovery. Plan and implement capacity upgrades, ensuring scalability and performance. Define, monitor, and manage SLAs, ensuring service levels meet or exceed expectations. Ensure systems comply with security and regulatory requirements. Skillset: Experienced in Kubernetes and Helm. Expertise in observability and monitoring tools such as Prometheus, Grafana, Datadog, or Elk. Experience in Azure cloud. Strong understanding of microservices architecture, including Postgres and AI systems. Expertise in automated testing frameworks and tools. Experience with monitoring and analytics tools to track SLIs, SLAs, and SLOs. Excellent problem-solving skills and attention to detail. Tenacious attitude. Proficiency in programming languages such as TypeScript and Python. Strong scripting skills in Bash, PowerShell, or similar. Understanding of networking principles and experience with network troubleshooting. Additional Information: This is a full-time, remote position and is only open to US Citizens due to potential security clearance requirements. Benefits: Salary: $140k – $175k. Stock options. Benefits package. Interested? Apply now in the link below or email your resume directly to matthew@alldus.com for consideration. 44985 #J-18808-Ljbffr