Site Reliability Engineer
2 days ago
Site Reliability Engineer
Job Location-Washington, DC; Hybrid
Overview:
Varada Consulting, LLC is seeking a full-time highly skilled and experienced Site Reliability Engineer (SRE) to join our team. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications through automation, monitoring, and infrastructure improvements. You will work closely with development, operations, and security teams to implement best practices for building and maintaining highly available and secure systems.
Job Duties:
- Implement and maintain Infrastructure as Code (IaC) solutions to automate provisioning, configuration, and management of infrastructure components.
- Utilize containerization technologies such as Docker and Kubernetes (K8) to deploy and manage microservices-based applications.
- Employ container orchestration tools like Rancher, OpenShift, etc., to automate deployment, scaling, and management of containerized applications.
- Collaborate with development and security teams to integrate security practices into the DevOps pipeline and ensure compliance with security standards and policies.
- Manage Source Code repositories and CI/CD pipelines using tools such as Team Foundation Server/Azure DevOps, Bitbucket, and GitHub to automate build, test, and deployment processes.
- Apply Site Reliability Engineering (SRE) principles to design, build, and operate highly scalable and reliable systems that meet the needs of our customers.
- Monitor system performance, availability, and reliability using monitoring and alerting tools, and proactively identify and address issues before they impact users.
- Participate in on-call rotations and respond to incidents, troubleshoot issues, and implement permanent fixes to prevent recurrence.
- Continuously improve system reliability, scalability, and performance through capacity planning, performance tuning, and infrastructure optimizations.
Required Qualifications:
- Bachelor's degree in Computer Science, Engineering, or a related field.
- Minimum of 8 years of experience as a Site Reliability Engineer or similar role.
- Strong experience with Infrastructure as Code (IaC), containerization, K8, and CI/CD Automation.
- Proficiency in container orchestration tools such as Rancher, OpenShift, etc.
- Experience working in a DevSecOps environment and integrating security practices into the development and operations processes.
- Hands-on experience with Source Code repositories and CI/CD pipeline solutions like Team Foundation Server/Azure DevOps, Bitbucket, and GitHub.
- Excellent problem-solving skills and ability to troubleshoot complex issues in distributed systems.
- Strong communication and collaboration skills, with the ability to work effectively in a cross-functional team environment.
Desired:
- Experience with Prometheus and Grafana or other monitoring tools.
- Certification in relevant technologies (e.g., Kubernetes, AWS, Azure) is a plus.
- Experience in scripting and programming languages such as PowerShell, Python, Bash, or Go for automation and tooling.
Clearance Requirements:
- Active Top Secret clearance/SCI with the ability to obtain and maintain Presidential Support Duty (PSD) approval (Yankee White) prior to employment
Join an Award – Winning Team Voted as Most Innovative and Fastest Growing Company, Varada Consulting offers highly customized IT capabilities in the federal civilian and DoD market space in support of the mission objectives of the federal government. Varada provides competitive compensation and benefits packages including 100% employer paid healthcare premium, matching 401k, and unlimited education/training.
Varada Consulting, LLC is an Equal Employment Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or veteran status.
-
Site Reliability Engineer
1 day ago
Washington, Washington, D.C., United States MetroStar Corporation Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at MetroStar Corporation. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and scalability of our systems.Key Responsibilities:Monitor and analyze platform and containerized applications to...
-
Site Reliability Engineer
7 days ago
Washington, Washington, D.C., United States MetroStar Systems Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at MetroStar Systems. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and scalability of our systems.Key Responsibilities:Monitor and analyze platform and containerized applications to identify...
-
Site Reliability Engineer
5 days ago
Washington, Washington, D.C., United States Alldus Full timeSite Reliability EngineerAlldus is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems.Key Responsibilities:Perform root cause analysis to identify and resolve system or application issues in a timely and...
-
Site Reliability Engineer
4 weeks ago
Washington, United States Cinder LLC Full time[Full Time] Site Reliability Engineer at Cinder (United States) Site Reliability Engineer Cinder United States Date Posted: 31 Oct, 2022 Work Location: Washington, DC, United States Salary Offered: $110 — $220 yearly Job Type: Full Time Experience Required: 1+ years Remote Work: Yes Stock Options: No Vacancies: 1 available About Cinder Cinder provides a...
-
Site Reliability Engineer
5 days ago
Washington, Washington, D.C., United States Tik Tok Full timeAbout the RoleTikTok is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our software systems.ResponsibilitiesWork with infrastructure, product, and platform engineering teams to operate and deploy software platforms, capacity planning,...
-
Site Reliability Engineer
4 days ago
Washington, Washington, D.C., United States CloudFit Software Full timeJob Title: Site Reliability EngineerCloudFit Software is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the quality, performance, and reliability of our CloudFit Managed Applications and Services systems.Key Responsibilities:Collaborate with cross-functional teams...
-
Site Reliability Engineer
2 days ago
Washington, United States Varada Consulting Full timeSite Reliability EngineerJob Location-Washington, DC; HybridOverview:Varada Consulting, LLC is seeking a full-time highly skilled and experienced Site Reliability Engineer (SRE) to join our team. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications through automation, monitoring, and...
-
Site Reliability Engineer
6 days ago
Washington, Washington, D.C., United States Cinder LLC Full timeAbout Cinder LLCCinder LLC provides a cutting-edge investigation platform to protect the internet.Our software helps Trust and Safety teams at the world's most influential companies innovate and adapt quickly to emerging threats.Job Title: Site Reliability EngineerWe're seeking an experienced Site Reliability Engineer to lead the development and deployment...
-
Site Reliability Engineer
1 month ago
Washington, United States Alldus Full timeOur client is a Series A startup within the Generative AI space and they are hiring a Site Reliability Engineer to join the team. Backed by one of the leading venture capital firms in the industry, this is an exciting opportunity to join a SaaS company that is revolutionizing their industry. Responsibilities: As the Site Reliability Engineer, you will...
-
Site Reliability Engineer
6 days ago
Washington, Washington, D.C., United States Palantir Technologies Full time{"title": "Site Reliability Engineer", "description": "Job SummaryPalantir Technologies is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications.Key ResponsibilitiesCollaborate with cross-functional teams...
-
Site Reliability Engineer
5 days ago
Washington, Washington, D.C., United States MetroStar Systems Full timeTransforming Government Services with Reliability and PerformanceAs a Site Reliability Engineer at MetroStar Systems, you will play a pivotal role in driving improvements in observability, performance, and reliability across high-level government platforms. Your expertise will be instrumental in making a lasting impact.Key Responsibilities:Monitor and...
-
Site Reliability Engineer
6 days ago
Washington, Washington, D.C., United States MetroStar Corporation Full timeMetroStar Corporation is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our organization, you will play a critical role in driving improvements in observability, performance, and reliability across our systems.**Key Responsibilities:*** Monitor and analyze platform and containerized applications to identify...
-
Site Reliability Engineer
3 weeks ago
Washington, United States StaffWorthy Inc. Full timeWe are a leading technology services provider with a rich history of assembling exceptional teams dedicated to delivering outstanding solutions. For over two decades, we have been committed to excellence, with a mission centered around our passion for our people and the value they deliver to our customers. Responsibilities Monitor platform and containerized...
-
Site Reliability Engineer
2 weeks ago
Washington, Washington, D.C., United States MetroStar Systems Full timeTransforming Government Services with Reliability and PerformanceAs a Site Reliability Engineer at MetroStar Systems, you will play a pivotal role in driving improvements in observability, performance, and reliability across high-level government platforms. Your expertise will be instrumental in making a lasting impact.Key Responsibilities:Monitor and...
-
Principal Site Reliability Engineer
7 days ago
Washington, Washington, D.C., United States Kansas Action for Children Full timeTransforming System ReliabilityWe're seeking a seasoned Principal Site Reliability Engineer to spearhead the improvement of system reliability and resilience at T-Mobile USA, Inc. in Overland Park, Kansas, United States.About the RoleAs a key member of our team, you'll apply your expertise to minimize manual effort and prevent operational incidents. Your...
-
Site Reliability Engineer
4 months ago
Washington, United States System One Full timeSite Reliability Engineer Work Location: 3 days onsite DC - JBAB, 2 days remote Clearance: Active TS/SCI with ability to clear PSD As a Site Reliability Engineer (SRE), you’ll continuously drive improvements in observability, performance, and reliability, with the goal to make an impact across the federal government. What You’ll Do Monitor platform and...
-
Site Reliability Engineer
5 days ago
Washington, Washington, D.C., United States Palantir Technologies Full timeAbout the RoleWe're seeking a skilled Site Reliability Engineer to join our team at Palantir Technologies. As a Site Reliability Engineer, you will play a critical role in building, operating, and maintaining high-performance, scalable, and reliable services for our production infrastructure.Key ResponsibilitiesMaintain the availability of cloud and physical...
-
Site Reliability Engineer
4 weeks ago
Washington, United States StaffWorthy Inc. Full timeWe are a leading technology services provider with a rich history of assembling exceptional teams dedicated to delivering outstanding solutions. For over two decades, we have been committed to excellence, with a mission centered around our passion for our people and the value they deliver to our customers.ResponsibilitiesMonitor platform and containerized...
-
Principal Site Reliability Engineer
4 weeks ago
Washington, United States Kansas Action for Children, Inc Full timeat T-Mobile USA, Inc. in Overland Park, Kansas, United States Job DescriptionBe unstoppable with us!T-Mobile is synonymous with innovation-and you could be part of the team that disrupted an entire industry! We reinvented customer service, brought real 5G to the nation, and now we're shaping the future of technology in wireless and beyond. Our work is as...
-
Sr. Site Reliability Engineer
2 months ago
Washington, United States CruitZi, INC Full timeJob DescriptionJob DescriptionOur Client is currently hiring a full-time Sr. Site Reliability Engineer (SRE), who will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal government.This role is Hybrid, requiring travel to downtown Washington, DC, at...