Current jobs related to Lead Site Reliability Engineer - Washington - Mount Indie


  • Washington, Washington, D.C., United States Capital One Full time

    Job Title: Lead Platform Engineer, Site Reliability EngineeringCapital One is seeking a highly skilled Lead Platform Engineer, Site Reliability Engineering to join our team. As a key member of our engineering organization, you will be responsible for designing, developing, and implementing scalable and reliable cloud-based systems.Key Responsibilities:Design...


  • Washington, United States Cinder LLC Full time

    [Full Time] Site Reliability Engineer at Cinder (United States) Site Reliability Engineer Cinder United States Date Posted: 31 Oct, 2022 Work Location: Washington, DC, United States Salary Offered: $110 — $220 yearly Job Type: Full Time Experience Required: 1+ years Remote Work: Yes Stock Options: No Vacancies: 1 available About Cinder Cinder provides a...


  • Washington, Washington, D.C., United States MetroStar Corporation Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at MetroStar Corporation. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and scalability of our systems.Key Responsibilities:Monitor and analyze platform and containerized applications to...


  • Washington, Washington, D.C., United States Cinder LLC Full time

    About Cinder LLCCinder LLC provides a cutting-edge investigation platform to protect the internet.Our software helps Trust and Safety teams at the world's most influential companies innovate and adapt quickly to emerging threats.Job Title: Site Reliability EngineerWe're seeking an experienced Site Reliability Engineer to lead the development and deployment...


  • Washington, United States Alldus Full time

    Our client is a Series A startup within the Generative AI space and they are hiring a Site Reliability Engineer to join the team. Backed by one of the leading venture capital firms in the industry, this is an exciting opportunity to join a SaaS company that is revolutionizing their industry. Responsibilities: As the Site Reliability Engineer, you will...


  • Washington, Washington, D.C., United States MetroStar Systems Full time

    Job Title: Site Reliability EngineerAt MetroStar Systems, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Monitor and analyze system performance to identify areas...


  • Washington, Washington, D.C., United States Palantir Technologies Full time

    {"title": "Site Reliability Engineer", "description": "Job SummaryPalantir Technologies is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications.Key ResponsibilitiesCollaborate with cross-functional teams...


  • Washington, Washington, D.C., United States MetroStar Systems Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at MetroStar Systems. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and scalability of our systems.Key Responsibilities:Monitor and analyze platform and containerized applications to identify...


  • Washington, Washington, D.C., United States DataRobot Full time

    Job Title: Director of Site Reliability Engineering Job Summary: DataRobot is seeking a highly skilled and experienced Director of Site Reliability Engineering to lead our SRE team. As a key member of our engineering organization, you will be responsible for ensuring the reliability, scalability, and performance of our platform. Key Responsibilities: *...


  • Washington, Washington, D.C., United States Alldus Full time

    Site Reliability EngineerAlldus is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems.Key Responsibilities:Perform root cause analysis to identify and resolve system or application issues in a timely and...


  • Washington, United States StaffWorthy Inc. Full time

    We are a leading technology services provider with a rich history of assembling exceptional teams dedicated to delivering outstanding solutions. For over two decades, we have been committed to excellence, with a mission centered around our passion for our people and the value they deliver to our customers. Responsibilities Monitor platform and containerized...


  • Washington, Washington, D.C., United States Tik Tok Full time

    About the RoleTikTok is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our software systems.ResponsibilitiesWork with infrastructure, product, and platform engineering teams to operate and deploy software platforms, capacity planning,...


  • Washington, Washington, D.C., United States CloudFit Software Full time

    Job Title: Site Reliability EngineerCloudFit Software is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the quality, performance, and reliability of our CloudFit Managed Applications and Services systems.Key Responsibilities:Collaborate with cross-functional teams...


  • Washington, United States Varada Consulting Full time

    Site Reliability EngineerJob Location-Washington, DC; HybridOverview:Varada Consulting, LLC is seeking a full-time highly skilled and experienced Site Reliability Engineer (SRE) to join our team. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications through automation, monitoring, and...


  • washington, United States Varada Consulting Full time

    Site Reliability EngineerJob Location-Washington, DC; HybridOverview:Varada Consulting, LLC is seeking a full-time highly skilled and experienced Site Reliability Engineer (SRE) to join our team. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications through automation, monitoring, and...


  • Washington, United States StaffWorthy Inc. Full time

    We are a leading technology services provider with a rich history of assembling exceptional teams dedicated to delivering outstanding solutions. For over two decades, we have been committed to excellence, with a mission centered around our passion for our people and the value they deliver to our customers.ResponsibilitiesMonitor platform and containerized...


  • Washington, Washington, D.C., United States Microsoft Full time

    Job Title: Site Reliability Engineer IIMicrosoft is seeking a highly skilled Site Reliability Engineer II to join our team. As a Site Reliability Engineer II, you will be responsible for designing, developing, and delivering software engineering solutions to serve and protect O365 government clouds.Key Responsibilities:Design, develop, and deploy software...


  • Washington, United States TEKsystems Full time

    **Job Summary**One of the largest financial institutions in Japan is seeking a highly skilled DevOps/Site Reliability Engineer to join a large-scale migration project. As a key member of the team, you will be responsible for designing and implementing the pipeline architecture for the migrations. This is an exciting opportunity to join a leading organization...


  • Washington, Washington, D.C., United States MetroStar Systems Full time

    Transforming Government Services with Reliability and PerformanceAs a Site Reliability Engineer at MetroStar Systems, you will play a pivotal role in driving improvements in observability, performance, and reliability across high-level government platforms. Your expertise will be instrumental in making a lasting impact.Key Responsibilities:Monitor and...


  • Washington, Washington, D.C., United States MetroStar Corporation Full time

    MetroStar Corporation is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our organization, you will play a critical role in driving improvements in observability, performance, and reliability across our systems.**Key Responsibilities:*** Monitor and analyze platform and containerized applications to identify...

Lead Site Reliability Engineer

4 months ago


Washington, United States Mount Indie Full time
Job DescriptionJob Description

Mount Indie is on the search for a Lead Site Reliability Engineering (SRE) to work remotely, focusing on delivering mission critical services that empower end users. The role will involve designing and implementing end to end CI/CD pipelines using AI/ML tooling.


Responsibilities:

  • Design and implement end-to-end CI/CD pipelines.
  • Employ extensive AWS cloud experience in a production environment (e.g., network, security, deployment, automation, serverless technologies).
  • Leverage a deep understanding of SRE principles for highly scalable and reliable systems.
  • Utilize extensive experience with Configuration Management and Infrastructure as Code.
  • Serve as a thought leader for agile development teams.
  • Enforce clarity of direction and a shared vision of success that is championed by team members, stakeholders, and product owners.
  • Build relationships, and work in collaboration with team members, stakeholders, product owners, and technical team leads.
  • Enhance processes, communication, and delivery that improve how work is done from discovery to delivery.

Required Qualifications:

  • At least 10 years of software engineering and DevOps experience.
  • Proven experience designing and implementing end to end continuous delivery pipelines
  • Strong cloud experience with AWS in a production environment (e.g., network, security, deployment, automation, serverless technologies).
  • Deep understanding of SRE principles for highly scalable and reliable systems.
  • Proven experience with Configuration Management and Infrastructure as Code (IaC).
  • Proven leadership skills, creating a supportive, inclusive, collaborative, and empowering team culture.
  • Ability to lead teams through ambiguity and establish a clear vision and direction.
  • Must have a bachelors degree in computer science, Information TechnologyManagement or related engineering field.
  • Ability to obtain and maintain a DHS Suitability clearance.