Reliability Engineering Manager

2 days ago


Washington DC USA, United States Specialized Group Full time

Unlock Your Potential as a Reliability Engineering Manager

Specialized Group is a leading quantitative hedge fund and financial technology firm that leverages advanced data science and machine learning to drive investment strategies and innovative solutions.

We're known for our cutting-edge research and collaborative environment, attracting top talent passionate about solving complex problems with data-driven approaches.

About the Role:

As a Reliability Engineering Manager, you will provide primary support for multiple large distributed systems, including mission-critical trading systems and in-house algorithms.

You will set team direction by working with stakeholders and prioritizing projects, manage a team of reliability engineers, and offer technical expertise and career development advice.

Partner with traders and portfolio managers to deliver business value across various asset classes.

Requirements:

  • 5+ years of professional experience in a technical and leadership position
  • Ability to program with one or more high-level languages (Python, Java, C++)
  • Experience in automating processes using algorithmic problem-solving methods
  • Professional fluency in English

This position offers a highly competitive compensation package, access to advanced technology within the financial industry, generous vacation and unlimited sick days, and the opportunity to work with international teams on global projects.



  • Washington, DC , USA, United States Specialized Group Full time

    Specialized Group is a leading quantitative hedge fund and financial technology firm that leverages advanced data science and machine learning to drive investment strategies and innovative solutions.Our company is known for its cutting-edge research and collaborative environment, attracting top talent passionate about solving complex problems with...


  • Washington, DC , USA, United States Palantir Technologies Full time

    {"title": "Site Reliability Engineer", "description": "Job SummaryWe are seeking a skilled Site Reliability Engineer to join our team at Palantir Technologies. As a Site Reliability Engineer, you will be responsible for designing, deploying, and operating high-performance, scalable, and reliable services for our production infrastructure.Key...


  • Washington, DC , USA, United States Northern Star Mining Services Limited Full time

    Reliability Engineer - MechanicalAt Northern Star Mining Services Limited, we are seeking a skilled Reliability Engineer - Mechanical to join our team. As a key member of our reliability team, you will play a pivotal role in upholding our STARR Core Values of Safety, Teamwork, Accountability, Respect, and Results.Key Responsibilities:Conduct analytical work...


  • Washington, DC , USA, United States Karsun Solutions Full time

    About Karsun SolutionsKarsun Solutions is a leading provider of innovative technology solutions to the US Government. Our team is dedicated to delivering high-quality services that transform the way our clients operate.Job SummaryWe are seeking a highly skilled Site Reliability Manager to join our team. The ideal candidate will be responsible for ensuring...


  • Washington, DC , USA, United States Kansas Action for Children Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at T-Mobile USA, Inc. in Overland Park, Kansas, United States.About the Role:The Principal Site Reliability Engineer will play a crucial role in improving system reliability and resilience, facilitating faster and more efficient...


  • Washington, DC , USA, United States Mount Indie Full time

    Job Title: Site Reliability EngineerAt Mount Indie, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Monitor and analyze platform and containerized applications...


  • Washington, DC , USA, United States Veterans Enterprise Technology Solutions Full time

    Job Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Veterans Enterprise Technology Solutions. As a Site Reliability Engineer, you will be responsible for ensuring the optimal performance and availability of our platform and containerized applications.Responsibilities:Monitor and...


  • Washington, DC , USA, United States Kansas Action for Children Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at T-Mobile USA, Inc. in Overland Park, Kansas, United States.About the Role:As a Principal Site Reliability Engineer, you will play a crucial role in improving system reliability and resilience, facilitating faster and more...


  • Washington, DC , USA, United States MetroStar Corporation Full time

    Job Title: Site Reliability EngineerAt MetroStar Corporation, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and scalability of our systems.Key Responsibilities:Monitor and analyze platform and containerized applications to...


  • Washington, DC , USA, United States Veterans Enterprise Technology Solutions Full time

    Job Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Veterans Enterprise Technology Solutions. As a Site Reliability Engineer, you will be responsible for ensuring the optimal performance and availability of our platform and containerized applications.Responsibilities:Monitor and...


  • Washington, DC , USA, United States Radius Networks Inc Full time

    About Radius Networks IncRadius Networks Inc is the global leader in location technology solutions, powering some of the world's largest restaurant, grocery, retail, and hospitality brands with its Flybuy platform. Flybuy helps companies deliver a seamless customer experience, boost loyalty, and drive efficient staff operations.Job SummaryWe're seeking a...


  • Washington, DC , USA, United States Palantir Technologies Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer to join our team at Palantir Technologies. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications.Key ResponsibilitiesCollaborate with cross-functional teams to design, implement, and maintain...


  • Washington, DC , USA, United States Cape Full time

    About CapeCape is a pioneering company that's redefining the boundaries of privacy and national security in the wireless industry. Founded in 2022 by a team of experts from Palantir and Anduril, we're driven by a passion for creating a more secure and private mobile experience.The RoleWe're seeking a highly skilled Site Reliability Engineer to join our team....


  • Washington, DC , USA, United States TEKsystems Full time

    Job SummaryTEKsystems is seeking a highly skilled DevOps/Site Reliability Engineer to join a large-scale migration project at one of Japan's largest financial institutions.This is an exciting opportunity to be part of a critical project within the organization, driving the architecture and setup of pipelines for migrations.Key Responsibilities:Design and...


  • Washington, DC , USA, United States Splunk Full time

    About SplunkSplunk is a leading provider of unified security and observability platforms, helping enterprises build a safer and more resilient digital world.Our mission is to empower organizations to keep their digital systems secure and reliable, and we're committed to creating a culture of belonging and diversity.Job SummaryWe're seeking a highly skilled...


  • Washington, DC , USA, United States TEKsystems Full time

    Job SummaryWe are seeking an experienced Senior Site Reliability Engineer/DevOps Engineer with a minimum of 8 years of expertise to join our dynamic engineering team.About the RoleAs a Senior SRE/DevOps, you will be instrumental in ensuring the availability, performance, and reliability of our systems, with a strong emphasis on security...


  • Washington, DC , USA, United States ASM Research Full time

    Job Title: Site Reliability EngineerThe Site Reliability Engineer will be a key member of our Technical Operations team, responsible for developing and maintaining tools, alerts, and dashboards to support monitoring application health and performance. The ideal candidate will be familiar with monitoring tools such as Splunk, AppDynamics, Dynatrace,...


  • Washington, DC, USA, United States Mount Indie Full time

    Job OverviewMt. Indie is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key ResponsibilitiesMonitor and analyze system performance, identifying areas for improvement and implementing...


  • Washington, Washington, D.C., United States Specialized Group Full time

    Job Title: Reliability Engineering ManagerSpecialized Group is a leading quantitative hedge fund and financial technology firm that leverages advanced data science and machine learning to drive investment strategies and innovative solutions.We are known for our cutting-edge research and collaborative environment, attracting top talent passionate about...


  • Washington, DC , USA, United States Palantir Technologies Full time

    Job SummaryWe are seeking a skilled Site Reliability Engineer to join our team at Palantir Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications.Key ResponsibilitiesCollaborate with cross-functional teams to ensure the reliability, scalability, and...