Current jobs related to Cloud Operations Reliability Engineer - New York, New York - CLS Group


  • New York, New York, United States Syndio Full time

    Job Title: Cloud Reliability Engineering ManagerAt Syndio, we're seeking a seasoned Cloud Reliability Engineering Manager to join our team. As a key member of our engineering leadership, you will be responsible for defining and driving a vision for cloud reliability engineering, aligning it with broader engineering and business goals.About the RoleThis is a...


  • New York, New York, United States Citigroup Inc Full time

    Job Title: Cloud Security Site Reliability EngineerCitigroup Inc. is seeking a highly skilled Cloud Security Site Reliability Engineer to join our team. As a key member of our Cloud Security team, you will be responsible for ensuring the security and reliability of our cloud-based systems and applications.Job Summary:The Cloud Security Site Reliability...


  • New York, New York, United States Syndio Full time

    Job Title: Cloud Reliability Engineering ManagerAt Syndio, we're seeking a seasoned Cloud Reliability Engineering Manager to join our team. As a key member of our engineering leadership, you will be responsible for defining and driving a vision for cloud reliability engineering, aligning it with broader engineering and business goals.About the RoleThis is a...


  • New York, New York, United States Celonis GmbH Full time

    About the RoleCelonis is seeking a highly skilled Cloud Reliability Engineer to join our team. As a key member of our SRE team, you will be responsible for designing, implementing, and managing cloud-based applications and platforms that meet our high standards for reliability and scalability.Key ResponsibilitiesDesign and implement cloud-based applications...


  • New York, New York, United States Forhyre Full time

    Job Title: Cloud Service Reliability EngineerWe are seeking a skilled Cloud Service Reliability Engineer to join our team at Forhyre. As a Cloud Service Reliability Engineer, you will be responsible for designing, implementing, and maintaining systems that ensure the reliability and efficiency of our cloud-based services.Key Responsibilities:Develop and...


  • New York, New York, United States Syndio Full time

    Job DescriptionEmpower organizations to achieve fairness and equity in the workplace by joining Syndio, a Series-C technology company committed to creating diverse and inclusive workplaces. As a Cloud and Site Reliability Engineering Manager, you will play a critical role in defining and driving a vision for the organization's cloud platform, ensuring it is...


  • New York, New York, United States Forhyre Full time

    Job Title: Cloud Service Reliability EngineerWe are seeking a skilled Cloud Service Reliability Engineer to join our team at Forhyre. As a Cloud Service Reliability Engineer, you will be responsible for designing, implementing, and maintaining systems on premise or in the cloud, focusing on identity and access management, cloud computing...


  • New York, New York, United States Syndio Full time

    Job Title: Cloud and Site Reliability Engineering ManagerAt Syndio, we're seeking a highly skilled Cloud and Site Reliability Engineering Manager to join our team. As a key member of our engineering leadership, you will be responsible for defining and driving a vision for our cloud platform, ensuring it is scalable, reliable, and secure.About the RoleThis is...


  • New York, New York, United States Syndio Full time

    Job OverviewSyndio is seeking a highly skilled Cloud and Site Reliability Engineering Manager to join our team. As a key member of our engineering leadership team, you will be responsible for leading a team of SREs, PE's, and COE's in designing, implementing, and operating production systems using best practices in automation, monitoring, and...


  • New York, New York, United States Diverse Lynx Full time

    Job Title: Site Reliability Engineer - Cloud Expert Job Summary: We are seeking a highly skilled Site Reliability Engineer with expertise in cloud engineering to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based systems. Responsibilities: *...


  • New York, New York, United States Diverse Lynx Full time

    Job Title: Cloud Operations EngineerAt Diverse Lynx, we're seeking a highly skilled Cloud Operations Engineer to join our team. As a key member of our engineering team, you will be responsible for ensuring the reliability and performance of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL and improve system...


  • New York, New York, United States Diverse Lynx Full time

    Job Title: SRE - Site Reliability EngineerLocation:New York, NY (Onsite)Full-time OpportunityMinimum Experience:10+ yearsJob Description:We are seeking a highly skilled Site Reliability Engineer with expertise in cloud engineering to join our team at Diverse Lynx LLC. As a key member of our team, you will be responsible for designing and implementing...

  • Cloud Engineer

    7 days ago


    New York, New York, United States Diverse Lynx Full time

    Job Title: Cloud Site Reliability EngineerLocation: New York, NY (Onsite)Full time OpportunityMinimum Experience: 5-10 YearsJob DescriptionWe are seeking a skilled Cloud Site Reliability Engineer to join our team at Diverse Lynx LLC. The ideal candidate will have experience in cloud engineering, operation automation, and monitoring. They will be responsible...


  • New York, New York, United States AEG Full time

    Job DescriptionWe are seeking a highly skilled Senior Cloud Reliability Engineer to join our team at AEG. As a key member of our Cloud Operations team, you will be responsible for leading and mentoring our SRE and TechOps teams with a focus on automation to drive accountability, efficiency, and continuous improvement.Key ResponsibilitiesBuild and Maintain...


  • New York, New York, United States Palantir Technologies Full time

    Job SummaryWe are seeking a skilled Site Reliability Engineer to join our team at Palantir Technologies. As a Site Reliability Engineer, you will be responsible for designing, deploying, and operating high-performance, scalable, and reliable services for our production infrastructure, across both cloud and on-prem environments.Key ResponsibilitiesMaintain...


  • New York, New York, United States Betterment Full time

    About the RoleWe are seeking a highly skilled Cloud Reliability Engineer to join our team at Betterment. As a Staff Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and security of our cloud-based systems.Key ResponsibilitiesDesign and implement scalable and reliable cloud native solutions using AWSDevelop...


  • New York, New York, United States Valstro Full time

    Job Title: Site Reliability EngineerValstro is a FinTech partnership working to deliver next-gen, Cloud-First, trading solutions to global, multi-asset-class institutional clients. We are a "people-first" company, and all the value that we bring to clients will come from the efforts of a collaborative, motivated and well-supported team.The applications that...


  • New York, New York, United States City National Bank Full time

    Job SummaryCity National Bank is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key ResponsibilitiesImplement solutions that improve stability, security, scalability,...

  • Reliability Engineer

    2 weeks ago


    New York, New York, United States Capital One Services, LLC Full time

    Job Title: Lead Reliability EngineerCapital One Services, LLC is seeking a highly skilled Lead Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for designing, developing, and implementing technical solutions to ensure the reliability and scalability of our cloud-based systems.Key...


  • New York, New York, United States City National Bank Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at City National Bank. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Design and implement solutions...

Cloud Operations Reliability Engineer

2 months ago


New York, New York, United States CLS Group Full time

About CLS Group

CLS Group stands as a pivotal entity within the global foreign exchange (FX) ecosystem. Trusted by numerous counterparties, CLS enhances the safety, efficiency, and cost-effectiveness of FX transactions. Each day, trillions of dollars in currency are processed through our advanced systems.

Our globally recognized settlement infrastructure, developed by market participants for market participants, significantly mitigates systemic risk while providing standardization for those engaged in the world's most actively traded currencies. By implementing multilateral netting, we achieve remarkable efficiencies, reducing funding requirements by over 96% on average, allowing clients to allocate their capital and resources more effectively.

CLS's suite of products is tailored to empower clients in managing risk throughout the entire FX lifecycle, whether through streamlined processing tools or insightful market intelligence derived from the largest single source of executed FX data available.

Our commitment to making a positive impact begins with our workforce. Our core values – Protect, Improve, Grow – are the foundation of our operations, fostering a supportive and inclusive workplace where every individual is encouraged to think innovatively and openly.

Position Overview

This role is primarily focused on the application of Site Reliability Engineering (SRE) methodologies within a cloud-hosted environment. Additionally, it serves as a central expertise point for SRE automation within the Platform Operations team.

Key Responsibilities

  • Implement SRE methodologies in the cloud environment, including the automation of repetitive tasks and the definition and implementation of Service Level Objectives (SLOs) and Service Level Agreements (SLAs).
  • Establish SRE as a core practice within the Cloud team, collaborating closely with Infrastructure Engineering to enhance observability and telemetry, ensuring that cloud services are equipped with appropriate service metrics and monitoring.
  • Develop GitOps practices for the cloud environment utilizing tools such as Terraform and Ansible, acting as a liaison between Engineering and Cloud Operations to fully integrate Infrastructure as Code for all new cloud deployments.
  • Provide escalation support for cloud and automation-related issues, prioritizing production stability at all times.
  • Identify and address risks and stability issues in the cloud environment through SRE best practices, contributing to incident postmortems.

Education and Qualifications

  • Bachelor's degree or equivalent experience.
  • Industry-standard IT certifications preferred (e.g., AWS, Microsoft, VMware, Redhat Linux).

Experience Requirements

  • Strong technical operational support experience within an infrastructure services team, ideally with a focus on cloud-hosted environments.
  • Proficient in automation technologies, particularly Terraform and Ansible, with the ability to implement Infrastructure as Code through GitOps methodologies.
  • Familiarity with at least one scripting language, preferably Python or PowerShell.
  • A minimum of 2 years of experience applying SRE methodologies within a support team, with a solid understanding of service level metrics.
  • Experience with Application Performance Monitoring (APM) tools (e.g., Grafana, Datadog, Dynatrace).
  • Background in regulated financial services or banking organizations.

Special Skills and Knowledge

  • Ability to understand and utilize at least one cloud service provider, such as AWS or Azure.
  • Possesses a strong service-oriented mindset, consistently delivering high-quality service to the business.
  • Effective communication skills with both business and technical personnel at all levels.
  • Proactive approach with the ability to provide regular updates to management and stakeholders.