Site Reliability Engineer

4 days ago


Jersey City, New Jersey, United States The Goldman Sachs Group, Inc Full time

About the Role

We are seeking a talented Site Reliability Engineer to join our SRE Platforms team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform services.

Our team is responsible for designing and building highly scalable tools that provide metrics and monitoring, log collection and analysis, and tracing. These tools are used by thousands of engineers every day, and we believe that reliability is the most important feature of any system.

Your Responsibilities

  • Develop, configure, deploy, and maintain large-scale distributed systems to handle various aspects of observability at a global scale.
  • Work with customers, product managers, and SRE experts to define observability product features and drive their requirements and implementation.
  • Run production environments spanning multiple cloud providers and on-prem data centers.

Requirements

  • 5+ years of relevant work experience.
  • In-depth understanding of DevOps and SRE concepts.
  • Expertise in developing and maintaining AWS cloud infrastructure (EC2, NLB, ALB, CLB, VPC, Private link, IAM).
  • Hands-on experience with DevOps tools like Terraform, Ansible, GitLab, and Jenkins.
  • Experience with observability platforms like Splunk or Elasticsearch.
  • Experience in scripting using Shell, PowerShell, and Python.
  • Strong communication skills, both verbal and written.
  • Comfortable with technical ownership, managing multiple stakeholders, and working as part of a global team.

About Goldman Sachs

At Goldman Sachs, we commit our people, capital, and ideas to help our clients, shareholders, and the communities we serve to grow. We believe who you are makes you better at what you do, and we're committed to fostering and advancing diversity and inclusion in our own workplace and beyond.



  • Jersey City, New Jersey, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Jersey City, New Jersey, United States CyberTec Full time

    Site Reliability EngineerCyberTec is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructureDevelop and maintain monitoring and...


  • Jersey City, New Jersey, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Jersey City, New Jersey, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Jersey City, New Jersey, United States The Goldman Sachs Group, Inc Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform services.Your...


  • Jersey City, New Jersey, United States City National Bank Full time

    Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that improve...


  • Jersey City, New Jersey, United States Goldman Sachs Full time

    About This RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our post-execution processing platforms, which handle trade processing, internal firm/firm trades, and client delivery across physical and synthetic...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    Job Title: Principal Site Reliability EngineerAt Fidelity Investments, we're seeking a highly skilled Principal Site Reliability Engineer to join our TechOps SRE team. As a key member of our team, you'll work closely with our engineering partners to drive initiatives from design to implementation, ensuring the reliability and scalability of our...


  • Jersey City, New Jersey, United States Syntricate Technologies Full time

    We are seeking a highly skilled AWS Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure, particularly our AWS environment.The ideal candidate will have strong experience with AWS, with a focus on SRE principles...


  • Jersey City, New Jersey, United States The Goldman Sachs Group Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at The Goldman Sachs Group. As a Site Reliability Engineer, you will be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform services.Key ResponsibilitiesDevelop and...


  • Jersey City, New Jersey, United States Hispanic Technology Executive Council Full time

    Job DescriptionAt Hispanic Technology Executive Council, we are committed to delivering exceptional results through the power of technology. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and observability of our services.Key ResponsibilitiesPartner with engineering and technology teams to improve reliability and...


  • Jersey City, New Jersey, United States Syntricate Technologies Full time

    We are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure, particularly on AWS. Your strong AWS experience and 2-3 years of recent experience will be invaluable in this role.The ideal...


  • Jersey City, New Jersey, United States Bank of America Full time

    Job Title: Site Reliability EngineerAt Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection.As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and observability of our services. You will partner with engineering and technology teams to improve the...


  • Jersey City, New Jersey, United States Royal Bank of Canada Full time

    Job SummaryAt Royal Bank of Canada, we're seeking a highly skilled Lead Site Reliability Engineer to join our team. As a key member of our Site Reliability Engineering (SRE) team, you'll be responsible for designing, implementing, and maintaining scalable, reliable, and efficient systems that meet the needs of our customers.Key ResponsibilitiesDesign and...


  • Jersey City, New Jersey, United States Royal Bank of Canada Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Royal Bank of Canada. As a key member of our Technology and Operations group, you will be responsible for designing, implementing, and maintaining scalable and reliable systems to support our business applications.Key ResponsibilitiesDesign and implement...


  • Jersey City, New Jersey, United States The Dignify Solutions LLC Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at The Dignify Solutions LLC. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining our cloud-based infrastructure. Your expertise in cloud platforms, automation tools, and security fundamentals will be crucial in...


  • Jersey City, New Jersey, United States Fidelity TalentSource LLC Full time

    Job Summary:We are seeking a highly skilled Principal Site Reliability Engineer to join our team at Fidelity Digital Assets. As a key member of our TechOps SRE team, you will work closely with our engineering partners to help enable and drive initiatives from design to implementation.The Role:As a Principal Site Reliability Engineer, you will be responsible...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    Job Title: Principal Site Reliability EngineerThe Role:As a member of the TechOps SRE team at Fidelity Investments, you will work closely with our engineering partners to enable and drive initiatives from design to implementation. Our highly available multi-region Kubernetes environments are best-in-class and central to our enterprise-grade infrastructure...


  • Jersey City, New Jersey, United States Fidelity TalentSource LLC Full time

    Job Title: Principal Site Reliability EngineerJob Summary:Fidelity Digital Assets is seeking a highly skilled Principal Site Reliability Engineer to join our Technical Operations team. As a key member of our team, you will be responsible for designing, implementing, and maintaining highly available, secure, and scalable cloud infrastructure on AWS. You will...


  • Jersey City, New Jersey, United States The Dignify Solutions LLC Full time

    Job SummaryWe are seeking a highly experienced Site Reliability Engineer Leader to join our team at The Dignify Solutions LLC. The ideal candidate will have a strong background in building and running applications in production with uptime over 99%.Key ResponsibilitiesDesign and implement large-scale Reliability & Observability Programs for complex...