Site Reliability Engineer

2 days ago


Jersey City, New Jersey, United States City National Bank Full time
Site Reliability Engineer

At City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.

Key Responsibilities:
  • Implement solutions that improve stability, security, scalability, and availability of our software platforms
  • Design mechanisms for proactive alerts and responses to identify and address reliability risks
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, planning, and reviews
  • Perform diverse engineering activities for performance tuning, monitoring, deployment, and production support
  • Design, build, and manage SLIs, SLOs, and Error budgets for Availability, Performance/Latency, and Throughput for critical services running in production
  • Be a proponent of using the SRE core principles in driving product velocity
  • Create educational documentation on how-to's and blog about use-cases and architectures that relate to cloud platforms and observability
  • Co-ordinate code reviews with goals of continuous improvement in design, build, and architectural practices
  • Prepare programming specifications from which programs will be written, and designs, codes, tests, debugs, and documents programs
  • Liaise with the team managing our public cloud environments, including setup, management, and troubleshooting
  • Design and create solutions to test application resiliency using chaos engineering, fail-over scenarios, and capacity analysis to reduce MTTR (Mean Time to resolve) and MTBF (Mean Time between Failures) to minimize client impact
  • Create and maintain application system overviews and technical documentation
Requirements:
  • Bachelor's Degree or equivalent
  • Minimum 5 years of experience in an Operational role, DevOps, SRE, or Software Engineering
  • Minimum 5 years of experience developing applications with an active user base, and deploying to production and going through any change management process
  • Minimum 3 years of experience doing development in any of.NET Core, Java, NodeJS, Python
  • Minimum 3 years of experience with development or administration on any cloud platforms (Cloud Foundry, Heroku, AWS, Azure, Google Cloud, IBM Cloud, Bluemix, Kubernetes, and others)
Preferred Skills and Knowledge:
  • Experience with development or administration on any cloud platforms like AWS, Azure, etc.
  • Experience with Elasticsearch/Splunk
  • Experience with Monitoring tools such as Dynatrace, Datadog, AppDynamics, etc.
  • Creativity, energy, and passion for leveraging technology to transform our industry
  • Design & Develop APIs and UIs to help make use of large data sets, infrastructure, and user experience
  • A good understanding of modern, cloud-centric architectures and DevOps principles
  • Experience with the operational aspects of software systems such as monitoring, centralized logging, and alerting
  • Providing standardized offerings to facilitate and ensure operational health of stacks throughout their lifecycle
Compensation:

Starting base salary: $87,027 - $138,965 per year. Exact compensation may vary based on skills, experience, and location. This job is eligible for bonus and/or commissions.



  • Jersey City, New Jersey, United States Open Systems Technologies Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Open Systems Technologies. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our distributed systems.Key Responsibilities:Design, implement, and maintain distributed systems to ensure...


  • Jersey City, New Jersey, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Jersey City, New Jersey, United States CyberTec Full time

    Site Reliability EngineerCyberTec is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructureDevelop and maintain monitoring and...


  • Jersey City, New Jersey, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Jersey City, New Jersey, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Jersey City, New Jersey, United States Aloden, Inc. Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Aloden, Inc. in Jersey City, New Jersey. As a Site Reliability Engineer, you will be responsible for ensuring the stability and reliability of our Onyx blockchain platform in production.Key Responsibilities:Ensure the stability and reliability of...


  • Jersey City, New Jersey, United States Goldman Sachs Full time

    About This RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our post-execution processing platforms, which handle trade processing, internal firm/firm trades, and client delivery across physical and synthetic...


  • Jersey City, New Jersey, United States Goldman Sachs Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our production systems, handling issues, managing incidents, and providing support to our users.ResponsibilitiesOwn production processes, handling...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    Job Title: Principal Site Reliability EngineerAt Fidelity Investments, we're seeking a highly skilled Principal Site Reliability Engineer to join our TechOps SRE team. As a key member of our team, you'll work closely with our engineering partners to drive initiatives from design to implementation, ensuring the reliability and scalability of our...


  • Jersey City, New Jersey, United States Syntricate Technologies Full time

    Job Title: Site Reliability Engineer - AWSWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure, particularly on AWS.Key Responsibilities:Design, implement, and maintain scalable and...


  • Jersey City, New Jersey, United States The Goldman Sachs Group Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at The Goldman Sachs Group. As a Site Reliability Engineer, you will be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform services.Key ResponsibilitiesDevelop and...


  • Jersey City, New Jersey, United States Hispanic Technology Executive Council Full time

    Job DescriptionAt Hispanic Technology Executive Council, we are committed to delivering exceptional results through the power of technology. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and observability of our services.Key ResponsibilitiesPartner with engineering and technology teams to improve reliability and...


  • Jersey City, New Jersey, United States Bank of America Full time

    Job Title: Site Reliability EngineerAt Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection.As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and observability of our services. You will partner with engineering and technology teams to improve the...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    Job Overview:About the RoleFidelity Investments is seeking a highly skilled Principal Site Reliability Engineer to join our Technical Operations team. As a key member of our team, you will be responsible for designing, implementing, and maintaining our cloud infrastructure to ensure high availability, scalability, and security.Key ResponsibilitiesDesign and...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    The RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our Technical Operations team at Fidelity Investments. As a member of this team, you will play a critical role in ensuring the reliability, scalability, and security of our cloud-based infrastructure. You will work closely with our engineering partners to design, implement,...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    Job SummaryWe are seeking a highly skilled Principal Site Reliability Engineer to join our TechOps team at Fidelity Investments. As a key member of our team, you will play a critical role in ensuring the reliability, scalability, and security of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement highly available, secure, and scalable...


  • Jersey City, New Jersey, United States Royal Bank of Canada Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Royal Bank of Canada. As a key member of our Technology and Operations group, you will be responsible for designing, implementing, and maintaining scalable and reliable systems to support our business applications.Key ResponsibilitiesDesign and implement...


  • Jersey City, New Jersey, United States Syntricate Technologies Full time

    We are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a key member of our infrastructure team, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using AWS servicesDevelop and...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    About the RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our TechOps SRE team at Fidelity Investments. As a member of this team, you will work closely with our engineering partners to help enable and drive initiatives from design to implementation.Key ResponsibilitiesDesign and implement highly available, secure, scalable...


  • Jersey City, New Jersey, United States RBC Capital Markets, LLC Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at RBC Capital Markets, LLC. As a key member of our Application Support team, you will be responsible for ensuring the reliability and performance of our applications and infrastructure.Key ResponsibilitiesPerform application production support, including off-hours...