Systems Reliability Engineer

7 days ago


New York, New York, United States mthree Recruiting Portal Full time
SRE - Leading Investment Bank

This role supports Institutional Securities and Wealth Management brokerage Operations platforms which include diverse technologies hosted by on premises and cloud platforms.

The role is expected to perform day to day support for the business alongside reliability engineering tasks.

The role has an emphasis on improving the reliability of our systems by working with the Software developers and Infrastructure engineering teams to develop automated reliability solutions.

Responsibilities:
  • Production management, inclusive of: incident and problem management, capacity management, monitoring, event management, change management, and plant hygiene.
  • Troubleshooting issues across the entire technology stack: hardware, software, application, and network.
  • Participating in on-call rotation and periodic conference calls with other specialists from other time zones.
  • Proactively identifying and addressing system reliability risks.
  • Working closely with development teams to design, build, and maintain systems from a reliability, stability, and resiliency perspective.
  • Identifying and driving opportunities to improve automation for our platforms; scope and create automation for deployment, management, and visibility of our services.
  • Representing the RPE organization in design reviews and operational readiness exercises for new and existing products/services.
Experience:
  • Demonstrated ability to troubleshoot problems and debug to identify root cause on large-scale distributed applications across multiple layers, i.e. software, Infrastructure and database.
  • Hands on experience on enterprise tools such as Prometheus, Grafana, Splunk, Apica
  • Hands-on experience of UNIX / Linux system support and Cloud based services.
  • Experience with Ansible, GitHub or any automation/configuration/release management tools
  • Automation-related experience is particularly valued using scripting languages such as python, bash, perl, ruby. One higher level language is desired.
  • Creating stored procedures and optimising SQL in Sybase or DB2.
  • Experience of Azure Networks, ServiceBus, Azure Virtual Machines and AzureSQL will be an advantage.


  • New York, New York, United States Edward Daniels Group Full time

    Reliability EngineerWe are seeking a skilled Reliability Engineer to join our team at Edward Daniels Group. As a key member of our technology team, you will be responsible for designing, implementing, and maintaining our cloud-based infrastructure.Key Responsibilities:Program with Python, Java, C/C++, or Go to develop and maintain our software...


  • New York, New York, United States mthree Recruiting Portal Full time

    SRE - Leading Investment BankThis role supports Institutional Securities and Wealth Management brokerage Operations platforms which include diverse technologies hosted by on premises and cloud platforms.The role is expected to perform day to day support for the business alongside reliability engineering tasks.The role has an emphasis on improving the...


  • New York, New York, United States Amtex Systems Inc. Full time

    Job Title: Site Reliability EngineerLocation: Midtown, NY (Hybrid)Duration: Full-TimeJob Summary:We are seeking a seasoned Site Reliability Engineer with extensive experience working with AWS to join our team at Amtex Systems Inc. The ideal candidate will have a strong background in architecting, implementing, and managing monitoring tools such as...


  • New York, New York, United States Hudson River Trading Full time

    About the RoleHudson River Trading (HRT) is seeking a highly skilled Senior IT Systems Reliability Engineer to join our IT Solutions Delivery team. This team is responsible for developing and maintaining the corporate productivity stack for the entire firm, both on-prem and in the cloud.Key ResponsibilitiesManage on-premise containerized web services and...

  • Reliability Engineer

    2 weeks ago


    New York, New York, United States Alchemy Full time

    About AlchemyAlchemy is a world-class developer platform that empowers innovators to build on the blockchain. Our mission is to democratize access to blockchain technology, making it easy for developers to create and deploy decentralized applications.The RoleWe're seeking a seasoned Infrastructure Engineer to join our team and drive reliability initiatives...

  • Reliability Engineer

    4 weeks ago


    New York, New York, United States Alchemy Full time

    About the RoleWe're seeking a highly skilled Reliability Engineer to join our Infrastructure team at Alchemy. As a key member of our team, you will play a critical role in designing, deploying, and continuously improving the infrastructure that supports our globally used developer platform.Key ResponsibilitiesDevelop and own company-wide reliability best...


  • New York, New York, United States Tik Tok Full time

    About the RoleWe're seeking a skilled Site Reliability Engineer to join our AML team, where you'll play a critical role in designing, building, and maintaining highly available, scalable, and fault-tolerant systems. Your expertise in analyzing and troubleshooting Linux-based distributed systems will be invaluable in ensuring the reliability and performance...

  • Reliability Engineer

    2 weeks ago


    New York, New York, United States International Flavors and Fragrances Full time

    Job SummaryDemonstrates expertise in multiple technology areas or applications, utilizing technical knowledge to develop value-creating solutions for our products or production assets.Functions independently in area of technical expertise; initiates and drives programs and identifies needed resources; receives limited guidance from supervisor; has...


  • New York, New York, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL and...


  • New York, New York, United States Unreal Gigs Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Unreal Gigs. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems.Key Responsibilities:Design, implement, and maintain scalable infrastructure solutions to support...


  • New York, New York, United States ADP Full time

    About ADPADP is a global leader in HR technology, offering the latest AI and machine learning-enhanced payroll, tax, HR, benefits, and more. We believe our people make all the difference in cultivating an inclusive, down-to-earth culture that welcomes ideas, encourages innovation, and values belonging.Job DescriptionWe are seeking a Site Reliability Engineer...


  • New York, New York, United States Motion Recruitment Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Motion Recruitment. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our systems, as well as collaborating with cross-functional teams to drive innovation and improvement.Key Responsibilities:Design,...


  • New York, New York, United States Unreal Gigs Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our tech startup, Unreal Gigs, specializing in infrastructure and authorization solutions.As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, availability, and performance of our systems. Your responsibilities will include designing,...


  • New York, New York, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is a leading destination for short-form mobile video, inspiring creativity and bringing joy to millions of users worldwide.Our mission is to provide a secure and reliable platform for users to express themselves, learn, and be entertained.Role OverviewWe are seeking a skilled Site Reliability Engineer to join our U.S....


  • New York, New York, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is a leading destination for short-form mobile video, inspiring creativity and bringing joy to millions of users worldwide.Our mission is to provide a secure and reliable platform for users to express themselves, learn, and be entertained.Site Reliability Engineering at TikTokAs a Site Reliability Engineer at TikTok, you...


  • New York, New York, United States Diverse Lynx Full time

    Job Title: SRE - Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL and...


  • New York, New York, United States FLOAT LLC Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Float LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud infrastructure, enabling our engineering teams to focus on delivering high-quality software to our customers.Key...


  • New York, New York, United States Grafbase, Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Engineering team at Grafbase, Inc.As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems and services.Key ResponsibilitiesCollaborate with cross-functional teams to develop and deploy software...


  • New York, New York, United States Alchemy Full time

    About the RoleAlchemy is seeking a highly skilled Site Reliability Engineer to join our Infrastructure team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our globally used developer platform.Our mission is to empower builders with the tools they need to create exceptional on-chain products....


  • New York, New York, United States Diverse Lynx Full time

    About the Role:Diverse Lynx is seeking a highly skilled Cloud Reliability Engineer to join our team. As a Cloud Reliability Engineer, you will be responsible for ensuring the reliability and efficiency of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL and improve system reliabilityDevelop and maintain...