Senior IT Site Reliability Engineer

3 months ago


New York, United States Hudson River Trading Full time

Hudson River Trading (HRT) is looking for a Senior IT Site Reliability Engineer to join our growing IT Solutions Delivery team. This team is responsible for developing and maintaining the corporate productivity stack for the entire firm, both on-prem and in the cloud. As a Senior IT SRE, you will ensure the availability and reliability of systems within this stack and grow our engineering practice in alignment with the firm’s larger engineering organization.This role requires a deep Linux operating system and application administration skill set, proficiency in Python, and solid experience with configuration management/IaC. Successful candidates should also have exceptional organizational, communication, and project management skills, as well as the ability to troubleshoot complex technical issues.ResponsibilitiesManage on-premise containerized web servicesAutomate and troubleshoot a broad range of technical infrastructureDesign and operate secure, reliable systemsDevelop and implement monitoring solutions to ensure high system uptime and reliability; utilize tools to detect and resolve issues proactivelyDocument system architecture, processes, and best practicesBreak down complexity, iterate, and communicate progress to a wide variety of leads and stakeholdersAssist with the administration of DHCP and DNS for both on-premise and external systems and applicationsQualifications5+ years of experience in site reliability engineering or related disciplinesProficiency with PythonExperience managing and monitoring containerized infrastructureExperience working with CI/CD tools such as Jenkins, GitHub Actions, or ArgoCDExpert experience with IaC and configuration management tools such as Terraform, SaltStack, Chef, Puppet, or AnsibleNice-to-haves:
Experience building and operating systems on cloud platforms (e.g. AWS, Azure, GCP)OpenLDAP or other directory services management expertiseAtlassian Data Center administration experience (on-prem)Web development experienceAnnual base salary range of $150,000 to $250,000. Pay (base and bonus) may vary depending on job-related skills and experience. A sign-on and discretionary performance bonus may be provided as part of the total compensation package, in addition to company-paid medical and/or other benefits.CultureHudson River Trading (HRT) brings a scientific approach to trading financial products. We have built one of the world's most sophisticated computing environments for research and development. Our researchers are at the forefront of innovation in the world of algorithmic trading. At HRT we welcome a variety of expertise: mathematics and computer science, physics and engineering, media and tech. We’re a community of self-starters who are motivated by the excitement of being at the cutting edge of automation in every part of our organization—from trading, to business operations, to recruiting and beyond. We value openness and transparency, and celebrate great ideas from HRT veterans and new hires alike. At HRT we’re friends and colleagues – whether we are sharing a meal, playing the latest board game, or writing elegant code. We embrace a culture of togetherness that extends far beyond the walls of our office.Feel like you belong at HRT? Our goal is to find the best people and bring them together to do great work in a place where everyone is valued. HRT is proud of our diverse staff; we have offices all over the globe and benefit from our varied and unique perspectives. HRT is an equal opportunity employer; so whoever you are we’d love to get to know you.



  • New York, United States Apollo Solutions Full time

    Site Reliability Engineer - Web3 Apollo Solutions have partnered with an innovative web3 start-up backed by top tier venture capital with a strong runway. They are looking to revolutionize the way way we with about the application of web3 and have already made significant inroads into the gaming, entertainment and finance industries. In this role, you will...


  • New York, NY, United States Hudson River Trading Full time

    Hudson River Trading (HRT) is looking for a Senior IT Site Reliability Engineer to join our growing IT Solutions Delivery team. This team is responsible for developing and maintaining the corporate productivity stack for the entire firm, both on-prem and in the cloud. As a Senior IT SRE, you will ensure the availability and reliability of systems within this...


  • New York, United States Grafbase, Inc. Full time

    We are looking for a Site Reliability Engineer to join our Engineering team. As an SRE, you will play a crucial role in ensuring the reliability, availability, and performance of our systems and services. You will collaborate, design, implement, and maintain infrastructure and automation solutions, supporting the continuous improvement of our platform's...


  • New York, United States MarketAxess Full time

    About Us  MarketAxess is on a journey to digitally transform one of the world’s largest financial markets, enabling the shift from analog, phone-based trading to a fully electronic marketplace. Why does this matter? Because our platform makes trading fixed-income more accessible, ultimately improving transparency, efficiency, and competition in the...


  • New York, United States Nationstaff Full time

    About This Role We are seeking a talented Site Reliability Engineer with experience in building and maintaining continuous integration, automating programmatic tasks, deploying applications, configuration management, and monitoring and maintaining the uptime of the platform. The Site Reliability Engineer will be an expert in Linux, is passionate about open...


  • New York, United States Cockroach Labs Full time

    Databases are the beating heart of every business in the world. Cockroach Labs is the creator of CockroachDB, the most highly evolved cloud-native, distributed SQL database on the planet that scales fast, survives anything, and thrives anywhere. We created CockroachDB to unshackle teams from the constraints of their database. Join us on our mission to...


  • New York, United States STAND 8 Technology Services Full time

    STAND 8 provides end to end IT solutions to enterprise partners across the United States and with offices in Los Angeles, New York, New Jersey, Atlanta, and more including internationally in Mexico and India. Are you passionate about Building Management Systems (BMS) and creating best-in-class building environments driven by best-in-class software and...


  • New York, New York, United States Oscar Health Full time

    About the RoleOscar Health is a cutting-edge health insurance company that's revolutionizing the industry. As a Site Reliability Engineer II, you'll be a key member of our SRE team, responsible for building and maintaining scalable, highly reliable software systems.With a focus on bridging the gap between development and operations teams, you'll work closely...


  • New York, United States Transfinder Full time

     The Junior Site Reliability Engineer (SRE) works to ensure Transfinder provides clients with the best-hosted software experience possible. The SRE works collaboratively between Development and Operations to scale, secure, monitor, and maintain cloud infrastructure for running Transfinder products.? Utilizing AWS and other cloud technologies, this position...


  • New York, New York, United States Fidelity Information Services Full time

    Company OverviewFidelity Information Services is a leading provider of financial services and technology solutions. Our mission is to empower our clients with innovative and reliable systems.SalaryThe estimated annual salary for this position is $31,200.Job DescriptionWe are seeking an experienced Site Reliability Engineer to join our team. As a key member...


  • New York, United States Insight Global Full time

    Job DescriptionJob DescriptionJob Description:Our client is looking for 1 remote Site Reliability Engineer to join their engineering organization. They will be responsible for investigating issues within broadcast playout systems and their integration points to find the root cause of problems or systemic issues. They must have at least 2 years of experience...


  • New York, United States Insight Global Full time

    Job DescriptionJob DescriptionJob Description:Our client is looking for 1 remote Site Reliability Engineer to join their engineering organization. They will be responsible for investigating issues within broadcast playout systems and their integration points to find the root cause of problems or systemic issues. They must have at least 2 years of experience...

  • Senior Site Engineer

    4 weeks ago


    New York, New York, United States Axionova Engineering Limited Full time

    Company Overview:Axionova Engineering Ltd. is a leading provider of innovative engineering solutions across various sectors, including structural, civil, environmental, and project management disciplines. Our services are built on a commitment to precision, efficiency, and sustainability, ensuring that every project exceeds client expectations.About the...


  • New York, United States MarketAxess Full time

    About Us MarketAxess is on a journey to digitally transform one of the world's largest financial markets, enabling the shift from analog, phone-based trading to a fully electronic marketplace. Why does this matter? Because our platform makes trading fixed-income more accessible, ultimately improving transparency, efficiency, and competition in the...


  • New York, New York, United States STAND 8 Technology Services Full time

    At STAND 8 Technology Services, we're on a mission to impact the world positively by creating success through PEOPLE, PROCESS, and TECHNOLOGY.We're seeking a dedicated Site Reliability Engineer with a strong focus on Building Management System (BMS) software, specifically Tridium Niagara. The ideal candidate will have a deep level of experience in setting up...


  • New York, United States Capital One Full time

    Plano 2 (31062), United States of America, Plano, TexasDirector, Technical Program Management- Site Reliability EngineeringAre you interested in leading programs that deliver on critical business goals and build large scale products & platforms?About Capital One: At Capital One, we're changing banking for good. We were founded on the belief that no one...


  • New York, United States AKRF Full time

    Founded in 1981, AKRF is an award-winning consulting firm with 400 professionals bringing the value of strategic thinking to our land development, transportation, energy, and water clients. We serve our clients through a commitment to technical excellence across our portfolio of professional services, while always acting with integrity and fostering a...


  • New York, United States Diverse Lynx Full time

    Job Title: SRE - Site Reliability Engineer Location: New York , NY (Onsite ) Full time Opportunity Minimum Experience: 5 - 10 Years Job Description Should be having cloud engineering experience and acting as the SME on operation automation and monitoring, identifying TOIL within the teams existing systems and processes, and implementing automated solutions...


  • New York, New York, United States Capital One Full time

    Capital One Reliability Engineer RoleWe are seeking a skilled Lead Reliability Engineer to join our team at Capital One. As a key member of our engineering group, you will play a critical role in designing and implementing reliable systems that meet the needs of our customers.About the JobCollaborate with Agile teams to design, develop, test, implement, and...


  • New York, New York, United States Clear Corporate Services LLC Full time

    At Clear Corporate Services LLC, we're shaping the future of digital identification and authentication. As a Senior Software Engineer, Observability, you'll play a pivotal role in establishing our Site Reliability Engineering (SRE) function. You'll accelerate the development and scaling of innovative systems that support our growing identity platform.You...