Database SRE
1 week ago
Hudson River Trading (HRT) is looking for a Database SRE to join our growing Research & Development team. This team builds and maintains an exceptionally large and growing distributed compute cluster, a petabyte-scale storage layer, operating systems, automation software, and development tools. Much of our hardware layer and operating system layer reflect the state-of-the-art at any given time.
We are looking for an experienced Database SRE with solid Linux systems engineering skills. You’ll be at the forefront of designing, building, and maintaining HRT’s diverse production database infrastructure, focusing on performance, scalability, and reliability. This is a job for an engineer looking to make a real difference working with a small, focused team of like-minded database and systems engineers.
Responsibilities
- Design, build, and maintain HRT’s SQL and NoSQL database infrastructure, supporting a globally distributed trading infrastructure and extending to large-scale research
- Configure and scale PostgreSQL and MySQL servers for trade-critical, real-time workflows
- Investigate and improve performance of PostgreSQL and MySQL queries and supporting infrastructure (Debian Linux)
- Transform the metrics and alerting landscape with Prometheus, VictoriaMetrics, and Grafana
- Architect and deploy Redis based key/value stores for current and future use-cases
- Engineer and apply infrastructure-as-code tools and services (Salt, Terraform, and others)
- Build and test proof of concept datastores to fit consumer use cases
Qualifications
- 5+ years of experience working in database management and infrastructure engineering
- Proven expertise in designing and implementing production database infrastructure, from security and replication to observability (monitoring, alerting, logging)
- Solid familiarity with state-of-the-art DB technologies and tradeoffs (distributed DBs, time-series DBs, relaxed consistency models, columnar storage)
- Experience with dynamic clustering solutions such as Patroni
- Strong knowledge and experience with Linux (especially Debian)
- Extensive experience with PostgreSQL or MySQL
- Solid programming with the ability to write Python with an understanding of data structures and the principles of software design
- Performance troubleshooting, and networking knowledge
- Experience with Redis, Prometheus, VictoriaMetrics and Grafana preferred
- Exceptional communication and project management skills - this role will require cross-collaboration with various stakeholders across HRT
- You are resourceful, have a sense of urgency, and are motivated to make things better
-
SRE/DevOps Engineer
4 months ago
New York, United States Open Systems Technologies Full timeA financial firm is looking for an SRE/DevOps Engineer to join their team in New York, NY.Compensation: $150-200kResponsibilitiesDesign, implement, and manage AWS cloud infrastructure using Terraform and CloudFormationDevelop and maintain CI/CD pipelines using GitLab for seamless code deployment and integrationCollaborate with blockchain engineers to ensure...
-
SRE/DevOps Engineer
1 week ago
New York, United States Open Systems Technologies Full timeA financial firm is looking for an SRE/DevOps Engineer to join their team in New York, NY.Compensation: $150-200kResponsibilitiesDesign, implement, and manage AWS cloud infrastructure using Terraform and CloudFormationDevelop and maintain CI/CD pipelines using GitLab for seamless code deployment and integrationCollaborate with blockchain engineers to ensure...
-
SRE / Applications Support Engineer
2 weeks ago
New York, United States Aloden, Inc. Full timeSRE / Applications Support Engineer Location: New York (Hybrid - 3 days onsite, 2 days remote) Candidate Preference: Local candidates or those willing to relocate to the New York area. Employment Type: W2 only Must-Have Skills: Application Support Experience: Approximately 7 years of experience in application support, ideally in a front-office...
-
SRE/DevOps Engineer
4 weeks ago
new york city, United States Open Systems Technologies Full timeA financial firm is looking for an SRE/DevOps Engineer to join their team in New York, NY.Compensation: $150-200kResponsibilitiesDesign, implement, and manage AWS cloud infrastructure using Terraform and CloudFormationDevelop and maintain CI/CD pipelines using GitLab for seamless code deployment and integrationCollaborate with blockchain engineers to ensure...
-
Platform Owner AIOps SRE
1 week ago
Waltham, MA, United States National Grid Full timeAbout us Every day, we deliver safe and secure energy to homes, communities, and businesses, connecting people to the energy they need for their lives. Our expertise and track record position us uniquely to shape the sustainable future of our industry as the pace of change accelerates.To succeed, we must anticipate customer needs, reduce energy delivery...
-
DevOps Engineer
1 month ago
New York, United States Motion Recruitment Full timeOur client, a groundbreaking company in the rewards and financial technology industry, is looking for a Site Reliability Engineer (SRE) / DevOps Engineer to join their dynamic team. This hybrid role, based in New York City, offers an exciting opportunity to manage and enhance infrastructure for a rapidly scaling platform that bridges renters with rewards and...
-
Site Reliability Engineer
1 week ago
Seattle, WA, United States Apple Inc. Full timeSite Reliability Engineer (SRE) - Object Storage People at Apple don’t just build products — they craft the kind of experience that have revolutionized entire industries. The diverse collection of our people and their ideas inspire innovation in everything we do. Imagine what you could do here! Join Apple, and help us leave the world better than we found...
-
Database Reliability Engineer
1 week ago
Denver, CO, United States Checkr Full timeAbout Checkr Checkr builds people infrastructure for the future of work. We've designed a faster—and fairer—way to screen job seekers. Established in 2014, Checkr puts modern technology powered by machine learning in the hands of hiring teams, helping to hire great new people with an experience that’s fast, smooth, and safe. Checkr has over 100,000...
-
Data Infrastructure Architect
2 weeks ago
New York, New York, United States Cockroach Labs Full timeCockroach Labs is the driving force behind CockroachDB, a groundbreaking cloud-native, distributed SQL database that scales rapidly, withstands any challenge, and thrives anywhere. We developed CockroachDB to free teams from the constraints of their databases. Join us on our mission to simplify how businesses build and operate world-changing...
-
Platform Infrastructure Engineer
6 months ago
New York, NY, United States Open Systems Technologies Full timeA financial firm is looking for a Platform Infrastructure Engineer to join their team in New York, NY.Compensation: $150-200kQualifications:A Bachelor's Degree in Computer Science, Engineering, or related technical field 10 years of DevOps, TechOps, or SRE experience with 5 years of AWS experienceMicroservices (Docker, Kubernetes) experience in a...
-
Highly Available Systems Architect
2 weeks ago
New York, New York, United States Perplexity AI Full timePerplexity AI is revolutionizing the way people search and interact with the internet. We are seeking a skilled Site Reliability Engineer (SRE) to join our team in designing, implementing, and scaling high-performance infrastructure and systems.The ideal candidate should have experience in designing scalable infrastructure, building systems, and performing...
-
Senior Data Reliability Engineer
1 week ago
Chicago, IL, United States CME Group Full timeDescription Position Overview: Data System Reliability Engineer (dSRE) CME Group: Where Futures Are MadeCME Group is the world's leading and most diverse derivatives marketplace. But who we are goes deeper than that, here you can impact markets worldwide, transform industries and build a career shaping tomorrow. We invest in your success and you own it,...
-
Site Reliability Engineer
2 weeks ago
New York, United States Cockroach Labs Full timeDatabases are the beating heart of every business in the world. Cockroach Labs is the creator of CockroachDB, the most highly evolved cloud-native, distributed SQL database on the planet that scales fast, survives anything, and thrives anywhere. We created CockroachDB to unshackle teams from the constraints of their database. Join us on our mission to...
-
Sr. Site Reliability Engineer
1 week ago
McLean, VA, United States GameStop Full timeOverview Design. Disrupt. Repeat. Be an agent of change on a team committed to achieving client-focused, mission-driven excellence. Steampunk is looking for an experienced Site Reliability Engineer with an appetite for taking on new challenges. Who We Are Steampunk is the explosive collision of human-centered design and traditional government...
-
Sr. Site Reliability Engineer
1 week ago
McLean, VA, United States Root Center For Advanced Recovery Full timeOverview Design. Disrupt. Repeat. Be an agent of change on a team committed to achieving client-focused, mission-driven excellence. Steampunk is looking for an experienced Site Reliability Engineer with an appetite for taking on new challenges. Who We Are Steampunk is the explosive collision of human-centered design and traditional government contracting. An...
-
Distributed Systems Reliability Expert
2 weeks ago
New York, New York, United States Cockroach Labs Full timeCockroach Labs is the creator of CockroachDB, a cloud-native, distributed SQL database that scales fast, survives anything, and thrives anywhere. Our mission is to simplify how businesses build and operate world-changing applications.About the RoleYou will oversee our production system, ensuring stable and scalable infrastructure as we deliver CockroachDB to...
-
Sr. Data Engineer
1 month ago
New York, United States Vimeo Full timeJob DescriptionJob DescriptionWe seek an experienced Sr. Data Engineer to manage and optimize our vector databases and data stores, such as Elastic Search and AWS OpenSearch. The ideal candidate will have a strong background in cloud platforms and SRE principles and experience leading data infrastructure migrations. This role is critical in maintaining our...
-
High-Performance Infrastructure Specialist
28 minutes ago
New York, New York, United States Perplexity AI Full timeAbout Perplexity AIWe're a fast-growing company leveraging AI to transform search and internet interaction. Our mission is to provide innovative solutions that enhance user experiences. With 10 million monthly active users and significant funding from top investors, we're expanding rapidly.Salary EstimateThis role offers a salary range of $200,000 to...
-
Junior Site Reliability Engineer
7 months ago
New York, United States Transfinder Full timeThe Junior Site Reliability Engineer (SRE) works to ensure Transfinder provides clients with the best-hosted software experience possible. The SRE works collaboratively between Development and Operations to scale, secure, monitor, and maintain cloud infrastructure for running Transfinder products.? Utilizing AWS and other cloud technologies, this position...
-
Platform Infrastructure Cloud, SRE Engineer
1 month ago
New York, United States iCapital Full timeJob DescriptionJob DescriptioniCapital is powering the world's alternative investment marketplace. Our financial technology platform has transformed how advisors, wealth management firms, asset managers, and banks evaluate and recommend bespoke public and private market strategies for their high-net-worth clients. iCapital services approximately $176...