Site Reliability Engineer
3 days ago
About the role
At Luma, our Site Reliability Engineer (SRE) team keeps our platform reliable, secure, and lightning fast. They own everything from AWS infrastructure and Kubernetes clusters to CI/CD pipelines, monitoring, and alerting. If you're passionate about tackling big challenges, automating at scale, and making systems more resilient, we'd love to have you on the team.
Please note:
- This position is required to work from Luma's Cincinnati, OH or New York, NY office 2-3 days/week
- SPONSORSHIP FOR U.S WORK AUTHORIZATION IS NOT AVAILABLE FOR THIS OPPORTUNITY
What you'll do
- Collaborate with product engineering teams to design and build the infrastructure their services run on.
- Keep our Kubernetes clusters on AWS EKS running smoothly, secure, and ready to scale.
- Design and deliver resilience strategies that cover multi-region architecture, backups, disaster recovery, and failover.
- Automate infrastructure with Terraform and Infrastructure-as-Code, reducing manual effort and human error.
- Help teams ship faster by improving CI/CD pipelines and deployment practices.
- Monitor performance and reliability using modern observability tools.
- Support on-call rotations and lead incident response with a focus on long-term fixes.
What We're Looking For
- You code to solve problems and are comfortable in one of the following languages: Python, Bash, Go, Java, or similar.
- You have strong experience with AWS (RDS, CloudFront, IAM, VPCs), Terraform, and Kubernetes.
- You are resilience focused, with experience designing and running systems that remain dependable during failures and recover seamlessly.
- You have hands-on experience improving and operating CI/CD pipelines (e.g., CircleCI, GitHub Actions, or similar) to help teams ship faster with confidence.
- You stay calm under pressure, bringing incident response expertise and strong root-cause analysis skills.
- Most importantly, you are a team player who brings clear communication, strong collaboration, and a mindset of continuous improvement.
Please note: sponsorship for U.S. work authorization is not available for this opportunity.
-
Site reliability engineer
3 days ago
New York, New York, United States WRITER Full timeAbout WRITERWRITER is where the world's leading enterprises orchestrate AI-powered work. Our vision is to expand human capacity through superintelligence. And we're proving it's possible – through powerful, trustworthy AI that unites IT and business teams together to unlock enterprise-wide transformation. With WRITER's end-to-end platform, hundreds of...
-
Site Reliability Engineer
13 hours ago
New York, New York, United States Talkspace Corporate Full time $88,000 - $131,000At Talkspace, we are committed to fostering a diverse, equitable, inclusive, and belonging-centered workplace where everyone can thrive while making a difference in mental health. Want to help over two million people receive quality mental healthcare? Come join our mission of getting therapy in the hands of everyoneWe are looking for an experienced Site...
-
Site Reliability Engineer
1 week ago
New York, New York, United States Madronich Dr Robert Full timeAt Talkspace, we are committed to fostering a diverse, equitable, inclusive, and belonging-centered workplace where everyone can thrive while making a difference in mental health. Want to help over two million people receive quality mental healthcare? Come join our mission of getting therapy in the hands of everyoneWe are looking for an experiencedSite...
-
Site Reliability Engineer
22 hours ago
New York, New York, United States Justworks Full timeCompany Description We work hard. We're helping businesses get off the ground, by simplifying payroll, benefits, payments and compliance needs, enabling them to focus on what matters. We're data driven and never stop inventing new ways to improve.Working at Justworks, you'll enjoy company happy hours, a welcoming and casual environment, great benefits, and...
-
Lead Site Reliability Engineer
22 hours ago
New York, New York, United States Kraken Full timeHelp us use technology to make a big green dent in the universeKraken powers some of the most innovative global developments in energy.We're a technology company focused on creating a smart, sustainable energy system. From optimising renewable generation, creating a more intelligent grid and enabling utilities to provide excellent customer experiences, our...
-
Director, Site Reliability Engineering
2 weeks ago
New York, New York, United States NBCUniversal Full time $189,592 - $220,000Company Description NBCUniversal is one of the world's leading media and entertainment companies. We create world-class content, which we distribute across our portfolio of film, television, and streaming, and bring to life through our theme parks and consumer experiences. We own and operate leading entertainment and news brands, including NBC, NBC News,...
-
Site Reliability Engineer
15 hours ago
New York, New York, United States S&P Global Full timeAbout the Role:Grade Level (for internal use):09Site Reliability Engineer – Datadog Specialist The Team: The IT Operations team at S&P Dow Jones Indices (S&P DJI) is tasked with owning and maintaining the Production IT systems that underpin S&P DJI's index platforms and applications, ensuring their high availability. The team prioritizes service...
-
Staff Site Reliability Engineer, Tech Lead
6 days ago
New York, New York, United States Unify Full timeAbout UnifyUnify was founded January 17th, 2023 by Austin Hughes and Connor Heggie. Prior to Unify, Austin led Ramp's growth product team focused on new customer acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from companies like Airbnb, Spotify, Bridgewater and LinkedIn.Our mission is to build the...
-
Real Asset Site Reliability Engineer
2 days ago
New York, New York, United States MSCI Inc. Full timeYour Team ResponsibilitiesThe MSCI office is offering an excellent opportunity for a Production Support Engineer to join the Real Estate Site Reliability Engineering team.The successful candidate will join a team of IT production engineers and will report to the Production Team Lead. The role requires close co-operation with the development and...
-
Staff Software Engineer, Reliability
22 hours ago
New York, New York, United States Metropolis Full time $180,000 - $200,000Who we areMetropolis is an artificial intelligence company that uses computer vision technology to enable frictionless, checkout-free experiences in the real world. Today, we are reimagining parking to enable millions of consumers to just "drive in and drive out." We envision a future where people transact in the real world with a speed, ease and convenience...