Staff Site Reliability Engineer

5 days ago


San Francisco, California, United States Hinge Health Full time

Hinge Health is moving people beyond pain by transforming the way it is treated and prevented. Connecting people digitally and in-person with expert clinical care, we combine advanced technology, AI and a care team of experts to guide people through personalized care directly from their phone. Our approach is proven to reduce pain by 68%, prevent 42% of new opioid prescriptions, and avoid more than half of joint replacement surgeries. Available to 18M people, Hinge Health is trusted by leading health plans and employers, including Land O'Lakes, L.L. Bean, Salesforce, Self-Insured Schools of California, Southern Company, City of Boston, US Foods, Toyota, and Verizon.

Learn more at

Here at Hinge Health, we welcome all applicants and know a diverse team makes us better and stronger. We look for individuals who embody our leadership principles and we value varied experiences and skill sets. Beyond specific work experience, we also look for unique capabilities and skill sets that are key indicators an applicant will thrive in our fast-paced, frequently evolving environment. If this sounds like the kind of place you'd like to be part of, please apply - we would love to hear from you

Hinge Health Hybrid Model:

We believe that remote work and in-person work have their own advantages and disadvantages, and we want to be able to leverage the best of both worlds. Employees in hybrid roles are required to be in the office 3 days/week.

About the Role

Site Reliability Engineers at Hinge Health are software engineers assuming the responsibility for all operational aspects of the Hinge Health runtime platform, including automation for self-service provisioning and configuring environments, containers, logging, monitoring, alerting, and relevant compliance and security automation associated with the platform. The ideal candidate thrives in a highly collaborative, cross functional environment.

WHAT YOU'LL ACCOMPLISH

  • Leadership​: Ensure a culture that values technical excellence together with support and compassion for individuals
  • Enable others by creating automation, tools, and collaborative workflow processes for engineers to ramp up new capabilities
  • Contribute to platform, networking and infrastructure configuration and deployment
  • Ensure all systems are appropriately auditable, with automation for supplying evidence
  • Enhance and automate failover capability and resiliency to fault conditions
  • Implement exhaustive testing, logging and monitoring with relevant alerting
  • Actively manage risk and prioritize vital infrastructure work
  • Assist with automating continuous integration and continuous delivery systems
  • Aid with automated system-wide testing efforts (smoke testing)

BASIC QUALIFICATIONS

  • 6+ years of SRE-focused projects involving process automation, systems administration, and/or general SRE activities
  • 4+ years of experience working with Terraform automating the provisioning, deployment, and scaling of AWS resources
  • Understanding of cloud security best practices and compliance standards.
  • Excellent understanding and experience in EKS, Helm, and GitHub Actions.
  • Experience defining and implementing best practices for AWS architecture, including security, performance, and cost optimization.
  • Programming expertise with a high-level language such as Python

PERFERRED QUALIFICATIONS

  • Ability to collaborate and problem solve across teams
  • Work on projects and respond to escalations simultaneously
  • Excellent communication skills, both written and verbal
  • Prior experience with healthcare data (PHI/PII/HIPAA requirements)
  • Experience developing software in Ruby on Rails or Node-based applications
  • Prior experience with PKI and/or cybersecurity

WHAT YOU'LL LOVE ABOUT US

  • Inclusive healthcare and benefits: On top of comprehensive medical, dental, and vision coverage, we offer employees and their family members help with gender-affirming care, tools for family and fertility planning, and travel reimbursements if healthcare isn't available where you live.
  • Planning for the future: Start saving for the future with our traditional or Roth 401k retirement plan options which include a 2% company match.
  • Modern life stipends: Manage your own learning and development

About Hinge Health:

LinkedIn recently named Hinge Health one of the Top 50 Startups. Forbes, Fast Company, and Inc. have also recognized our technology, innovation, and culture.

Since our founding in 2014, we've raised more than $800 million from leading investors, including Coatue and Tiger Global. We work with 1000 customers across every industry and the public sector — including Salesforce, Verizon, and the State of New Jersey — to give more than 23 million people access to the care they need. We're positioned to continue leading the market with unmatched investments in clinical research, care innovation, machine learning, AI, and computer vision.

Diversity and Inclusion:

We're committed to building diverse teams that reflect the communities we serve. Visit to learn more about what moves us.

Hinge Health is an equal opportunity employer and prohibits discrimination and harassment of any kind. We make employment decisions without regards to race, color, religion, sex, sexual orientation, gender identity, national origin, age, veteran status, disability status, pregnancy, or any other basis protected by federal, state or local law. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements.

We provide reasonable accommodations for candidates with disabilities. If you feel you need assistance or an accommodation due to a disability, let us know by reaching out to your recruiter.

By providing your information through this page or applying for a job at Hinge Health, you acknowledge that Hinge Health will collect, use, and process your information as part of our job application process. For more information on how Hinge Health processes your personal information, click here to view our Applicant and Personnel Privacy Notice.

Disclaimer:

There continues to be a significant increase in phishing attempts across all industries where fraudsters are impersonating real employees and sending fictitious job offers to applicants in a scheme to obtain sensitive information. Please note that we will never ask for your financial information at any part of the interview process including the post-offer stage, and will only correspond through domain email addresses.

If you encounter any suspicious activity, we recommend you cease all communication with the individual and consider reporting them to the U.S. FBI Internet Crime Complaint Center. If you would like to verify the legitimacy of an email you received from our recruiting team, please forward it to

*Please do not send resumes via email*



  • San Francisco, California, United States Crusoe Energy Full time

    Senior Staff Site Reliability Engineer at CrusoeCrusoe is seeking a talented Senior Staff Site Reliability Engineer to join our team. As a Senior Staff Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our systems. You will work closely with cross-functional teams to design and implement solutions that improve...


  • San Francisco, California, United States X (formerly Twitter) Full time

    Are you prepared to join the X team and help build the ultimate real-time information-sharing app, revolutionizing how people connect? At X, we're on a mission to become a trusted global digital public square, committed to minimal censorship within legal boundaries. Our goal is to empower every user to freely create and share ideas, fostering open public...


  • San Francisco, California, United States Apollo Solutions Full time

    Principal Site Reliability Engineer Apollo Solutions have partnered with a groundbreaking Fintech start-up backed by top tier venture capital. They are looking to significantly disrupt how we view, store and invest our personal finance and have already made significant waves in the industry. The Principal Site Reliability Engineer will be working closely...


  • San Francisco, California, United States Retool Full time

    ABOUT OUR COMPANYAt Retool, we are committed to revolutionizing the way businesses develop internal software. Our innovative development platform combines traditional software development with modern tools to create efficient and user-friendly solutions for companies of all sizes. We believe in creating good software by simplifying complex tasks and...


  • San Francisco, California, United States Instabase Full time

    At Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index Ventures, our...


  • San Francisco, California, United States Resource Informatics Group Full time

    Job Title: Site Reliability Engineer Work Location :San Francisco, CA (Hybrid after showing successful engagement) Duration: 18+ months Most important skills: 10 years of Oracle database administration experience on large production environmentDatabase hands on skills especially around database and system troubleshooting and administrationGoldenGate setup,...


  • San Francisco, California, United States PicnicHealth Full time

    [Full Time] Site Reliability Engineer at PicnicHealth (United States) | BEAMSTART Jobs Site Reliability Engineer PicnicHealth United StatesDate Posted10 Aug, 2023Work LocationSan Francisco, United StatesSalary Offered$160 — $190 yearlyJob TypeFull TimeExperience Required6+ yearsRemote WorkYesStock OptionsNoVacancies1 availableHealthcare needs good data. At...


  • San Francisco, California, United States GRNET S.A. Full time

    About GRNET GRNET National Infrastructures for Research and Technology, is an entity of the Greek Government, operating under the Ministry of Digital Governance. It provides advanced network and cloud computing services to academic and research institutions, educational entities at all levels, as well as to public, broader public, and private sector...


  • San Francisco, California, United States SingleStore Full time

    [Full Time] Senior Site Reliability Engineer at SingleStore (United States) | BEAMSTART Jobs Senior Site Reliability Engineer SingleStore United StatesDate Posted31 Oct, 2022Work LocationSan Francisco, United StatesSalary OfferedNot SpecifiedJob TypeFull TimeExperience Required3+ yearsRemote WorkYesStock OptionsNoVacancies1 availablePosition OverviewMemSQL...


  • San Mateo, California, United States eTek IT Full time

    Position : Site Reliability EngineerLocation : San Mateo, CARequired Skills Must Haves: 3 to 5 years exp. Kubernetes, DataDog, cloud services, large scale systems, AWS&GCP, minor Azure GKE, home strung clusters on prem, and AKS (Very Small), EKS Consistent upgrades across all the clusters and clouds Nice to Have: Gaming experience bonusAdditional SkillsJob...


  • San Diego, California, United States PEAK Technical Staffing USA Full time

    Hiring Senior Site Reliability Engineer; primary responsibilities will include contributing to the implementation and delivery of the end-to-end automation platform, to support continuous integration and continuous delivery (CI/CD), with a focus on developer self-service capabilities.NOTE:Must have build out experience with Kubernetes. This position requires...


  • San Francisco, California, United States Observable Full time

    Observable is seeking a full-time infrastructure and site reliability engineer to help improve, administrate, and grow Observable systems as we scale to meet our customer's needs.What you will doPerform site reliability and ops work for Observable production and staging environments. (Manage servers Tweak WAF rules Optimize SQL queries And more)Design and...


  • San Francisco, California, United States Observable Full time

    Observable is seeking a full-time infrastructure and site reliability engineer to help improve, administrate, and grow Observable systems as we scale to meet our customer's needs.What you will doPerform site reliability and ops work for Observable production and staging environments. (Manage servers Tweak WAF rules Optimize SQL queries And more)Design and...


  • San Ramon, California, United States LaSalle Network Full time

    LaSalle Network has partnered with a well-established software provider that's based in San Ramon, CA, who's in need of a well-rounded, Site Reliability Engineer (SRE) – Grafana Observability – with a strong background in Grafana and related tools such as Prometheus and Telegraf. The ideal candidate will play a crucial role in accelerating the transition...


  • San Francisco, California, United States Fountain Full time

    [Full Time] Senior Site Reliability Engineer (US) at Fountain (United States) | BEAMSTART Jobs Senior Site Reliability Engineer (US) Fountain United StatesDate Posted26 Jun, 2022Work LocationSan Francisco, United StatesSalary OfferedNot SpecifiedJob TypeFull TimeExperience Required6+ yearsRemote WorkYesStock OptionsNoVacancies1 availableFountain is...


  • San Francisco, California, United States Zetachain Full time

    We are seeking a Sr. Site Reliability Engineer to join our team and run critical infrastructure for our blockchain and web applications. You'll learn to deploy and maintain a fleet of RPC and validator nodes for multiple blockchain networks. You'll also provide guidance and expertise to development teams to ensure their application follow modern best...


  • San Francisco, California, United States Zetachain Full time

    We are seeking a Sr. Site Reliability Engineer to join our team and run critical infrastructure for our blockchain and web applications. You'll learn to deploy and maintain a fleet of RPC and validator nodes for multiple blockchain networks. You'll also provide guidance and expertise to development teams to ensure their application follow modern best...


  • San Francisco, California, United States Okta, Inc. Full time

    Get to know Okta Okta is The Worlds Identity Company. We free everyone to safely use any technologyanywhere, on any device or app. Our Workforce and Customer Identity Clouds enable secure yet flexible access, authentication, and automation that tra Reliability Engineer, Security, Liability, Reliability, Security Engineer, Reliability


  • San Francisco, California, United States Cisco Full time

    About The Role The ThousandEyes Cloud Agent Ops team's primary focus surrounds maintaining and growing a worldwide fleet of vantage points that strive to cover user experience from a geographical and service provider perspective. The team is tasked with building and running our platform's global agent infrastructure, with a focus on aspects such as...


  • San Francisco, California, United States Fastly Full time

    Fastly helps people stay better connected with the things they love. Fastly's edge cloud platform enables customers to create great digital experiences quickly, securely, and reliably by processing, serving, and securing our customers' applications as close to their end-users as possible — at the edge of the Internet. The platform is designed to take...