Staff Site Reliability Engineer

2 months ago


San Francisco, United States CV Library Full time

JOB TITLE: Staff SRE

TOP 3 SKILLS:

  1. GoLang
  2. Kubernetes
  3. Ruby

LOCATION: Remote

DURATION: Direct Hire

RATE RANGE: $160-180K

SUMMARY:

We're looking for a driven software engineer who cares deeply about their craft, and who wants to use their skills to bring about positive change in the world while working in a high performing organization using modern software development approaches. Someone who is comfortable in the rapidly changing nature of a startup environment but also adept at moving relentlessly forward: doing what needs to be done to unblock projects that truly deliver value to our users.

RESPONSIBILITIES:

  1. Site Reliability Engineer: Collaborate with engineers and cross-functional teams to proactively identify and mitigate risks, ensuring timely and effective solutions. Advocate for reducing complexity and focus on empowering others across the tech stack to drive excellence and innovation.
  2. Lean and Agile Owner: Collaborate with cross-functional teams to distill and synthesize non-functional requirements into discreet and meaningful iterations that can be quickly implemented. Leverage Lean Startup and Agile methodologies along with Continuous Integration and Continuous Deployment infrastructure to rapidly prototype and validate ideas.
  3. Operational Maintainer: As an SRE, you will be responsible for managing the on-call rotation for the engineering squad. It is expected that when not actively triaging or responding to an incident, that you will spend the balance of your time building process, procedures, and technology that result in service level indicators (SLI) that align with our service level objectives.
  4. Problem Solver: Be ready, willing, and able to dive into logs, statsd, and other various platform telemetries to identify potential performance, scale, and stability issues before they become bottlenecks.

QUALIFICATIONS:

  1. 5+ years engineering experience, at least part of which is in a startup environment
  2. Recent and relevant experience with compliance and security regulations and processes
  3. Alignment with BetterUp mission of enabling self-driven behavior change
  4. Succeeded in a remote work environment
  5. Advanced level experience with Infrastructure as Code (e.g, Terraform, CloudFormation)
  6. Willingness to participate in a 24x7x365 on-call rotation
  7. Experience identifying and establishing meaningful service level indicators and objectives
  8. Experience with Kubernetes
  9. Experience developing in a high-level programming (e.g, Python, Ruby, JavaScript)
  10. Experienced in Agile product development processes (SCRUM, Kanban, Lean Startup, etc)
  11. Strong verbal and written communication
  12. Impressive track record of maintaining a high bar of quality, stability, and availability
#J-18808-Ljbffr

  • San Francisco, United States Ursus Inc Full time

    JOB TITLE: Staff SRE **TOP 3 SKILLS:** GoLang Kubernetes Ruby LOCATION: Remote DURATION: Direct Hire RATE RANGE: $160-180K SUMMARY: We're looking for a driven software engineer who cares deeply about their craft, and who wants to use their skills to bring about positive change in the world while working in a high performing...


  • San Francisco, United States Ursus Inc Full time

    JOB TITLE: Staff SRE **TOP 3 SKILLS:** GoLang Kubernetes Ruby LOCATION: Remote DURATION: Direct Hire RATE RANGE: $160-180K SUMMARY: We're looking for a driven software engineer who cares deeply about their craft, and who wants to use their skills to bring about positive change in the world while working in a high performing...


  • San Francisco, United States Ellation, Inc. Full time

    Who We AreWe‘re a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our...


  • San Francisco, CA, United States Crusoe Full time

    Crusoe is building the World’s Favorite AI-first Cloud infrastructure company. We’re pioneering vertically integrated,  purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of...


  • San Francisco, United States Ellation, Inc. Full time

    Who We Are We're a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our collection...


  • San Francisco, United States Ellation, Inc. Full time

    Who We Are We're a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our collection...


  • San Francisco, United States Bun Full time

    Bun is an open-source JavaScript tooling company focused on making programming simpler. We've raised $26 million from top investors in Silicon Valley, are among the top GitHub repositories and have a growing community of 33,000 Discord members.We're hiring an experienced Site Reliability Engineer to scale and maintain the infrastructure that builds and tests...


  • San Francisco, United States EVONA Full time

    Site Reliability Engineer (SRE)Location: San Francisco Bay AreaRole Overview:We are seeking a highly skilled Site Reliability Engineer (SRE) to join a dynamic team at a rapidly growing technology company. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of mission-critical systems, while implementing automation...


  • San Francisco, United States Unreal Gigs Full time

    Are you passionate about building and maintaining resilient systems that ensure high availability and performance? Do you excel at automating processes, troubleshooting complex issues, and creating systems that scale smoothly? If you're ready to take on the challenge of ensuring reliable, efficient, and secure system operations, our client has the perfect...


  • San Francisco, United States Crusoe Full time

    Crusoe is building the World's Favorite AI-first Cloud infrastructure company. We're pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the...


  • San Francisco, California, United States Indotronix International Corporation Full time

    Job DescriptionWe are seeking a highly experienced Site Reliability Engineering Lead to join our team at Indotronix International Corporation.The ideal candidate will have experience with site reliability engineering, Kubernetes, Docker, CI/CD, and Jenkins, as well as strong production support skills. A background in Splunk or similar logging/observability...


  • San Francisco, United States Crunchyroll Full time

    About Crunchyroll WE HELP EVERYONE BELONG. IT'S OUR PURPOSE. Founded by fans, Crunchyroll delivers the art and culture of anime to a passionate community. We super-serve over 100 million anime and manga fans across 200+ countries and territories, and help them connect with the stories and characters they crave. Whether that experience is online or in-person,...


  • San Francisco, United States Crusoe Energy Systems LLC Full time

    Crusoe is building the World’s Favorite AI-first Cloud infrastructure company. We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the...


  • San Francisco, United States Crusoe Full time

    Crusoe is building the World’s Favorite AI-first Cloud infrastructure company. We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications. Crusoe is redefining AI cloud infrastructure, with a mission to align the future of computing with the future of the...


  • san francisco, United States New York Technology Partners Full time

    Must Have's in the order of preference.Typical Java/J2EE experience between 6 and 10 yearsApplication Production Support(SRE - Site Reliability Engineering) with 3+ years - Preferably in e-commerce domainHands-on experience in any of the UI Frameworks(AngularJS, VueJS etc) - 1+ years


  • San Francisco, United States New York Technology Partners Full time

    Must Have's in the order of preference.Typical Java/J2EE experience between 6 and 10 yearsApplication Production Support(SRE - Site Reliability Engineering) with 3+ years - Preferably in e-commerce domainHands-on experience in any of the UI Frameworks(AngularJS, VueJS etc) - 1+ years


  • San Francisco, United States Arbitrum Full time

    Our mission is to bring blockchain to a billion people. The Alchemy Platform is a world class developer platform designed to make building on the blockchain easy. We've built leading infrastructure in the space, powering over$105billion in transactions for tens of millions of users in 99% of countries worldwide. The Alchemy team draws from decades of deep...


  • San Francisco, United States Resource Informatics Group Full time

    Job Title: Site Reliability Engineer Work Location: San Francisco, CA (Hybrid after showing successful engagement) Duration: 18+ months Most important skills:10 years of Oracle database administration experience on large production environment Database hands on skills especially around database and system troubleshooting and administration GoldenGate setup,...


  • San Francisco Bay Area, United States Bun Full time

    Bun is an open-source JavaScript tooling company focused on making programming simpler. We've raised $26 million from top investors in Silicon Valley, are among the top GitHub repositories and have a growing community of 33,000 Discord members.We're hiring an experienced Site Reliability Engineer to scale and maintain the infrastructure that builds and tests...


  • San Francisco, United States ESL FACEIT GROUP Full time

    At EFG (ESL FACEIT Group) we create worlds beyond gameplay where players and fans become community. We pride ourselves in having a corporate social responsibility which is that “IT’S NOT GG (Good Game), UNTIL IT’S GG FOR ALL”. We are passionate about the culture we foster that ultimately helps to create and shape the world of esports, gaming...