We have other current jobs related to this field that you can find below


  • Seattle, Washington, United States Flexe Full time

    Flexe solves the hardest omnichannel logistics problems for the world's largest retailers and brands. Integrating technology, open logistics networks, and elastic economic models allows Flexe customers to move fast, at scale, and with precision. Founded in 2013 and headquartered in Seattle, Flexe brings deep logistics expertise and enterprise-grade...


  • Seattle, Washington, United States Apple Full time

    Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our Apple Services Engineering team in Seattle, Washington. As a key member of our dynamic team, you will play a critical role in ensuring the availability, latency, and overall health of our object store orchestration service.Key...


  • Seattle, United States Prodigy Resources Full time

    About Us: Prodigy is seeking an SRE to join our client's organization which is leading the charge in fintech innovation, providing state-of-the-art solutions that drive financial success and empower our clients. As they embark on an exciting Greenfield project, they're seeking an experienced Site Reliability Engineer to join their team. This role is critical...


  • Seattle, United States Prodigy Resources Full time

    About Us: Prodigy is seeking an SRE to join our clients organization which is leading the charge in fintech innovation, providing state-of-the-art solutions that drive financial success and empower our clients. As they embark on an exciting Greenfield project, theyre seeking an experienced Site Reliability Engineer to join their team. This role is critical...


  • Seattle, United States Apple Full time

    Senior Site Reliability Engineer, Object Storage Seattle, Washington, United States Software and Services The Apple Services Engineering (ASE) team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. They...


  • Seattle, United States Prodigy Resources Full time

    About Us:Prodigy is seeking an SRE to join our client's organization which is leading the charge in fintech innovation, providing state-of-the-art solutions that drive financial success and empower our clients. As they embark on an exciting Greenfield project, they're seeking an experienced Site Reliability Engineer to join their team. This role is critical...


  • Seattle, United States Prodigy Resources Full time

    About Us:Prodigy is seeking an SRE to join our client's organization which is leading the charge in fintech innovation, providing state-of-the-art solutions that drive financial success and empower our clients. As they embark on an exciting Greenfield project, they're seeking an experienced Site Reliability Engineer to join their team. This role is critical...


  • Seattle, United States Apple Full time

    To view your favorites, sign in with your Apple ID. Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Join Apple’s Cloud Service Infrastructure team as a site reliability...


  • Seattle, United States West500 Partners Full time

    Our client is a fast-growing downtown Seattle startup developing AI automation for professional services, including legal technology and medical records. They have a great product market fit and rapidly increasing revenues and are currently in need of a local Software Engineering Lead with CI/CD expertise, an AWS background, and a keen interest in innovative...


  • Seattle, United States West500 Partners Full time

    Our client is a fast-growing downtown Seattle startup developing AI automation for professional services, including legal technology and medical records. They have a great product market fit and rapidly increasing revenues and are currently in need of a local Software Engineering Lead with CI/CD expertise, an AWS background, and a keen interest in innovative...


  • Seattle, United States Oracle Full time

    OCI Incident Response is the first line of defense for maintaining the high availability of Oracle’s cloud. We make customer-impacting events shorter, less frequent, and less impactful by providing large-scale incident management. We are front-and-center in driving down event duration by using our operational experience, knowledge of standard processes,...


  • Seattle, Washington, United States F5 Networks Full time

    About F5 NetworksAt F5 Networks, we are dedicated to shaping a superior digital landscape. Our teams empower organizations worldwide to create, secure, and operate applications that enhance our interactions with the ever-evolving digital environment.We are deeply committed to cybersecurity, safeguarding consumers from fraud, and enabling businesses to...


  • Seattle, United States Capgemini Full time

    **Site Reliability Engineer** **FTE with benefits** Our team is looking to add experienced Site Reliability / DevOps Engineer to our team. + Experiencedwith **Python and Shell Scripting.** + **Shouldhave extensive experience with Azure or AWS (Azure preferred)** + **Experiencewith Monitoring and Observability - Datadog** + **Experiencewith Infrastructure as...


  • Seattle, United States Oracle Full time

    OCI Incident Response is the first line of defense for maintaining the high availability of Oracle’s cloud. We make customer-impacting events shorter, less frequent, and less impactful by providing large-scale incident management. We are front-and-center in driving down event duration by using our operational experience, knowledge of standard processes,...


  • Seattle, Washington, United States Oracle Full time

    Overview: The OCI Incident Response team serves as the primary defense mechanism for ensuring the uninterrupted operation of Oracle's cloud services. Our mission is to reduce the frequency and impact of customer-affecting incidents by implementing effective large-scale incident management strategies. We leverage our operational expertise, adherence to...


  • Seattle, United States Oracle Full time

    OCI Incident Response is the first line of defense for maintaining the high availability of Oracle’s cloud. We make customer-impacting events shorter, less frequent, and less impactful by providing large-scale incident management. We are front-and-center in driving down event duration by using our operational experience, knowledge of standard processes,...


  • Seattle, United States Moloco Full time

    About the Role Moloco is a machine learning company that operates at massive scale (we ingest 10 petabytes of training data per day), and our models are blazingly fast (return predictions in 10 milliseconds or less); and a profitable unicorn (we are valued at $2 billion and have been profitable for the last 13+ quarters). We are looking for an exceptional...


  • Seattle, Washington, United States Circle Full time

    About the RoleWe are seeking a highly skilled Cloud Engineer to join our team at Circle, a leading financial technology company. As a Senior Site Reliability Engineer, you will play a critical role in designing, building, and maintaining our cloud infrastructure estate to meet the growing demands of our worldwide customer base.You will be responsible for...


  • Seattle, Washington, United States Apple Full time

    Overview:Position Number: The Apple Services Engineering team exemplifies Apple's dedication to merging creativity with technology. We invite you to join the Apple Services Engineering Cloud Service Infrastructure team as a Site Reliability Engineer, where you will play a pivotal role in supporting and expanding cloud services for millions of Apple users....


  • Seattle, United States Oracle Full time

    We are seeking experienced cloud technologists, interested in solving hard problems on tight schedules, to join our Major Incident Management team. OCI Incident Response is the first line of defense for maintaining the high availability of Oracles c Reliability Engineer, Architect, Liability, Engineer, Principal, Reliability, Technology

Senior Site Reliability Engineer

2 months ago


Seattle, United States Sentry Full time
About Sentry

Bad software is everywhere, and we’re tired of it. Sentry is on a mission to help developers write better software faster, so we can get back to enjoying technology.

With more than $217 million in funding and 100,000+ organizations that believe we’re on to something, we're building performance and error monitoring tools that help companies like Disney, Microsoft, and Atlassian spend less time fixing bugs and more time building products. If you like to selfishly build things that make your digital life better, come help us build the next generation of software monitoring tools.

About the role

The Site Reliability Engineering team is responsible for the deployment, configuration, maintenance and monitoring of Sentry's hosted platform. We do this by leveraging automation tools to automatically spin up and scale services to meet the traffic demands of 1,000,000+ developers. Sentry receives over a billion events a day, and processes terabytes of data to return complex aggregations with sub-second latency.

As Senior Site Reliability Engineer, you will work with a multitude of technologies and have a direct impact on how Sentry evolves to handle 100x our current event volume. You’ll contribute to the vision of the SRE team in a world of cloud providers and partner with other engineering teams in their efforts to grow and sustain Sentry.

In this role you will
  • Ensure the uptime and reliability of Sentry's hosted platform
  • Architect and automate services and systems to meet the demand of scale
  • Analyze and tune systems to operate at maximum efficiency
  • Collaborate with other Engineering teams to deploy and scale new and existing services
  • Be a member of the team's on-call rotation - being available to respond and resolve critical issues
You’ll love this job if you
  • Enjoy leading the way
  • Enjoy fiddling with new cloud technologies and services
  • Dig into system internals during the troubleshooting process
  • Have seen networks make and break hosted solutions, and have direct experience with growing and maintaining distributed systems
  • Are familiar with the various SaaS ecosystems and have taken ownership of a service you once knew nothing about
  • Got a story (or two) of royally goofing it and can tell us why it would never happen again under your watch
Qualifications
  • 5+ years of relevant experience
  • Experience with production monitoring and logging tools
  • Experience with some or all of the following tools we leverage:
    • System Administration: Debian, Docker
    • Cloud: Google Cloud Platform
    • Databases: PostgreSQL, ClickHouse, Redis, BigTable
    • Environment Management: Saltstack, Kubernetes, Terraform
    • TCP/HTTP Routing: HAProxy, NGINX, Envoy
    • Data Streaming Platforms: Kafka, RabbitMQ
  • Experience with programming and scripting (Python is a plus)
  • Good written and oral communication skills and ability to articulate technical concepts clearly and succinctly

The base salary range (or hourly wage range, if applicable) that Sentry reasonably expects to pay for this position is $175,000 to $215,000. A successful candidate’s actual base salary (or hourly wage) amount will be determined by a variety of relevant factors including, without limitation, the candidate’s work location, education, work and other relevant experience, skills, and job-related knowledge. A successful candidate will be eligible to participate in Sentry’s employee benefit plans/programs applicable to the candidate’s position (including incentive compensation, equity grants, paid time off, and group health insurance coverage). See Sentry Benefits for more details about the Company’s benefit plans/programs. #LI-DNI

Equal Opportunity at Sentry

Sentry is committed to providing equal employment opportunities to its employees and candidates for employment regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, or other legally-protected characteristic. This commitment includes the provision of reasonable accommodations to employees and candidates for employment with physical or mental disabilities who require such accommodations in order to (a) perform the essential functions of their jobs, or (b) seek employment with Sentry. We strive to build a diverse team, with an inclusive culture where every teammate can thrive. Sentry is an open-source company because we believe that everyone, everywhere, should have the ability and tools to make great software. Software should be accessible. That starts with making our industry accessible.

If you need assistance or an accommodation due to a disability, you may contact us at accommodations@sentry.io.

Want to learn more about how Sentry handles applicant data? Get the details in our Applicant Privacy Policy.