See more Collapse

Principal Site Reliability Engineer

2 months ago


Seattle, United States Workday Full time

Your work days are brighter here.At Workday, it all began with a conversation over breakfast. When our founders met at a sunny California diner, they came up with an idea to revolutionize the enterprise software market. And when we began to rise, one thing that really set us apart was our culture. A culture which was driven by our value of putting our people first. And ever since, the happiness, development, and contribution of every Workmate is central to who we are. Our Workmates believe a healthy employee-centric, collaborative culture is the essential mix of ingredients for success in business. That’s why we look after our people, communities and the planet while still being profitable. Feel encouraged to shine, however that manifests: you don’t need to hide who you are. You can feel the energy and the passion, it's what makes us unique. Inspired to make a brighter work day for all and transform with us to the next stage of our growth journey? Bring your brightest version of you and have a brighter work day here.About the TeamThe Database Engineering team at Workday is responsible for ensuring the entire Workday’s Data related needs are met with high performance and scale, while providing utmost high availability that our customers expect from Workday. This team takes pride in ensuring seamless operation of 1000s of production and non-production databases across multiple data centers, public clouds and geographies. Are you passionate about database technologies?Do you love to tackle sophisticated, large-scale database challenges in the world today? If yes, then give us a shout

About the RoleWorkday is looking for a highly skilled Senior SRE with a focus on Open Source database technologies and Cloud Native solutions.The ideal candidate will have hands-on expertise in crafting, developing, and running enterprise-level database systems with a keen emphasis on resiliency, high availability, security, and scalability. Proficiency in MySQL, PostgreSQL, and other Cloud Native database technologies is crucial, along with hands-on experience in deploying and maintaining MySQL topology using tools such as Orchestrator.We are open to hiring a Senior or a Principal Site Reliability EngineerAbout YouBasic Qualifications:

For Senior SRE

7+ yrs of experience managing databases for enterprise cloud applications at scale.3+ yrs of working in MySQL and/or Postgres database environments.3+ yrs in deploying and maintaining the MySQL topology management tool, Orchestrator.Expertise with Python/GO.have worked with Infrastructure automation (Terraform, Ansible, etc.), CI/CD pipelines (GIT, Jenkins, Argo etc), and configuration management tools( Ansible, Chef etc).For Principal SRE

10+ yrs of experience managing databases for enterprise cloud applications at scale5+ yrs of working in MySQL and/or Postgres database environments.5+ yrs in deploying and maintaining the MySQL topology management tool, Orchestrator.Expertise with Python/GO.Have worked with Infrastructure automation (Terraform, Ansible, etc.), CI/CD pipelines (GIT, Jenkins, Argo etc), and configuration management tools( Ansible, Chef etc).Other Qualifications:

Experience working with private and public clouds (IAAS, AWS, etc) and capacity management principlesDeep knowledge in MySQL GTID based replication, group replication and Galera high availability solutionsUsing technologies like kubernetes/dockerGreat teammate with excellent interpersonal skills as well as the ability to prioritize multiple tasks in a fast-paced environment.Available for on-call support on a rotating basisYou have a BS/MS or equivalent experience in Computer Science or a related technical field#LI-RS

Workday Pay Transparency StatementThe annualized base salary ranges for the primary location and any additional locations are listed below.

Workday pay ranges vary based on work location. As a part of the total compensation package, this role may be eligible for the Workday Bonus Plan or a role-specific commission/bonus, as well as annual refresh stock grants. Recruiters can share more detail during the hiring process. Each candidate’s compensation offer will be based on multiple factors including, but not limited to, geography, experience, skills, job duties, and business need, among other things. For more information regarding Workday’s comprehensive benefits, please click here.Primary Location: USA.CA.Pleasanton

Primary Location Base Pay Range: $188,000 USD - $282,000 USD

Additional US Location(s) Base Pay Range: $154,300 USD - $282,000 USD

Our Approach to Flexible Work

With Flex Work, we’re combining the best of both worlds: in-person time and remote. Our approach enables our teams to deepen connections, maintain a strong community, and do their best work. We know that flexibility can take shape in many ways, so rather than a number of required days in-office each week, we simply

spend at least half (50%) of our time each quarter in the office or in the field

with our customers, prospects, and partners (depending on role). This means you'll have the freedom to create a flexible schedule that caters to your business, team, and personal needs, while being intentional to make the most of time spent together. Those in our remote "home office" roles also have the opportunity to come together in our offices for important moments that matter.Pursuant to applicable Fair Chance law, Workday will consider for employment qualified applicants with arrest and conviction records.Workday is an Equal Opportunity Employer including individuals with disabilities and protected veterans.Are you being referred to one of our roles? If so, ask your connection at Workday about our Employee Referral process #J-18808-Ljbffr


We have other current jobs related to this field that you can find below


  • Seattle, United States Oracle Full time

    OCI Incident Response is the first line of defense for maintaining the high availability of Oracle’s cloud. We make customer-impacting events shorter, less frequent, and less impactful by providing large-scale incident management. We are front-and-center in driving down event duration by using our operational experience, knowledge of standard processes,...


  • Seattle, United States Oracle Full time

    OCI Incident Response is the first line of defense for maintaining the high availability of Oracle’s cloud. We make customer-impacting events shorter, less frequent, and less impactful by providing large-scale incident management. We are front-and-center in driving down event duration by using our operational experience, knowledge of standard processes,...


  • Seattle, United States Moderna Full time

    The Role Moderna is expanding our footprint to Seattle to further our mission of delivering the greatest possible impact to people through mRNA medicines! Our new technology hub in Seattle will focus on software product development for our Commercial, Data & Machine Learning, Cloud Infrastructure, Security, and Engineering Excellence (dev tools) products and...


  • Seattle, United States Capgemini Full time

    **Site Reliability Engineer** **FTE with benefits** Our team is looking to add experienced Site Reliability / DevOps Engineer to our team. + Experiencedwith **Python and Shell Scripting.** + **Shouldhave extensive experience with Azure or AWS (Azure preferred)** + **Experiencewith Monitoring and Observability - Datadog** + **Experiencewith Infrastructure as...


  • Seattle, United States Saxon Global Full time

    Starbucks Senior Site Reliability Engineer (Cloud) 8-month contract (Likely extension to 18 month with strong performance) Hybrid - (Must be local to the Seattle area, onsite at Starbucks headquarters 3 days a week with 2 days remote) Job Summary and Mission This position contributes to Starbucks on their Data Platform Services team. This team maintains...


  • Seattle, United States Perkins Coie Full time

    Job Description: Perkins Coie is seeking a highly skilled and experienced Site Reliability Engineer (SRE) specializing in automation and storage management to join our team. The ideal candidate will be responsible for designing, implementing, and maintaining our storage infrastructure to ensure high availability and performance. They will be part of the SRE...


  • Seattle, United States Capgemini Full time

    LeadSite Reliability Engineer Seattle,WA FTE/Direct hiring with benefits NoRemote - Onsite and Hybrid position fromWA location only Qualification& Skills 8+ years ofexperience in Site Reliability Engineering or related field Develop,maintain and configure cloud observability systems (e.g., Datadog, Splunk,OpenTelemetry, APM, etc.). Buildflexible...


  • Seattle, United States Perkins Coie Full time

    Job Description: Perkins Coie is seeking a highly skilled and experienced Site Reliability Engineer (SRE) specializing in automation and storage management to join our team. The ideal candidate will be responsible for designing, implementing, and maintaining our storage infrastructure to ensure high availability and performance. They will be part of the SRE...


  • Seattle, United States F5 Networks Full time

    At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation. Everything we do centers around...


  • Seattle, Washington, United States Flexe Full time

    Flexe solves the hardest omnichannel logistics problems for the world's largest retailers and brands. Integrating technology, open logistics networks, and elastic economic models allows Flexe customers to move fast, at scale, and with precision. Founded in 2013 and headquartered in Seattle, Flexe brings deep logistics expertise and enterprise-grade...


  • Seattle, Washington, United States Moderna, Inc. Full time

    The RoleModerna is expanding our footprint to Seattle to further our mission of delivering the greatest possible impact to people through mRNA medicines Our new technology hub in Seattle will focus on software product development for our Commercial, Data & Machine Learning, Cloud Infrastructure, Security, and Engineering Excellence (dev tools) products and...


  • Seattle, United States Moderna, Inc. Full time

    The RoleModerna is expanding our footprint to Seattle to further our mission of delivering the greatest possible impact to people through mRNA medicines! Our new technology hub in Seattle will focus on software product development for our Commercial, Data & Machine Learning, Cloud Infrastructure, Security, and Engineering Excellence (dev tools) products and...


  • Seattle, United States SingleStore Full time

    Position Overview MemSQL is seeking a Senior Site Reliability Engineer to help drive our Kubernetes product strategy surrounding our managed service. You will be at the forefront; crafting the design, building out the collaborated vision, and sustaining your envisioned product strategy. This role will be an integral part of building our managed service...


  • Seattle, United States Capgemini Full time

    **LeadSite Reliability Engineer** **Seattle,WA** **FTE/Direct hiring with benefits** **NoRemote - Onsite and Hybrid position fromWA location only** **Qualification& Skills** + 8+ years ofexperience in Site Reliability Engineering or related field + Develop,maintain and configure cloud observability systems (e.g., Datadog, Splunk,OpenTelemetry, APM, etc.). +...


  • Seattle, United States Sentry Full time

    About Sentry Bad software is everywhere, and we’re tired of it. Sentry is on a mission to help developers write better software faster, so we can get back to enjoying technology. With more than $217 million in funding and 100,000+ organizations that believe we’re on to something, we're building performance and error monitoring tools that help companies...


  • Seattle, United States West500 Partners Full time

    Our client is a fast-growing downtown Seattle startup developing AI automation for professional services, including legal technology and medical records. They have a great product market fit and rapidly increasing revenues and are currently in need of a local Software Engineering Lead with CI/CD expertise, an AWS background, and a keen interest in innovative...


  • Seattle, United States West500 Partners Full time

    Our client is a fast-growing downtown Seattle startup developing AI automation for professional services, including legal technology and medical records. They have a great product market fit and rapidly increasing revenues and are currently in need of a local Software Engineering Lead with CI/CD expertise, an AWS background, and a keen interest in innovative...

  • Principal Engineer

    2 months ago


    Seattle, Washington, United States Uber Technologies, Inc. Full time

    Principal Engineer - Platform DatabaseBackend, EngineeringSeattle, Washington | San Francisco, California | Sunnyvale, CaliforniaAbout the TeamThe Platform Engineering team (Compute, Network, Storage, Reliability and Hardware engineering, Corporate IT and more) lies within the broader Uber Engineering group. The mission of the Platform Engineering team is to...


  • Seattle, United States Apple Full time

    Senior Site Reliability Engineer, Object Storage Seattle, Washington, United States Software and Services The Apple Services Engineering (ASE) team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And they...


  • Seattle, United States Censys Full time

    Censys knows the internet and cloud better than anyone else. Attack Surface Management provides customers with an attacker-centric view of all externally facing internet and cloud to extend visibility, prioritize, and remediate the most critical risk exposures that will actually lead to a breach. Our daily IPv4 scans and the world’s largest SSL/TLS...