See more Collapse

Senior Site Reliability Engineer

1 month ago


Seattle, United States Saxon Global Full time

Starbucks Senior Site Reliability Engineer (Cloud) 8-month contract (Likely extension to 18 month with strong performance) Hybrid - (Must be local to the Seattle area, onsite at Starbucks headquarters 3 days a week with 2 days remote) Job Summary and Mission

This position contributes to Starbucks on their Data Platform Services team. This team maintains and improves the data platform that many Starbucks services are dependent on. When you order coffee with your rewards starts on the Starbucks mobile app, that application is reliant on this data platform to pull your information in real time as you order. It also is responsible for maintain the data related to purchased items in Starbucks stores so they can track and analyze which items are the most popular and can adjust their offering accordingly. This is a critical data platform and will be a great learning and career growth opportunity with an enterprise scale company with one of the best company cultures in the Seattle area. Summary of Key Responsibilities

Responsibilities and essential job functions include but are not limited to the following:

Responsible for health of production system Develop monitoring dashboards Configure alerts and automate process for system recovery Monitor alerts and take proactive steps to resolve system issues Troubleshoot production issues Lead production troubleshooting calls Responsible for patches and updates on production systems. Design and build cutting-edge, multi-micro service solutions to support Starbucks's growth worldwide. Work with cross-functional teams for on-going design efforts and systems support. Automate password and certificate rotations on application and DB servers. Helping CI/CD team during rolling out application and infrastructure globally. Collaborates with development team, other Information Technology (IT) team's developer leads. Initiates process improvements for new and existing systems. Coaches, and mentors other team members. Performs cross-training and facilitates information sharing among team members. Participates in a production support rotation that includes pager responsibilities. Most Important Requirements to be Considered: •

Senior Site Reliability Engineering Experience - 5+ years as an SRE and 7+ total in IT Industry •

Expert in Azure Cloud •

Expert in Kubernetes •

Strong Skills with SQL •

Strong skills with Kafka, Event Hub, or other Messaging Broker •

Must be able to commute to Starbucks headquarters in Seattle 3 times a week - local to Seattle area. Summary of Experience

Requires 7+ years experience in the IT industry Requires 5+ years of software and DevOps development engineering Experience in working with cloud environment Azure preferred. Experience with using Kafka, Event Hub or any messaging broker. Experience with Cassandra, PostgresSQL, Cosmos DB Experience on Jenkins/ Python / Terraform / Ansible Experience with DataDog, Splunk or other logging and APM tools. Experience in working with Linux environment. In-depth understanding of Computer Science fundamentals in object-oriented design, data structures, algorithms, and problem solving Experience building complex, scalable, high-performance software systems that have been successfully delivered to customers Demonstrated knowledge of best practices for the design and implementation of large-scale systems as well as experience in taking such systems from design to production Experience building and operating mission critical, highly available (24x7) systems Ability to work well with a team in a fast-paced agile development environment. Bachelors in Computer Science or equivalent work experience. Excellent communication, analytical and problem-solving skills Extensive understanding in SDLC and scrum methodologies. Required Knowledge, Skills and Abilities

- Strong interpersonal skills - Ability to communicate clearly and concisely, both orally and in writing - Strong analytical and problem-solving skills - Proficiency in programming languages - Ability to quickly learn new application systems and technologies - Knowledge of basic project management framework and methodology

Ability to accurately break down complex application designs into component deliverables and estimate design and development timelines Requires Strong Systems Life Cycle methodology experience Requires excellent oral, written, and presentation skills General IT Skills:

Experience in Application support - Problem diagnosis and resolution Expert in interpretation of functional requirements Development of technical design specifications for complex projects Expert in industry standard development methodologies Experience in middleware integration using tools like Web Methods A good understanding of industry standards and best practices to be able to conduct code reviews Conduct code reviews with the team to improve compliance with established best practices and coding standards Provide mentorship and guidance to the team to improve overall quality of code and application development Work with team members to ensure application designs are in line with best practices and are scalable, reliable, and that all designs optimize performance and usability. Requires strong problem solving and analytic skills to translate business requirements into systems solutions. Integrate application support efforts with concurrent, parallel application development efforts Core Competencies

Customer Focus

- Delivers legendary service that meets and exceeds all customers' expectations

Ethics and Integrity

- Adheres to Starbucks values, beliefs and principles during good and bad times

Composure

- Remains calm, maintains perspective and responds in a professional manner when faced with tough situations

Personal Learning

- Takes personal responsibility for the continuous learning of new knowledge, skills and experiences

Dealing with Ambiguity

- Able to successfully function during times of uncertainty and changing priorities

Decision-Making

- Makes timely and quality decisions based on a mixture of analysis, wisdom, experience, and judgment

Interpersonal Savvy

- Builds effective relationships with all people; up, down and sideways, inside and outside of Starbucks

Results Oriented

- Gets results and achieves goals

Required Skills : Senior Site Reliability Engineering Experience - 5+ years as an SRE and 7+ total in IT Industry Expert in Azure Cloud Expert in Kubernetes Strong Skills with SQL Strong skills with Kafka, Event Hub, or other Messaging Broker Must be able to commute to Starbucks headquarters in Seattle 3 times a week - local to Seattle area. Basic Qualification : Senior Site Reliability Engineering Experience - 5+ years as an SRE and 7+ total in IT Industry Expert in Azure Cloud Expert in Kubernetes Strong Skills with SQL Strong skills with Kafka, Event Hub, or other Messaging Broker Must be able to commute to Starbucks headquarters in Seattle 3 times a week - local to Seattle area. Additional Skills : Senior Site Reliability Engineering Experience - 5+ years as an SRE and 7+ total in IT Industry Expert in Azure Cloud Expert in Kubernetes Strong Skills with SQL Strong skills with Kafka, Event Hub, or other Messaging Broker Must be able to commute to Starbucks headquarters in Seattle 3 times a week - local to Seattle area. Background Check :Yes Notes : Selling points for candidate : Project Verification Info : Candidate must be your W2 Employee :Yes Exclusive to Apex :No Face to face interview required :No Candidate must be local :Yes Candidate must be authorized to work without sponsorship ::No Interview times set : :No Type of project :Development/Engineering Master Job Title :Architect: Apps (Other) Branch Code :Seattle #J-18808-Ljbffr


We have other current jobs related to this field that you can find below


  • Seattle, United States SingleStore Full time

    Position Overview MemSQL is seeking a Senior Site Reliability Engineer to help drive our Kubernetes product strategy surrounding our managed service. You will be at the forefront; crafting the design, building out the collaborated vision, and sustaining your envisioned product strategy. This role will be an integral part of building our managed service...


  • Seattle, Washington, United States Flexe Full time

    Flexe solves the hardest omnichannel logistics problems for the world's largest retailers and brands. Integrating technology, open logistics networks, and elastic economic models allows Flexe customers to move fast, at scale, and with precision. Founded in 2013 and headquartered in Seattle, Flexe brings deep logistics expertise and enterprise-grade...


  • Seattle, United States Sentry Full time

    About Sentry Bad software is everywhere, and we’re tired of it. Sentry is on a mission to help developers write better software faster, so we can get back to enjoying technology. With more than $217 million in funding and 100,000+ organizations that believe we’re on to something, we're building performance and error monitoring tools that help companies...


  • Seattle, United States Apple Full time

    Senior Site Reliability Engineer, Object Storage Seattle, Washington, United States Software and Services The Apple Services Engineering (ASE) team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And they...


  • Seattle, United States West500 Partners Full time

    Our client is a fast-growing downtown Seattle startup developing AI automation for professional services, including legal technology and medical records. They have a great product market fit and rapidly increasing revenues and are currently in need of a local Software Engineering Lead with CI/CD expertise, an AWS background, and a keen interest in innovative...


  • Seattle, United States West500 Partners Full time

    Our client is a fast-growing downtown Seattle startup developing AI automation for professional services, including legal technology and medical records. They have a great product market fit and rapidly increasing revenues and are currently in need of a local Software Engineering Lead with CI/CD expertise, an AWS background, and a keen interest in innovative...


  • Seattle, United States Censys Full time

    Censys knows the internet and cloud better than anyone else. Attack Surface Management provides customers with an attacker-centric view of all externally facing internet and cloud to extend visibility, prioritize, and remediate the most critical risk exposures that will actually lead to a breach. Our daily IPv4 scans and the world’s largest SSL/TLS...


  • Seattle, United States Capgemini Full time

    **Site Reliability Engineer** **FTE with benefits** Our team is looking to add experienced Site Reliability / DevOps Engineer to our team. + Experiencedwith **Python and Shell Scripting.** + **Shouldhave extensive experience with Azure or AWS (Azure preferred)** + **Experiencewith Monitoring and Observability - Datadog** + **Experiencewith Infrastructure as...


  • Seattle, United States Oracle Full time

    OCI Incident Response is the first line of defense for maintaining the high availability of Oracle’s cloud. We make customer-impacting events shorter, less frequent, and less impactful by providing large-scale incident management. We are front-and-center in driving down event duration by using our operational experience, knowledge of standard processes,...


  • Seattle, United States Oracle Full time

    OCI Incident Response is the first line of defense for maintaining the high availability of Oracle’s cloud. We make customer-impacting events shorter, less frequent, and less impactful by providing large-scale incident management. We are front-and-center in driving down event duration by using our operational experience, knowledge of standard processes,...


  • Seattle, United States Perkins Coie Full time

    Job Description: Perkins Coie is seeking a highly skilled and experienced Site Reliability Engineer (SRE) specializing in automation and storage management to join our team. The ideal candidate will be responsible for designing, implementing, and maintaining our storage infrastructure to ensure high availability and performance. They will be part of the SRE...


  • Seattle, United States Capgemini Full time

    LeadSite Reliability Engineer Seattle,WA FTE/Direct hiring with benefits NoRemote - Onsite and Hybrid position fromWA location only Qualification& Skills 8+ years ofexperience in Site Reliability Engineering or related field Develop,maintain and configure cloud observability systems (e.g., Datadog, Splunk,OpenTelemetry, APM, etc.). Buildflexible...


  • Seattle, United States Axon Enterprise Inc Full time

    As a contributor in the APX platform engineering organization, you are passionate about delivering solutions to the real-time problems our mission-critical cloud native services encounter. You are also obsessed about achieving the high quality and re Reliability Engineer, Liability, Reliability, Platform Engineer, Reliability, Engineer, Technology


  • Seattle, United States Perkins Coie Full time

    Job Description: Perkins Coie is seeking a highly skilled and experienced Site Reliability Engineer (SRE) specializing in automation and storage management to join our team. The ideal candidate will be responsible for designing, implementing, and maintaining our storage infrastructure to ensure high availability and performance. They will be part of the SRE...


  • Seattle, United States F5 Networks Full time

    At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation. Everything we do centers around...


  • Seattle, United States Capgemini Full time

    **LeadSite Reliability Engineer** **Seattle,WA** **FTE/Direct hiring with benefits** **NoRemote - Onsite and Hybrid position fromWA location only** **Qualification& Skills** + 8+ years ofexperience in Site Reliability Engineering or related field + Develop,maintain and configure cloud observability systems (e.g., Datadog, Splunk,OpenTelemetry, APM, etc.). +...


  • Seattle, United States Starbucks Full time

    Now Brewing – Site Reliability Engineer I, Digital Displays The IOT & Retail Hardware organization spends most of its time out at the edges of the map. Naturally, this means lots of exploration—prototyping and building and testing for internal efforts or in collaboration with R&D teams. But our work doesn’t stop when experiments do. POCs that test well...

  • Reliability Engineer

    2 weeks ago


    Seattle, United States JLL Full time

    OVERVIEW - Reliability Engineer JLL is seeking aReliability Engineerto join our team! In JLL Work Dynamics our most significant assets are our "People" and our "Clients". We will act with Dignity and Respect, make Ethical Decisions, champion Corporate Responsibility and serve as a driving force for a Sustainable Asset Management. There are opportunities for...

  • Software Engineer

    2 months ago


    Seattle, United States Lacework Full time

    At Lacework, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, big sky thinking, and obsess over getting the details right. We love what we do and are proud of our work to secure clouds and container environments for thousands of users...

  • Site Reliability

    1 month ago


    Seattle, United States Canonical Full time

    This role is an opportunity for a hands-on, but literally hands-off, technologist with a passion for Linux to build a career with Canonical and drive the success with those leveraging Ubuntu and open source products. If you have experience of IT operations automation, Infrastructure as Code and a passion for technology, then you will enjoy working with some...