Senior Site Reliability Engineer

21 hours ago


Seattle WA United States Apple Full time

Senior Site Reliability Engineer - ASE

Seattle, Washington, United States

Software and Services

Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Join Apple’s Cloud Service Infrastructure team as a Site Reliability Engineer to help support and scale cloud services for thousands of development and operations engineers. This is a hands-on role to establish SRE practices for a private cloud service to accelerate our ability to reliably and consistently deliver thousands of applications.

Description

As a Site Reliability Engineer, you will be responsible for providing the platform for mission-critical cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and services to flourish. The successful candidate will be highly self-motivated with a passion for excellence, quality, and detail. The SRE will not only support operations but also work closely with the developers and architects within the team to aid in the design and assist with the implementation to improve stability, security, and scalability.

AS AN SRE AT APPLE, YOU WILL:
  1. Operate, monitor, and triage all aspects of our production and non-production environments.
  2. Pioneer and implement the next-generation telemetry system.
  3. Prepare alert handling procedures, runbooks, and collaborate with the off-shore SRE teams.
  4. Automate deployment and orchestration of services into the cloud environment as well as other routine processes.
  5. Actively participate in capacity planning, scale testing, and disaster recovery exercises.
  6. Interact with and support partner teams, including engineering, QA, and program management.
  7. Cultivate and maintain relationships with internal and external third-party vendors.

Minimum Qualifications

  1. Bachelor's Degree in Computer Science, an engineering-related field, or equivalent related experience. Advanced Degree preferred.
  2. 5+ years in a Site Reliability Engineering, DevOps, or Infrastructure focused role.
  3. Must be an expert and have in-depth professional experience working with Kubernetes.
  4. Proficient in GoLang.

Key Qualifications

  1. Ability to implement and coordinate telemetry using monitoring and observability tools such as Splunk, Grafana, and Prometheus.
  2. Experience operating large scale multi-tenant Infrastructure as a Managed service.
  3. Knowledge of the Linux operating system and its variations.
  4. Experience with GitOps, CI/CD tools, and deployment strategies like Spinnaker, Argo.
  5. Able to troubleshoot issues across the entire infrastructure stack.
  6. Outstanding organizational and communication skills.

Pay & Benefits

At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $166,600 and $296,300, and your base pay will depend on your skills, qualifications, experience, and location. Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.

#J-18808-Ljbffr

  • Seattle, United States UKG (Ultimate Kronos Group) Full time

    About the Team: Senior Site Reliability Engineers at UKG are team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning,...


  • Seattle, United States Saxon Global Full time

    Starbucks Senior Site Reliability Engineer (Cloud) 8-month contract (Likely extension to 18 month with strong performance) Hybrid - (Must be local to the Seattle area, onsite at Starbucks headquarters 3 days a week with 2 days remote) Job Summary and Mission This position contributes to Starbucks on their Data Platform Services team. This team maintains and...


  • Seattle, WA, United States Axon Full time

    Your Impact As a contributor in the APX SRE organization, you are passionate about delivering solutions to the real-time problems our mission-critical cloud native services encounter. You are also obsessed about achieving the high quality and reliability our customers demand. You will work closely not only with the APX SRE organization, but your technical...


  • Chicago, IL, United States WEX, Inc. Full time

    The WEX Site Reliability Engineering (SRE) team is seeking an entry-level Site Reliability Engineer Level 1 who is passionate about learning and growing in the field of software development and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits...


  • Seattle, United States SingleStore Full time

    Position Overview MemSQL is seeking a Senior Site Reliability Engineer to help drive our Kubernetes product strategy surrounding our managed service. You will be at the forefront; crafting the design, building out the collaborated vision, and sustaining your envisioned product strategy. This role will be an integral part of building our managed service...


  • Chicago, IL, United States WEX Inc. Full time

    Senior Staff Site Reliability Engineer Apply to locations: Chicago, IL; Bay Area, CA; San Francisco, CA. About the Role The WEX Site Reliability Engineering (SRE) team is seeking a Senior Staff SRE who is passionate about developing software and solutions focused on observability, incident response, reliability and performance, operational excellence, and...


  • Boston, MA, United States Wasabi Technologies Inc. Full time

    At Wasabi, we’re a proven collection of pioneers, visionaries and disruptive doers. We see things differently than our competitors, and we make our mark in the industry by challenging the norm and delivering the unexpected and improbable. We’re a fast-growing company taking the Cloud Storage industry by storm and recognized as one of the best places to...


  • Miami, FL, United States Royal Caribbean Group Full time

    Site Reliability Engineer Journey with us! Combine your career goals and sense of adventure by joining our incredible team of employees at Royal Caribbean Group . We are proud to offer a competitive compensation and benefits package, and excellent career development opportunities, each offering unique ways to explore the world. We are proud to be the...


  • Mountain View, CA, United States VLink Inc Full time

    Senior Site Reliability Engineer- Only local to Mountain View, CA or Bellevue, WAOnly USC/GC/EAD- W2 onlyNO C2CContractRemoteJob Description:Primary:Ability to code in Python or GoLinux Admin (System Administration & Network Configuration)Debugging & Troubleshooting (Application and Infrastructure) production performance issuesKnowledge of MQ (Message Queue...


  • Annapolis Junction, MD, United States Maximus Full time

    General information Job Posting Title Site Reliability Engineer Date Wednesday, October 16, 2024 City Annapolis Junction State MD Country United States Working time Full-time Description & Requirements Maximus is seeking a Site Reliability Engineer to provide expertise to a federal client in support of their mission critical systems in defense of our...


  • Annapolis Junction, MD, United States Maximus Full time

    General information ...


  • Seattle, United States UKG (Ultimate Kronos Group) Full time

    About the Team: Site Reliability Engineers at UKG are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning,...


  • Aliso Viejo, CA, United States Sony Interactive Entertainment Full time

    Why PlayStation? PlayStation isn't just the Best Place to Play - it's also the Best Place to Work. Today, we're recognized as a global leader in entertainment producing The PlayStation family of products and services including PlayStation5, PlayStation4, PlayStationVR, PlayStationPlus, acclaimed PlayStation software titles from PlayStation Studios, and...


  • Redmond, WA, United States Microsoft Full time

    OverviewSecurity represents the most critical priorities for our customers in a world awash in digital threats, regulatory scrutiny, and estate complexity. Microsoft Security aspires to make the world a safer place for all. We want to reshape security and empower every user, customer, and developer with a security cloud that protects them with end to end,...


  • Miami, FL, United States INSPYR Solutions Full time

    Title: Site Reliability Engineer Make sure to apply quickly in order to maximise your chances of being considered for an interview Read the complete job description below. Location: Miami, FL Duration: 6+ months Compensation: $55.00 -60.00 Work Requirements: US Citizen, GC Holders or Authorized to Work in the U.S. Site Reliability...


  • Seattle, WA, United States Apple Full time

    Site Reliability Engineering Leader - Security, Apple Service Engineering Seattle, Washington, United States Software and Services People at Apple don’t just build products — they craft the kind of experience that have revolutionized entire industries. The diverse collection of our people and their ideas inspire innovation in everything we do. Imagine...


  • Aiea, HI, United States Smxtech Full time

    SMX is seeking a Site Reliability Engineer to support the USINDOPACOM J6 portfolio of programs. This position is a hybrid between Camp H.M. Smith Marine Corps Base and Joint Base Pearl Harbor-Hickam in Hawaii. This position requires a DoD TS/SCI security clearance which requires US citizenship for work on DoD contracts. Responsibilities Independently manage...


  • Seattle, WA, United States Apple Full time

    Site Reliability Engineering (SRE) Manager, Apple Services Engineering Seattle, Washington, United States Software and Services Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish!...


  • Sunnyvale, CA, United States Apple Inc. Full time

    To view your favorites, sign in with your Apple Account. Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. The people here at Apple don’t just create products —...


  • Sunnyvale, CA, United States Natcast, Inc. Full time

    Natcast (short for The National Center for the Advancement of Semiconductor Technology) is a new, purpose-built, non-profit entity created to operate the National Semiconductor Technology Center (NSTC) consortium, established by the CHIPS Act of the U.S. government. Working at Natcast represents an opportunity to help extend America’s leadership in...