Service Reliability Engineer

2 weeks ago


Seattle, Washington, United States Apple Full time

The Service Reliability Engineer role in Apple Services Engineering requires a mix of strategic engineering and design along with hands-on, technical work. This SRE will configure, tune, and fix multi-tiered systems to achieve optimal application performance, stability and availability. We manage jobs as well as applications on bare-metal and cloud computing platforms to deliver data processing for many of Apple's global products. Our teams work with exabytes of data, petabytes of memory, and tens of thousands of jobs to enable predictable and performant data analytics enabling features in Apple Music, TV+, App Store and other world-class products. If you love designing, running systems and infrastructure that will impact millions of users, then this is the place for you.

Key Responsibilities:

  • Configure, tune, and fix multi-tiered systems to achieve optimal application performance, stability, and availability.
  • Manage jobs as well as applications on bare-metal and cloud computing platforms.
  • Deliver data processing for many of Apple's global products.
  • Work with exabytes of data, petabytes of memory, and tens of thousands of jobs.
  • Enable predictable and performant data analytics enabling features in Apple Music, TV+, App Store, and other world-class products.

Requirements:

  • BS degree in computer science or equivalent field with 5+ years or MS degree with 3+ years experience, or equivalent.
  • At least 5 years in a Service Reliability Engineering (SRE), DevOps, or infrastructure-focused role.
  • 5+ years of running services in a large-scale *nix environment.
  • Understanding of SRE principles and goals along with prior on-call experience.
  • The ability to design, author, and release code in any language (Go, Python, Ruby, or Java would be a plus).
  • Deep understanding and experience in one or more of the following - Hadoop, Spark, Flink, Kubernetes, AWS.

Preferred Qualifications:

  • Fast learner with excellent analytical problem-solving and interpersonal skills.
  • Experience working on supporting Java applications.
  • Experience using monitoring and logging solutions like Splunk, Grafana, etc.
  • Familiarity with DNS, HTTP, message queues, queueing theory, RPC frameworks, datastore.
  • Experience working with geographically distributed teams and implementing high-level projects and migrations.
  • Strong communication skills and ability to deliver results on time with high quality.


  • Seattle, Washington, United States PMI WW Brands LLC Full time

    Job Title: Service Reliability EngineerStanley, a HAVI company, is seeking a highly skilled Service Reliability Engineer to join our Foundational Technology team. As a key member of our team, you will play a crucial role in shaping and optimizing our software development and deployment processes.Key Responsibilities:Develop and implement broad best-practices...


  • Seattle, Washington, United States PMI Full time

    About the RoleStanley, a HAVI company, is experiencing rapid growth and is seeking an experienced Engineering Manager for Service Reliability to join our team. As a key member of our software development team, you will play a crucial role in shaping and optimizing our software development and deployment processes.Key ResponsibilitiesDevelop and implement...

  • Reliability Engineer

    4 weeks ago


    Seattle, Washington, United States Amentum Full time

    Job Title: Reliability EngineerAmentum is seeking a highly skilled Reliability Engineer to join our team. As a Reliability Engineer, you will be responsible for ensuring the optimal performance and reliability of our equipment and facilities.Key Responsibilities:Develop and maintain a reliability program to minimize downtime and maximize asset uptime.Perform...

  • Reliability Engineer

    3 weeks ago


    Seattle, Washington, United States Meta Platforms, Inc. Full time

    Job Title: Reliability EngineerMeta Platforms, Inc. is seeking a skilled Reliability Engineer to join our team. As a Reliability Engineer, you will play a critical role in ensuring the reliability and quality of our products.Job SummaryWe are looking for a highly motivated and detail-oriented individual to collaborate with cross-functional teams to identify...


  • Seattle, Washington, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will play a critical role in ensuring the scalability, availability, and performance of our services.Key ResponsibilitiesDesign, implement, and maintain large-scale distributed systems to support our...


  • Seattle, Washington, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will be responsible for designing, building, and operating large-scale distributed systems that provide secure, scalable, and highly available services to our customers.Key ResponsibilitiesDesign and...

  • Reliability Engineer

    3 weeks ago


    Seattle, Washington, United States Amentum Full time

    Job Title: Reliability EngineerWe are seeking a highly skilled Reliability Engineer to join our team at Amentum. As a key member of our maintenance team, you will be responsible for ensuring the reliability and efficiency of our equipment and processes.Key Responsibilities:Develop and maintain the reliability program, including predictive maintenance routes...

  • Reliability Engineer

    3 weeks ago


    Seattle, Washington, United States Amentum Full time

    Job Title: Reliability EngineerWe are seeking a highly skilled Reliability Engineer to join our team at Amentum. As a key member of our maintenance team, you will be responsible for ensuring the reliability and efficiency of our equipment and processes.Key Responsibilities:Develop and maintain the reliability program, including predictive maintenance routes...


  • Seattle, Washington, United States Apple Full time

    Job SummaryThe Service Reliability Engineer role in Apple Services Engineering requires a unique blend of strategic engineering and technical expertise. As a key member of our team, you will be responsible for configuring, tuning, and fixing multi-tiered systems to achieve optimal application performance, stability, and availability.Key...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.About the RoleWe are seeking a talented and motivated individual to join our dynamic...


  • Seattle, Washington, United States Blue Origin Full time

    Reliability Engineer Opportunity at Blue OriginWe are seeking a highly skilled Reliability Engineer to join our team at Blue Origin. As a key member of our Engines business unit, you will be responsible for ensuring the reliability and safety of our engines and propulsion systems.Your primary focus will be on identifying factors that drive engine reliability...

  • Reliability Engineer

    3 weeks ago


    Seattle, Washington, United States Blue Origin Full time

    Reliability Engineer - Engines & AvionicsAt Blue Origin, we're pushing the boundaries of space exploration and development. As a Reliability Engineer - Engines & Avionics, you'll play a critical role in ensuring the reliability and safety of our engines and avionics systems.Key Responsibilities:Identify and analyze reliability requirements for engine control...

  • Reliability Engineer

    3 weeks ago


    Seattle, Washington, United States Blue Origin Full time

    Reliability Engineer - Engines & AvionicsAt Blue Origin, we're pushing the boundaries of space exploration and development. As a Reliability Engineer - Engines & Avionics, you'll play a critical role in ensuring the reliability and safety of our engines and avionics systems.Key Responsibilities:Identify and mitigate reliability risks in engine and avionics...


  • Seattle, Washington, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will play a critical role in ensuring the scalability, availability, and performance of our services.Key ResponsibilitiesDesign, implement, and maintain large-scale distributed systems to support our...


  • Seattle, Washington, United States Blue Origin Full time

    Job Title: Senior Reliability EngineerAt Blue Origin, we're pushing the boundaries of space exploration and development. As a Senior Reliability Engineer, you'll play a critical role in ensuring the reliability and safety of our engines and propulsion systems.Responsibilities:Develop and implement reliability requirements for engine control systemsSupport...

  • Reliability Engineer

    3 weeks ago


    Seattle, Washington, United States Blue Origin Full time

    Reliability Engineer - Engines & AvionicsAt Blue Origin, we're pushing the boundaries of space exploration and development. As a Reliability Engineer - Engines & Avionics, you'll play a critical role in ensuring the reliability and safety of our engines and avionics systems.Key Responsibilities:Identify and analyze reliability requirements for engine and...


  • Seattle, Washington, United States Oracle Full time

    About the Role:Oracle is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, develop, and deploy software to improve the availability, scalability, and efficiency of...


  • Seattle, Washington, United States Oracle Full time

    About the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Oracle. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure. You will work closely with our development teams to design, implement, and operate large-scale distributed...


  • Seattle, Washington, United States Tik Tok Full time

    About the RoleThis is a Site Reliability Engineer position, focusing on the data pipeline reliability for the Video Platform team in USDS.Data SREs monitor data and keep production batch and real-time processing jobs up and running with the highest level of availability, ensuring our users have the freshest, complete, and correct data...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled and motivated Site Reliability Engineer to join our dynamic and growing team at Apple.About the RoleAs a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key ResponsibilitiesDesign, implement,...