Cloud Monitoring SRE

4 weeks ago


Seattle, Washington, United States Apple Full time
About the Role

We are seeking a highly skilled Cloud Monitoring SRE to join our team at Apple. As a Cloud Monitoring SRE, you will be responsible for designing, building, and operating the monitoring infrastructure that provides visibility into the services and infrastructure that run Apple.

Key Responsibilities
  • Design and build the next generation of cloud and systems monitoring infrastructure, focusing on automation, availability, performance, and efficiency at scale.
  • Work closely with engineering teams to produce and roll out fixes for systemic and latent reliability issues.
  • Automate and reduce toil by implementing configuration management and deployment tools, monitoring systems and services, and optimizing performance and resource utilization.
  • Implement runbooks for everyday maintenance actions and respond to incidents, diagnose, and follow up on system outages or alerts.
  • Collaborate with a global and asynchronously communicating team to deliver high-quality results.
Requirements
  • B.S. in computer science or similar field or equivalent experience.
  • Minimum 2+ years of industry experience.
  • Experience in Python, bash scripting, or any other languages.
  • Strong sense of ownership and integrity demonstrated through clear communication and collaboration.
  • Experience and confidence around incident response and incident management.
  • Experience/knowledge in managing and scaling distributed systems in a public, private, or hybrid cloud environment.
  • Experience/knowledge with the Prometheus ecosystem.
  • Acute drive to automate manual operations and improve them through repeated iteration.
  • Comfortable with Open Source configuration management and orchestration tools (such as Helm, Puppet, and Spinnaker).
  • Familiarity with micro-services architecture and container orchestration with Kubernetes.
What We Offer

At Apple, we offer a comprehensive compensation package, including base pay, discretionary bonuses, and commission payments. We also provide a range of benefits, including comprehensive medical and dental coverage, retirement benefits, and reimbursement for certain educational expenses. Additionally, Apple employees are eligible for discretionary restricted stock unit awards and can purchase Apple stock at a discount through our Employee Stock Purchase Plan.

We are an equal opportunity employer committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.


  • Cloud Monitoring SRE

    4 weeks ago


    Seattle, Washington, United States Apple Full time

    Cloud Monitoring SREAt Apple, we're looking for a skilled Cloud Monitoring SRE to join our team. As a Cloud Monitoring SRE, you will be responsible for designing and building the next generation of cloud and systems monitoring infrastructure, focusing on automation, availability, performance, and efficiency at scale.You will work closely with our engineering...

  • Cloud Monitoring SRE

    3 weeks ago


    Seattle, Washington, United States Apple Full time

    Job Description:At Apple, we're looking for a skilled Cloud Monitoring SRE to join our team. As a Cloud Monitoring SRE, you will be responsible for designing and building the next generation of cloud and systems monitoring infrastructure, focusing on automation, availability, performance, and efficiency at scale.Key Responsibilities:Design and build cloud...


  • Seattle, Washington, United States Apple Full time

    At Apple, we're looking for a passionate and dedicated Site Reliability Engineering Manager to lead a team focused on providing our customers with the highest quality Apple Services experience.Our services have to scale globally, stay highly available, and "just work." If you love designing, engineering, and running systems and infrastructure that will help...


  • Seattle, Washington, United States Apple Full time

    About the RoleWe are seeking a highly skilled and experienced Site Reliability Engineering Manager to lead our Cloud Monitoring team at Apple. As a key member of our Apple Services Engineering organization, you will be responsible for designing, building, and operating the monitoring and observability platform that enables our customers to have a seamless...


  • Seattle, Washington, United States Apple Full time

    Role SummaryAt Apple, we're committed to delivering exceptional services that revolutionize entire industries. As a Cloud Monitoring SRE Manager, you'll play a critical role in ensuring the reliability and performance of our cloud-based monitoring services.Key ResponsibilitiesLead SRE teams responsible for the reliability and performance of cloud-based...

  • Cloud Monitoring SRE

    4 weeks ago


    Seattle, Washington, United States Apple Full time

    Cloud Monitoring SRE - Automation ExpertAt Apple, we're looking for a talented Cloud Monitoring SRE - Automation Expert to join our team. As a key member of our Cloud Monitoring team, you'll be responsible for designing and building the next generation of cloud and systems monitoring infrastructure. You'll work closely with our engineering teams to automate...


  • Seattle, Washington, United States Apple Full time

    About the RoleWe are seeking a highly skilled Cloud Monitoring SRE Manager to lead our team in providing exceptional observability capabilities for our customers. As a key member of our Cloud Services Engineering team, you will be responsible for designing, engineering, and running systems and infrastructure that will help millions of customers.Key...


  • Seattle, Washington, United States Apple Full time

    Role OverviewApple is seeking a highly skilled Cloud Monitoring SRE Manager to lead a team responsible for ensuring the reliability and performance of our cloud-based monitoring services. As a key member of our Service Engineering team, you will be responsible for designing, implementing, and maintaining the systems and infrastructure that support our...

  • Cloud Monitoring SRE

    4 weeks ago


    Seattle, Washington, United States Apple Full time

    Job DescriptionApple Services Engineering infrastructure is BIG. Operating at our scale, across multiple geographically dispersed data centers and servicing hundreds of millions of users presents unique challenges. As a Site Reliability Engineer on the Cloud Monitoring Team at Apple, you will be working to improve the reliability and performance of the...

  • SRE DevOps Engineer

    4 weeks ago


    Seattle, Washington, United States Adobe Full time

    About the RoleWe are seeking an experienced SRE DevOps Engineer to join our Identity Resilience team at Adobe. As a key member of our team, you will be responsible for building and evolving the next generation of Identity Services for Adobe's cloud platform.Key ResponsibilitiesDesign and implement performance and availability optimizations across all layers...


  • Seattle, Washington, United States CloudBC Labs Full time

    Job Title: Senior SRE - Data DevOps SpecialistCloudBC Labs is seeking a highly skilled Senior SRE - Data DevOps Specialist to join our team. As a key member of our Cloud Infrastructure team, you will be responsible for ensuring the health and reliability of our production systems.Key Responsibilities:Develop and maintain monitoring dashboards to ensure...

  • SRE/DevOps Engineer

    4 weeks ago


    Seattle, Washington, United States Capgemini Full time

    Job Title: SRE/DevOps EngineerCapgemini is seeking an experienced SRE/DevOps Engineer to join our team. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining scalable and highly available systems in the cloud.Key Responsibilities:Design and implement scalable and highly available systems in the...

  • Cloud Advocate

    3 weeks ago


    Seattle, Washington, United States Datadog Full time

    About the RoleWe are seeking a highly skilled Developer Advocate to join our team at Datadog. As a key member of our engineering team, you will play a critical role in shaping the future of cloud observability and monitoring.Key ResponsibilitiesDevelop and deliver technical content, including blog posts, conference talks, and demos, to educate developers on...

  • Cloud Advocate

    4 weeks ago


    Seattle, Washington, United States Datadog Full time

    We're seeking a seasoned Cloud Advocate to join our team at Datadog. As a key member of our Cloud Alliances team, you'll play a crucial role in shaping the future of cloud observability and monitoring.Your primary responsibility will be to educate developers on the benefits of cloud computing, leveraging your expertise in GCP services to create compelling...


  • Seattle, Washington, United States Elit IT Inc. Full time

    Sr. SRE (Site Reliability Engineer) - Data DevOps/ DataOps/ No-SQLElit IT Inc. is seeking a highly skilled Senior Site Reliability Engineer to join our team in Seattle, WA. As a key member of our Data DevOps team, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement...


  • Seattle, Washington, United States Apple Full time

    Senior Site Reliability EngineerImagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish.This is a hands-on role to establish SRE practices for a private cloud service to accelerate our...

  • SRE DevOps Engineer

    4 weeks ago


    Seattle, Washington, United States Adobe Systems Inc Full time

    Job SummaryWe are seeking a highly skilled SRE DevOps Engineer to join our Identity Resilience team at Adobe Systems Inc. The successful candidate will be responsible for building and evolving the next generation of Identity Services for our cloud platform.Key ResponsibilitiesWork in all layers of an n-tier application stack, starting from infrastructure...


  • Seattle, Washington, United States Capgemini Full time

    Job SummaryCapgemini is seeking a highly skilled SRE / Sr. DevOps Engineer to join our team. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining scalable, highly available, and secure cloud-based systems.Key Responsibilities* Design and implement scalable, highly available, and secure cloud-based...


  • Seattle, Washington, United States Apple Full time

    Job SummaryApple is seeking a highly skilled Senior Site Reliability Engineer to join our Object Storage team. As a key member of our team, you will be responsible for designing, implementing, and maintaining our cloud-based object storage infrastructure.Key ResponsibilitiesDesign and implement scalable and highly available cloud-based object storage...

  • Software Engineer

    4 weeks ago


    Seattle, Washington, United States Xaira Therapeutics Full time

    We are seeking a skilled Cloud Engineer to join our team at Xaira Therapeutics. As a Cloud Engineer, you will be responsible for designing, building, and maintaining our internal platform to support engineers, AI scientists, and our cutting-edge AI-powered biotechnology company.Key Responsibilities:Design and implement cloud infrastructure solutions using...