Cloud Monitoring SRE Manager
4 weeks ago
At Apple, we're committed to delivering exceptional services that revolutionize entire industries. As a Cloud Monitoring SRE Manager, you'll play a critical role in ensuring the reliability and performance of our cloud-based monitoring services.
Key Responsibilities
- Lead SRE teams responsible for the reliability and performance of cloud-based monitoring services
- Collaborate with developers and architects to design and implement improvements to stability, security, and scalability
- Promote observability of systems for monitoring, alerting, and metrics reporting
- Advocate best practices of reliability engineering
Requirements
- Minimum 5+ years of experience handling services in a large-scale environment
- Desire to build, grow, and mentor a team to meet both their career goals and the organization's goals
- Experience with Cloud Computing technologies (particularly Kubernetes)
- Experience and confidence around incident response and incident management
- Experience with the Prometheus ecosystem
- Practical experience in Python, bash scripting
- Theoretical knowledge of Go, Java, and/or Scala
Preferred Qualifications
- 2+ years professional experience in an engineering leadership position
- Comfortable with Open Source configuration management and orchestration tools (such as Helm, Puppet, and Spinnaker)
- Developing and delivering multi-mode communications that convey a clear understanding of the unique needs of different audiences
- Knowing the most effective and efficient processes to get things done, with a focus on continuous improvement
- Rebounding from setbacks and adversity when facing difficult challenges and balancing the needs of multiple stakeholders
What We Offer
- Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services
- Reimbursement for certain educational expenses — including tuition
- Discretionary bonuses or commission payments, as well as relocation
Note
Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.
-
Cloud Monitoring SRE
4 weeks ago
Seattle, Washington, United States Apple Full timeCloud Monitoring SREAt Apple, we're looking for a skilled Cloud Monitoring SRE to join our team. As a Cloud Monitoring SRE, you will be responsible for designing and building the next generation of cloud and systems monitoring infrastructure, focusing on automation, availability, performance, and efficiency at scale.You will work closely with our engineering...
-
Cloud Monitoring SRE Manager
4 weeks ago
Seattle, Washington, United States Apple Full timeAt Apple, we're looking for a passionate and dedicated Site Reliability Engineering Manager to lead a team focused on providing our customers with the highest quality Apple Services experience.Our services have to scale globally, stay highly available, and "just work." If you love designing, engineering, and running systems and infrastructure that will help...
-
Cloud Monitoring SRE Manager
1 month ago
Seattle, Washington, United States Apple Full timeAbout the RoleWe are seeking a highly skilled and experienced Site Reliability Engineering Manager to lead our Cloud Monitoring team at Apple. As a key member of our Apple Services Engineering organization, you will be responsible for designing, building, and operating the monitoring and observability platform that enables our customers to have a seamless...
-
Cloud Monitoring SRE
4 weeks ago
Seattle, Washington, United States Apple Full timeJob Description:At Apple, we're looking for a skilled Cloud Monitoring SRE to join our team. As a Cloud Monitoring SRE, you will be responsible for designing and building the next generation of cloud and systems monitoring infrastructure, focusing on automation, availability, performance, and efficiency at scale.Key Responsibilities:Design and build cloud...
-
Cloud Monitoring SRE Manager
4 weeks ago
Seattle, Washington, United States Apple Full timeAbout the RoleWe are seeking a highly skilled Cloud Monitoring SRE Manager to lead our team in providing exceptional observability capabilities for our customers. As a key member of our Cloud Services Engineering team, you will be responsible for designing, engineering, and running systems and infrastructure that will help millions of customers.Key...
-
Cloud Monitoring SRE
4 weeks ago
Seattle, Washington, United States Apple Full timeJob DescriptionApple Services Engineering infrastructure is BIG. Operating at our scale, across multiple geographically dispersed data centers and servicing hundreds of millions of users presents unique challenges. As a Site Reliability Engineer on the Cloud Monitoring Team at Apple, you will be working to improve the reliability and performance of the...
-
SRE DevOps Engineer
4 weeks ago
Seattle, Washington, United States Adobe Full timeAbout the RoleWe are seeking an experienced SRE DevOps Engineer to join our Identity Resilience team at Adobe. As a key member of our team, you will be responsible for building and evolving the next generation of Identity Services for Adobe's cloud platform.Key ResponsibilitiesDesign and implement performance and availability optimizations across all layers...
-
Cloud Advocate
4 weeks ago
Seattle, Washington, United States Datadog Full timeAbout the RoleWe are seeking a highly skilled Developer Advocate to join our team at Datadog. As a key member of our engineering team, you will play a critical role in shaping the future of cloud observability and monitoring.Key ResponsibilitiesDevelop and deliver technical content, including blog posts, conference talks, and demos, to educate developers on...
-
Senior Cloud Reliability Engineer
1 month ago
Seattle, Washington, United States Apple Full timeSenior Site Reliability EngineerImagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish.This is a hands-on role to establish SRE practices for a private cloud service to accelerate our...
-
SRE DevOps Engineer
4 weeks ago
Seattle, Washington, United States Adobe Systems Inc Full timeJob SummaryWe are seeking a highly skilled SRE DevOps Engineer to join our Identity Resilience team at Adobe Systems Inc. The successful candidate will be responsible for building and evolving the next generation of Identity Services for our cloud platform.Key ResponsibilitiesWork in all layers of an n-tier application stack, starting from infrastructure...
-
Senior Cloud Solutions Architect
4 weeks ago
Seattle, Washington, United States HashiCorp Full timeJob SummaryThe Resident Technology Services team at HashiCorp offers premium, long-term professional services to our customers. As a Senior Cloud Solutions Architect, you will be part of this team, working closely with our customers to align with their biggest technical transformation initiatives. You will be responsible for designing and executing plans for...
-
SRE / Sr. DevOps Engineer
4 weeks ago
Seattle, Washington, United States Capgemini Full timeJob SummaryCapgemini is seeking a highly skilled SRE / Sr. DevOps Engineer to join our team. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining scalable, highly available, and secure cloud-based systems.Key Responsibilities* Design and implement scalable, highly available, and secure cloud-based...
-
Staff Cloud Security Engineer
4 weeks ago
Seattle, Washington, United States Zscaler Full timeAbout ZscalerZscaler (NASDAQ:ZS) was founded in 2007 with a mission to make the cloud a safe place to do business and a more enjoyable experience for enterprise users.As the operator of the world's largest security cloud, Zscaler accelerates digital transformation so enterprises can be more agile, efficient, resilient, and secure.The pioneering, AI-powered...
-
Senior Cloud Reliability Engineer
4 weeks ago
Seattle, Washington, United States Apple Full timeSenior Site Reliability EngineerImagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish.This is a hands-on role to establish SRE practices for a private cloud service to accelerate our...
-
Software Engineer
4 weeks ago
Seattle, Washington, United States Xaira Therapeutics Full timeWe are seeking a skilled Cloud Engineer to join our team at Xaira Therapeutics. As a Cloud Engineer, you will be responsible for designing, building, and maintaining our internal platform to support engineers, AI scientists, and our cutting-edge AI-powered biotechnology company.Key Responsibilities:Design and implement cloud infrastructure solutions using...
-
Seattle, Washington, United States Amazon Full timeAbout the RoleWe are seeking a highly skilled Software Development Manager to lead our AWS Monitoring Systems team. As a key member of our Cloud Infrastructure organization, you will be responsible for architecting, building, and scaling distributed software systems that ensure the health of AWS hardware.Your primary focus will be on leading a team of...
-
Service Reliability Engineer
4 weeks ago
Seattle, Washington, United States Apple Full timeThe Service Reliability Engineer role in Apple Services Engineering requires a mix of strategic engineering and design along with hands-on, technical work. This SRE will configure, tune, and fix multi-tiered systems to achieve optimal application performance, stability and availability. We manage jobs as well as applications on bare-metal and cloud computing...
-
Site Reliability Engineering Manager
4 weeks ago
Seattle, Washington, United States Apple Full timeRole OverviewAs a Site Reliability Engineering Manager at Apple, you will be responsible for leading a team that provides the platform for mission-critical cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and services to flourish.Key ResponsibilitiesEstablish SRE practices for a private cloud service to accelerate...
-
Site Reliability Engineering Lead
4 weeks ago
Seattle, Washington, United States DAT Solutions Full timeAbout DAT SolutionsWe are a next-generation SaaS technology company that has been at the leading edge of innovation in transportation supply chain logistics for 45 years.We continue to transform the industry year over year, by deploying a suite of software solutions to millions of customers every day - customers who depend on us for the most relevant data...
-
Cloud Support Specialist
1 month ago
Seattle, Washington, United States Jobs for Humanity Full timeCloud Support SpecialistAs a Cloud Support Specialist, you will play a critical role in ensuring the smooth operation of our Cloud Program. Your responsibilities will include providing frontline cloud operational support, incident management, and administration for Morgan Stanley (AWS, Snowflake, MongoDB Atlas, Confluent Cloud). You will also be responsible...