Site Reliability Engineer
3 weeks ago
Juniper Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the high availability and performance of our cloud infrastructure. This includes monitoring, troubleshooting, and resolving issues across microservices and distributed platforms.
Key Responsibilities- Maintain system availability, health, and service levels (SLAs, SLOs) of large-scale cloud infrastructure running in AWS and GCP.
- Support infrastructure components, data streaming frameworks, and databases, such as Kubernetes, Flink, Storm, Spark, Kafka, Cassandra, Elasticsearch, Redis, Postgres, ArangoDB, and many others.
- Monitor, troubleshoot, analyze failures, and provide support for software engineers to debug production issues across microservices and distributed platforms.
- Join on-call rotation and resolution of issues in a 24x7 multi-cloud (AWS/GCP) environment.
- Monitor metrics and performance of applications and cloud infrastructure.
- Handle entire lifecycle of incident management, including reporting, analyzing, handling incidents, until its closure and writing RCAs.
- Write and update runbooks for knowledge-driven automated processes and bots.
- Perform capacity planning based on performance, usage, and utilization stats.
- Follow SRE best practices and procedures.
- Bachelor's degree in computer science or computer engineering or equivalent.
- Minimum 5+ years of devops/SRE experience.
- 3+ years hands-on experience with AWS or GCP, EC2 (GCE), IAM, S3 (GS), Docker, Kubernetes pods, Jenkins, Prometheus, CloudWatch (Stack Driver), Linux, Ansible.
- 3+ years' experience in deploying code and infrastructure in AWS or GCP using continuous integration/continuous delivery (CI/CD) tools in production environments.
- 5+ Administration experience of distributed computation and streaming frameworks, like Kafka, Cassandra, Elasticsearch, Flink, Storm, Spark, and cloud services EMR, Dataproc, Elasticache, AWS RDS, GCP SQL or similar.
- 5+ years of automation using Python or/and Golang, or/and Rust, and shell scripting.
- 5+ prior experience in developing metrics to monitor health of infrastructure and applications.
- Good understanding of Terraform or CloudFormation or any IaC code is preferred.
- Any opensource development experience.
- AI Ops /Gen AI experience.
- Automation using workflow services GitHub Actions, Google Workflows, Jenkins, GitLab, Slack and Confluence/Jira.
- Microservices release operations experience.
Juniper Networks offers a competitive salary range of $140,800.00 to $202,400.00 per year, as well as medical benefits, 401(k) eligibility, vacation, sick time, and parental leave.
-
Site Reliability Engineer
1 month ago
Cupertino, California, United States Apple Full timeJob Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our Cloud Service Infrastructure team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and security of our cloud services.Key Responsibilities:Operate, monitor, and prioritize our production and...
-
Site Reliability Engineer
3 weeks ago
Cupertino, California, United States Apple Full timeJob SummaryWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will play a critical role in designing, building, and maintaining our core infrastructure, which enables thousands of Apple Developers to submit their Apps to the App Store that delight millions of Apple...
-
Site Reliability Engineer
3 weeks ago
Cupertino, California, United States Apple Full timeJob SummaryApple is seeking a skilled Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will play a vital role in designing, building, and maintaining our core infrastructure, ensuring the reliability and scalability of our services.About the RoleThis is an exciting opportunity to work with a...
-
Site Reliability Engineer
3 weeks ago
Cupertino, California, United States Apple Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Apple. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and security of our distributed systems and infrastructure.Key ResponsibilitiesDesign, develop, and maintain scalable and reliable systems and...
-
Site Reliability Engineer
1 month ago
Cupertino, California, United States Apple Full timeJob DescriptionApple is seeking an innovative Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will play a vital role in designing, building, and maintaining our core infrastructure, enabling thousands of Apple Developers to submit their Apps to the App Store that delight millions of Apple...
-
Site Reliability Engineer
1 month ago
Cupertino, California, United States Apple Full timeJob DescriptionApple is seeking an experienced Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining our core infrastructure, which enables thousands of Apple Developers to submit their Apps to the App Store that delight millions of Apple...
-
Site Reliability Engineer
3 weeks ago
Cupertino, California, United States Juniper Networks Full timeAbout the RoleJuniper Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure. You will work closely with our development team to identify and resolve issues, and collaborate with other teams...
-
Site Reliability Engineer
3 weeks ago
Cupertino, California, United States Apple Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and security of our cloud-based services.Key ResponsibilitiesDesign and implement scalable search infrastructure using Solr, Kafka, and...
-
Site Reliability Engineer
2 weeks ago
Cupertino, California, United States Apple Full timeJob SummaryApple is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for developing processes, tools, and automation for managing distributed systems in production environments. You will work closely with our software and systems engineering teams to build and run large-scale, massively...
-
Site Reliability Engineer
4 weeks ago
Cupertino, California, United States Apple Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering (ASE) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our global services.Key ResponsibilitiesLead data-driven roadmap and quarterly planning for a subset of core services from a...
-
Site Reliability Engineer
1 month ago
Cupertino, California, United States Juniper Networks Full timeJob Title: Site Reliability EngineerJuniper Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.Key Responsibilities:Maintain system availability, health, and service levels (SLAs, SLOs) of large-scale...
-
Site Reliability Engineer
3 weeks ago
Cupertino, California, United States Apple Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team in Cupertino. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and security of our distributed systems and services.Key ResponsibilitiesDesign, develop, and maintain scalable and reliable...
-
Site Reliability Engineer
2 weeks ago
Cupertino, California, United States Juniper Networks Full timeJob DescriptionJuniper Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure. This includes monitoring, troubleshooting, and resolving issues in our cloud-based systems.Key ResponsibilitiesDesign,...
-
Site Reliability Engineer
3 weeks ago
Cupertino, California, United States Apple Full timeJob SummaryApple is seeking a highly skilled Site Reliability Engineer to join our Apple Service Engineering - SRE team. As a key member of our team, you will be responsible for developing processes, tools, and automation for managing distributed systems in production environments. Our SRE team combines software and systems engineering and system...
-
Site Reliability Engineer
4 weeks ago
Cupertino, California, United States Apple Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Apple. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesLead data-driven roadmap and quarterly planning for a subset of core services from a reliability...
-
Site Reliability Engineer
4 weeks ago
Cupertino, California, United States Apple Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team in Cupertino. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and security of our distributed systems and services.Key ResponsibilitiesDesign, develop, and maintain scalable and reliable...
-
Senior Site Reliability Engineer
3 weeks ago
Cupertino, California, United States Apple Full timeJob Title: Senior Site Reliability EngineerAt Apple, we're looking for a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our Apple Services Engineering team, you'll play a vital role in designing, building, and maintaining our core infrastructure.Key Responsibilities:Collaborate with cross-functional teams to understand...
-
Site Reliability Engineer
3 weeks ago
Cupertino, California, United States Apple Full timeJob Title: Site Reliability EngineerAt Apple, we're looking for a skilled Site Reliability Engineer to join our Edge & Messaging SRE team. As a Site Reliability Engineer, you'll play a critical role in building and running the services that hundreds of millions of customers use every day.About the RoleThis team provides systems that are foundational for many...
-
Site Reliability Engineer
3 weeks ago
Cupertino, California, United States Juniper Networks Full timeAbout the RoleJuniper Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, reliability, and performance of our cloud infrastructure.Key ResponsibilitiesMaintain system availability, health, and service levels (SLAs, SLOs) of large-scale cloud...
-
Site Reliability Engineer
3 weeks ago
Cupertino, California, United States Juniper Networks Full timeJob DescriptionJuniper Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure. This includes monitoring, troubleshooting, and resolving issues across our microservices and distributed platforms.Key...