Site Reliability Engineer

2 days ago


Sunnyvale, California, United States Apple Full time
About the Role

We are seeking a highly skilled Senior Site Reliability Engineer to join our team at Apple. As a key member of our Manufacturing Systems & Infrastructure (MSI) team, you will play a critical role in maintaining and enhancing the reliability of our production systems.

Key Responsibilities
  • Design, develop, and maintain scalable, reliable, and efficient infrastructure.
  • Implement monitoring, alerting, and logging systems to ensure the health and performance of applications.
  • Automate repetitive tasks and improve system efficiency through scripting and tool development.
  • Collaborate with development teams to improve service reliability and promote best practices in software development and deployment.
  • Conduct root cause analysis of system failures and implement corrective actions to prevent recurrence.
  • Participate in on-call rotations and respond to incidents, minimizing downtime and impact on users.
  • Drive continuous improvement initiatives to enhance system performance, scalability, and reliability.
  • Mentor and provide guidance to junior team members, fostering a culture of learning and innovation.
Requirements
  • 7+ years of experience in site reliability engineering, DevOps, or a related field.
  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
  • Strong experience with cloud platforms: AWS, Google Cloud Platform, or Microsoft Azure.
  • Proficiency in infrastructure as code tools: Terraform, Ansible, or CloudFormation.
  • Expertise in containerization and orchestration: Docker, Kubernetes, and HELM.
  • Strong scripting and programming skills: Python, Go, Shell, or Ruby.
  • In-depth knowledge of monitoring and observability tools: Prometheus, Grafana, Open Telemetry, Splunk.
  • Familiarity with version control systems: Git.
  • Solid understanding of Linux/Unix system administration and networking.
  • Excellent problem-solving skills and a proactive approach to incident management.
  • Experience with database management and optimization: MySQL, PostgreSQL, or NoSQL databases like MongoDB and Cassandra.
  • Knowledge of message brokers and streaming platforms: Kafka, RabbitMQ, or Amazon Kinesis.
What We Offer

At Apple, we offer a comprehensive compensation package, including base pay, discretionary bonuses, and commission payments. Our benefits include comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and reimbursement for certain educational expenses. We are an equal opportunity employer committed to inclusion and diversity.



  • Sunnyvale, California, United States Futran Tech Solutions Pvt. Ltd. Full time

    Job Title: Site Reliability EngineerAbout the Role:Futran Tech Solutions Pvt. Ltd. is seeking a skilled Site Reliability Engineer to join our team. As an SRE, you will be responsible for ensuring the reliability, scalability, and performance of our distributed systems on AWS.Key Responsibilities:• Design, implement, and maintain large-scale distributed...


  • Sunnyvale, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Apple. As a key member of our Manufacturing Systems & Infrastructure (MSI) team, you will play a critical role in maintaining and enhancing the reliability of our production systems.Key ResponsibilitiesDesign, develop, and maintain scalable, reliable, and efficient...


  • Sunnyvale, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Apple. As a key member of our Manufacturing Systems & Infrastructure (MSI) team, you will play a critical role in maintaining and enhancing the reliability of our production systems.Key ResponsibilitiesDesign, develop, and maintain scalable, reliable, and...


  • Sunnyvale, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Apple. As a key member of our Manufacturing Systems & Infrastructure (MSI) team, you will play a critical role in maintaining and enhancing the reliability of our production systems.Key ResponsibilitiesDesign, develop, and maintain scalable, reliable, and...


  • Sunnyvale, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Apple. As a key member of our Manufacturing Systems & Infrastructure (MSI) team, you will play a critical role in maintaining and enhancing the reliability of our production systems.Key ResponsibilitiesDesign, develop, and maintain scalable, reliable, and...


  • Sunnyvale, California, United States Synopsys Full time

    About the RoleWe are seeking a highly skilled and experienced Staff Site Reliability Engineer to join our Data Center team at Synopsys. As a key member of our team, you will be responsible for ensuring the reliability and performance of our compute and storage infrastructure.Key ResponsibilitiesMonitor support queues and address tickets promptly, ensuring...


  • Sunnyvale, California, United States Motion Recruitment Full time

    Location: 100% RemoteEmployment Type: Full TimeSalary Range: $140k - $160kA prominent Managed Service Provider with a robust clientele at an enterprise level is seeking to add a full-time remote Site Reliability Engineer to their skilled team.This organization manages over 20,000 servers, developing innovative infrastructures and projects from the ground...


  • Sunnyvale, California, United States Motion Recruitment Partners LLC Full time

    Company OverviewA prominent Managed Service Provider with extensive partnerships and a diverse clientele at an enterprise level is seeking a dedicated full-time remote Site Reliability Engineer. This organization manages a vast array of servers, exceeding 20,000, to create innovative infrastructures and projects from the ground up.Role OverviewAs a Senior...


  • Sunnyvale, California, United States Motion Recruitment Partners LLC Full time

    About the CompanyA prominent Managed Service Provider, recognized for its extensive partnerships and enterprise-level clientele, is seeking a dedicated full-time remote Site Reliability Engineer. This organization manages a vast array of over 20,000 servers, developing innovative infrastructures and projects from the ground up.Role OverviewAs a Senior Site...


  • Sunnyvale, California, United States Motion Recruitment Full time

    Location: 100% RemoteEmployment Type: Full TimeSalary Range: $140k - $160kA prominent Managed Service Provider, recognized for its extensive partnerships and enterprise-level clientele, is seeking a dedicated full-time remote Site Reliability Engineer.This organization manages a vast array of over 20,000 servers, focusing on developing innovative...


  • Sunnyvale, California, United States Motion Recruitment Full time

    Location: 100% RemotePosition Type: Full TimeSalary: $140k - $160kA prominent Managed Service Provider with extensive partnerships and a diverse client base is seeking to add a full-time remote Site Reliability Engineer to their skilled team.This organization manages over 20,000 servers, developing innovative infrastructures and projects from the ground...


  • Sunnyvale, California, United States Red Oak Technologies Full time

    Company Overview:Red Oak Technologies is a premier provider of comprehensive staffing solutions across various sectors, including Information Technology, Marketing, Finance, Business Operations, Manufacturing, and Engineering. Our expertise lies in swiftly sourcing and effectively aligning top-tier professional talent with clients who require highly skilled...


  • Sunnyvale, California, United States Capgemini Engineering Full time

    Job Title: Site Reliability EngineerCapgemini Engineering is seeking a skilled Site Reliability Engineer to join our team in Sunnyvale, CA. As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability and performance of our cloud-based applications using Azure Kubernetes Services (AKS).Key Responsibilities:Maintain and improve the...


  • Sunnyvale, California, United States Red Oak Technologies Full time

    Company Overview:Red Oak Technologies is a premier provider of extensive resourcing solutions across diverse industries, including Information Technology, Marketing, Finance, Business Operations, Manufacturing, and Engineering. Our expertise lies in swiftly sourcing and effectively aligning top-tier professional talent with clients who require highly skilled...


  • Sunnyvale, California, United States TekWissen LLC Full time

    Job OverviewTekWissen Group is a leading workforce management provider with a global presence. Our client is a digital technology and transformation company that requires a skilled Site Reliability Engineer to join their team.Job ResponsibilitiesDesign and implement scalable and reliable cloud infrastructure using Kubernetes and containerization.Develop and...


  • Sunnyvale, California, United States NetApp Full time

    About the RoleThe Site Reliability Engineering Manager will lead a dynamic team responsible for ensuring the reliability, performance, and efficiency of our critical systems.Key ResponsibilitiesLead and mentor a team of SREs, fostering a culture of continuous improvement and innovation.Collaborate with product and engineering teams to design and implement...


  • Sunnyvale, California, United States Info Way Solutions Full time

    Java Engineer with Site Reliability Expertise Location: Sunnyvale, CA (Day 1 Onsite) Job Overview: As a Java Engineer with a focus on Site Reliability, you will be responsible for developing robust, efficient, and testable code in Java. Your expertise will play a crucial role in the architecture and deployment of large-scale distributed systems,...


  • Sunnyvale, California, United States Red Oak Technologies Full time

    Company Overview:Red Oak Technologies stands at the forefront of delivering comprehensive staffing solutions across diverse sectors such as Information Technology, Marketing, Finance, Business Operations, Manufacturing, and Engineering. Our expertise lies in swiftly identifying and effectively aligning top-tier professional talent with organizations in...


  • Sunnyvale, California, United States Red Oak Technologies Full time

    Company Overview:Red Oak Technologies stands as a premier provider of extensive resourcing solutions across diverse industries, including Information Technology, Marketing, Finance, Business Operations, Manufacturing, and Engineering. Our expertise lies in swiftly sourcing and effectively aligning top-tier professional talent with clients in urgent need of...


  • Sunnyvale, California, United States Capgemini Engineering Full time

    Site Reliability EngineerCapgemini Engineering is seeking a skilled Site Reliability Engineer to join our team in Sunnyvale, CA. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our cloud-based applications using Azure Kubernetes Services (AKS).Key Responsibilities:Maintain and improve the...