Site Reliability Engineer

1 week ago


Cupertino, California, United States TEKsystems Full time
About the Role

We are seeking a highly skilled Site Reliability Engineer to join our team at TEKsystems. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud infrastructure, particularly in the areas of Pulsar and Kubernetes.

Key Responsibilities
  • Proactive capacity monitoring: Identify potential issues before they become major problems by analyzing growth and utilization metrics.
  • Reactive 24/7 1st layer support: Provide timely and effective support to resolve incidents and ensure minimal downtime.
  • Pulsar ramp-up: Assist in the initial work to ramp up Pulsar operation in production, including knowledge transfer and ownership of Pulumi recipes.
  • Setup alerts and write runbooks for Pulsar clusters.
  • Help load test, stress test, and fine-tune Pulsar properties.
  • Write K8s operators to autoscale Pulsar infrastructure.
Requirements
  • 6+ years of experience in cloud infrastructure engineering, with a focus on Pulsar and Kubernetes.
  • Experience with scripting languages such as Java, Python, Typescript, or Go.
  • Familiarity with automation tools like Terraform, Pulumi, Helm Charts, Spinnaker, Puppet, and Kustomize.
  • Knowledge of business continuity concepts, including multi-region design patterns, blast radius reduction, GSLB/GTM concepts, and alerting & incident management.
What We Offer
  • A competitive salary and benefits package.
  • The opportunity to work with a talented team of engineers and contribute to the development of cutting-edge cloud infrastructure.
  • A dynamic and supportive work environment that encourages growth and learning.
About TEKsystems

TEKsystems is a leading provider of technology services and solutions, with a strong focus on cloud infrastructure engineering. We are committed to helping our clients achieve their goals through innovative solutions and exceptional service.



  • Cupertino, California, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining our core infrastructure. This infrastructure enables thousands of Apple Developers to submit their Apps to the App Store that delight millions of...


  • Cupertino, California, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our Cloud Service Infrastructure team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and security of our cloud services.Key Responsibilities:Operate, monitor, and prioritize our production and...


  • Cupertino, California, United States Apple Full time

    Job DescriptionApple is seeking an innovative Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will play a vital role in designing, building, and maintaining our core infrastructure, enabling thousands of Apple Developers to submit their Apps to the App Store that delight millions of Apple...


  • Cupertino, California, United States Apple Full time

    Job DescriptionApple is seeking an experienced Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining our core infrastructure, which enables thousands of Apple Developers to submit their Apps to the App Store that delight millions of Apple...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering (ASE) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesDesign, implement, and maintain scalable and highly available cloud...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering (ASE) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our global services.Key ResponsibilitiesLead data-driven roadmap and quarterly planning for a subset of core services from a...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesLead data-driven roadmap and quarterly planning for a subset of core services from a...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Maps Infrastructure team. As a key member of our team, you will be responsible for designing, building, and maintaining scalable and reliable infrastructure services that support our business operations.Key ResponsibilitiesCollaborate with engineering, security, and SRE...


  • Cupertino, California, United States Juniper Networks Full time

    Job Title: Site Reliability EngineerJuniper Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.Key Responsibilities:Maintain system availability, health, and service levels (SLAs, SLOs) of large-scale...


  • Cupertino, California, United States Juniper Networks Full time

    Job Title: Site Reliability EngineerJuniper Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.Key Responsibilities:Maintain system availability, health, and service levels (SLAs, SLOs) of large-scale...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Maps Infrastructure team. As a key member of our team, you will be responsible for designing, building, and maintaining scalable and reliable cloud infrastructure services.Key ResponsibilitiesDesign and implement scalable and reliable cloud infrastructure...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Services Engineering (ASE) team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our global services.Key ResponsibilitiesLead data-driven roadmap and quarterly planning for a subset of core services from a...


  • Cupertino, California, United States Bayside Solutions Full time

    Job Title: Site Reliability Engineer, Virtualization and PlanningWe are seeking a highly skilled Site Reliability Engineer to join our team at Bayside Solutions, Inc. This role will focus on supporting Infrastructure as a Service (IaaS) virtualization platforms, Linux compute environments, and capacity planning.Key Responsibilities:Design and implement...


  • Cupertino, California, United States Bayside Solutions Full time

    Job Title: Site Reliability Engineer, Virtualization and PlanningWe are seeking a highly skilled Site Reliability Engineer to join our team at Bayside Solutions, Inc. This role will focus on supporting Infrastructure as a Service (IaaS) virtualization platforms, Linux compute environments, and capacity planning.Key Responsibilities:Design and implement...


  • Cupertino, California, United States Apple Full time

    Role SummaryAs a Site Reliability Engineer at Apple, you will play a critical role in ensuring the reliability and scalability of our cloud services. You will be responsible for designing, building, and implementing innovative solutions to improve the stability, security, and scalability of our cloud systems.Key ResponsibilitiesOperate, monitor, and...


  • Cupertino, California, United States Apple Full time

    Job Title: Site Reliability Engineering ManagerAt Apple, we're looking for a talented Site Reliability Engineering Manager to join our team. As a Site Reliability Engineering Manager, you will be responsible for designing, developing, and operating Fleet Management Services, including core infrastructure to provide fast, secure, and reliable data center...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Apple. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesLead data-driven roadmap and quarterly planning for a subset of core services from a reliability...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Apple. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud-based services.Key ResponsibilitiesLead data-driven roadmap and quarterly planning for a subset of core services from a reliability...


  • Cupertino, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Apple Service Engineering - Solr SRE team. As a key member of our team, you will be responsible for developing processes, tools, and automation for managing distributed systems in production environments.Key ResponsibilitiesDesign and implement scalable search infrastructure...


  • Cupertino, California, United States Juniper Networks Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Juniper Networks. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud infrastructure.Key ResponsibilitiesMaintain system availability, health, and service levels (SLAs, SLOs) of...