Site Reliability Engineer

1 week ago


Los Angeles, California, United States eTek IT Services, Inc. Full time
Job Overview

The Site Reliability Engineer will play a crucial role in ensuring the reliability, scalability, and performance of our cloud infrastructure and applications, ultimately contributing to the seamless operations of our systems. This role is vital in maintaining a high level of uptime and system efficiency, enhancing the overall user experience, and enabling our organization to meet its objectives.

Key Responsibilities
  • Design and implement monitoring and alerting systems to ensure high availability and performance of cloud services
  • Develop automation tools for cloud provisioning, configuration management, and application deployment
  • Collaborate with cross-functional teams to ensure that new software and systems are production-ready
  • Perform capacity planning and manage cloud infrastructure capacity efficiently
  • Conduct root cause analysis of production issues and implement preventive measures
  • Participate in on-call rotations and respond to system emergencies
  • Ensure compliance with security and regulatory standards in all aspects of the cloud infrastructure
  • Contribute to the continuous improvement of the reliability and performance of cloud systems and applications
  • Implement best practices for cloud infrastructure and services
  • Lead initiatives to optimize system performance and stability
  • Conduct periodic testing of disaster recovery and failover systems
  • Document cloud system configurations, processes, and procedures
  • Assist in evaluating new technologies and methods to improve reliability and performance
Required Qualifications
  • Bachelor's degree in Computer Science, Information Technology, or a related field
  • 3+ years of experience in a site reliability engineering role
  • Proficiency in Linux system administration and troubleshooting
  • Strong programming skills in Python, Shell scripting, or other scripting languages
  • Experience with cloud platforms such as AWS, GCP, or Azure
  • Expertise in building and maintaining scalable, high-performance systems
  • Knowledge of containerization and orchestration technologies (Docker, Kubernetes)
  • Hands-on experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK)
  • Ability to design and implement automated solutions for cloud infrastructure and application deployment
  • Excellent troubleshooting and problem-solving skills
  • Understanding of networking concepts and protocols
  • Strong communication and collaboration skills
  • Relevant certifications (e.g., AWS Certified DevOps Engineer, Google Professional Cloud DevOps Engineer) a plus


  • Los Angeles, California, United States Capgemini Full time

    Job Title: Site Reliability EngineerCapgemini is seeking a skilled Site Reliability Engineer to join our team in Sunnyvale, CA or Sylmar, CA. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our cloud-based applications using Azure Kubernetes Services (AKS).Key Responsibilities:Maintain and improve...


  • Los Angeles, California, United States Tik Tok Full time

    {"title": "Site Reliability Engineer", "content": "About the RoleTikTok is seeking an experienced Site Reliability Engineer to join our USDS Video Platform team. As a key member of our team, you will be responsible for ensuring the reliability and scalability of our video system, which serves billions of users worldwide.As a Site Reliability Engineer, you...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States City National Bank Full time

    {"title": "Site Reliability Engineer", "description": "Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at City National Bank. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States Tik Tok Full time

    About the RoleWe are seeking an experienced Site Reliability Engineer to join our USDS Video Platform team at TikTok. As a key member of our team, you will be responsible for ensuring the reliability and scalability of our video system, which serves billions of users worldwide.ResponsibilitiesDesign and implement scalable and reliable systems to support our...


  • Los Angeles, California, United States StubHub Full time

    About the OpportunityStubHub is seeking a Senior Site Reliability Engineer to design and develop next-generation technologies and complex features. As a key member of our team, you will be responsible for ensuring the reliability, availability, and performance of our critical systems.Key ResponsibilitiesBuild and maintain an observability platform to monitor...


  • Los Angeles, California, United States City National Bank Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at City National Bank. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key ResponsibilitiesImplement solutions that improve stability, security,...


  • Los Angeles, California, United States City National Bank Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at City National Bank. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key ResponsibilitiesImplement solutions that improve stability, security,...


  • Los Angeles, California, United States StubHub Full time

    About the RoleStubHub is seeking a Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing and developing next-generation technologies and complex features to ensure the reliability, availability, and performance of our critical systems.Key ResponsibilitiesBuild and maintain an...


  • Los Angeles, California, United States StubHub Full time

    About the RoleStubHub is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing and developing next-generation technologies and complex features to ensure the reliability, availability, and performance of our critical systems.Key ResponsibilitiesBuild...


  • Los Angeles, California, United States StubHub Full time

    About the RoleStubHub is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing and developing next-generation technologies and complex features to ensure the reliability, availability, and performance of our critical systems.Key ResponsibilitiesBuild...


  • Los Angeles, California, United States StubHub Full time

    About the RoleStubHub is seeking a Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing and developing next-generation technologies and complex features to ensure the reliability, availability, and performance of our critical systems.Key ResponsibilitiesBuild and maintain an...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability Principal EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Principal Engineer to join our team. As a Site Reliability Principal Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our cloud-based systems.Key Responsibilities:Design, build, and manage...


  • Los Angeles, California, United States Disqo Full time

    About DISQODISQO is the brand experience (BX) platform for understanding every customer experience. Businesses trust DISQO to power better decisions for every customer, touchpoint, and outcome. DISQO's insights, agile testing and advertising measurement products are powered by millions of consumers on the industry's largest opt-in consumer data platform.Our...