Site Reliability Engineer

2 weeks ago


Los Angeles, California, United States Tik Tok Full time
{"title": "Site Reliability Engineer", "content": "About the Role

TikTok is seeking an experienced Site Reliability Engineer to join our USDS Video Platform team. As a key member of our team, you will be responsible for ensuring the reliability and scalability of our video system, which serves billions of users worldwide.

As a Site Reliability Engineer, you will work closely with our engineering teams to design, implement, and operate our video processing platform. You will be responsible for monitoring the system, responding to incidents, and performing capacity management to ensure system stability and cost-effectiveness.

We are looking for a passionate and self-motivated individual who is eager to take on exciting challenges and contribute to the growth of our platform.

Responsibilities
  • Ensure the overall reliability of TikTok's video system, including video publishing and distribution.
  • Perform lifecycle management of production systems, including change management, service deployment, operations, and emergency response.
  • Monitor the system and respond to incidents to maintain system service level agreement (SLA), review and follow up all production incidents.
  • Perform capacity management of compute, storage, and network bandwidth resources to ensure system stability and save infrastructure costs.
  • Provide strong support during big events to ensure the system is capable of consuming a large volume of Internet traffic.
  • Build tools, automations, visualizations, and monitors to facilitate the operation and optimization of the global infrastructure.
Qualifications
  • Bachelor's degree in Computer Science or a related technical background involving software/system engineering, or equivalent working experience.
  • 2+ years of SRE or DevOps experience in large-scale online services.
  • Programming experience with at least one of the following languages: C, C++, Java, Python, C#, or Go.
What We Offer

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe, and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy.

We are passionate about this and hope you are too. If you are passionate about ensuring software reliability, love problem-solving, and are prepared for exciting challenges, we would like you to join our team.

", "company": "TikTok", "location": "USDS", "job_type": "Full-time"}

  • Los Angeles, California, United States Capgemini Full time

    Job Title: Site Reliability EngineerCapgemini is seeking a skilled Site Reliability Engineer to join our team in Sunnyvale, CA or Sylmar, CA. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our cloud-based applications using Azure Kubernetes Services (AKS).Key Responsibilities:Maintain and improve...


  • Los Angeles, California, United States City National Bank Full time

    {"title": "Site Reliability Engineer", "description": "Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at City National Bank. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States Tik Tok Full time

    About the RoleWe are seeking an experienced Site Reliability Engineer to join our USDS Video Platform team at TikTok. As a key member of our team, you will be responsible for ensuring the reliability and scalability of our video system, which serves billions of users worldwide.ResponsibilitiesDesign and implement scalable and reliable systems to support our...


  • Los Angeles, California, United States StubHub Full time

    About the OpportunityStubHub is seeking a Senior Site Reliability Engineer to design and develop next-generation technologies and complex features. As a key member of our team, you will be responsible for ensuring the reliability, availability, and performance of our critical systems.Key ResponsibilitiesBuild and maintain an observability platform to monitor...


  • Los Angeles, California, United States City National Bank Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at City National Bank. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key ResponsibilitiesImplement solutions that improve stability, security,...


  • Los Angeles, California, United States City National Bank Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at City National Bank. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key ResponsibilitiesImplement solutions that improve stability, security,...


  • Los Angeles, California, United States StubHub Full time

    About the RoleStubHub is seeking a Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing and developing next-generation technologies and complex features to ensure the reliability, availability, and performance of our critical systems.Key ResponsibilitiesBuild and maintain an...


  • Los Angeles, California, United States StubHub Full time

    About the RoleStubHub is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing and developing next-generation technologies and complex features to ensure the reliability, availability, and performance of our critical systems.Key ResponsibilitiesBuild...


  • Los Angeles, California, United States StubHub Full time

    About the RoleStubHub is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing and developing next-generation technologies and complex features to ensure the reliability, availability, and performance of our critical systems.Key ResponsibilitiesBuild...


  • Los Angeles, California, United States StubHub Full time

    About the RoleStubHub is seeking a Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing and developing next-generation technologies and complex features to ensure the reliability, availability, and performance of our critical systems.Key ResponsibilitiesBuild and maintain an...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability Principal EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Principal Engineer to join our team. As a Site Reliability Principal Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our cloud-based systems.Key Responsibilities:Design, build, and manage...


  • Los Angeles, California, United States Disqo Full time

    About DISQODISQO is the brand experience (BX) platform for understanding every customer experience. Businesses trust DISQO to power better decisions for every customer, touchpoint, and outcome. DISQO's insights, agile testing and advertising measurement products are powered by millions of consumers on the industry's largest opt-in consumer data platform.Our...


  • Los Angeles, California, United States Capgemini Full time

    About the RoleCapgemini is seeking a skilled Site Reliability Engineer to join our team in Sunnyvale, CA. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our cloud-based applications using Azure Kubernetes Services (AKS).Key ResponsibilitiesMaintain and improve the reliability and performance of...