Site Reliability Engineer

3 weeks ago


Los Angeles, California, United States Tik Tok Full time
About the Role

TikTok is the leading destination for short-form mobile video, and our mission is to inspire creativity and bring joy. As a Site Reliability Engineer on our Video Platform team, you will play a critical role in ensuring the reliability and stability of our video system, which provides excellent experiences for billions of users around the world.

Responsibilities

As a Site Reliability Engineer, you will be responsible for the overall reliability of TikTok's video system, including video publishing and distribution. This will involve performing lifecycle management of production systems, including change management, service deployment, operations, and emergency response. You will also be responsible for monitoring the system and responding to incidents to maintain system service level agreements (SLAs), reviewing and following up on all production incidents. Additionally, you will perform capacity management of compute, storage, and network bandwidth resources to ensure system stability and save infrastructure costs. You will also provide strong support during big events to ensure the system is capable of consuming a large volume of internet traffic. Furthermore, you will build tools, automations, visualizations, and monitors to facilitate the operation and optimization of the global infrastructure.

Qualifications

To be successful in this role, you will need a Bachelor's degree in Computer Science or a related technical background involving software/system engineering, or equivalent working experience. You will also need 2+ years of SRE or DevOps experience in large-scale online services. Additionally, you will need programming experience with at least one of the following languages: C, C++, Java, Python, C#, or Go. Preferred qualifications include extensive knowledge of networking, operation systems, database systems, and container technology, as well as good understanding of every aspect of microservice architecture and hands-on experience in troubleshooting in large-scale distributed systems. Experience in building solutions with AWS, Google, Azure, and other cloud services is a plus. Passionate, self-motivated, and good teamwork skills are also essential.

About TikTok

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe, and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. We are passionate about this and hope you are too. We are committed to celebrating our diverse voices and creating an environment that reflects the many communities we reach. We regularly review our hybrid work model, and the specific requirements may change at any time.

  • Los Angeles, California, United States ICON Consultants, Inc. Full time

    Job Title: Site Reliability EngineerAt ICON Consultants, Inc., we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our datacenter infrastructure.Responsibilities:Data Monitoring and Alerting: Design and implement data...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States eTek IT Services, Inc. Full time

    Job OverviewAs a Site Reliability Engineer at eTek IT Services, Inc., you will play a vital role in ensuring the reliability, performance, and scalability of our infrastructure and applications. This is a unique opportunity to work with cutting-edge technology and make a significant impact on our organization's success.Key ResponsibilitiesDesign and...


  • Los Angeles, California, United States City National Bank Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at City National Bank. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key ResponsibilitiesImplement solutions that improve stability, security,...


  • Los Angeles, California, United States eTek IT Services, Inc. Full time

    Job OpportunityWe are seeking a highly skilled Site Reliability Engineer to join our team at eTek IT Services, Inc.Job SummaryAs a Site Reliability Engineer, you will be responsible for ensuring the high availability and scalability of our cloud-based systems. You will work closely with our development team to design, implement, and maintain reliable systems...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability Principal EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Principal Engineer to join our team. As a key member of our engineering organization, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems.Key Responsibilities:Design, build, and manage...


  • Los Angeles, California, United States City National Bank Full time

    About the RoleWe are seeking a highly skilled Site Reliability Principal Engineer to join our team at City National Bank. As a Site Reliability Principal Engineer, you will be responsible for ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key ResponsibilitiesArchitect solutions that improve the...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability Principal EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Principal Engineer to join our team. As a key member of our cloud operations team, you will be responsible for ensuring the reliability, scalability, and maximum uptime of our systems in the data center or cloud platform.Key...


  • Los Angeles, California, United States Loft Orbital Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Loft Orbital. As a key member of our Site Reliability team, you will be responsible for ensuring the reliability, scalability, and maintainability of our ground segment infrastructure.Key ResponsibilitiesCollaborate with development, operations, and IT teams to...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability Principal EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Principal Engineer to join our team. As a key member of our engineering organization, you will be responsible for ensuring the reliability, scalability, and maximum uptime of our software platforms.Key Responsibilities:Architect solutions to...


  • Los Angeles, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S.This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols...


  • Los Angeles, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is a leading destination for short-form mobile video, inspiring creativity and bringing joy to users worldwide. Our mission is to empower creators and communities to express themselves authentically.Our TeamThe U.S. Data Security team is responsible for protecting sensitive data and information, ensuring the highest...


  • Los Angeles, California, United States Tik Tok Full time

    About the Role:This is a Site Reliability Engineer position, focusing on the data pipeline reliability for the Video Platform team in USDS.Data SREs monitor data and keep production batch and real-time processing jobs up and running with the highest level of availability, ensuring our users have the freshest, complete, and correct data...


  • Los Angeles, California, United States Abbott Laboratories company Full time

    About the RoleAbbott is a global healthcare leader that helps people live more fully at all stages of life. Our portfolio of life-changing technologies spans the spectrum of healthcare, with leading businesses and products in diagnostics, medical devices, nutritionals and branded generic medicines.As a Senior Site Reliability Engineer, you will play a...


  • Los Angeles, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok U.S. Data Security is a subsidiary of TikTok in the U.S., dedicated to protecting user data and ensuring the reliability of our platform.ResponsibilitiesGain a comprehensive understanding of the TikTok experience and its underlying componentsMaintain services to meet service level agreements (SLAs) and service level...


  • Los Angeles, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S., responsible for protecting user data and ensuring the security of the TikTok platform.ResponsibilitiesGain a deep understanding of the TikTok...


  • Los Angeles, California, United States ICON Consultants, Inc. Full time

    **Job Title:** Site Reliability Engineer - Datacenter Expert**Location:** Remote**Pay Rate:** $100/hour + benefits**Assignment Length:** 3-month W2 Contract**Industry:** TechnologyThe ideal candidate will have experience with system operations and running large-scale, massively distributed infrastructure.Responsibilities:Data monitoring and alerting, data...


  • Los Angeles, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S.Our focus is on providing oversight and protection of the TikTok platform and U.S. user data, so millions of Americans can continue turning to TikTok...


  • Los Angeles, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S.This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols...


  • Los Angeles, California, United States Vale Full time

    About the RoleWe are seeking a highly skilled Reliability Advisor/Engineer to join our team at the Voisey's Bay Mine Site in Labrador, Canada. As a key member of our Reliability department, you will play a critical role in delivering improved performance of our assets by implementing and optimizing our reliability program.Key ResponsibilitiesIdentify and...