Site Reliability Engineer

4 weeks ago


Los Angeles, California, United States Tik Tok Full time
About TikTok U.S. Data Security

TikTok is a leading destination for short-form mobile video, inspiring creativity and bringing joy to users worldwide. Our mission is to empower creators and communities to express themselves authentically.

Our Team

The U.S. Data Security team is responsible for protecting sensitive data and information, ensuring the highest level of availability and security for our users. We work closely with various teams within the Video Architecture platform to deliver reliable and scalable data pipelines.

The Role

We are seeking a Site Reliability Engineer to join our team, focusing on data pipeline reliability for the Video Platform. As a Data SRE, you will monitor data and keep production batch and real-time processing jobs up and running with the highest level of availability, ensuring our users have the freshest, complete, and correct data possible.

Responsibilities:
  • Manage day-to-day operations of data services, real-time/batch data pipelines, including Service Level Agreement management, pipeline deployment, performance tuning, and troubleshooting.
  • Proactively monitor and troubleshoot data pipelines and systems for performance issues, errors, or anomalies.
  • Create tools, build alarms, and dashboards to drive internal process improvements and automation.
  • Improve systems reliability, efficiency, and velocity through scaling, optimization of resources and data processing workflows, potentially refactoring code or implementing new solutions.
  • Develop and deploy new reliable and scalable data pipelines and infrastructure components as required by business needs.
Requirements:
  • Bachelor's in Computer Science or a related technical background involving software/system engineering, or equivalent working experience.
  • Good programming experience with SQL and at least one of the following languages: Java, Python, Go, or Scala.
  • Experience in data engineering, with a focus on data systems reliability, scalability, and performance.
Preferred Qualifications:
  • Solid experience with big data technologies (e.g., Hadoop, Spark, Flink, YARN) and databases (SQL, NoSQL).
  • Knowledge of data pipeline and workflow management tools (e.g., Airflow, Luigi).
  • Demonstrated independent thinking capabilities and troubleshooting skills in large-scale distributed systems.
  • Good communication and coordination skills.
Why Join Us

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe, and so does our workplace. We are passionate about this and hope you are too.

TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs, or other reasons protected by applicable laws.



  • Los Angeles, California, United States ICON Consultants, Inc. Full time

    Job Title: Site Reliability EngineerAt ICON Consultants, Inc., we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our datacenter infrastructure.Responsibilities:Data Monitoring and Alerting: Design and implement data...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Implement solutions that...


  • Los Angeles, California, United States eTek IT Services, Inc. Full time

    Job OverviewAs a Site Reliability Engineer at eTek IT Services, Inc., you will play a vital role in ensuring the reliability, performance, and scalability of our infrastructure and applications. This is a unique opportunity to work with cutting-edge technology and make a significant impact on our organization's success.Key ResponsibilitiesDesign and...


  • Los Angeles, California, United States City National Bank Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at City National Bank. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key ResponsibilitiesImplement solutions that improve stability, security,...


  • Los Angeles, California, United States eTek IT Services, Inc. Full time

    Job OpportunityWe are seeking a highly skilled Site Reliability Engineer to join our team at eTek IT Services, Inc.Job SummaryAs a Site Reliability Engineer, you will be responsible for ensuring the high availability and scalability of our cloud-based systems. You will work closely with our development team to design, implement, and maintain reliable systems...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability Principal EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Principal Engineer to join our team. As a key member of our engineering organization, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems.Key Responsibilities:Design, build, and manage...


  • Los Angeles, California, United States City National Bank Full time

    About the RoleWe are seeking a highly skilled Site Reliability Principal Engineer to join our team at City National Bank. As a Site Reliability Principal Engineer, you will be responsible for ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key ResponsibilitiesArchitect solutions that improve the...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability Principal EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Principal Engineer to join our team. As a key member of our cloud operations team, you will be responsible for ensuring the reliability, scalability, and maximum uptime of our systems in the data center or cloud platform.Key...


  • Los Angeles, California, United States Loft Orbital Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Loft Orbital. As a key member of our Site Reliability team, you will be responsible for ensuring the reliability, scalability, and maintainability of our ground segment infrastructure.Key ResponsibilitiesCollaborate with development, operations, and IT teams to...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability Principal EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Principal Engineer to join our team. As a key member of our engineering organization, you will be responsible for ensuring the reliability, scalability, and maximum uptime of our software platforms.Key Responsibilities:Architect solutions to...


  • Los Angeles, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S.This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols...


  • Los Angeles, California, United States Tik Tok Full time

    About the Role:This is a Site Reliability Engineer position, focusing on the data pipeline reliability for the Video Platform team in USDS.Data SREs monitor data and keep production batch and real-time processing jobs up and running with the highest level of availability, ensuring our users have the freshest, complete, and correct data...


  • Los Angeles, California, United States Abbott Laboratories company Full time

    About the RoleAbbott is a global healthcare leader that helps people live more fully at all stages of life. Our portfolio of life-changing technologies spans the spectrum of healthcare, with leading businesses and products in diagnostics, medical devices, nutritionals and branded generic medicines.As a Senior Site Reliability Engineer, you will play a...


  • Los Angeles, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok U.S. Data Security is a subsidiary of TikTok in the U.S., dedicated to protecting user data and ensuring the reliability of our platform.ResponsibilitiesGain a comprehensive understanding of the TikTok experience and its underlying componentsMaintain services to meet service level agreements (SLAs) and service level...


  • Los Angeles, California, United States Tik Tok Full time

    About the RoleTikTok is the leading destination for short-form mobile video, and our mission is to inspire creativity and bring joy. As a Site Reliability Engineer on our Video Platform team, you will play a critical role in ensuring the reliability and stability of our video system, which provides excellent experiences for billions of users around the...


  • Los Angeles, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S., responsible for protecting user data and ensuring the security of the TikTok platform.ResponsibilitiesGain a deep understanding of the TikTok...


  • Los Angeles, California, United States ICON Consultants, Inc. Full time

    **Job Title:** Site Reliability Engineer - Datacenter Expert**Location:** Remote**Pay Rate:** $100/hour + benefits**Assignment Length:** 3-month W2 Contract**Industry:** TechnologyThe ideal candidate will have experience with system operations and running large-scale, massively distributed infrastructure.Responsibilities:Data monitoring and alerting, data...


  • Los Angeles, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S.Our focus is on providing oversight and protection of the TikTok platform and U.S. user data, so millions of Americans can continue turning to TikTok...


  • Los Angeles, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S.This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols...


  • Los Angeles, California, United States Vale Full time

    About the RoleWe are seeking a highly skilled Reliability Advisor/Engineer to join our team at the Voisey's Bay Mine Site in Labrador, Canada. As a key member of our Reliability department, you will play a critical role in delivering improved performance of our assets by implementing and optimizing our reliability program.Key ResponsibilitiesIdentify and...