Current jobs related to Reliability Engineering Specialist - Mountain View, California - TikTok


  • Mountain View, California, United States Aeva, Inc Full time

    Aeva, Inc. is pushing the boundaries of sensing and perception for autonomous vehicles and beyond. Our 4D LiDAR technology offers unparalleled instantaneous velocity measurement, long-range performance at high resolutions, and immunity to LiDAR or sunlight interference. We are seeking a skilled Reliability Testing Specialist to join our dynamic Quality and...


  • Mountain View, California, United States Yoh Full time

    Job SummaryWe are seeking a highly skilled Reliability Test Engineer to join our team at Yoh, a Day & Zimmermann company. As a Reliability Test Engineer, you will be responsible for ensuring the reliability and quality of our products through various testing and validation processes.Key ResponsibilitiesDevelop and execute reliability test plans and...


  • Mountain View, California, United States Yoh, A Day & Zimmermann Company Full time

    Job DescriptionReliability Test EngineerJob Summary:We are seeking a highly skilled Reliability Test Engineer to join our team. As a Reliability Test Engineer, you will be responsible for designing and executing reliability tests to ensure the quality and performance of our products.Key Responsibilities:Design and execute reliability tests to evaluate the...


  • Mountain View, California, United States Yoh Full time

    Job Title: Reliability Test EngineerWe are seeking a skilled Reliability Test Engineer to join our team at Yoh, a Day & Zimmermann company. As a Reliability Test Engineer, you will be responsible for executing established reliability test procedures and performing various environmental, mechanical, and certification testing.Key Responsibilities:Execute...


  • Mountain View, California, United States Yoh, A Day & Zimmermann Company Full time

    Job DescriptionReliability Test EngineerJob Summary:We are seeking a highly skilled Reliability Test Engineer to join our team. As a Reliability Test Engineer, you will be responsible for designing and executing reliability tests to ensure the quality and performance of our products.Key Responsibilities:Design and execute reliability tests to evaluate the...


  • Mountain View, California, United States Atlassian Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Atlassian. As a Site Reliability Engineer, you will play a critical role in ensuring the performance, reliability, and scalability of our cloud-based services.ResponsibilitiesDesign, implement, and maintain scalable and reliable cloud infrastructureCollaborate with...


  • Mountain View, California, United States Aeva, Inc Full time

    About Aeva, Inc.Aeva is revolutionizing the sensing and perception landscape for autonomous vehicles and beyond. Our cutting-edge 4D LiDAR technology offers unparalleled instantaneous velocity measurement, long-range performance, and resistance to LiDAR or sunlight interference. This innovative solution is built from the ground up at silicon photonics scale...


  • Mountain View, California, United States Yoh, A Day & Zimmermann Company Full time

    Job DescriptionYoh, a Day & Zimmermann company, is seeking a highly skilled Reliability Test Engineer to join our team. As a Reliability Test Engineer, you will be responsible for ensuring the reliability and quality of our products through various testing procedures.Key Responsibilities:Develop and execute reliability test plans and procedures to ensure the...


  • Mountain View, California, United States Optomi Full time

    Job Title: Site Reliability EngineerOptomi, in partnership with a large consulting firm, is seeking an experienced Site Reliability Engineer for their Remote team. This position requires a versatile, highly motivated individual capable of supplying frontline technical and operational support to our Site Reliability teams.As a vital part of the Reliability...


  • Mountain View, California, United States Moveworks Full time

    About MoveworksMoveworks is a leading AI startup that provides a universal AI copilot for search and automation across all business applications. Our mission is to empower employees to work faster and more efficiently by eliminating repetitive support issues and delivering instant knowledge.Job DescriptionWe are seeking a highly skilled Site Reliability...


  • Mountain View, California, United States Moveworks Full time

    About MoveworksMoveworks is a leading AI-powered automation platform that helps businesses streamline their operations and improve employee productivity. Our innovative technology enables employees to find information and get support in one place, reducing costs and increasing efficiency.Job DescriptionWe are seeking a highly skilled Site Reliability...


  • Mountain View, California, United States Aeva, Inc Full time

    About Aeva, IncAeva is a pioneering company in the field of sensing and perception for autonomous vehicles and beyond. Our innovative 4D LiDAR technology offers unparalleled performance, range, and reliability.Role OverviewWe are seeking a highly skilled Reliability Test Engineer to join our dynamic Quality and Reliability Team. In this critical role, you...


  • Mountain View, California, United States Aeva, Inc Full time

    About Aeva, Inc.Aeva is a pioneering company in the field of sensing and perception for autonomous vehicles and beyond. Our innovative 4D LiDAR technology is built from the ground up at silicon photonics scale for mass-market applications. We are seeking a highly skilled Reliability Test Engineer to join our dynamic Quality and Reliability Team.Key...


  • Mountain View, California, United States Moveworks Full time

    About MoveworksMoveworks is a leading AI startup that provides a universal AI copilot for search and automation across all business applications. Our mission is to empower employees to work faster and more efficiently by eliminating repetitive support issues and delivering instant knowledge.Job DescriptionWe are seeking a highly skilled Site Reliability...


  • Mountain View, California, United States Atlassian Full time

    About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the performance and reliability of our services. You will work closely with our teams to identify and resolve issues, and develop solutions to improve our systems.Key Responsibilities:Investigate...


  • Mountain View, California, United States Tik Tok Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer to join our Applied Machine Learning (AML) team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining highly available, scalable, and fault-tolerant systems.ResponsibilitiesDesign and develop large-scale systems that meet the needs of our AML...


  • Mountain View, California, United States Synopsys Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Platform Team at Synopsys. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our engineering environment. You will work closely with our development teams to design, implement, and operate scalable and efficient...


  • Mountain View, California, United States Aurora Innovation Full time

    Job Title: Staff Hardware Reliability EngineerAurora Innovation is seeking a highly skilled Staff Hardware Reliability Engineer to join our team. As a key member of our Hardware Reliability team, you will be responsible for ensuring the robustness and dependability of hardware systems in the Aurora hardware stack.Job Summary:We are looking for a talented...


  • Mountain View, California, United States Tik Tok Full time

    About the Role:This is a Site Reliability Engineer position focusing on data pipeline reliability for the Video Platform team in USDS.Data SREs monitor data and keep production batch and real-time processing jobs up and running with the highest level of availability, ensuring our users have the freshest, complete, and correct data...


  • Mountain View, California, United States Tik Tok Full time

    About the RoleThis is a Site Reliability Engineer position, focusing on the data pipeline reliability for the Video Platform team in USDS.Data SREs monitor data and keep production batch and real-time processing jobs up and running with the highest level of availability, ensuring our users have the freshest, complete, and correct data...

Reliability Engineering Specialist

2 months ago


Mountain View, California, United States TikTok Full time

TikTok stands as a premier platform for short-form mobile video, dedicated to fostering creativity and delivering joy. Our global presence spans numerous cities, enhancing our mission to protect users and content creators worldwide.

The Trust and Safety Engineering Team is rapidly expanding, tasked with developing advanced machine learning models and systems aimed at combating internet abuse and fraud on our platform. Our commitment is to safeguard billions of users and publishers daily.

Utilizing cutting-edge machine learning technologies, we analyze vast amounts of data generated on our platform to enhance our trust and safety systems. Through our ongoing efforts, TikTok strives to provide an exceptional user experience, bringing joy to individuals globally.

In this role, you will tackle complex challenges associated with scalability, leveraging your expertise in coding, algorithms, complexity analysis, and large-scale system design. We cultivate a culture characterized by diversity, intellectual curiosity, openness, and effective problem-solving, encouraging collaboration while promoting autonomy.

Key Responsibilities
  • Oversee daily operations of data services and real-time/batch data pipelines, including SLA management, system deployment, performance optimization, and troubleshooting.
  • Develop tools and automation to enhance system administration and operational efficiency.
  • Participate in regular on-call duties to ensure system reliability.
  • Engage in and refine the entire service lifecycle, from inception and design through development, capacity planning, launch reviews, deployment, operation, and continuous improvement.
  • Design and implement software platforms and monitoring frameworks for effective, automated, and intelligent service-oriented architecture (SOA) governance.
  • Ensure sustainable system scalability through automation; drive improvements in system reliability, efficiency, and speed.
  • Practice responsible user support, incident response, and conduct blameless postmortems.
Qualifications
  • Bachelor's degree in Computer Science or a related field, with a minimum of 3 years of relevant experience.
  • Proven independent thinking and troubleshooting capabilities.
  • Proficiency in programming languages such as Python, Go, C, C++, Java, or Rust.
  • Familiarity with backend systems including MySQL, Redis, Nginx, Kafka, Kubernetes, Docker, and big data technologies like Hadoop, Spark, Flink, Hive, OLAP, and ClickHouse.
  • Understanding of Unix/Linux system internals, networking, and distributed systems.
  • Strong communication and coordination skills.
  • Experience in Trust & Safety is advantageous.

TikTok is devoted to fostering an inclusive environment where employees are recognized for their skills, experiences, and unique perspectives. Our platform connects individuals globally, and we aim to reflect the diverse communities we serve. We believe that every individual should be evaluated based on their strengths and experiences, regardless of their background or identity.

We are committed to providing reasonable accommodations throughout our recruitment process.