Site Reliability Engineer, Edge

1 week ago


Mountain View, California, United States Tik Tok Full time
Job Title: Site Reliability Engineer, Edge

At TikTok, we're committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe, and so does our workplace.

About the Role

We're seeking a highly skilled Site Reliability Engineer to join our Edge team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, fault-tolerance, and scalability of our edge services. You will work closely with our engineering teams to design, implement, and operate large-scale, massively distributed infrastructure.

Responsibilities
  • Architect and implement solutions that enable internal and external customers to harness the power of TikTok's content delivery network.
  • Contribute to data pipelines, tools, automations, visualizations, and monitors to facilitate the operation and optimization of edge services.
  • Data monitoring and alerting, data quality assurance, and anomaly detection.
  • Document team processes and policies, including methods of engagement and SLOs.
  • Analyze, design, and implement solutions at the system level to remove bottlenecks and improve edge service performance.
  • Implement monitoring and alerting to improve issue detection and response.
Requirements
  • Bachelor's degree with 2+ years of experience in Computer Engineering, Computer Science, or related fields, or equivalent experience.
  • 2+ years working experience in the field of CDN performance and traffic engineering, network solution architecting, or network-focused site reliability engineering roles.
  • Experience in networking technologies such as TCP/IP, BGP, DNS, etc. in a carrier-grade environment. Past experience with CDN technologies.
  • 2+ years experience in one or more programming languages such as Java, C++, Go, or scripting experience in Shell and Python.
Preferred Qualifications
  • Experience in operating in a multi-CDN environment.
  • Understanding of IPv6 and IPv4-IPv6 coexistence technologies.
  • Self-driven and capable of working with ambiguity and moving projects from concept to delivery.
  • Experience in designing, analyzing, and building automation and tools for large-scale systems.
About TikTok

TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs, or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at [insert contact information].

This role requires the ability to work with and support systems designed to protect sensitive data and information. As such, this role will be subject to strict national security-related screening.



  • Mountain View, California, United States Tik Tok Full time

    Job SummaryTikTok is seeking a highly skilled Site Reliability Engineer - Edge Services to join our team. As a key member of our Edge SRE team, you will be responsible for ensuring the reliability, fault-tolerance, and scalability of our edge services.ResponsibilitiesDesign and implement solutions to optimize edge service performance and remove...


  • Mountain View, California, United States Tik Tok Full time

    About the RoleTikTok is revolutionizing the way people create and consume short-form mobile video. As a Site Reliability Engineer, Edge Services - USDS, you will play a critical role in ensuring the reliability and performance of our edge services.Key ResponsibilitiesDesign and implement solutions to optimize edge service performance and...


  • Mountain View, California, United States Samsung Electronics America North America Full time

    Job Title: Platform Site Reliability EngineerSamsung Ads is seeking a highly skilled Platform Site Reliability Engineer to join our Global Ads Product & Engineering team. As a key member of our team, you will play a crucial role in ensuring the reliability, scalability, and performance of our advertising technology platform.Key Responsibilities:Design,...


  • Mountain View, California, United States Samsung Electronics America North America Full time

    Transforming Advertising Technology with Samsung AdsSamsung Ads is revolutionizing the advertising landscape with cutting-edge technology and innovative services. As a key player in this evolution, we're seeking a talented Embedded Site Reliability Engineer to join our Global Ads Product & Engineering team.Key Responsibilities:Design and implement scalable...


  • Mountain View, California, United States Optomi Full time

    Job Title: Site Reliability EngineerOptomi, in partnership with a large consulting firm, is seeking an experienced Site Reliability Engineer for their Remote team. This position requires a versatile, highly motivated individual capable of supplying frontline technical and operational support to our Site Reliability teams.As a vital part of the Reliability...


  • Mountain View, California, United States Moveworks Full time

    About MoveworksMoveworks is a leading AI startup that provides a universal AI copilot for search and automation across all business applications. Our mission is to empower employees to work faster and more efficiently by eliminating repetitive support issues and delivering instant knowledge.Job DescriptionWe are seeking a highly skilled Site Reliability...


  • Mountain View, California, United States Tik Tok Full time

    About the RoleWe are seeking an experienced Site Reliability Engineer to join our global infrastructure team. As a Site Reliability Engineer, you will be responsible for building and operating large-scale, massively distributed infrastructures to ensure the reliability, fault-tolerance, and efficiency of our edge services.ResponsibilitiesDesign, build, and...


  • Mountain View, California, United States Samsung Electronics America North America Full time

    Site Reliability Engineer - DevOps InfrastructureAt Samsung Ads, we're transforming the advertising landscape with cutting-edge technology. As a Site Reliability Engineer - DevOps Infrastructure, you'll play a crucial role in ensuring the reliability, scalability, and performance of our advertising technology platform.Key Responsibilities:Design and...


  • Mountain View, California, United States Moveworks Full time

    About MoveworksMoveworks is a leading AI-powered automation platform that helps businesses streamline their operations and improve employee productivity. Our innovative technology enables employees to find information and get support in one place, reducing costs and increasing efficiency.Job DescriptionWe are seeking a highly skilled Site Reliability...


  • Mountain View, California, United States Atlassian Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Atlassian. As a Site Reliability Engineer, you will play a critical role in ensuring the performance, reliability, and scalability of our cloud-based services.ResponsibilitiesDesign, implement, and maintain scalable and reliable cloud infrastructureCollaborate with...


  • Mountain View, California, United States Moveworks Full time

    About MoveworksMoveworks is a leading AI startup that provides a universal AI copilot for search and automation across all business applications. Our mission is to empower employees to work faster and more efficiently by eliminating repetitive support issues and delivering instant knowledge.Job DescriptionWe are seeking a highly skilled Site Reliability...


  • Mountain View, California, United States Optomi Full time

    Optomi's Site Reliability Engineer OpportunityWe are seeking a skilled Site Reliability Engineer to join our team at Optomi, in partnership with a large consulting firm. This role requires a versatile and highly motivated individual who can provide frontline technical and operational support to our Site Reliability teams.Key Responsibilities:Collaborate with...


  • Mountain View, California, United States Tik Tok Full time

    About the RoleWe are seeking an experienced Site Reliability Engineer to join our global infrastructure team. As a Site Reliability Engineer, you will be responsible for building and operating large-scale, massively distributed infrastructures to ensure the reliability, fault-tolerance, and efficiency of our edge services.ResponsibilitiesDesign, build, and...


  • Mountain View, California, United States Tik Tok Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer to join our Applied Machine Learning (AML) team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining highly available, scalable, and fault-tolerant systems.ResponsibilitiesDesign and develop large-scale systems that meet the needs of our AML...


  • Mountain View, California, United States Synopsys Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Platform Team at Synopsys. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our engineering environment. You will work closely with our development teams to design, implement, and operate scalable and efficient...


  • Mountain View, California, United States Groq Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Groq. As a Principal Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our tools and services for provisioning and managing the full lifecycle of Groq hardware and...


  • Mountain View, California, United States Groq Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Groq. As a Principal Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our tools and services for provisioning and managing the full lifecycle of Groq hardware and...


  • Mountain View, California, United States Groq Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Groq. As a Principal Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our tools and services for provisioning and managing the full lifecycle of Groq hardware and...


  • Mountain View, California, United States Tik Tok Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our AML team, where you will play a critical role in designing, building, and maintaining highly available, scalable, and fault-tolerant systems.ResponsibilitiesDesign and develop large-scale systems that meet the needs of our users.Monitor and analyze system performance,...


  • Mountain View, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is a leading destination for short-form mobile video, inspiring creativity and bringing joy to millions of users worldwide. Our mission is to create a platform that connects people from diverse backgrounds and cultures.Our TeamU.S. Data Security (USDS) is a subsidiary of TikTok in the U.S., dedicated to providing...