Site Reliability Engineer, Edge

3 weeks ago


Seattle, Washington, United States Tik Tok Full time
{"title": "Edge Site Reliability Engineer", "subtitle": "Build and Run Large-Scale Infrastructure", "content": "

At TikTok, we're committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe, and so does our workplace.

As an Edge Site Reliability Engineer, you'll have the opportunity to manage complex systems at scale, including hyperscale datacenters, public cloud, global content distribution networks, and load balancers that handle Tbps of traffic.

You'll collaborate with various teams to translate business needs into concrete action items and improvements in system design or procedures.

We follow a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department.

Responsibilities:

  • Build data pipelines, tools, automations, visualizations, and monitors to facilitate the operation and optimization of edge services.
  • Data monitoring and alerting, data quality assurance, and anomaly detection.
  • Document team processes and policies, including methods of engagement and SLOs.
  • Analyze, design, and implement solutions at the system level to remove bottlenecks and improve edge service performance.
  • Implement monitoring and alerting to improve issue detection and response.

Qualifications:

  • Master's degree (or Bachelor's degree with 2+) years of experience in Computer Engineering, Electrical Engineering, Computer Science, or related major.
  • 3+ years experience working with Unix Linux systems from kernel to shell and beyond with experience working with system libraries, file systems, and client-server protocols.
  • 2+ years experience in one or more programming languages such as Java, C++, Go, or scripting experience in Shell and Python.
  • Strong analytical skills and the ability to solve real-world problems in a fast-moving environment.

TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs, or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://www.tiktok.com/. This role requires the ability to work with and support systems designed to protect sensitive data and information. As such, this role will be subject to strict national security-related screening.

", "requirements": "

Minimum Qualifications:

  • Master's degree (or Bachelor's degree with 2+) years of experience in Computer Engineering, Electrical Engineering, Computer Science, or related major.
  • 3+ years experience working with Unix Linux systems from kernel to shell and beyond with experience working with system libraries, file systems, and client-server protocols.
  • 2+ years experience in one or more programming languages such as Java, C++, Go, or scripting experience in Shell and Python.
  • Strong analytical skills and the ability to solve real-world problems in a fast-moving environment.

Preferred qualifications:

  • Experience with the Hadoop ecosystem - HDFS, Yarn, Spark, etc.
  • Self-driven and capable of working with ambiguity and moving projects from concept to delivery.
  • Experience in building solutions with AWS, Google, Azure, and other cloud services.
  • Experience in networking technologies such TCP/IP, BGP, DNS, etc. in a carrier-grade environment.
  • Experience in developing and operating one or more of the following systems: OpenStack, Kubernetes, Nginx, ipvs, ELK stack, Hadoop, etc.
", "company": "TikTok"}

  • Seattle, Washington, United States Tik Tok Full time

    About the RoleTikTok is seeking a highly skilled Site Reliability Engineer to join our Edge team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our edge services.ResponsibilitiesDesign, build, and maintain data pipelines, tools, and automations to facilitate the operation and...


  • Seattle, Washington, United States Tik Tok Full time

    About the RoleTikTok is seeking a highly skilled Site Reliability Engineer to join our Edge team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, fault-tolerance, and scalability of our edge services.ResponsibilitiesDesign, build, and maintain data pipelines, tools, and automations to facilitate the operation and...


  • Seattle, Washington, United States Tik Tok Full time

    {"title": "Edge Site Reliability Engineer", "subtitle": "Build and Run Large-Scale Infrastructure", "content": "At TikTok, we're committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe, and so does our workplace.We're seeking an Edge Site...


  • Seattle, Washington, United States Phaidra Full time

    About PhaidraPhaidra is a cutting-edge technology company that's revolutionizing the industrial automation sector. Our mission is to empower facilities to adapt and improve over time, leveraging AI-powered control systems that learn and evolve continuously.We're a team of innovators, engineers, and problem-solvers who share a passion for creating...


  • Seattle, Washington, United States Phaidra Full time

    About PhaidraPhaidra is a pioneering company in the industrial automation sector, leveraging AI-powered control systems to enable facilities to adapt and improve over time.Our mission is to revolutionize the way industrial facilities operate, making them more efficient, sustainable, and responsive to their environment.Job DescriptionWe are seeking a highly...


  • Seattle, Washington, United States Saxon Global Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our Data Platform Services team at Starbucks. As a key member of our team, you will be responsible for maintaining and improving the data platform that supports various Starbucks services.Key ResponsibilitiesEnsure the health and stability of our production systemDevelop and...


  • Seattle, Washington, United States Apple Full time

    Job SummaryApple is seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Services team. As a key member of our team, you will be responsible for designing, implementing, and operating large-scale cloud infrastructure to support Apple's internet services.About the RoleWe are looking for a strong, enthusiastic developer with a passion...


  • Seattle, Washington, United States Apple Full time

    About the RoleAt Apple, we're looking for talented Site Reliability Engineers to join our Apple Services Engineering team. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability, scalability, and performance of our services, including iCloud, iTunes, Siri, and Maps.Key ResponsibilitiesDesign, build, and operate large-scale...


  • Seattle, Washington, United States Gusto Full time

    About GustoGusto is a leading provider of modern, online people platforms that empower small businesses to manage their teams effectively. Our comprehensive suite of tools includes full-service payroll, health insurance, 401(k)s, expert HR, and team management solutions. With offices in Denver, San Francisco, and New York, we serve over 300,000 businesses...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our dynamic team. As a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key Responsibilities:Design, implement, and maintain security measures,...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our dynamic team. As a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key Responsibilities:Design, implement, and maintain security measures,...


  • Seattle, Washington, United States Moloco Full time

    About MolocoMoloco is a cutting-edge machine learning company that empowers organizations to grow and unlock the full value of their unique first-party data. We're a leader in the advertising technology industry, recognized for our innovative approach to performance marketing and visionary product infrastructure.Our MissionWe're on a mission to deliver...


  • Seattle, Washington, United States Sogeti Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Sogeti. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our software systems and infrastructure.Key Responsibilities:Develop, maintain, and configure cloud observability...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our dynamic team. As a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key Responsibilities:Design, implement, and maintain security measures,...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled and motivated Site Reliability Engineer to join our dynamic and growing team at Apple.About the RoleAs a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key ResponsibilitiesDesign, implement,...


  • Seattle, Washington, United States Tik Tok Full time

    {"title": "Site Reliability Engineer", "content": "About the RoleTikTok is seeking a highly skilled Site Reliability Engineer to join our US Data Security team. As a key member of our Video Platform team, you will be responsible for ensuring the reliability and scalability of our video system, which serves billions of users worldwide.ResponsibilitiesDesign...


  • Seattle, Washington, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok U.S. Data Security is a subsidiary of TikTok in the U.S., dedicated to protecting user data and ensuring the security of our platform.ResponsibilitiesWe are seeking a highly motivated and experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the...


  • Seattle, Washington, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok U.S. Data Security is a subsidiary of TikTok in the U.S., dedicated to protecting user data and ensuring the security of our platform.ResponsibilitiesWe are seeking a highly motivated and experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the...


  • Seattle, Washington, United States Oracle Full time

    Job DescriptionOracle is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, develop, and deploy automation tools to improve the efficiency and reliability of our...


  • Seattle, Washington, United States Tik Tok Full time

    {"title": "Site Reliability Engineer", "content": "About the RoleTikTok is seeking an experienced Site Reliability Engineer to join our USDS Video Platform team. As a key member of our team, you will be responsible for ensuring the reliability and scalability of our video system, which serves billions of users worldwide.ResponsibilitiesDesign and implement...