Site Reliability Engineer

4 weeks ago


Seattle, Washington, United States Tik Tok Full time
About the Role

TikTok is seeking a skilled Site Reliability Engineer to join our AML team, where you will combine system engineering and machine learning expertise to develop and run a massively distributed AI/ML recommendation system.

Responsibilities
  • Design, build, and maintain highly available, scalable, and fault-tolerant systems.
  • Monitor and analyze system performance, identifying and resolving issues before causing user impact.
  • Develop and maintain automated monitoring, alerting, and incident response systems.
  • Collaborate closely with software engineering teams to ensure that applications are designed with reliability, scalability, and performance in mind.
  • Implement and maintain security best practices and ensure compliance with regulatory requirements.
  • Participate in on-call rotations and respond to issues and incidents within and outside of normal business hours.
Requirements
  • Expertise in analyzing and troubleshooting Linux-based distributed systems.
  • Bachelor's/Master's degree in Computer Science, Computer Engineering, or equivalent years of experience in a SRE or software engineering role.
  • Experience programming with at least one commonly used language (C, C++, Python, Go).
  • Strong understanding of data structures and algorithms.
  • Competent knowledge of relational database systems.
Preferred Qualifications
  • Ability to design and maintain large-scale systems.
  • Strong understanding of code optimization and routine task automation.
  • Proficiency in at least one machine learning framework: TensorFlow, PyTorch, MXNet or PaddlePaddle
About TikTok

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach.



  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our dynamic team. As a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key Responsibilities:Design, implement, and maintain security measures,...


  • Seattle, Washington, United States HireIO Inc Full time

    Job Title: Site Reliability EngineerHireIO Inc is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our distributed systems.Key Responsibilities:Design and implement scalable and reliable systemsCollaborate with cross-functional...


  • Seattle, Washington, United States Phaidra Full time

    About PhaidraPhaidra is a cutting-edge technology company that's revolutionizing the industrial automation sector. Our mission is to empower facilities to adapt and improve over time, leveraging AI-powered control systems that learn and evolve continuously.We're a team of innovators, engineers, and problem-solvers who share a passion for creating...


  • Seattle, Washington, United States Sogeti Full time

    Site Reliability Engineer **Job Summary** We are seeking an experienced Site Reliability Engineer to join our team. As a key member of our operations team, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure. **Key Responsibilities** * Design, implement, and maintain scalable and reliable cloud...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.About the RoleWe are seeking a talented and motivated individual to join our dynamic...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our dynamic team. As a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key Responsibilities:Design, implement, and maintain security measures,...


  • Seattle, Washington, United States Sogeti Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Sogeti. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our software systems and infrastructure.Key Responsibilities:Develop, maintain, and configure cloud observability...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our dynamic team. As a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key Responsibilities:Design, implement, and maintain security measures,...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled and motivated Site Reliability Engineer to join our dynamic and growing team at Apple.About the RoleAs a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key ResponsibilitiesDesign, implement,...


  • Seattle, Washington, United States Tik Tok Full time

    {"title": "Site Reliability Engineer", "content": "About the RoleTikTok is seeking a highly skilled Site Reliability Engineer to join our US Data Security team. As a key member of our Video Platform team, you will be responsible for ensuring the reliability and scalability of our video system, which serves billions of users worldwide.ResponsibilitiesDesign...


  • Seattle, Washington, United States Phaidra Full time

    About PhaidraPhaidra is a pioneering company in the industrial automation sector, leveraging AI-powered control systems to enable facilities to adapt and improve over time.Our mission is to revolutionize the way industrial facilities operate, making them more efficient, sustainable, and responsive to their environment.Job DescriptionWe are seeking a highly...


  • Seattle, Washington, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok U.S. Data Security is a subsidiary of TikTok in the U.S., dedicated to protecting user data and ensuring the security of our platform.ResponsibilitiesWe are seeking a highly motivated and experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the...


  • Seattle, Washington, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok U.S. Data Security is a subsidiary of TikTok in the U.S., dedicated to protecting user data and ensuring the security of our platform.ResponsibilitiesWe are seeking a highly motivated and experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the...


  • Seattle, Washington, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok U.S. Data Security is a subsidiary of TikTok in the U.S., dedicated to protecting user data and ensuring the security of our platform.ResponsibilitiesWe are seeking a highly motivated and experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a skilled Site Reliability Engineer to join our Object Storage SRE team. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability, scalability, and performance of our cloud storage systems.About the RoleWe're seeking a seasoned software and systems engineer with a...


  • Seattle, Washington, United States Sogeti Full time

    Site Reliability EngineerWe are seeking an experienced Site Reliability Engineer to join our team at Sogeti. As a key member of our operations team, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using Azure or...


  • Seattle, Washington, United States Oracle Full time

    Job DescriptionOracle is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, develop, and deploy automation tools to improve the efficiency and reliability of our...


  • Seattle, Washington, United States Nerdshub E Pvt Ltd Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Nerdshub E Pvt Ltd. As a Site Reliability Engineer, you will be responsible for ensuring the health and stability of our production systems, developing monitoring dashboards, and configuring alerts to automate system recovery.Key...


  • Seattle, Washington, United States Tik Tok Full time

    {"title": "Site Reliability Engineer", "content": "About the RoleTikTok is seeking an experienced Site Reliability Engineer to join our USDS Video Platform team. As a key member of our team, you will be responsible for ensuring the reliability and scalability of our video system, which serves billions of users worldwide.ResponsibilitiesDesign and implement...


  • Seattle, Washington, United States Capgemini Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our software systems and infrastructure.Key Responsibilities:Develop, maintain, and configure cloud observability systems (e.g.,...