Site Reliability Engineer

4 weeks ago


New York, New York, United States Tik Tok Full time
About TikTok U.S. Data Security

TikTok is a leading platform for short-form mobile video, and our mission is to inspire creativity and bring joy. Our U.S. Data Security division is a subsidiary of TikTok, dedicated to protecting user data and ensuring the security of our platform.

Job Summary

We are seeking a highly skilled Site Reliability Engineer to join our AML team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining highly available, scalable, and fault-tolerant systems. You will work closely with software engineering teams to ensure that applications are designed with reliability, scalability, and performance in mind.

Responsibilities
  • Design, build, and maintain highly available, scalable, and fault-tolerant systems.
  • Monitor and analyze system performance, identifying and resolving issues before causing user impact.
  • Develop and maintain automated monitoring, alerting, and incident response systems.
  • Collaborate closely with software engineering teams to ensure that applications are designed with reliability, scalability, and performance in mind.
  • Implement and maintain security best practices and ensure compliance with regulatory requirements.
  • Participate in on-call rotations and respond to issues and incidents within and outside of normal business hours.
Requirements
  • Expertise in analyzing and troubleshooting Linux-based distributed systems.
  • Bachelor's/Master's degree in Computer Science, Computer Engineering, or equivalent years of experience in a SRE or software engineering role.
  • Experience programming with at least one commonly used language (C, C++, Python, Go).
  • Strong understanding of data structures and algorithms.
  • Competent knowledge of relational database systems.
Preferred Qualifications
  • Ability to design and maintain large-scale systems.
  • Strong understanding of code optimization and routine task automation.
  • Proficiency in at least one machine learning framework (TensorFlow, PyTorch, MXNet, or PaddlePaddle).
About Us

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe, and so does our workplace. We are passionate about celebrating our diverse voices and creating an environment that reflects the many communities we reach.

We are committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs, or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us.



  • New York, New York, United States CapB InfoteK Full time

    Job Title: Site Reliability EngineerAbout the Role:At CapB InfoteK, we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:• Develop and build low-level component...


  • New York, New York, United States Lorven Technologies Full time

    Job Title: Site Reliability EngineerLorven Technologies is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly available...


  • New York, New York, United States Lorven Technologies Full time

    Job Title: Site Reliability EngineerLorven Technologies is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain infrastructure automation...


  • New York, New York, United States Insight Global Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will be responsible for ensuring the uptime and reliability of our production and non-production environments. You will work closely with our development teams to build and maintain the infrastructure and applications...


  • New York, New York, United States Insight Global Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will be responsible for ensuring the uptime and reliability of our production and non-production environments.Key Responsibilities:Monitor availability and system health to ensure optimal...


  • New York, New York, United States Cynet Systems Full time

    Job Title: Site Reliability EngineerJob Summary:Cynet Systems is seeking a highly skilled Site Reliability Engineer to lead the development and implementation of geospatial application performance monitoring strategies. The ideal candidate will have a strong background in Site Reliability Engineering (SRE) and proven experience in using Dynatrace for...


  • New York, New York, United States Phaxis Full time

    Site Reliability EngineerWe are seeking an experienced Site Reliability Engineer to join our team at Phaxis. As a Site Reliability Engineer, you will be responsible for designing and building scalable and resilient systems, collaborating with engineering teams to advocate for optimal system use, and managing our centralized development infrastructure.Key...


  • New York, New York, United States Apollo Solutions Full time

    Site Reliability EngineerApollo Solutions is partnering with a pioneering artificial intelligence business that is revolutionizing the use of AI/ML in gaming and security.The company is working closely with government contracts and gaming console companies and is seeking a Site Reliability Engineer to join their growing team.The Site Reliability Engineer...


  • New York, New York, United States Grafbase, Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Engineering team at Grafbase, Inc. As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, availability, and performance of our systems and services.You will collaborate with cross-functional teams to design, implement, and maintain...


  • New York, New York, United States Grafbase, Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Engineering team at Grafbase, Inc.As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems and services.Key ResponsibilitiesCollaborate with cross-functional teams to develop and deploy software...


  • New York, New York, United States Alchemy Full time

    About the RoleAlchemy is seeking a highly skilled Site Reliability Engineer to join our Infrastructure team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our globally used developer platform.Our mission is to empower builders with the tools they need to create exceptional on-chain products....


  • New York, New York, United States City National Bank Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at City National Bank. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Design and implement solutions...


  • New York, New York, United States Grafbase, Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Engineering team at Grafbase, Inc. As an SRE, you will play a critical role in ensuring the reliability, availability, and performance of our systems and services.Key ResponsibilitiesCollaborate with cross-functional teams to ensure software is developed and deployed for...


  • New York, New York, United States Valstro Full time

    About ValstroValstro is a FinTech company that is revolutionizing the trading industry with its cloud-first, next-gen trading solutions. As a people-first company, we prioritize collaboration, motivation, and support to deliver exceptional value to our clients.Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team. As a key...


  • New York, New York, United States Peloton Full time

    About the RolePeloton is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our platform.Your Daily ImpactDesign and implement automated infrastructure provisioning and deployment processes using Terraform and...


  • New York, New York, United States Peloton Full time

    About the RolePeloton is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our platform.Your Daily ImpactDesign and implement automated infrastructure provisioning and deployment processes using Terraform and...


  • New York, New York, United States Lorven Technologies Full time

    Job Title: Site Reliability EngineerLorven Technologies is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement infrastructure automation using Ansible...


  • New York, New York, United States Insight Global Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and reliability of our production and non-production environments.Key ResponsibilitiesMonitor availability and system health to ensure optimal performanceDesign...


  • New York, New York, United States Tik Tok Full time

    About the RoleTikTok is seeking a highly skilled Site Reliability Engineer to join our AML team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining highly available, scalable, and fault-tolerant systems.ResponsibilitiesDesign and implement large-scale systems to ensure high availability and scalability.Monitor...


  • New York, New York, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is the leading destination for short-form mobile video, and our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S., responsible for providing oversight and protection of the TikTok platform and U.S. user data. Our focus is on delivering a secure and reliable...