Site Reliability Engineer for Recommendation Systems

3 weeks ago


San Jose, California, United States Tik Tok Full time
{"title": "Recommendation Infrastructure Team", "description": "Our Mission

At TikTok, we inspire creativity and bring joy. Our platform is built to help imaginations thrive.

Our Team

We are a team of passionate individuals who believe in the power of creativity and innovation. We work together to build and optimize the architecture for our recommendation system, ensuring the best experience for our users.

What You'll Do
  • Engage in the whole lifecycle of Recommendation systems, from design consulting to launch reviews, deployment, operation, and refinement.
  • Deliver tools and software to improve the reliability and scalability of services, automate operations, and enhance R&D efficiency.
  • Build availability of large-scale services deployed across global data centers.
  • Plan, manage, and optimize cloud resources utilization, ensuring SLA of large-scale clusters.
  • Measure and monitor availability, latency, and overall service health.
  • Practice sustainable incident response and postmortems.
Qualifications
  • Bachelor's degree or above in Computer Science or related fields.
  • At least 2 years of work experience in SRE of large-scale systems deployment with high reliability and scalability.
  • Familiar with system operation skills in Linux and network.
  • Experience programming in at least one of the following languages: Python, Perl, Go, or C/C++.
  • Experience in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Familiar with popular CI/CD procedures and environments.
  • Effective communication skills and a sense of ownership and drive.

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe, and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy.

We are passionate about this and hope you are too. TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs, or other reasons protected by applicable laws.

", "lang_code": "en"}

  • San Jose, California, United States Adobe Full time

    About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our team at Adobe. As a key member of our Cloud Engineering team, you will play a critical role in designing, deploying, and optimizing our cloud services.Key ResponsibilitiesDevelop software and tools to improve the reliability and performance of our cloud servicesCollaborate...


  • San Jose, California, United States Adobe Systems Inc Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Enterprise Tools team at Adobe Systems Inc. As a key member of our team, you will be responsible for ensuring the performance, availability, and scalability of our tools and systems.Key ResponsibilitiesRun tools hosted on-premises and perform application-level...


  • San Jose, California, United States Adobe Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Adobe. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud services.Key ResponsibilitiesDevelop software and tools to design, deploy, and optimize cloud servicesProvide hands-on technical...


  • San Jose, California, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement automation scripts using shell,...


  • San Jose, California, United States Altius Technologies, Inc. Full time

    Job Title: Site Reliability EngineerAltius Technologies, Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining the infrastructure and systems that support our business applications.Key Responsibilities:Design and implement automation...


  • San Jose, California, United States Adobe Systems Inc Full time

    {"title": "Site Reliability Engineer", "description": "Transforming Digital ExperiencesAt Adobe, we're passionate about empowering people to create beautiful and powerful digital experiences. We're on a mission to hire the very best and create exceptional employee experiences where everyone is respected and has access to equal opportunity.The OpportunityWe...


  • San Jose, California, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • San Jose, California, United States TikTok Full time

    About UsTikTok is a global leader in short-form mobile video, inspiring creativity and bringing joy to our users. Our mission is to create a platform that helps imaginations thrive, and we're committed to celebrating our diverse voices and creating an inclusive space for our employees.Job DescriptionWe're seeking a talented Senior Software Engineer -...


  • San Jose, California, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement automation scripts using...


  • San Jose, California, United States Tik Tok Full time

    {"title": "Site Reliability Engineer", "content": "Role OverviewTikTok is a leading destination for short-form mobile video, and our mission is to inspire creativity and bring joy. Our platform is built to help imaginations thrive, and we're seeking Site Reliability Engineers (SREs) to join our monetization technology team.The monetization technology team...


  • San Jose, California, United States Adobe Systems Inc Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at Adobe. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our cloud-based services.Key ResponsibilitiesDesign and implement scalable and highly available systems to support our cloud-based...


  • San Jose, California, United States ApTask Full time

    About ApTask:ApTask is a leading global provider of workforce solutions and talent acquisition services, dedicated to shaping the future of work.As an African American-owned and Veteran-certified company, ApTask offers a comprehensive suite of services, including staffing and recruitment solutions, managed services, IT consulting, and project management.With...


  • San Jose, California, United States Splunk Full time

    About SplunkSplunk is a leading provider of cloud-based data analytics and monitoring solutions. Our mission is to make machine data accessible, usable, and valuable to everyone.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our Cloud TechOps team. As a Site Reliability Engineer, you will be responsible for ensuring the...


  • San Jose, California, United States Tik Tok Full time

    {"title": "Site Reliability Engineer", "description": "\u003Cp\u003EAt TikTok, we're seeking Site Reliability Engineers (SREs) to join our monetization technology team.\u003C/p\u003E\u003Cp\u003EOur team works on building and running large-scale, globally distributed, fault-tolerant ads systems.\u003C/p\u003E\u003Cp\u003ESREs keep the systems up and running...


  • San Jose, California, United States Zscaler Full time

    About ZscalerZscaler is a leading cloud security company that provides a comprehensive security platform to protect enterprises from cyber threats. With a mission to make the cloud a safe place to do business, Zscaler has built a reputation for delivering innovative and effective security solutions.Job SummaryWe are seeking an experienced Site Reliability...


  • San Diego, California, United States BAE SYSTEMS Full time

    Job DescriptionAt BAE Systems, we're pushing the boundaries of innovation in the field of Site Reliability Engineering. We're seeking a highly skilled and motivated individual to join our team as a Site Reliability Engineer, where you'll play a critical role in ensuring the seamless delivery of our cloud-based solutions.Key Responsibilities:Deliver...


  • San Jose, California, United States NetApp Full time

    Job SummaryAs a Site Reliability Engineer at NetApp, you will be responsible for managing, supporting, and maintaining a reliable environment for our site. This involves ensuring the stability and security of multiple open-source systems and platforms that are run or operated in that environment.Key ResponsibilitiesBuilding and supporting a reliable site for...


  • San Jose, California, United States ByteDance Full time

    About the RoleByteDance is seeking a highly skilled Site Reliability Engineer to join our Applied Machine Learning team. As a Site Reliability Engineer, you will play a critical role in ensuring the availability and performance of our machine learning services, which are used by hundreds of millions of people around the world.ResponsibilitiesDesign and...


  • San Jose, California, United States Cisco Full time

    About the RoleCisco is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure. You will work closely with our development teams to identify and resolve issues, and collaborate with other teams to...


  • San Jose, California, United States Adobe Full time

    About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our team at Adobe. As a key member of our Cloud Engineering team, you will play a critical role in designing, deploying, and optimizing our cloud services.Key ResponsibilitiesDevelop software and tools to improve the reliability and performance of our cloud servicesProvide...