Lead Site Reliability Engineer

7 days ago


San Jose California, United States Hireio, Inc. Full time
About Hireio, Inc.

Hireio, Inc. stands at the forefront of the mobile video landscape, recognized as a premier platform for short-form video content. As a leading Unicorn startup, we have achieved remarkable milestones, including over 1.3 billion mobile downloads in the United States and 2 billion globally. With a robust user base of 1.5 billion monthly active users, we are proud to be one of the most popular social entertainment applications worldwide.

About the Team

Our Site Reliability Engineering (SRE) team is a trailblazer in the realm of data infrastructure. We expertly blend software engineering with operational excellence to architect, construct, and oversee expansive, distributed systems. Our commitment to managing one of the industry's most extensive cloud infrastructures is unwavering. As the landscape of software development evolves, the integration of diverse components has become essential, placing SRE at the heart of this transformation.

This position requires proficiency in designing, developing, and maintaining these components, ensuring they are transformed into scalable, cloud-managed, and dependable systems.

Our experts serve as vital connectors, facilitating the integration of various components to deliver high-performance systems.

The dynamic field of SRE is about actively influencing the future of technology, not merely keeping pace with it. We play a significant role in shaping the next chapter of data infrastructure and are in the process of establishing global teams. Join us in this transformative journey.

Key Responsibilities:

  • Engage in and enhance the entire service lifecycle, from conception and design to development, capacity planning, launch evaluations, deployment, operation, and continuous improvement.
  • Architect and implement software platforms and monitoring frameworks to efficiently govern service-oriented architecture (SOA).
  • Develop and oversee components of cloud-managed data infrastructure, utilizing technologies such as Kubernetes, Redis, MySQL, Flink, and others.
  • Create sustainable mechanisms for system scalability, including automation, to enhance reliability, efficiency, and speed.
  • Provide ongoing user support, manage incident responses, and conduct blameless postmortems to foster continuous system improvement.
Qualifications:

  • Bachelor's degree in Computer Science or a related technical discipline with a minimum of 5 years of relevant experience.
  • Proficiency in programming languages such as C, C++, Java, Python, Go, or Rust.
  • Familiarity with Unix/Linux system internals, networking, and distributed systems.
  • [Preferred] Experience with MySQL, Redis, Nginx, Kubernetes, Docker, OpenStack, Hadoop, Spark, Flink, etc.
  • [Preferred] Experience in designing and analyzing large-scale distributed systems.
  • [Preferred] Strong problem-solving and communication skills.
  • [Preferred] Bilingual proficiency in Mandarin and English.
Benefits:

Our benefits are thoughtfully designed to reflect our company culture and values, fostering an efficient and inspiring work environment while supporting our employees in both their professional and personal lives.

We offer the following benefits to eligible employees:

Comprehensive medical insurance with 100% premium coverage for employees and approximately 75% for dependents, along with a Health Savings Account (HSA) featuring a company match. Additional benefits include Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life, and AD&D insurance plans. Flexible Spending Account (FSA) options are also available for Health Care, Limited Purpose, and Dependent Care.

Our time-off policies include 10 paid holidays annually, 17 days of Paid Personal Time Off (PPTO) (prorated upon hire and increasing with tenure), and 10 paid sick days per year, in addition to 12 weeks of paid Parental leave and 8 weeks of paid Supplemental Disability.


We also provide generous mental and emotional health benefits through our Employee Assistance Program (EAP) and Lyra, along with a 401K company match, gym, and cellphone service reimbursements. The company reserves the right to modify or change these benefits programs at any time, with or without notice.



  • San Jose, United States VDart Inc Full time

    Job DescriptionJob DescriptionJob Title: Lead Site Reliability EngineerLocation: San Jose, CA (2 Days Hybrid)Duration: / Term: 6+ monthsJob Description:Experience Desired: 14+ Years. Responsibilities:Please look for 14 years hands on Coding/scripting (Ansible) , Python , Cloud Computing About the Role We seek a highly skilled and dynamic Site Reliability...


  • San Jose, California, United States Zscaler Full time

    About ZscalerAt Zscaler, our Engineering team has developed the largest cloud security platform globally, and we continue to innovate. With over 100 patents and ambitious plans for service enhancement and global expansion, our team has established us as a leader in cloud security, serving more than 15 million users across 185 countries. We invite you to...


  • San Jose, California, United States Zscaler Full time

    About ZscalerAt Zscaler, our Engineering team has developed the largest cloud security platform globally, and we continue to innovate. With over 100 patents and ambitious plans for service enhancement and global expansion, our team has established us as the leader in cloud security, serving more than 15 million users across 185 countries. We invite you to...


  • San Jose, California, United States Zscaler Full time

    About UsZscaler has developed the world's largest cloud security platform, continually innovating and expanding our services. With a robust portfolio of over 100 patents and ambitious plans for global growth, our team has established itself as a leader in cloud security, serving more than 15 million users across 185 countries. We are looking for talented...


  • San Jose, United States Adobe Full time

    Site Reliability Engineer page is loadedAdobe’s Reliability Engineering team is looking for a Site Reliability Engineer (SRE) to help build and operate services like Adobe Sign. Adobe Sign is the fastest, and easiest way to get contracts signed and filed.You have a track record as a site reliability engineer in large-scale SaaS businesses, and a strong...


  • San Jose, California, United States Adobe Full time

    Site Reliability Engineer page is loadedAdobe's Reliability Engineering team is looking for a Site Reliability Engineer (SRE) to help build and operate services like Adobe Sign. Adobe Sign is the fastest, and easiest way to get contracts signed and filed.You have a track record as a site reliability engineer in large-scale SaaS businesses, and a strong...


  • San Jose, United States Trianz Full time

    Job Description Role: Site Reliability Engineer Employment Type: Contract – Only VISA FREE Work location: Sanjose, CA Work mode: Onsite- 2 days in a week / 3 days Remote About the Role We seek a highly skilled and dynamic Site Reliability Engineer – Consultant. In this role you will: Maintain and improve the reliability, performance, and availability of...


  • San Jose, California, United States Hireio, Inc. Full time

    Exciting Opportunity: Data Infrastructure Site Reliability Engineering (SRE) TeamJoin Hireio, Inc., a premier platform for short-form mobile video hosting services. As a trailblazer in technology, our SRE team integrates software development with infrastructure management to architect, construct, and oversee extensive, highly distributed systems. We operate...


  • San Jose, United States F5 Full time

    F 5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F 5 Distributed Cloud Product. Due to the nature of work this role requires US Citizenship. Primary Responsibil Reliability Engineer, Liability, Engineer, Reliability, Reliability, Technology, Support


  • San Jose, California, United States Zscaler Full time

    About ZscalerZscaler is a leading cloud security platform provider, offering a comprehensive suite of solutions to protect businesses from cyber threats. Our team of experts has built a robust platform that enables organizations to harness the power of the cloud while ensuring the security and integrity of their data.Job SummaryWe are seeking an experienced...


  • San Jose, United States Zscaler Full time

    Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your...


  • San Jose, California, United States VDart Inc Full time

    Job OverviewPosition: Lead Site Reliability EngineerLocation: San Jose, CA (Hybrid Work Model)Contract Duration: 6+ monthsExperience Required: 14+ YearsRole Summary:We are in search of a highly experienced and proactive Site Reliability Engineer Consultant. In this pivotal role, you will be responsible for:Key Responsibilities:Enhancing the reliability,...


  • San Jose, California, United States VDart Inc Full time

    Job OverviewPosition: Lead Site Reliability EngineerLocation: San Jose, CA (Hybrid Work Model)Contract Duration: 6+ monthsExperience Required: 14+ YearsRole Summary:We are in search of a highly experienced and proactive Site Reliability Engineer Consultant. In this capacity, you will be responsible for:Key Responsibilities:Enhancing the reliability,...


  • San Jose, United States Zscaler Full time

    Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your...


  • San Jose, California, United States Western Digital Full time

    Job OverviewCompany Overview:At Western Digital, we are dedicated to enhancing the way you store and manage data, whether it’s in your pocket, home, car, or the cloud. Our Advanced Reliability Engineering (ARE) team is committed to pioneering reliability assurance methodologies that set industry standards and encompass the entire product lifecycle for our...


  • San Diego, California, United States Dexcom Full time

    About Dexcom:Founded in 1999, Dexcom, Inc. (NASDAQ: DXCM) is a pioneer in the development and marketing of Continuous Glucose Monitoring (CGM) systems designed for use by individuals with diabetes and healthcare professionals. As a leader in the transformation of diabetes management, Dexcom is committed to providing innovative CGM technology that empowers...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS company recognized as the premier provider of Salesforce DevSecOps solutions tailored for regulated sectors such as finance, insurance, and healthcare. Our platform empowers developers to streamline their workflows, enhancing productivity and accelerating release cycles while adhering to...


  • San Francisco, United States Saxon Global Full time

    Lead DevOps/Site Reliability Enginee Looking for a resource more senior in the DevOps space, with a leaning toward site reliability engineering. Docker containers, Kubernetes automation Mostly focused on the automation, current pain points around deployments reliability around their data engineering processes. SRE who can go beyond the memory, what...


  • San Jose, United States TCWGlobal Full time

    Site Reliability Engineer (Kubernetes)*US citizenship or Greencard holder- W2 ContractSan Jose, CA 95134 ( LOCAL CANDIDATES ONLY- MUST BE LIVING IN SAN JOSE, CA)$80-110hr (Weekly pay + benefits)6 month contract (Excellent potential for extension)Full-time: M-F 8am-5pm (Onsite 2 days a week)***Please note: This role is only accepting candidates that currently...


  • San Jose, United States TCWGlobal Full time

    Job DescriptionJob DescriptionSite Reliability Engineer (Kubernetes)*US citizenship or Greencard holder- W2 ContractSan Jose, CA 95134 ( LOCAL CANDIDATES ONLY- MUST BE LIVING IN SAN JOSE, CA)$80-110hr (Weekly pay + benefits)6 month contract (Excellent potential for extension)Full-time: M-F 8am-5pm (Onsite 2 days a week)***Please note:This role is only...