Senior Cloud Reliability Engineer

1 week ago


San Jose California, United States Hireio, Inc. Full time
About the Company

Hireio, Inc. is a leading technology company that specializes in short-form mobile video hosting services. With over 1.3 billion mobile downloads in the United States and 2 billion worldwide, we have established ourselves as a leader in the industry.

About the Team

Our Data Infrastructure team is a pioneer in innovation, seamlessly merging software development and infrastructure operations to design, build, and manage large-scale, highly distributed systems. We take pride in overseeing one of the industry's most extensive cloud infrastructures.

Job Summary

We are seeking a Senior Cloud Reliability Engineer to join our team. As a key member of our SRE team, you will be responsible for designing, developing, and operating cloud-managed, scalable, and reliable systems.

Responsibilities:
  • Participate in and enhance the complete service lifecycle, from inception and design, through development, capacity planning, launch reviews, deployment, operation, and refinement.
  • Design and implement software platforms and monitoring frameworks to govern service-oriented architecture (SOA) efficiently, automatically, and intelligently.
  • Develop and manage components of cloud-managed data infrastructure, encompassing technologies such as Kubernetes, Redis, MySQL, Flink, and more.
  • Establish sustainable mechanisms for scaling systems, such as automation, to drive enhancements in reliability, efficiency, and velocity.
  • Provide sustainable user support, manage incident responses, and conduct blameless postmortems as part of our ongoing efforts to improve our systems.
Requirements:
  • Bachelor's degree in Computer Science or a related technical field with 5+ years of experience
  • Experience programming in one of the following languages: C, C++, Java, Python, Go, and Rust
  • Familiar with Unix/Linux system internals, networking, and distributed systems
  • [Preferred] Experience in MySQL, Redis, Nginx, Kubernetes, Docker, OpenStack, Hadoop, Spark, Flink, etc.
  • [Preferred] Experience in designing and analyzing large-scale distributed systems
  • [Preferred] Strong skills in problem-solving and communication
  • [Preferred] Bilingual in Mandarin and English
Benefits

We offer a comprehensive benefits package, including:

  • 100% premium coverage for employee medical insurance
  • 75% premium coverage for dependents
  • Health Savings Account (HSA) with a company match
  • Dental, Vision, Short/Long-term Disability, Basic Life, Voluntary Life, and AD&D insurance plans
  • Flexible Spending Account (FSA) options
  • 10 paid holidays per year
  • 17 days of Paid Personal Time Off (PPTO)
  • 10 paid sick days per year
  • 12 weeks of paid Parental leave
  • 8 weeks of paid Supplemental Disability
  • Mental and emotional health benefits through our EAP and Lyra
  • 401K company match
  • Gym and cellphone service reimbursements


  • San Jose, California, United States Zscaler Full time

    About ZscalerZscaler is a leading cloud security company that serves thousands of enterprise customers worldwide, including 40% of Fortune 500 companies. Founded in 2007, our mission is to make the cloud a safe and secure place for businesses to operate. As the operator of the world's largest security cloud, we accelerate digital transformation for...


  • San Jose, California, United States Zscaler Full time

    About Zscaler Zscaler, a leader in cloud security, has been dedicated to creating a secure digital environment for enterprises since its inception in 2007. With a mission to enhance the safety of cloud operations and improve user experiences, Zscaler serves a vast array of clients, including a significant portion of Fortune 500 companies. As the architect of...


  • San Jose, California, United States Zscaler Full time

    We are seeking an experienced Cloud Reliability Engineer to join our CRE team at Zscaler. As a key member of our team, you will be responsible for:Key Responsibilities:Troubleshooting and identifying the root cause of cloud reliability issues.Developing solutions and observability tools focusing on early detection and prevention for cloud and customer...


  • San Francisco, California, United States AutoRABIT Holding, Inc. Full time

    About AutoRABIT Holding, Inc.AutoRABIT Holding, Inc. is a leading provider of Salesforce DevSecOps platform for regulated industries such as financial institutions, insurance, and healthcare. Our solutions enable developers to automate their daily tasks to be more productive and increase the release velocity for their development team, while meeting...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    About AutoRABIT Holding Inc.AutoRABIT Holding Inc. is a leading provider of Salesforce DevSecOps platform for regulated industries such as financial institutions, insurance, and healthcare. Our solutions enable developers to automate their daily tasks, increasing productivity and release velocity while meeting stringent security, compliance, and privacy...


  • San Jose, California, United States Hireio, Inc. Full time

    About UsHireio, Inc. is a leading video editing solution provider that aims to make content creation easier and more engaging. With a strong focus on innovation and customer satisfaction, we have established ourselves as a top player in the industry.Job DescriptionWe are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key...


  • San Jose, California, United States Zscaler Full time

    About ZscalerZscaler is a leading cloud security company that provides a comprehensive security platform to protect enterprises from cyber threats. With a mission to make the cloud a safe place to do business, Zscaler has built a reputation as a trusted partner for organizations around the world.Job SummaryWe are seeking an experienced Principal Software...


  • San Jose, United States Zscaler Full time

    Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your...


  • San Jose, United States Zscaler Full time

    We ask that you have U.S. Citizenship given this role requires work on the federal cloud platform. Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant...


  • San Francisco, California, United States SoCode US Full time

    Cloud Reliability Engineer – Graduates welcome - Up to $90KWe are thrilled to present this Cloud Reliability Engineer position with a leading software firm recognized for its groundbreaking machine learning innovations. Established 8 years ago, this company boasts a talented team of 150 engineers and is celebrated for its transformative technology within...


  • San Francisco, California, United States Abnormal Security Full time

    About the RoleAbnormal Security is a leading provider of cloud-based cybersecurity solutions, trusted by enterprises of all sizes to stop cybercrime. As a Cloud Reliability Engineer II, you will play a critical role in ensuring the reliability and availability of our products, which must scale with the growth of our customers.Our goal is to establish our...


  • San Francisco, California, United States Autodesk, Inc. Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to lead our cloud infrastructure efforts and ensure the reliability and performance of our software solutions. As a key member of our team, you will be responsible for designing, implementing, and maintaining scalable and secure cloud infrastructure to support our growing user...


  • San Jose, United States F5 Full time

    F 5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F 5 Distributed Cloud Product. Due to the nature of work this role requires US Citizenship. Primary Responsibil Reliability Engineer, Liability, Engineer, Reliability, Reliability, Technology, Support


  • San Jose, California, United States Zscaler Full time

    About ZscalerZscaler is a leading cloud security platform provider, offering a comprehensive suite of solutions to protect businesses from cyber threats. Our team of experts has built a robust platform that enables organizations to harness the power of the cloud while ensuring the security and integrity of their data.Job SummaryWe are seeking an experienced...


  • San Diego, California, United States Platform Science Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Platform Science. As a key member of our cloud operations team, you will be responsible for ensuring the reliability and performance of our cloud-based platform.Key ResponsibilitiesDevelop and enhance Continuous Integration/Continuous Deployment (CI/CD)...


  • San Francisco, California, United States Crusoe Full time

    Job Description**Mission-Driven Opportunity**Crusoe Energy is a pioneering company that aims to unlock value in stranded energy resources through the power of computation. Our mission is to align the long-term interests of the climate with the future of global computing infrastructure.**Our Approach**We co-locate mobile data centers with stranded energy...


  • California, Missouri, United States Bitwarden Inc. Full time

    About BitwardenBitwarden empowers organizations, developers, and individuals to securely manage and share sensitive information. With a transparent, open-source approach to password management, secrets management, and innovations in passwordless and passkey solutions, Bitwarden facilitates the extension of robust security practices across all online...


  • San Jose, California, United States Spectro Cloud Full time

    Company OverviewSpectro Cloud is dedicated to transforming enterprise infrastructure, enabling seamless operations from data centers to edge computing across various platforms. Our innovative solutions empower organizations to manage applications on Kubernetes in a manner that suits their unique needs.Founded by a team of seasoned experts in multi-cloud...


  • San Jose, California, United States Microsoft Corporation Full time

    At Microsoft Corporation, the Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) team is pivotal in driving the evolution of our expansive Cloud Infrastructure, which is integral to our "Intelligent Cloud" vision. SCHIE is responsible for delivering the essential infrastructure and foundational technologies that support over 200 online services,...


  • San Jose, California, United States Hireio, Inc. Full time

    About the RoleHireio, Inc. is seeking a highly skilled Senior Software Development Engineer to join our team and contribute to the development of our cloud storage solutions. As a key member of our team, you will be responsible for designing, developing, and maintaining our cloud storage systems, ensuring they meet the highest standards of performance,...