Lead Site Reliability Engineer
7 days ago
Hireio, Inc. stands at the forefront of the mobile video landscape, recognized as a premier platform for short-form video content. As a leading Unicorn startup, we have achieved remarkable milestones, including over 1.3 billion mobile downloads in the United States and 2 billion globally. With a robust user base of 1.5 billion monthly active users, we are proud to be one of the most popular social entertainment applications worldwide.
About the Team
Our Site Reliability Engineering (SRE) team is a trailblazer in the realm of data infrastructure. We expertly blend software engineering with operational excellence to architect, construct, and oversee expansive, distributed systems. Our commitment to managing one of the industry's most extensive cloud infrastructures is unwavering. As the landscape of software development evolves, the integration of diverse components has become essential, placing SRE at the heart of this transformation.
This position requires proficiency in designing, developing, and maintaining these components, ensuring they are transformed into scalable, cloud-managed, and dependable systems.
Our experts serve as vital connectors, facilitating the integration of various components to deliver high-performance systems.
The dynamic field of SRE is about actively influencing the future of technology, not merely keeping pace with it. We play a significant role in shaping the next chapter of data infrastructure and are in the process of establishing global teams. Join us in this transformative journey.Key Responsibilities:
- Engage in and enhance the entire service lifecycle, from conception and design to development, capacity planning, launch evaluations, deployment, operation, and continuous improvement.
- Architect and implement software platforms and monitoring frameworks to efficiently govern service-oriented architecture (SOA).
- Develop and oversee components of cloud-managed data infrastructure, utilizing technologies such as Kubernetes, Redis, MySQL, Flink, and others.
- Create sustainable mechanisms for system scalability, including automation, to enhance reliability, efficiency, and speed.
- Provide ongoing user support, manage incident responses, and conduct blameless postmortems to foster continuous system improvement.
- Bachelor's degree in Computer Science or a related technical discipline with a minimum of 5 years of relevant experience.
- Proficiency in programming languages such as C, C++, Java, Python, Go, or Rust.
- Familiarity with Unix/Linux system internals, networking, and distributed systems.
- [Preferred] Experience with MySQL, Redis, Nginx, Kubernetes, Docker, OpenStack, Hadoop, Spark, Flink, etc.
- [Preferred] Experience in designing and analyzing large-scale distributed systems.
- [Preferred] Strong problem-solving and communication skills.
- [Preferred] Bilingual proficiency in Mandarin and English.
Our benefits are thoughtfully designed to reflect our company culture and values, fostering an efficient and inspiring work environment while supporting our employees in both their professional and personal lives.
We offer the following benefits to eligible employees:
Comprehensive medical insurance with 100% premium coverage for employees and approximately 75% for dependents, along with a Health Savings Account (HSA) featuring a company match. Additional benefits include Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life, and AD&D insurance plans. Flexible Spending Account (FSA) options are also available for Health Care, Limited Purpose, and Dependent Care.
Our time-off policies include 10 paid holidays annually, 17 days of Paid Personal Time Off (PPTO) (prorated upon hire and increasing with tenure), and 10 paid sick days per year, in addition to 12 weeks of paid Parental leave and 8 weeks of paid Supplemental Disability.
We also provide generous mental and emotional health benefits through our Employee Assistance Program (EAP) and Lyra, along with a 401K company match, gym, and cellphone service reimbursements. The company reserves the right to modify or change these benefits programs at any time, with or without notice.
-
Lead Site Reliability Engineer
1 month ago
San Jose, United States VDart Inc Full timeJob DescriptionJob DescriptionJob Title: Lead Site Reliability EngineerLocation: San Jose, CA (2 Days Hybrid)Duration: / Term: 6+ monthsJob Description:Experience Desired: 14+ Years. Responsibilities:Please look for 14 years hands on Coding/scripting (Ansible) , Python , Cloud Computing About the Role We seek a highly skilled and dynamic Site Reliability...
-
Lead Site Reliability Engineer
2 weeks ago
San Jose, California, United States Zscaler Full timeAbout ZscalerAt Zscaler, our Engineering team has developed the largest cloud security platform globally, and we continue to innovate. With over 100 patents and ambitious plans for service enhancement and global expansion, our team has established us as a leader in cloud security, serving more than 15 million users across 185 countries. We invite you to...
-
Lead Site Reliability Engineer
2 weeks ago
San Jose, California, United States Zscaler Full timeAbout ZscalerAt Zscaler, our Engineering team has developed the largest cloud security platform globally, and we continue to innovate. With over 100 patents and ambitious plans for service enhancement and global expansion, our team has established us as the leader in cloud security, serving more than 15 million users across 185 countries. We invite you to...
-
Lead Site Reliability Engineer
2 weeks ago
San Jose, California, United States Zscaler Full timeAbout UsZscaler has developed the world's largest cloud security platform, continually innovating and expanding our services. With a robust portfolio of over 100 patents and ambitious plans for global growth, our team has established itself as a leader in cloud security, serving more than 15 million users across 185 countries. We are looking for talented...
-
Site Reliability Engineer
2 months ago
San Jose, United States Adobe Full timeSite Reliability Engineer page is loadedAdobe’s Reliability Engineering team is looking for a Site Reliability Engineer (SRE) to help build and operate services like Adobe Sign. Adobe Sign is the fastest, and easiest way to get contracts signed and filed.You have a track record as a site reliability engineer in large-scale SaaS businesses, and a strong...
-
Site Reliability Engineer
2 months ago
San Jose, California, United States Adobe Full timeSite Reliability Engineer page is loadedAdobe's Reliability Engineering team is looking for a Site Reliability Engineer (SRE) to help build and operate services like Adobe Sign. Adobe Sign is the fastest, and easiest way to get contracts signed and filed.You have a track record as a site reliability engineer in large-scale SaaS businesses, and a strong...
-
Site Reliability Engineer
1 month ago
San Jose, United States Trianz Full timeJob Description Role: Site Reliability Engineer Employment Type: Contract – Only VISA FREE Work location: Sanjose, CA Work mode: Onsite- 2 days in a week / 3 days Remote About the Role We seek a highly skilled and dynamic Site Reliability Engineer – Consultant. In this role you will: Maintain and improve the reliability, performance, and availability of...
-
Lead Site Reliability Engineer
2 weeks ago
San Jose, California, United States Hireio, Inc. Full timeExciting Opportunity: Data Infrastructure Site Reliability Engineering (SRE) TeamJoin Hireio, Inc., a premier platform for short-form mobile video hosting services. As a trailblazer in technology, our SRE team integrates software development with infrastructure management to architect, construct, and oversee extensive, highly distributed systems. We operate...
-
Sr Site Reliability Engineer
3 weeks ago
San Jose, United States F5 Full timeF 5 Inc. is actively seeking an exceptional Senior Site Reliability Engineer to play a pivotal role in our SRE team for the groundbreaking F 5 Distributed Cloud Product. Due to the nature of work this role requires US Citizenship. Primary Responsibil Reliability Engineer, Liability, Engineer, Reliability, Reliability, Technology, Support
-
Reliability Engineering Lead
6 days ago
San Jose, California, United States Zscaler Full timeAbout ZscalerZscaler is a leading cloud security platform provider, offering a comprehensive suite of solutions to protect businesses from cyber threats. Our team of experts has built a robust platform that enables organizations to harness the power of the cloud while ensuring the security and integrity of their data.Job SummaryWe are seeking an experienced...
-
Senior Site Reliability Engineer
3 weeks ago
San Jose, United States Zscaler Full timeOur Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your...
-
Senior Site Reliability Engineer
2 weeks ago
San Jose, California, United States VDart Inc Full timeJob OverviewPosition: Lead Site Reliability EngineerLocation: San Jose, CA (Hybrid Work Model)Contract Duration: 6+ monthsExperience Required: 14+ YearsRole Summary:We are in search of a highly experienced and proactive Site Reliability Engineer Consultant. In this pivotal role, you will be responsible for:Key Responsibilities:Enhancing the reliability,...
-
Senior Site Reliability Engineer
2 weeks ago
San Jose, California, United States VDart Inc Full timeJob OverviewPosition: Lead Site Reliability EngineerLocation: San Jose, CA (Hybrid Work Model)Contract Duration: 6+ monthsExperience Required: 14+ YearsRole Summary:We are in search of a highly experienced and proactive Site Reliability Engineer Consultant. In this capacity, you will be responsible for:Key Responsibilities:Enhancing the reliability,...
-
Site Reliability Engineer-Federal
1 week ago
San Jose, United States Zscaler Full timeOur Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your...
-
Lead Technologist in Reliability Engineering
2 weeks ago
San Jose, California, United States Western Digital Full timeJob OverviewCompany Overview:At Western Digital, we are dedicated to enhancing the way you store and manage data, whether it’s in your pocket, home, car, or the cloud. Our Advanced Reliability Engineering (ARE) team is committed to pioneering reliability assurance methodologies that set industry standards and encompass the entire product lifecycle for our...
-
Senior Site Reliability Engineering Lead
2 weeks ago
San Diego, California, United States Dexcom Full timeAbout Dexcom:Founded in 1999, Dexcom, Inc. (NASDAQ: DXCM) is a pioneer in the development and marketing of Continuous Glucose Monitoring (CGM) systems designed for use by individuals with diabetes and healthcare professionals. As a leader in the transformation of diabetes management, Dexcom is committed to providing innovative CGM technology that empowers...
-
Lead Site Reliability Engineer
2 weeks ago
San Francisco, California, United States AutoRABIT Holding Inc. Full timeJob OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS company recognized as the premier provider of Salesforce DevSecOps solutions tailored for regulated sectors such as finance, insurance, and healthcare. Our platform empowers developers to streamline their workflows, enhancing productivity and accelerating release cycles while adhering to...
-
Lead DevOps/Site Reliability Engineer
2 months ago
San Francisco, United States Saxon Global Full timeLead DevOps/Site Reliability Enginee Looking for a resource more senior in the DevOps space, with a leaning toward site reliability engineering. Docker containers, Kubernetes automation Mostly focused on the automation, current pain points around deployments reliability around their data engineering processes. SRE who can go beyond the memory, what...
-
Site Reliability Engineer
2 months ago
San Jose, United States TCWGlobal Full timeSite Reliability Engineer (Kubernetes)*US citizenship or Greencard holder- W2 ContractSan Jose, CA 95134 ( LOCAL CANDIDATES ONLY- MUST BE LIVING IN SAN JOSE, CA)$80-110hr (Weekly pay + benefits)6 month contract (Excellent potential for extension)Full-time: M-F 8am-5pm (Onsite 2 days a week)***Please note: This role is only accepting candidates that currently...
-
Site Reliability Engineer
2 months ago
San Jose, United States TCWGlobal Full timeJob DescriptionJob DescriptionSite Reliability Engineer (Kubernetes)*US citizenship or Greencard holder- W2 ContractSan Jose, CA 95134 ( LOCAL CANDIDATES ONLY- MUST BE LIVING IN SAN JOSE, CA)$80-110hr (Weekly pay + benefits)6 month contract (Excellent potential for extension)Full-time: M-F 8am-5pm (Onsite 2 days a week)***Please note:This role is only...