Senior Site Reliability Manager

2 days ago


San Jose, California, United States Triune Infomatics Inc Full time
Role:

Senior Site Reliability Manager


Triune Infomatics Inc is seeking an experienced Senior Site Reliability Manager to join our team and contribute to the design and upkeep of our cloud-based IoT edge orchestration solution.


Job Summary:

The Senior Site Reliability Manager will be responsible for ensuring the availability of our SaaS platform and meeting the uptime and performance requirements of our Fortune 500 customers.


Key Responsibilities:
  • Lead the SRE Operations team to implement processes and procedures that ensure quality and predictability of disaster recovery, performance monitoring, alerting, and reporting.
  • Ensure compliance with ISO27001 and SOC2 standards in incident handling.
  • Play a key role in team performance, growth, and on-call strategy for 24x7x365 availability.
  • Serve as the initial escalation point for incidents, ensuring timely resolution by involving other teams as needed.
  • Collaborate with the SRE Technical Lead and other engineering groups to suggest and implement platform improvements.
  • Regularly report on platform performance to upper management.
  • Interface with the Customer Experience Organization and meet with customers as required.
  • Perform hands-on duties as part of the SRE Operations Team.

Qualifications:
  • Bachelor's degree in Computer Science, Engineering, or related field.
  • Minimum of 5 years of experience in a Site Reliability Engineer role or similar.
  • Proven experience in managing and leading SRE or operations teams.
  • Strong understanding of cloud-based architectures and distributed systems.
  • Experience with disaster recovery, performance monitoring, and alerting systems.
  • Familiarity with ISO27001 and SOC2 standards.
  • Excellent problem-solving skills and ability to handle high-pressure situations.
  • Strong communication and interpersonal skills.
  • Energetic, self-starter with a customer-centric mindset.


  • San Jose, California, United States F5 Full time

    Job SummaryF5 is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our SRE team, you will play a pivotal role in ensuring the reliability and scalability of our distributed cloud product.Key ResponsibilitiesDesign and implement automation solutions to reduce toil and improve operational efficiencyParticipate in...


  • San Jose, California, United States F5 Full time

    About F5F5 is a leading provider of cloud and security solutions, empowering organizations to create, secure, and run applications that enhance the digital experience.Job SummaryWe are seeking an exceptional Senior Site Reliability Engineer to join our SRE team for the F5 Distributed Cloud Product. As a key member of our team, you will play a pivotal role in...


  • San Jose, California, United States Hireio, Inc. Full time

    About the RoleHireio, Inc. is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our data infrastructure team, you will be responsible for designing, building, and managing large-scale, highly distributed systems.Our team is a pioneer in innovation, seamlessly merging software development and infrastructure...


  • San Jose, California, United States HireIO Inc Full time

    Job DescriptionAt HireIO Inc, we're seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our Site Reliability Engineering (SRE) team, you will be responsible for designing, implementing, and operating large-scale, distributed systems.ResponsibilitiesDesign and implement software platforms and monitor frameworks for...


  • San Jose, California, United States F5 Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at F5. As a key member of our SRE team, you will play a pivotal role in ensuring the reliability and scalability of our Distributed Cloud Product.Key ResponsibilitiesDesign and implement automation solutions to reduce toil and improve operational...


  • San Jose, California, United States F5 Full time

    About F5F5 is a leading provider of cloud and security solutions, empowering organizations to create, secure, and run applications that enhance the digital experience.Job SummaryWe are seeking an exceptional Senior Site Reliability Engineer to join our SRE team for the F5 Distributed Cloud Product. As a key member of our team, you will play a pivotal role in...


  • San Jose, California, United States F5 Full time

    About the RoleWe are seeking an exceptional Senior Site Reliability Engineer to join our SRE team for the groundbreaking F5 Distributed Cloud Product. As a key member of our team, you will play a pivotal role in ensuring the reliability, scalability, and security of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement automation solutions...


  • San Jose, California, United States Hireio, Inc. Full time

    Job OverviewWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Hireio, Inc.The ideal candidate will have a strong background in software development, systems engineering, and cloud infrastructure. They will be responsible for designing, implementing, and maintaining large-scale, distributed systems that are highly available,...


  • San Jose, California, United States Tik Tok Full time

    Senior Site Reliability Engineer, Global E-CommerceWe're seeking a highly skilled Senior Site Reliability Engineer to join our Global E-Commerce team. As a key member of our team, you will be responsible for ensuring the reliability and scalability of our e-commerce platform.Responsibilities:Participate in global on-call rotations and lead incident response...


  • San Jose, California, United States F5 Full time

    About F5F5 is a leading provider of cloud and security solutions, empowering organizations to create, secure, and run applications that enhance the digital experience.Job SummaryWe are seeking an exceptional Senior Site Reliability Engineer to join our SRE team for the F5 Distributed Cloud Product. As a key member of our team, you will play a pivotal role in...


  • San Francisco, California, United States Infused Solutions Full time

    Senior Site Reliability EngineerInfused Solutions is seeking a highly skilled Senior Site Reliability Engineer to join their IT infrastructure team. Our client is a market leader in the San Francisco area, and we are looking for a talented individual with expertise in Microsoft Azure and a strong background in software engineering.Key Responsibilities:Design...


  • San Francisco, California, United States smartrecruiters - JobBoard Full time

    Job Title: Senior Site Reliability EngineerAt Twitter, we're looking for a seasoned Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you'll be responsible for leading a team of site reliability engineers who work tirelessly to keep Twitter reliable and scalable.Key Responsibilities:Lead a team of site...


  • San Francisco, California, United States smartrecruiters - JobBoard Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for leading a team of site reliability engineers who work to keep Twitter reliable and scalable.Responsibilities:Lead a team of site reliability engineers to...


  • San Jose, California, United States Tik Tok Full time

    Job Title: Senior Site Reliability EngineerAt TikTok, we're committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe, and so does our workplace. As a Senior Site Reliability Engineer, you'll play a critical role in shaping the future of...


  • San Francisco, California, United States SingleStore Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at SingleStore. As a key member of our engineering team, you will be responsible for designing, building, and running elastic Kubernetes clusters across on-prem, AWS, Azure, and Google Cloud environments.Key Responsibilities:Help drive...


  • San Jose, California, United States Tik Tok Full time

    Job Title: Senior Site Reliability Engineer, Global E-CommerceWe are seeking a highly skilled Senior Site Reliability Engineer to join our Global E-Commerce team. As a key member of our team, you will be responsible for ensuring the reliability and scalability of our e-commerce platform.Responsibilities:Be part of our global on-call rotation and be...


  • San Francisco, California, United States Infused Solutions Full time

    Job Title: Senior Site Reliability EngineerWe are seeking an experienced Senior Site Reliability Engineer to join our team at Infused Solutions. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining scalable, high-availability infrastructure for our platform.Key Responsibilities:Architect and manage...


  • San Francisco, California, United States Astranis Full time

    Astranis MissionAstranis is revolutionizing global connectivity by developing the next generation of smaller, more cost-effective spacecraft. Our mission is to bridge the digital divide and connect the four billion people worldwide who lack internet access.Job SummaryWe are seeking a highly motivated and experienced Senior Site Reliability Engineer to join...


  • San Francisco, California, United States Twitter Full time

    Job DescriptionAt Twitter, we're committed to delivering a seamless and reliable experience for our users. As a Senior Site Reliability Engineer, you'll play a critical role in ensuring the stability and scalability of our services.ResponsibilitiesLead a team of site reliability engineers to design, implement, and maintain scalable and reliable...


  • San Jose, California, United States X (formerly Twitter) Full time

    About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our Command Center Team at X (formerly Twitter). As a Site Reliability Engineer, you will play a critical role in ensuring the high availability and reliability of our services, working closely with cross-functional teams to drive significant impact across all areas of the...