Senior Engineering Manager, Site Reliability Operations

23 hours ago


Redwood City, California, United States Box Full time
Transform the Future of Content Management

At Box, we're revolutionizing the way organizations work with content. As a Senior Engineering Manager, Site Reliability Operations, you'll play a critical role in ensuring the seamless operation of our cloud infrastructure. Join our team and be part of shaping the future of content management.

Key Responsibilities:
  • Lead and oversee a global Network Operations Center (NOC) team
  • Ensure continuous monitoring of network systems through the NOC team
  • Optimize Mean Time To X (MTTx) process to minimize service disruptions
  • Act as primary point of contact for critical incidents
  • Maintain clear communication channels with stakeholders
Requirements:
  • Minimum 6 years of experience in technical operations or a related field
  • Profound knowledge of Cloud infrastructure, monitoring tools, and incident management procedures
  • Exceptional written and verbal communication skills
  • Strong problem-solving and decision-making capabilities
What We Offer:
  • Competitive salary and benefits package
  • Opportunity to work with a leading cloud infrastructure company
  • Collaborative and dynamic work environment


  • Redwood City, California, United States Box Full time

    About BoxBox is the market leader for Cloud Content Management, empowering businesses to accelerate their digital transformation. Our mission is to power how the world works together, and we're seeking a talented Senior Software Engineer to join our Site Reliability Engineering team.Job SummaryWe're looking for a highly skilled Senior Software Engineer to...


  • Redwood City, California, United States Box Full time

    Transforming the Way the World Works TogetherAt Box, we're revolutionizing Cloud Content Management, and we need a talented Senior Software Engineer, Site Reliability Engineering to join our team. As a key member of our SRE organization, you'll play a crucial role in bringing AI to our content cloud, ensuring the reliability and scalability of our...


  • Redwood City, California, United States Oracle Full time

    Job DescriptionOracle is seeking a highly skilled Senior Principal Site Reliability Engineer to join our team. As a key member of our Site Reliability Engineering (SRE) team, you will be responsible for designing and delivering mission-critical cloud infrastructure solutions that meet the needs of our customers.ResponsibilitiesCollaborate with...


  • Redwood City, California, United States Moloco Full time

    About MolocoMoloco is a pioneering machine learning company that empowers organizations to unlock the full value of their unique first-party data, revolutionizing the traditional path to performance advertising. By harnessing the power of cutting-edge machine learning technologies, we play a unique and visible role in shaping the digital economy, allowing...


  • Redwood City, California, United States Oracle Full time

    Job DescriptionOracle is seeking a highly skilled Senior Principal Site Reliability Engineer to join our team. As a key member of our Site Reliability Engineering (SRE) team, you will be responsible for designing and delivering mission-critical cloud infrastructure solutions that meet the needs of our customers.ResponsibilitiesCollaborate with...


  • Redwood City, California, United States 1872 Consulting Full time

    Site Reliability EngineerAt 1872 Consulting, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our systems, working closely with developer teams to identify and resolve issues.Key Responsibilities:Be on-call rotation to respond to...


  • Redwood City, California, United States 1872 Consulting Full time

    Site Reliability EngineerAt 1872 Consulting, we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our systems, working closely with developer teams to identify and resolve issues.Key Responsibilities:Be on-call rotation to respond to...


  • Redwood City, California, United States Zilliz Full time

    About ZillizZilliz is a fast-growing startup that specializes in developing cutting-edge vector database technologies for enterprise-grade AI applications. Our mission is to democratize AI by simplifying data management and making vector databases accessible to every organization.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join...


  • Redwood City, California, United States Zilliz Full time

    About ZillizZilliz is a pioneering startup that specializes in developing cutting-edge vector database technologies for enterprise-grade AI applications.As the company behind the world's most popular open-source vector database, Milvus, Zilliz is committed to simplifying data management for AI applications and making vector databases accessible to every...


  • Redwood City, California, United States Zilliz Full time

    About ZillizZilliz is a pioneering startup that specializes in developing cutting-edge vector database technologies for enterprise-grade AI applications.As the company behind the world's most popular open-source vector database, Milvus, Zilliz is committed to simplifying data management for AI applications and making vector databases accessible to every...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a skilled Site Reliability Engineer to join our team. As a key member of our operations team, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle fleet's critical services.Key ResponsibilitiesDesign and implement fault-tolerant systems for our servicesCollaborate with cross-functional teams...


  • Foster City, California, United States Bayone Full time

    As a Site Reliability Engineer at Bayone, you will be responsible for ensuring the smooth operation of our large production service. Your key responsibilities will include: **Service Maintenance** * Perform regular host OS upgrades to ensure the latest security patches and features are applied. * Upgrade Docker images to ensure the latest software versions...


  • Culver City, California, United States ICON Consultants, LP Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at ICON Consultants, LP. As a key member of our technical operations team, you will be responsible for ensuring the reliability and performance of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement solutions to improve system performance and remove...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle fleet's critical services.Key ResponsibilitiesDesign and implement fault-tolerant systems for our autonomous vehicle fleetCollaborate with...


  • Redwood City, California, United States C3, Inc. Full time

    Job Title: Senior Product Manager - Reliability and SustainabilityC3 AI is seeking a highly skilled Senior Product Manager to lead our product efforts in AI-based asset performance management and energy efficiency use cases. As a key member of our team, you will be responsible for developing and executing a comprehensive product strategy that drives business...


  • Foster City, California, United States Bayone Full time

    Job DescriptionAs a Site Reliability Engineer at BayoneWe are seeking a highly skilled Site Reliability Engineer to join our team. The successful candidate will be responsible for ensuring the smooth operation of our large production service.Key Responsibilities:Perform OS upgrades, Docker image upgrades, and SSL certificate upgrades to maintain service...


  • Foster City, California, United States Omega Solutions Inc Full time

    Job Description and ResponsibilitiesWe are seeking a highly skilled Site Reliability Engineer to join our team at Omega Solutions Inc.The ideal candidate will have a strong background in Unix/Linux administration, Bash scripting, and experience with configuration management automation tools like Chef and Ansible.Key Responsibilities:Design and implement...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle fleet services.Key ResponsibilitiesDesign and implement fault-tolerant systems for our autonomous vehicle servicesCollaborate with...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining the systems that support our autonomous vehicle fleet.Key ResponsibilitiesDesign and implement scalable, fault-tolerant systems to support our autonomous...


  • Foster City, California, United States Bayone Full time

    Job DescriptionAs a Site Reliability Engineer at Bayone, you will be responsible for ensuring the smooth operation of our large production service. This includes:Key ResponsibilitiesService Maintenance: Perform regular host OS upgrades, Docker image upgrades, and SSL certificate upgrades to ensure the service remains up-to-date and secure.Metrics and...