Site Reliability Engineer

2 days ago


Foster City, California, United States Zoox Full time
About the Role

Zoox is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle fleet's critical systems.

Key Responsibilities
  • Design and implement scalable and fault-tolerant systems for our autonomous vehicle fleet
  • Collaborate with cross-functional teams to identify and resolve infrastructure-related issues
  • Develop and maintain monitoring and alerting systems to ensure timely issue detection and resolution
  • Work with our DevOps team to implement continuous integration and continuous deployment (CI/CD) pipelines
  • Stay up-to-date with industry trends and emerging technologies to ensure our infrastructure remains competitive
Requirements
  • 3+ years of experience in a Site Reliability Engineer or similar role
  • Strong understanding of cloud infrastructure and automation tools (e.g. Ansible, Terraform, Salt)
  • Experience with microservice architecture and containerization (e.g. Kubernetes)
  • Proficiency in programming languages such as Python, Java, or C++
  • Bachelor's degree in Computer Science, Engineering, or related field
Preferred Qualifications
  • Experience with AWS or other cloud platforms
  • Knowledge of data processing and analytics tools (e.g. ELK, Prometheus, Grafana)
  • Experience with CI/CD tools and pipelines
  • Master's degree in Computer Science, Engineering, or related field
About Zoox

Zoox is a robotics company developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. We're looking for talented individuals who share our passion for innovation and excellence.



  • Foster City, California, United States Bayone Full time

    As a Site Reliability Engineer at Bayone, you will be responsible for ensuring the smooth operation of our large production service. Your key responsibilities will include: **Service Maintenance** * Perform regular host OS upgrades to ensure the latest security patches and features are applied. * Upgrade Docker images to ensure the latest software versions...


  • Foster City, California, United States Bayone Full time

    Job DescriptionAs a Site Reliability Engineer at BayoneWe are seeking a highly skilled Site Reliability Engineer to join our team. The successful candidate will be responsible for ensuring the smooth operation of our large production service.Key Responsibilities:Perform OS upgrades, Docker image upgrades, and SSL certificate upgrades to maintain service...


  • Foster City, California, United States Omega Solutions Inc Full time

    Job Description and ResponsibilitiesWe are seeking a highly skilled Site Reliability Engineer to join our team at Omega Solutions Inc.The ideal candidate will have a strong background in Unix/Linux administration, Bash scripting, and experience with configuration management automation tools like Chef and Ansible.Key Responsibilities:Design and implement...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle fleet's critical services.Key ResponsibilitiesDesign and implement fault-tolerant systems for our autonomous vehicle fleetCollaborate with...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle fleet services.Key ResponsibilitiesDesign and implement fault-tolerant systems for our autonomous vehicle servicesCollaborate with...


  • Foster City, California, United States Bayone Full time

    Job DescriptionAs a Site Reliability Engineer at Bayone, you will be responsible for ensuring the smooth operation of our production services. This includes:Key ResponsibilitiesUpgrading and maintaining the host OS, Docker images, and SSL certificates to ensure optimal performance and security.Defining and refining metrics to track service health and...


  • Foster City, California, United States Bayone Full time

    Job DescriptionAs a Site Reliability Engineer at Bayone, you will be responsible for ensuring the smooth operation of our large production service. This includes:Key ResponsibilitiesPerforming host OS upgrades, Docker image upgrades, and SSL certificate upgradesDefining and refining metrics to track service health and performanceAutomating software releases...


  • Foster City, California, United States Bayone Full time

    Job DescriptionAs a Site Reliability Engineer at Bayone, you will:Ensure the smooth operation of our large-scale production service, encompassing:• Host OS upgrades• Docker image upgrades• SSL certificate upgradesKey Responsibilities:• Define and refine metrics to track service health and performance• Automate software releases and service...


  • Foster City, California, United States Bayone Full time

    Job DescriptionAs a Site Reliability Engineer at Bayone, you will:Ensure the smooth operation of our large-scale production service by:Performing regular host OS upgradesUpdating Docker images and SSL certificatesYou will also be responsible for:Defining and refining metrics to track service health and performanceAutomating software releases and service...


  • Foster City, California, United States Bayone Full time

    Job DescriptionAs a Site Reliability Engineer at Bayone, you will be responsible for ensuring the smooth operation of our large production service. This includes:Key ResponsibilitiesService Maintenance: Perform regular host OS upgrades, Docker image upgrades, and SSL certificate upgrades to ensure the service remains up-to-date and secure.Metrics and...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining the systems that support our autonomous vehicle fleet.Key ResponsibilitiesDesign and implement scalable, fault-tolerant systems to support our autonomous...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a skilled Site Reliability Engineer to join our team. As a key member of our operations team, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle fleet's critical services.Key ResponsibilitiesDesign and implement fault-tolerant systems for our servicesCollaborate with cross-functional teams...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle services.Key ResponsibilitiesDesign and implement fault-tolerant systems for our autonomous vehicle servicesCollaborate with cross-functional...


  • Redwood City, California, United States 1872 Consulting Full time

    Site Reliability EngineerAt 1872 Consulting, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our systems, working closely with developer teams to identify and resolve issues.Key Responsibilities:Be on-call rotation to respond to...


  • Redwood City, California, United States 1872 Consulting Full time

    Site Reliability EngineerAt 1872 Consulting, we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our systems, working closely with developer teams to identify and resolve issues.Key Responsibilities:Be on-call rotation to respond to...


  • Culver City, California, United States ICON Consultants, LP Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at ICON Consultants, LP. As a key member of our technical operations team, you will be responsible for ensuring the reliability and performance of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement solutions to improve system performance and remove...


  • Redwood City, California, United States Zilliz Full time

    About ZillizZilliz is a fast-growing startup that specializes in developing cutting-edge vector database technologies for enterprise-grade AI applications. Our mission is to democratize AI by simplifying data management and making vector databases accessible to every organization.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join...


  • Redwood City, California, United States Box Full time

    About BoxBox is the market leader for Cloud Content Management, empowering businesses to accelerate their digital transformation. Our mission is to power how the world works together, and we're seeking a talented Senior Software Engineer to join our Site Reliability Engineering team.Job SummaryWe're looking for a highly skilled Senior Software Engineer to...


  • Redwood City, California, United States Zilliz Full time

    About ZillizZilliz is a pioneering startup that specializes in developing cutting-edge vector database technologies for enterprise-grade AI applications.As the company behind the world's most popular open-source vector database, Milvus, Zilliz is committed to simplifying data management for AI applications and making vector databases accessible to every...


  • Redwood City, California, United States Oracle Full time

    Job DescriptionOracle is seeking a highly skilled Senior Principal Site Reliability Engineer to join our team. As a key member of our Site Reliability Engineering (SRE) team, you will be responsible for designing and delivering mission-critical cloud infrastructure solutions that meet the needs of our customers.ResponsibilitiesCollaborate with...