Senior Site Reliability Engineer

3 days ago


Foster City, California, United States Zoox Full time
About the Role

Zoox is seeking a skilled Site Reliability Engineer to join our team. As a key member of our operations team, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle fleet's critical services.

Key Responsibilities
  • Design and implement fault-tolerant systems for our services
  • Collaborate with cross-functional teams to deploy and operate services
  • Develop and maintain monitoring and reporting tools to track service performance
  • Work with large data sets and data-processing pipelines
  • Stay up-to-date with industry trends and best practices
Requirements
  • Experience with configuration management tools like Ansible, Terraform, or Salt
  • Proficiency with microservice architecture and Kubernetes
  • Ability to extract and report useful performance metrics
  • Linux expertise
  • Programming skills in Python or C/C++
  • Bachelor's degree in engineering, mathematics, or a related field, and 2+ years of relevant experience
Bonus Requirements
  • AWS Architecture and operational experience
  • Experience with Kafka and CI/CD best practices
  • Master's degree in engineering, mathematics, or a related field
Compensation

Zoox offers a competitive compensation package, including salary, Amazon RSUs, and Zoox Stock Appreciation Rights. The salary range for this position is $160,000 to $256,000, with a sign-on bonus possible. Compensation will vary based on location and level.

About Zoox

Zoox is a robotics company developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem. We're looking for top talent to join our fast-moving and highly execution-oriented team.



  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining the systems that support our autonomous vehicle fleet.Key ResponsibilitiesDesign and implement scalable, fault-tolerant systems to support our autonomous...


  • Foster City, California, United States Bayone Full time

    As a Site Reliability Engineer at Bayone, you will be responsible for ensuring the smooth operation of our large production service. Your key responsibilities will include: **Service Maintenance** * Perform regular host OS upgrades to ensure the latest security patches and features are applied. * Upgrade Docker images to ensure the latest software versions...


  • Foster City, California, United States Bayone Full time

    Job DescriptionAs a Site Reliability Engineer at BayoneWe are seeking a highly skilled Site Reliability Engineer to join our team. The successful candidate will be responsible for ensuring the smooth operation of our large production service.Key Responsibilities:Perform OS upgrades, Docker image upgrades, and SSL certificate upgrades to maintain service...


  • Foster City, California, United States Omega Solutions Inc Full time

    Job Description and ResponsibilitiesWe are seeking a highly skilled Site Reliability Engineer to join our team at Omega Solutions Inc.The ideal candidate will have a strong background in Unix/Linux administration, Bash scripting, and experience with configuration management automation tools like Chef and Ansible.Key Responsibilities:Design and implement...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle fleet's critical services.Key ResponsibilitiesDesign and implement fault-tolerant systems for our autonomous vehicle fleetCollaborate with...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle fleet services.Key ResponsibilitiesDesign and implement fault-tolerant systems for our autonomous vehicle servicesCollaborate with...


  • Foster City, California, United States Bayone Full time

    Job DescriptionAs a Site Reliability Engineer at Bayone, you will be responsible for ensuring the smooth operation of our large production service. This includes:Key ResponsibilitiesPerforming host OS upgrades, Docker image upgrades, and SSL certificate upgradesDefining and refining metrics to track service health and performanceAutomating software releases...


  • Foster City, California, United States Bayone Full time

    Job DescriptionAs a Site Reliability Engineer at Bayone, you will be responsible for ensuring the smooth operation of our production services. This includes:Key ResponsibilitiesUpgrading and maintaining the host OS, Docker images, and SSL certificates to ensure optimal performance and security.Defining and refining metrics to track service health and...


  • Foster City, California, United States Bayone Full time

    Job DescriptionAs a Site Reliability Engineer at Bayone, you will:Ensure the smooth operation of our large-scale production service, encompassing:• Host OS upgrades• Docker image upgrades• SSL certificate upgradesKey Responsibilities:• Define and refine metrics to track service health and performance• Automate software releases and service...


  • Foster City, California, United States Bayone Full time

    Job DescriptionAs a Site Reliability Engineer at Bayone, you will:Ensure the smooth operation of our large-scale production service by:Performing regular host OS upgradesUpdating Docker images and SSL certificatesYou will also be responsible for:Defining and refining metrics to track service health and performanceAutomating software releases and service...


  • Foster City, California, United States Bayone Full time

    Job DescriptionAs a Site Reliability Engineer at Bayone, you will be responsible for ensuring the smooth operation of our large production service. This includes:Key ResponsibilitiesService Maintenance: Perform regular host OS upgrades, Docker image upgrades, and SSL certificate upgrades to ensure the service remains up-to-date and secure.Metrics and...


  • Redwood City, California, United States Box Full time

    About BoxBox is the market leader for Cloud Content Management, empowering businesses to accelerate their digital transformation. Our mission is to power how the world works together, and we're seeking a talented Senior Software Engineer to join our Site Reliability Engineering team.Job SummaryWe're looking for a highly skilled Senior Software Engineer to...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle fleet's critical systems.Key ResponsibilitiesDesign and implement scalable and fault-tolerant systems for our autonomous vehicle...


  • Redwood City, California, United States Box Full time

    Transforming the Way the World Works TogetherAt Box, we're revolutionizing Cloud Content Management, and we need a talented Senior Software Engineer, Site Reliability Engineering to join our team. As a key member of our SRE organization, you'll play a crucial role in bringing AI to our content cloud, ensuring the reliability and scalability of our...


  • Redwood City, California, United States Oracle Full time

    Job DescriptionOracle is seeking a highly skilled Senior Principal Site Reliability Engineer to join our team. As a key member of our Site Reliability Engineering (SRE) team, you will be responsible for designing and delivering mission-critical cloud infrastructure solutions that meet the needs of our customers.ResponsibilitiesCollaborate with...


  • Redwood City, California, United States Oracle Full time

    Job DescriptionOracle is seeking a highly skilled Senior Principal Site Reliability Engineer to join our team. As a key member of our Site Reliability Engineering (SRE) team, you will be responsible for designing and delivering mission-critical cloud infrastructure solutions that meet the needs of our customers.ResponsibilitiesCollaborate with...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle services.Key ResponsibilitiesDesign and implement fault-tolerant systems for our autonomous vehicle servicesCollaborate with cross-functional...


  • Redwood City, California, United States 1872 Consulting Full time

    Site Reliability EngineerAt 1872 Consulting, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our systems, working closely with developer teams to identify and resolve issues.Key Responsibilities:Be on-call rotation to respond to...


  • Redwood City, California, United States 1872 Consulting Full time

    Site Reliability EngineerAt 1872 Consulting, we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our systems, working closely with developer teams to identify and resolve issues.Key Responsibilities:Be on-call rotation to respond to...


  • Redwood City, California, United States Moloco Full time

    About MolocoMoloco is a pioneering machine learning company that empowers organizations to unlock the full value of their unique first-party data, revolutionizing the traditional path to performance advertising. By harnessing the power of cutting-edge machine learning technologies, we play a unique and visible role in shaping the digital economy, allowing...