Infrastructure Reliability Engineer

2 weeks ago


Foster City, California, United States Zoox Full time

Zoox is Seeking: Infrastructure Reliability Engineer

At Zoox, we are in search of an Infrastructure Reliability Engineer dedicated to maintaining the seamless functionality of vital services that drive the evolution of autonomous vehicles. This position is integral to every phase of service implementation, focusing on the creation of resilient and easily manageable systems, as well as ongoing improvements.

Key Responsibilities:

  • Architecting systems that prioritize maintainability and fault tolerance
  • Overseeing the deployment, management, and continuous enhancement of services

Required Qualifications:

  • Experience in supporting production-level service infrastructure
  • Strong understanding of microservices and Kubernetes
  • Ability to extract performance metrics utilizing ELK, Prometheus, and Grafana
  • Proficient in Linux environments
  • Familiarity with programming in Python or C/C++
  • Bachelor's degree in engineering, mathematics, or a related discipline with a minimum of 2 years of experience

Preferred Qualifications:

  • Experience with AWS architecture and operations
  • Skills in deploying and managing Kafka / MSK
  • Knowledge of CI / CD best practices
  • Experience in handling large datasets
  • Master's degree in engineering, mathematics, or a related field

Compensation Overview:

The compensation package for this role includes a competitive salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. The salary range is between $160,000 and $256,000, with potential for a sign-on bonus. Additional benefits are available based on location and position level.

About Zoox:

Zoox is pioneering the development of fully autonomous vehicle fleets and the supporting ecosystem for market introduction. Positioned at the convergence of robotics, machine learning, and design, Zoox aims to transform mobility-as-a-service in urban environments.



  • Foster City, California, United States Zoox Full time

    Zoox is Seeking: Infrastructure Reliability EngineerAt Zoox, we are searching for an Infrastructure Reliability Engineer dedicated to maintaining the seamless functionality of essential services that drive the development of autonomous vehicles. This position involves significant contributions at every phase of service implementation, from crafting resilient...


  • Foster City, California, United States Zoox Full time

    Zoox is Seeking: Site Reliability EngineerAt Zoox, we are in search of a Site Reliability Engineer dedicated to maintaining the seamless functionality of vital services that are pivotal for the evolution of autonomous vehicles. This position involves a significant contribution at every phase of service implementation, encompassing the design of resilient and...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle fleet's critical services.Key ResponsibilitiesDesign and implement fault-tolerant systems for our autonomous vehicle...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a skilled Site Reliability Engineer to join our team and contribute to the development of our autonomous vehicle fleet. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and reliability of our services, which are critical to the development process for autonomous vehicles.Key...


  • Foster City, California, United States Zoox Full time

    About the RoleZoox is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle fleet's critical services.Key ResponsibilitiesDesign and implement fault-tolerant systems for our autonomous vehicle fleet's...


  • Redwood City, California, United States Dexterity Full time

    Location: Redwood City, CA Travel Required: No Job Classification: Exempt FT Position: Infrastructure Software Engineer Company Overview: Dexterity is an innovative startup specializing in robotic manipulation, dedicated to developing intelligent automation solutions for warehouse operations. Our state-of-the-art technology is designed to transform the...


  • Redwood City, California, United States C3 AI Full time

    C3 AI, Inc (NYSE: AI) stands at the forefront of Enterprise AI software, dedicated to expediting digital transformation. The C3 AI Platform is a proven solution that offers extensive services for the efficient and cost-effective development of enterprise-scale AI applications. This platform enhances the value chain across various industries with prebuilt,...


  • Universal City, California, United States NBC Universal Media, LLC Full time

    Role OverviewThe Infrastructure Solutions Engineer will play a pivotal role in collaborating with various teams to ensure seamless delivery and support of infrastructure solutions.Key ResponsibilitiesCollaboration: Engage with Infrastructure Engineering, Managed Service Provider (MSP) resources, facility support, vendors, and application teams.Operational...


  • Universal City, California, United States NBCUniversal Full time

    Job OverviewCompany Overview:NBCUniversal is a leader in the entertainment industry, producing and distributing high-quality content across various platforms, including film, television, and streaming services. Our commitment to diversity, equity, and inclusion shapes our culture and the content we create, ensuring it resonates with audiences...


  • Foster City, California, United States Conviva Full time

    As Conviva continues to grow, we are actively looking for highly motivated and talented distributed systems engineers at all levels to join our dynamic backend development teams. You will work with some of the best engineers in building our distributed real-time streaming platform for processing, indexing and querying internet scale data. At more senior...


  • Redwood City, California, United States Insight Global Full time

    Job Description**Job Summary:**We are seeking a highly skilled Senior Systems Engineer to join our team at Insight Global. As a key member of our IT department, you will be responsible for ensuring the smooth operation of our IT infrastructure, including network, cloud, and security systems.Key Responsibilities:IT Infrastructure Management: Ensure the...


  • Redwood City, California, United States Moloco Full time

    About Moloco: Moloco is a pioneering machine learning organization dedicated to empowering businesses of all sizes to harness the full potential of their unique first-party data, transforming the conventional approach to performance advertising. While the largest tech firms have demonstrated the effectiveness of data-driven ad-targeting, the same level of...


  • Redwood City, California, United States SmartSource Technical Solutions Full time

    Important Notice:SmartSource Technical Solutions is in search of a dedicated Systems Engineer for a full-time, onsite role.Position Overview:The selected candidate will be responsible for overseeing and optimizing our organization's on-premises and Microsoft 365 environments, ensuring seamless operation, security, governance, and effective utilization of...


  • Culver City, California, United States V-Soft Consulting Group, Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at V-Soft Consulting Group, Inc. as a Data Center Expert. In this role, you will be responsible for ensuring the reliability and performance of our data center infrastructure.Key ResponsibilitiesData Monitoring and Alerting: Design and implement data monitoring and...


  • Culver City, California, United States Apple Full time

    SummaryWe are part of Apple's Hardware Reliability Engineering team, dedicated to collaborating with various iOS hardware engineering groups. Our mission is to enhance and ensure the durability and dependability of Apple's innovative products.In this role, you will engage with multiple engineering teams throughout the entire product lifecycle, from initial...


  • Foster City, California, United States Swtest Full time

    Swtest is in search of an innovative leader to manage the IT Platform Engineering teams focused on Storage and Cloud solutions.Our initiatives in robotics and artificial intelligence are fundamentally supported by extensive, high-performance storage infrastructures and the strategic application of cloud computing and SaaS offerings.Key Responsibilities:...


  • Foster City, California, United States Swtest Full time

    Swtest is in search of an innovative leader to manage the IT Platform Engineering teams focused on Storage and Cloud technologies. Our advancements in robotics and artificial intelligence depend significantly on large-scale, high-performance storage infrastructures and the strategic application of cloud computing and SaaS offerings. Key Responsibilities: ...


  • Redwood City, California, United States SnorkelAI Full time

    At Snorkel AI, we are dedicated to making AI accessible to all by creating a premier platform for AI data development. The evolution of AI has been remarkable, and we are at the forefront of this transformation. Our focus remains on the critical role that data plays in developing high-performance, production-ready AI systems. We collaborate with leading...


  • Foster City, California, United States Zoox Full time

    OverviewThe Software Hardware-in-the-Loop (HIL) team is dedicated to enhancing the testing environment for both system and subsystem components, focusing on improving uptime, reliability, and feature enhancements. To facilitate the integration of our testing framework across various teams, the SW HIL team actively engages in test development and manages...


  • Redwood City, California, United States Promote Project Full time

    About the RoleWe are seeking a highly skilled and experienced Manager of AI System Infrastructure and MLOps Engineering to join our team at Promote Project. As a key member of our AI/ML and Data Engineering team, you will be responsible for the stability and scalable operations of our leading-edge GPU Cloud Compute Cluster.This role will involve guiding our...