Current jobs related to Site Reliability Engineer - San Mateo - The Ladders


  • San Mateo, California, United States Verkada Full time

    About the RoleWe are seeking a talented Site Reliability Engineer to join our Infrastructure team at Verkada. As a member of this team, you will be responsible for managing our infrastructure and ensuring it is scalable, secure, and efficient.Your primary focus will be on optimizing our cluster cost efficiency, enforcing security requirements, improving...


  • San Mateo, California, United States Verkada Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Infrastructure team at Verkada. As a key member of our team, you will be responsible for managing our infrastructure and ensuring it is scalable, secure, and efficient.Key ResponsibilitiesManage and optimize our cloud infrastructure to ensure cost efficiency and...


  • San Mateo, California, United States Roblox Full time

    Drive Reliability and Scalability at RobloxWe're seeking an exceptional Site Reliability Engineering Manager to join our Compute infrastructure team at Roblox. As a key member of our engineering leadership team, you'll partner with other leaders to establish a group that evolves Compute's best practices and systems to ensure they meet the highest standards...


  • San Mateo, California, United States Zoox Full time

    About the RoleZoox is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the uptime and reliability of our autonomous vehicle fleet. This is a critical role that requires a strong understanding of software and hardware systems, as well as excellent problem-solving...

  • Senior Director

    4 weeks ago


    San Mateo, United States Visa Full time

    Company Description Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure...


  • San Jose, California, United States Adobe Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Adobe. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud services. You will work closely with our development team to design, deploy, and optimize our cloud services,...


  • San Francisco, California, United States Unreal Gigs Full time

    Job Title: Site Reliability EngineerAt Unreal Gigs, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the high availability, scalability, and performance of our complex distributed systems.Key Responsibilities:Design and implement monitoring, logging, and alerting...


  • San Francisco, California, United States Unreal Gigs Full time

    Job Title: Site Reliability EngineerAt Unreal Gigs, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the high availability, scalability, and performance of our complex distributed systems.Key Responsibilities:Design and implement monitoring, logging, and alerting...


  • San Francisco, California, United States Wasmer Full time

    About the RoleWe are seeking an exceptional Site Reliability Engineer to join our team at Wasmer. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining scalable and reliable infrastructure solutions for our Edge computing platform.Key ResponsibilitiesDesign and implement scalable and reliable infrastructure...


  • San Leandro, California, United States United Software Group Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at United Software Group. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our digital platforms.Key Responsibilities:Design, implement, and maintain scalable and efficient systems...


  • San Francisco, California, United States Instabase Full time

    About InstabaseAt Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry.With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index...


  • San Antonio, Texas, United States Dunhill Professional Search Full time

    Job Title: Site Reliability EngineerWe are seeking a highly motivated Site Reliability Engineer to join our team at Dunhill Professional Search. As a Site Reliability Engineer, you will be responsible for ensuring the smooth operation of our cloud-based applications and infrastructure.Key Responsibilities:Provide integration and operational support for...


  • San Francisco, California, United States Apollo Solutions Full time

    Site Reliability EngineerApollo Solutions has partnered with a pioneering artificial intelligence business that is revolutionizing the use of AI/ML in gaming and security.The company is working closely with government contracts and gaming console companies and is seeking a Site Reliability Engineer to join their growing team.The Site Reliability Engineer...


  • San Francisco, California, United States DaVita Full time

    About the RoleThe WEX Site Reliability Engineering team is seeking a skilled Site Reliability Engineer to join our Platform Reliability organization. As a key member of our team, you will be responsible for developing software and solutions focused on observability, incident response, reliability, and performance.You will collaborate with our engineering...


  • San Diego, California, United States Qualcomm Full time

    Job Title: Site Reliability EngineerJoin Qualcomm as a Site Reliability Engineer and be part of a highly collaborative team focused on provisioning and maintaining infrastructure and services with stability, sustainability, and security always on your mind.About the RoleWe are seeking a skilled Site Reliability Engineer to join our team. As a Site...


  • San Leandro, California, United States Omni Inclusive Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Omni Inclusive. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and availability of our Digital Sales & Marketing platforms.Key Responsibilities:Collaborate with Engineering teams to maintain the...


  • San Leandro, California, United States Omni Inclusive Full time

    Job Title: Site Reliability EngineerOmni Inclusive is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and availability of our Digital Sales & Marketing platforms.Key Responsibilities:Collaborate with Engineering teams to maintain the SLAs &...


  • San Leandro, California, United States Omni Inclusive Full time

    About the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Omni Inclusive. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and availability of our Digital Sales & Marketing platforms.Key Responsibilities:Design, implement, and maintain scalable and efficient systems to...


  • San Francisco, California, United States Roman Health Pharmacy LLC Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Xero. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud-based platform.Key ResponsibilitiesInvestigate operational surprises and support teams in post-incident activitiesConduct in-depth incident...


  • San Jose, California, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement automation scripts using shell,...

Site Reliability Engineer

2 months ago


San Mateo, United States The Ladders Full time
Zoox is looking for a site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from designing systems that are easy to maintain and fault-tolerant through deployment, operation, and continual improvement. Zoox is a robotics company and our ethos of automation extends throughout the infrastructure components we build. Be prepared to work with systems handling large volumes of data and data-processing pipelines performing compute-intensive tasks on CPUs and GPUs.

Qualifications
    • Experience in supporting production service infrastructure and utilizing configuration management tools like Ansible, Terraform, or Salt
    • Proficiency with microservice architecture and tooling around Kubernetes
    • Ability to extract and report useful performance or service metrics using ELK, prometheus, grafana
    • Linux, no matter the flavor
    • Familiarity with Python or C/C++
    • Bachelor's degree in an engineering, mathematics, or related field and 2+ years of relevant experience
Bonus Qualifications
    • AWS Architecture and operational experience with a range of tech like OS, RDS, ECS, EKS
    • Deploying and managing Kafka / MSK as a service
    • Establishing and supporting CI / CD best practices
    • Experience handling large data sets
    • Master's degree in an engineering, mathematics, or related field


Compensation

There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. The salary range for this position is $160,000 to $256,000. A sign-on bonus may be offered as part of the compensation package. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.

Zoox also offers a comprehensive package of benefits including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.

About Zoox

Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We're looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.

Follow us on LinkedIn

A Final Note:

You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.