Senior Site Reliability Engineer/DevOps
2 weeks ago
Zoox is seeking a Site Reliability Engineer to help ensure the availability, performance, and resilience of the services that power the development and operation of our autonomous vehicles. As a robotics company, Zoox embraces automation at every layer of our infrastructure, and you’ll help drive that ethos forward. You’ll work hands‑on with systems that process massive volumes of data and support compute‑intensive pipelines running on both CPUs and GPUs.
Design and implement highly scalable and reliable systems to support Zoox's autonomous vehicle platform.
Optimize system performance, reliability, and scalability.
Develop and maintain monitoring, alerting, and reporting systems to ensure proactive identification and resolution of issues.
Collaborate with software engineering teams to improve software architecture, deployment processes, and automation.
Conduct root cause analysis of production issues and implement corrective actions.
Implement disaster recovery and business continuity plans.
5+ years of experience in site reliability engineering or a similar role, with a strong background in working with large-scale distributed systems.
Proven experience with cloud platforms such as AWS, GCP, or Azure.
Deep understanding of networking, storage, and database technologies.
Strong programming skills in languages such as Python, Go, C/C++, or Java.
Bonus Qualifications
Experience in the automotive or autonomous vehicle industry.
There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. A sign‑on bonus may be offered as part of the compensation package. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. Zoox also offers a comprehensive package of benefits, including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long‑term care insurance, long‑term and short‑term disability insurance, and life insurance.
Zoox is developing the first ground‑up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility‑as‑a‑service in urban environments. Follow us on LinkedIn
If you need an accommodation to participate in the application or interview process please reach out to or your assigned recruiter.
xrczosw Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.
-
Mid-Level Site Reliability/ DevOps Engineer
4 weeks ago
San Francisco, CA, United States Jobright.ai Full timeMid-Level Site Reliability/ DevOps Engineer Join to apply for the Mid-Level Site Reliability/ DevOps Engineer role at Mid-Level Site Reliability/ DevOps Engineer 2 days ago Be among the first 25 applicants Join to apply for the Mid-Level Site Reliability/ DevOps Engineer role at Jobright is an AI-powered career platform that helps job seekers discover...
-
Senior Site Reliability Engineer/DevOps
4 weeks ago
San Francisco, CA, United States Fractal Full timeSite Reliability Engineer Fractal Analytics is a strategic AI partner to Fortune 500 companies with a vision to power every human decision in the enterprise. Fractal is building a world where individual choices, freedom, and diversity are the greatest assets. We believe that a true Fractalite empowers imagination with intelligence. You will need to work...
-
Senior Site Reliability Engineer/DevOps
4 weeks ago
San Francisco, CA, United States Primer Full timePrimer helps B2B products break out of the B2C-centric marketing box. Our platform turns consumer ad channels, data streams, and emerging AI workflows into measurable growth engines for go-to-market teams. We ingest billions of rows from first- and third-party sources, map them to rich company context, and surface hyper-targeted audiences and real-time...
-
Senior Site Reliability Engineer/DevOps
2 weeks ago
Palo Alto, CA, United States black.ai Full timeQuantum computing holds the promise of humanity’s mastery over the natural world, but only if we can build a real quantum computer. PsiQuantum is on a mission to build the first real, useful quantum computers, capable of delivering the world‑changing applications that the technology has long promised. We know that means we will need to build a system...
-
Senior DevOps Engineer
2 weeks ago
Palo Alto, CA, United States Menlo Ventures Full timeFounded in 2017, Obsidian Security was created to close a critical gap: securing the SaaS applications where modern business happens—platforms like Microsoft 365, Salesforce, and hundreds more. Backed by top investors including Greylock, Norwest Venture Partners, and IVP, we’ve built a complete SaaS security platform to reduce risk, detect and respond to...
-
Senior Site Reliability Engineer
3 weeks ago
Cupertino, CA, United States Apple Inc. Full timeA leading technology company is seeking a Site Reliability Engineer to join the Apple Maps Application Services team in Cupertino. The experience expected from applicants, as well as additional skills and qualifications needed for this job are listed below. In this role, you will be responsible for ensuring the reliability and scalability of the backend...
-
Site Reliability Engineer
2 weeks ago
Foster City, United States Replit, Inc. Full timeReplit is the agentic software creation platform that enables anyone to build applications using natural language. With millions of users worldwide and over 500,000 business users, Replit is democratizing software development by removing traditional barriers to application creation.About the role:Join our Site Reliability Engineering team and help ensure the...
-
Site Reliability Engineer
5 days ago
Foster City, United States Repl.it Full timeReplit is the fastest way to turn ideas into software. With our powerful AI-powered Agent and Assistant, anyone can create and launch apps from natural language in just one click. Build and deploy full-stack applications directly from your browser—no setup required. Never written a line of code in your life? No problem. Replit makes software creation...
-
Staff Site Reliability Engineer
2 weeks ago
Foster City, United States Replit, Inc. Full timeReplit is the agentic software creation platform that enables anyone to build applications using natural language. With millions of users worldwide and over 500,000 business users, Replit is democratizing software development by removing traditional barriers to application creation. About the role: Join our Site Reliability Engineering (SRE) team and help...
-
Staff Site Reliability Engineer
6 days ago
Foster City, United States Repl.it Full timeLocationFoster City, CA (Hybrid) In office M,W,FEmployment TypeFull timeDepartmentEngineeringCompensationCompensation is determined based on career level, with the base salary for this role ranging from $220K – $325K • Offers Equity • Offers Bonus • Performance Based BonusReplit is the agentic software creation platform that enables anyone to build...