Site Reliability Engineer

4 weeks ago


Foster City, United States Zoox Full time
Zoox is looking for a site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from designing systems that are easy to maintain and fault-tolerant through deployment, operation, and continual improvement. Zoox is a robotics company and our ethos of automation extends throughout the infrastructure components we build. Be prepared to work with systems handling large volumes of data and data-processing pipelines performing compute-intensive tasks on CPUs and GPUs.

Qualifications

Experience in supporting production service infrastructure and utilizing configuration management tools like Ansible, Terraform, or Salt Proficiency with microservice architecture and tooling around Kubernetes Ability to extract and report useful performance or service metrics using ELK, prometheus, grafana Linux, no matter the flavor Familiarity with Python or C/C++ Bachelor's degree in an engineering, mathematics, or related field and 2+ years of relevant experience

Bonus Qualifications

AWS Architecture and operational experience with a range of tech like OS, RDS, ECS, EKS Deploying and managing Kafka / MSK as a service Establishing and supporting CI / CD best practices Experience handling large data sets Master's degree in an engineering, mathematics, or related field  CompensationThere are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. The salary range for this position is $160,000 to $256,000. A sign-on bonus may be offered as part of the compensation package. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.Zoox also offers a comprehensive package of benefits including paid time off ( sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance. About Zoox Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team. Accommodations If you need an accommodation to participate in the application or interview process please reach out to or your assigned recruiter.

  • Foster City, United States Bayone Full time

    As a Site Reliability Engineer, you will: Keep a large production service up and running including: Host OS upgrades Docker image upgrades SSL certificate upgrades Define and refine metrics to track service health and performance. Automate software releases and service failovers. Requirements Bachelor's degree in Engineering, Mathematics or...


  • Foster City, United States Zoox Full time

    Zoox is looking for a site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from designing systems that are easy to maintain and fault-tolerant through...


  • Foster City, United States Zoox Full time

    Zoox is looking for an experienced leader to lead our Site Reliability Engineering team. Infrastructure is key in building, validating, and running our autonomous driving software, and the team you’ll be running supports it all. In this highly impactful role, you will closely work with partners in many teams including the driving AI teams, safety...


  • Foster City, United States Zoox Full time

    Foster City, CA • Full-time Staff/Senior Staff Site Reliability Engineer Zoox is looking for a site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from...


  • Jersey City, United States SelektIT Full time

    Job Description Position: Site Reliability Engineer Company Overview: Purelogics is a fast-growing technology company that provides innovative solutions to businesses of all sizes. Our team consists of highly skilled and dedicated professionals who are passionate about delivering top-notch services to our clients. We are currently looking for a Site...

  • Reliability Engineer

    2 weeks ago


    Foster City, United States Zoox Full time

    At Zoox we have set the goal to provide our customers with the highest level of safety and a best-in-class experience while using our fully autonomous vehicles. You will work with a team of world-class engineers with diverse backgrounds such as robotics, control, and vehicle engineering, to deliver the vehicle performance using virtual tools and...


  • Foster City, United States Knewin Full time

    Zoox is looking for a site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from designing systems that are easy to maintain and fault-tolerant through...


  • Arizona City, United States Openlane Full time

    Job Description: Site Reliability Engineer (f.k.a. Platform Engineer) for CarsArrive Network, Inc. located in Mesa, AZ. Provide daily, hands-on assistance to maintain and advance the build process to ensure reliability and optimum integration with Continuous Integration/Continuous Delivery (CI/CD) and Release Management. Work with the development,...


  • Oklahoma City, United States BJ's Wholesale Club Full time

    Lead Site Reliability Engineer page is loaded Lead Site Reliability Engineer Apply locations BJ's Club Support Center Marlborough, MA #5997 time type Full time posted on Posted 2 Days Ago job requisition id R147855 Join our team of more than 34,000 team members, supporting our members and communities in our Club Support Center, 235+ clubs and eight...


  • Jersey City, United States Pinnacle Group, Inc. Full time

    W2 only - Preferred Citizen or Green Card Holder Contract to Hire Must Have: AWS Certification7-8 years of experience and 2 years of AWS expTools: Grafana, DataDogDatabase: MySQL or Oracle-Unix, Linux, Shell Scripting, LAN, NFS-Python, Go Lang, Terraform, Jenkins -Docker, Kubernetes Site Reliability Engineer (AWS) (SRE)Roles and Responsibilities:• Design,...


  • Jersey City, United States Pinnacle Group, Inc. Full time

    W2 only - Preferred Citizen or Green Card Holder Contract to Hire Must Have: AWS Certification7-8 years of experience and 2 years of AWS expTools: Grafana, DataDogDatabase: MySQL or Oracle-Unix, Linux, Shell Scripting, LAN, NFS-Python, Go Lang, Terraform, Jenkins -Docker, Kubernetes Site Reliability Engineer (AWS) (SRE)Roles and Responsibilities:• Design,...


  • Nevada City, United States Talent Space Full time

    Talent Space is looking for a consulting Site Reliability Engineer/SRE for our SaaS client to support a large Production Environment. Role will address Corporate level requirements rather than be focused on a specific Engineering Group/Team. Job Description: As a Cloud Infrastructure Engineer, you will play a pivotal role in maintaining, enhancing, and...


  • Jersey City, New Jersey, United States tapwage Full time

    There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Digital Private Markets /Aumni (A JP Morgan Chase Company), you will solve complex...


  • Kansas City, United States Gorilla Logic Full time

    Gorilla Logic: Mid-Level Site Reliability Engineer (SRE) Gorilla Logic provides nearshore Agile teams to Fortune 500 and SMB companies, bringing unparalleled expertise in the delivery of full-stack web, mobile, and enterprise applications. Our highly collaborative Agile Gorillas are uniquely qualified to implement complex software initiatives. With offices...


  • Salt Lake City, United States Sorenson Communications Full time

    Come be a part of our mission and make a meaningful and positive impact with the industry leading provider of language services for the Deaf and heard-of-hearing! Benefits Paid Vacation Time and Paid Sick Time and Paid Holidays k % match with immediate vesting Nationwide Medical Insurance plans and coverage (Medical, Dental/Orthodontia, Vision) ...


  • Jersey City, New Jersey, United States Devexperts Full time

    Company DescriptionDevexperts has been working for nearly two decades consulting and developing for the financial industry. We solve complex technological challenges facing the most well-respected financial institutions worldwide.By becoming a part of Devexperts, you'll become a part of a company that fosters self-improvement and actively seeks...


  • Jersey City, United States Veterans Sourcing Group LLC Full time

    Site Reliability Engineer (AWS) (SRE) Jersey City, NJ- onsite 3 days/ week 12 month minimum contract w/ possible full time conversion Roles And Responsibilities Design, code, test, and deliver software to automate manual operational work Troubleshoot priority incidents, facilitate blameless post-mortems, and ensure permanent closure of incidents Engage with...


  • Kansas City, United States Gorilla Logic Full time

    Gorilla Logic Overview Gorilla Logic provides nearshore Agile teams to Fortune 500 and SMB companies, bringing unparalleled expertise in the delivery of full-stack web, mobile, and enterprise applications. Our highly collaborative Agile Gorillas are uniquely qualified to implement complex software initiatives. With offices in the United States, Costa Rica,...


  • Jersey City, United States Hispanic Technology Executive Council Full time

    At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day. One of the keys to driving Responsible Growth is being a great place to work for our teammates...


  • Jersey City, United States DevExperts Full time

    Devexperts has been working for nearly two decades consulting and developing for the financial industry. We solve complex technological challenges facing the most well-respected financial institutions worldwide. By becoming a part of Devexperts, you’ll become a part of a company that fosters self-improvement and actively seeks out-of-the-box ideas. Our...