Reliability Engineer for Distributed Systems

2 days ago


Palo Alto, California, United States Tesla Full time
Company Overview

Tesla is a leading electric vehicle manufacturer accelerating the world's transition to sustainable energy. Our mission-critical systems enable our engineers to design and develop innovative solutions.

Job Summary

We are seeking a highly skilled Site Reliability Engineer to join our Design Technology Operations team. This position will be responsible for building release processes, managing Kubernetes infrastructure, measuring site performance, and participating in on-call rotations.



  • Palo Alto, California, United States Rubrik Full time

    About the Job:We are seeking an early career software engineer with a strong interest in distributed database technologies and cloud computing platforms to work on high-performance and scalable systems.As a software engineer at Rubrik, you will design, develop, test, deploy, maintain, and improve software systems that enable our customers to protect and...


  • Palo Alto, California, United States SambaNova Systems Full time

    About the RoleWe are seeking an experienced Embedded Software Engineer to join our Runtime team at SambaNova Systems. The successful candidate will be responsible for designing and implementing new features for our runtime/embedded OS stack, working on system software support for the next generation RDU system, and providing tools and performance profilers...


  • Palo Alto, California, United States Clockwork Inc Full time

    Job DescriptionWe're seeking bright, versatile software engineers who will help develop and deploy a wide range of next-generation time-sensitive applications. As a key member of our team, you'll use your coding and distributed system knowledge to contribute directly to the design and build of high-performance, reliable, and scalable systems.


  • Palo Alto, California, United States SambaNova Systems Full time

    We are seeking an exceptional Senior Software Engineer to join our Runtime team at SambaNova Systems. As a pioneer in the field of AI, we strive to push the boundaries of what is possible in high-performance computing. In this role, you will be responsible for designing and implementing novel system software solutions that enable efficient execution of AI...


  • Palo Alto, California, United States Glean Full time

    Unlock the Power of Enterprise KnowledgeWe're Glean, a pioneering company that's changing the game by making knowledge work faster and more humane. Our platform is the engine that drives this transformation, connecting AI and knowledge to provide unparalleled search relevance and access to enterprise information.About the PositionWe're seeking skilled...


  • Palo Alto, California, United States Tesla Full time

    Job DescriptionWe're seeking a highly skilled Distributed Systems Engineer to join our team and contribute to the development of our cloud-based IoT platforms for renewable energy and sustainability solutions. As a key member of our engineering team, you will design, develop, and operate scalable and reliable distributed software systems that process...


  • Palo Alto, California, United States Criteo Full time

    Criteo is seeking a talented Principal Software Architect to lead the design and development of our distributed systems infrastructure. As a key member of our engineering organization, you will be responsible for defining architecture standards, guiding technical decisions, and ensuring the scalability, reliability, and performance of our systems.The ideal...


  • Palo Alto, California, United States xAI Full time

    Job DescriptionWe are seeking a highly skilled Distributed Training Systems Engineer to join our team at xAI. As a key member of our engineering team, you will design, build, and implement large-scale distributed training systems.You will be responsible for profiling, debugging, and optimizing multi-host GPU utilization, as well as...


  • Palo Alto, California, United States SambaNova Systems Full time

    SambaNova Systems is a leading provider of full-stack, generative AI platforms for enterprise and government organizations. As a Senior Software Engineer on our Runtime team, you will play a key role in designing and implementing next-generation high-performance compute systems for AI applications at scale.We are searching for an experienced embedded...


  • Palo Alto, California, United States Tesla Full time

    Job OverviewTesla is looking for an experienced Power Distribution Engineer - Semi to work on the design and development of high voltage distribution systems for the Tesla Semi.This role involves working closely with cross-functional teams to identify and mitigate risks associated with new architectures and technologies. You will develop and implement...


  • East Palo Alto, California, United States Amazon Full time

    Job DescriptionThis role involves translating functional and technical requirements into detailed architecture and design, coding and testing complex system components, participating in code and design reviews, working with other teams to deliver and operate large scale, distributed services in the cloud, and overall system architecture, scalability,...


  • Palo Alto, California, United States Tesla Full time

    Job DescriptionCome and contribute to the development of Tesla's innovative data platforms, a cutting-edge platform that processes vast amounts of IoT data daily.About the RoleWe are seeking an experienced software engineer with a passion for distributed systems to join our team. As a Lead Data Engineer for Large-Scale Distributed Systems, you will be...


  • Palo Alto, California, United States Clockwork Inc Full time

    Company OverviewClockwork Inc is a pioneering startup in Silicon Valley, revolutionizing computer networking and distributed systems. Founded in 2018 by a group of researchers from Stanford University, our high-precision network clock synchronization system delivers up to nanosecond accuracy at scale, powering mission-critical enterprise applications in...


  • Palo Alto, California, United States Tesla Full time

    Job DescriptionWe are seeking a highly skilled High Voltage Reliability Specialist to join our team at Tesla. This role involves designing reliability into high voltage distribution systems for the Tesla Semi, with a strong focus on materials and high voltage applications.In this position, you will play a key role in the reliability lifecycle of these...


  • East Palo Alto, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Database Systems Engineer, Distributed SQL to join our team at Amazon. In this role, you will be responsible for designing and developing innovative database systems that meet the needs of our customers.As a member of our team, you will have the opportunity to work on complex technical problems and contribute to...


  • Palo Alto, California, United States Rubrik Full time

    About the RoleWe are seeking a highly skilled High-Performance Software Systems Engineer to join our team at Rubrik. In this role, you will take full ownership of projects from design to implementation, test and deployment.Your primary focus will be on designing, developing, and delivering hardware and OS abstraction for Rubrik CDM software services. You...


  • Palo Alto, California, United States Amazon Full time

    About the RoleWe are seeking a highly experienced Distributed Systems Performance Expert to join our Amazon Redshift team. As a key member of our performance engineering team, you will play a crucial role in identifying ways to improve performance and help set the direction and priorities of other engineering teams.


  • Palo Alto, California, United States Plume Design, Inc. Full time

    We're looking for a seasoned Technical Manager with extensive experience in Customer Facing environments to lead our Site Reliability Engineering Team. This team is focused on deployments, fixes, and sustainability.The ideal candidate will have strong technical knowledge in key areas while focusing on customer satisfaction.Key ResponsibilitiesSupervise a...


  • Palo Alto, California, United States SoftWash Systems Full time

    We are seeking a skilled Systems Biologist and Data Engineer to join our team at CZ Biohub SF. In this role, you will develop and apply advanced data science and engineering skills to drive innovative research and discovery in systems biology.Key Responsibilities:Design and implement high-performance data processing pipelines for large-scale biological...


  • Palo Alto, California, United States Broadcom Corporation Full time

    Simplify complex systems and empower innovation with us at Broadcom Corporation!We are seeking a visionary Distributed Systems Architect to spearhead the development of our next-generation platform.This exciting opportunity comes with an annual base salary range of $141,000 - $225,000, making it a highly competitive offer in the market.As a key architect,...