Staff Site Reliability Engineer, Fleetnet
2 weeks ago
We're a small, expert team at Tesla creating the next-generation server-side infrastructure to support the growing fleets of Tesla products. We're looking for seasoned SREs with domain expertise in one or more of: containers, public clouds, and cloud-native apps.
Responsibilities- Design and write software that enables rapid prototyping by development teams, ensuring the highest levels of reliability and availability.
- Drive the migration of large-scale, distributed fleet applications towards cloud-native microservices.
- Influence architectural decisions with a focus on security, scalability, and high-performance.
- Automate the build and deployment of infrastructure using Docker, Kubernetes, and other orchestration technologies in a hybrid-cloud environment.
- Setup and maintain monitoring, metrics, and reporting systems for fine-grained observability and actionable alerting.
- Experience building and maintaining SaaS infrastructure.
- Expert skills with Linux, networking, storage, and virtualization automation with tools like Kubernetes, Terraform, Ansible, Chef, and others.
- Setting up and supporting CI/CD.
- Proficiency in a high-level language like Python, Go, Ruby, and/or Java.
- Scaling through data-driven capacity planning, within both physical data centers and Cloud infrastructure (AWS, GCP, or Azure).
- Troubleshooting and full-cycle incident response (mitigation, correction, prevention).
- Strong belief in spreading and acquiring knowledge through mentorship and acting like an owner.
- Smart but humble, with a bias for action and for enabling others' success.
- Competitive pay.
- Aetna PPO and HSA plans with $0 payroll deduction.
- Family-building, fertility, adoption, and surrogacy benefits.
- Dental and vision plans with options including $0 paycheck contribution.
- Company-paid HSA contribution when enrolled in the High Deductible Aetna medical plan with HSA.
- Healthcare and Dependent Care Flexible Spending Accounts (FSA).
- LGBTQ+ care concierge services.
- 401(k) with employer match, Employee Stock Purchase Plans, and other financial benefits.
- Company-paid Basic Life, AD&D, short-term, and long-term disability insurance.
- Employee Assistance Program.
- Sick and Vacation time (Flex time for salary positions), and Paid Holidays.
- Back-up childcare and parenting support resources.
- Critical illness, hospital indemnity, accident insurance, theft, and legal services.
- Pet insurance.
- Weight Loss and Tobacco Cessation Programs.
- Tesla Babies program.
- Commuter benefits.
- Employee discounts and perks program.
$104,000 - $348,000/annual salary, depending on level, plus cash and stock awards, and benefits.
-
Staff Site Reliability Engineer, Fleetnet
2 weeks ago
Stanford, United States Tesla Full timeAbout the RoleWe're seeking a seasoned Site Reliability Engineer to join our team at Tesla, where you'll play a critical role in designing and building the next-generation server-side infrastructure to support our growing fleets of electric vehicles.As a key member of our team, you'll be responsible for driving the migration of large-scale, distributed fleet...
-
Staff Site Reliability Engineer, Fleetnet
2 weeks ago
Stanford, United States Tesla Full timeAbout the RoleWe're a small, expert team at Tesla creating the next-generation server-side infrastructure to support the growing fleets of Tesla products. We're looking for seasoned SREs with domain expertise in one or more of: containers, public clouds, and cloud-native apps.ResponsibilitiesDesign and write software that enables rapid prototyping by...
-
Stanford, United States Tesla Full timeJob SummaryWe are seeking a highly skilled Staff Site Reliability Engineer to join our PLM Operations team at Tesla. As a key member of our team, you will be responsible for ensuring the reliability and performance of our PLM systems, which are critical to the success of our engineering design tools.Key ResponsibilitiesDefine Service Level Objectives (SLOs)...
-
Senior Site Reliability Engineer
3 weeks ago
Stanford, United States Rubrik Job Board Full timeJob Title: Senior Site Reliability EngineerRubrik is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will be responsible for ensuring the smooth operation of our infrastructure services and ensuring they have the capacity for future growth.Key Responsibilities:Ensure high availability and...
-
Site Reliability Engineer
4 weeks ago
Stanford, California, United States Rubrik Job Board Full timeJob DescriptionRubrik is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the smooth operation of our cloud-based infrastructure and services.Key Responsibilities:Database Management: Ensure high availability and durability of our databases, and establish best...
-
Senior Site Reliability Engineer
3 weeks ago
Stanford, California, United States Rubrik Job Board Full timeAbout the RoleRubrik is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will play a critical role in ensuring the smooth operation of our infrastructure services and ensuring they have the capacity for future growth.Key ResponsibilitiesHigh Availability and Durability: Ensure the high...
-
Senior Site Reliability Engineer
4 weeks ago
Stanford, California, United States Rubrik Job Board Full timeAbout the RoleRubrik is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for ensuring the high availability and durability of our databases, establishing best practices for internal teams to write performant SQL queries, and performing periodic database upgrades with...
-
Senior Site Reliability Engineer, PLM Operations
2 weeks ago
Stanford, United States Tesla Full timeAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our PLM Operations team at Tesla. As a key member of our team, you will be responsible for ensuring the reliability and performance of our 3DExperience services running on on-prem Kubernetes.Key ResponsibilitiesDefine Service Level Objectives (SLOs) around latency,...
-
Stanford, United States Tesla Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our AI Infrastructure team at Tesla. As a key member of our team, you will be responsible for maintaining and improving our platform to ensure our Full-Self-Driving (FSD), Tesla Bot & Dojo engineering teams have the necessary tools and resources to be productive.Key...
-
Senior Site Reliability Engineer, PLM Operations
3 weeks ago
Stanford, United States Tesla Full timeAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our PLM Operations team at Tesla. As a key member of our team, you will be responsible for ensuring the reliability and performance of our 3DExperience services running on on-prem Kubernetes.Key ResponsibilitiesDefine Service Level Objectives (SLOs) around latency,...
-
Reliability Engineer for Power Distribution
2 weeks ago
Stanford, United States Tesla Full timeJob Title: Reliability Engineer for Power DistributionWe are seeking a highly skilled Reliability Engineer to join our team at Tesla. As a Reliability Engineer for Power Distribution, you will play a key role in designing and implementing reliability solutions for our high voltage distribution systems.Key Responsibilities:Design and implement accelerated...
-
Reliability Engineer for Drive Inverters
2 weeks ago
Stanford, United States Tesla Full timeJob Title: Reliability Engineer for Drive InvertersWe are seeking a highly skilled Reliability Engineer to join our team at Tesla. As a Reliability Engineer for Drive Inverters, you will play a critical role in designing and developing reliable high voltage power modules and components for our Tesla Semi.Key Responsibilities:Design and develop accelerated...
-
Reliability Characterization Engineer
2 weeks ago
Stanford, United States Tesla Full timeJob Title: Reliability Characterization EngineerAs a Reliability Characterization Engineer at Tesla, you will play a crucial role in enhancing the reliability of our innovative Industrial Energy, Residential Energy, Charging, and Solar products. Your primary responsibility will be to investigate underlying mechanisms of reliability test failures during the...
-
Senior Cloud Reliability Engineer
2 weeks ago
Stanford, United States Foundry Technologies, Inc. Full timeAbout FoundryFoundry Technologies, Inc. is a pioneering company that aims to revolutionize the way we access and utilize compute capacity. Our mission is to make AI compute universally accessible and useful, and we're building a new breed of public cloud to achieve this goal.We're a dynamic and rapidly growing organization, backed by top investors and...
-
Mechanical Reliability Engineer for Tesla Bot
3 weeks ago
Stanford, United States Tesla Full timeAbout the RoleWe are seeking a highly skilled Mechanical Reliability Engineer to join our team at Tesla, working on the design and development of our humanoid robot, the Tesla Bot. As a key member of our Design for Reliability team, you will play a critical role in ensuring the reliability and performance of the bot's mechanical components and...
-
Stanford, United States Tesla Full timeJob SummaryWe are seeking a highly skilled Electronics Reliability Engineer to join our team at Tesla. As an Electronics Reliability Engineer, you will play a critical role in enhancing the reliability of our innovative Energy and Charging products.Key ResponsibilitiesConduct in-depth failure analysis and investigate the underlying mechanisms of electronic...
-
Hardware Reliability Specialist
2 days ago
Stanford, United States Wing Aviation Full timeAbout Wing:Wing is a pioneering company in the field of drone delivery, offering a safe, fast, and sustainable solution for last mile logistics. Our mission is to create the preferred means of delivery for the planet, and we're committed to building a workforce that's representative of the global communities we serve.About the Role:We're seeking a highly...
-
Electronics Reliability Specialist
2 weeks ago
Stanford, United States Tesla Full timeJob SummaryWe are seeking a highly skilled Electronics Reliability Engineer to join our team at Tesla. As a key member of our Energy and Charging product development team, you will be responsible for ensuring the reliability and quality of our innovative products.Key ResponsibilitiesConduct in-depth failure analysis and investigate the underlying mechanisms...
-
Staff Electrical Engineer, Bot
3 weeks ago
Stanford, California, United States Tesla Full timeJob SummaryTesla is seeking a highly skilled Electrical Engineer to join our Self-Driving Hardware team. As a key member of this team, you will be responsible for designing and developing the computing hardware that enables our autonomous driving capabilities.ResponsibilitiesDesign and develop complex high-speed boards that deliver compute performance while...
-
Senior Cloud Infrastructure Engineer
3 days ago
Stanford, United States Foundry Technologies, Inc. Full timeAbout FoundryFoundry Technologies, Inc. is revolutionizing the cloud computing industry by making AI compute universally accessible and useful. Our mission is to orchestrate the world's compute capacity, rendering it accessible and useful for all.We are a dynamic and rapidly growing organization, backed by Sequoia, Lightspeed, Jeff Dean, Eric Schmidt, and...