Senior Site Reliability Engineer, PLM Operations

2 weeks ago


Stanford, United States Tesla Full time
About the Role

We are seeking a highly skilled Senior Site Reliability Engineer to join our PLM Operations team at Tesla. As a key member of our team, you will be responsible for ensuring the reliability and performance of our 3DExperience services running on on-prem Kubernetes.

Key Responsibilities
  • Define Service Level Objectives (SLOs) around latency, traffic, errors, and saturation.
  • Maintain Tesla-custom Helm Charts to deploy highly customized and evolving 3DExperience services.
  • Modernize our deployment infrastructure using custom GitHub Actions, ArgoCD, Atlantis, and Terraform.
  • Achieve high performance services using tools like Prometheus, Grafana, Catchpoint, Splunk, and OpsGenie.
  • Be in an on-call rotation, manage incidents as Incident Commander, and write actionable incident reports.
  • Manage tasks via Jira for observability and human capacity planning.
  • Write and review design documents - testing frameworks, deployment models, environment definitions, etc.
Requirements
  • Deep networking experience, e.g., experience troubleshooting outages from L7 to L3, experience contributing to infra or networking GitHub repos or publications.
  • Deep Oracle Database experience, e.g., indexing deltas, schema migrations.
  • Docker/Kubernetes, e.g., performed kubelet upgrades in-situ, used skopeo or CRI-O intentionally, configured containerd.
  • Diagnosing problems in legacy enterprise Java stacks.
  • Installing, managing, or using 3DExperience, or similar experience with other PLM software.
  • Outstanding experience with Scientific computing or LIMS.
  • Deep understanding of hypervisor technology (VMware).
What We Offer

As a full-time Tesla employee, you will be eligible for a competitive salary, cash and stock awards, and a comprehensive benefits package, including medical, dental, and vision plans, 401(k) with employer match, and more.

Additionally, you will have access to Tesla's Employee Assistance Program, sick and vacation time, paid holidays, and a range of other benefits.

Voluntary benefits include critical illness, hospital indemnity, accident insurance, theft and legal services, and pet insurance.

Expected compensation: $104,000 - $348,000/annual salary, depending on level, plus cash and stock awards, and benefits.



  • Stanford, United States Tesla Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our PLM Operations team at Tesla. As a key member of our team, you will be responsible for ensuring the reliability and performance of our 3DExperience services running on on-prem Kubernetes.Key ResponsibilitiesDefine Service Level Objectives (SLOs) around latency,...


  • Stanford, United States Tesla Full time

    Job SummaryWe are seeking a highly skilled Staff Site Reliability Engineer to join our PLM Operations team at Tesla. As a key member of our team, you will be responsible for ensuring the reliability and performance of our PLM systems, which are critical to the success of our engineering design tools.Key ResponsibilitiesDefine Service Level Objectives (SLOs)...


  • Stanford, United States Rubrik Job Board Full time

    Job Title: Senior Site Reliability EngineerRubrik is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will be responsible for ensuring the smooth operation of our infrastructure services and ensuring they have the capacity for future growth.Key Responsibilities:Ensure high availability and...


  • Stanford, California, United States Rubrik Job Board Full time

    About the RoleRubrik is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will play a critical role in ensuring the smooth operation of our infrastructure services and ensuring they have the capacity for future growth.Key ResponsibilitiesHigh Availability and Durability: Ensure the high...


  • Stanford, California, United States Rubrik Job Board Full time

    About the RoleRubrik is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for ensuring the high availability and durability of our databases, establishing best practices for internal teams to write performant SQL queries, and performing periodic database upgrades with...


  • Stanford, California, United States Rubrik Job Board Full time

    Job DescriptionRubrik is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the smooth operation of our cloud-based infrastructure and services.Key Responsibilities:Database Management: Ensure high availability and durability of our databases, and establish best...


  • Stanford, United States Foundry Technologies, Inc. Full time

    About FoundryFoundry Technologies, Inc. is a pioneering company that aims to revolutionize the way we access and utilize compute capacity. Our mission is to make AI compute universally accessible and useful, and we're building a new breed of public cloud to achieve this goal.We're a dynamic and rapidly growing organization, backed by top investors and...


  • Stanford, United States Tesla Full time

    About the RoleWe're seeking a seasoned Site Reliability Engineer to join our team at Tesla, where you'll play a critical role in designing and building the next-generation server-side infrastructure to support our growing fleets of electric vehicles.As a key member of our team, you'll be responsible for driving the migration of large-scale, distributed fleet...


  • Stanford, California, United States PsiQuantum Full time

    About the RoleWe are seeking an experienced Senior Mechanical Design Engineer to join our team at PsiQuantum, a pioneering company in the field of quantum computing. As a key member of our Mechanical Engineering team, you will play a crucial role in designing and specifying components and assemblies for our advanced electronic and optical modules.Key...


  • Stanford, United States Tesla Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our AI Infrastructure team at Tesla. As a key member of our team, you will be responsible for maintaining and improving our platform to ensure our Full-Self-Driving (FSD), Tesla Bot & Dojo engineering teams have the necessary tools and resources to be productive.Key...

  • Senior SCADA Engineer

    3 weeks ago


    Stanford, United States Tesla Full time

    Job Title: Senior SCADA EngineerAt Tesla, we're seeking a highly skilled Senior SCADA Engineer to join our Application and SCADA team. As a key member of our team, you will be responsible for designing and deploying SCADA solutions for utility sites, working closely with our engineering organization to develop a standardized platform for project cost...


  • Stanford, California, United States Tesla Full time

    About the RoleWe are seeking a highly skilled Mechanical Design Engineer to join our Power Electronics team at Tesla. As a key member of our team, you will be responsible for designing cutting-edge power converters used across various Tesla products, including vehicles, energy storage, solar, and manufacturing.Key ResponsibilitiesDesign and develop...


  • Stanford, United States Tesla Full time

    Job Title: Senior Service NPI EngineerTesla is seeking a highly skilled Senior Service NPI Engineer to join our Energy Service Engineering team. As a key member of our team, you will play a critical role in ensuring the smooth operation of our diverse fleet of Energy Products.Key Responsibilities:Proactive Problem Solver: Identify and resolve complex...


  • Stanford, United States Tesla Full time

    Job Title: Senior Battery Test EngineerAs a Senior Battery Test Engineer at Tesla, you will play a critical role in designing and implementing reliability testing for our battery systems. Your expertise will be essential in ensuring the quality and durability of our products.Responsibilities:Lead reliability test execution for all Industrial & Residential...


  • Stanford, United States Tesla Full time

    Job Title: Reliability Engineer for Power DistributionWe are seeking a highly skilled Reliability Engineer to join our team at Tesla. As a Reliability Engineer for Power Distribution, you will play a key role in designing and implementing reliability solutions for our high voltage distribution systems.Key Responsibilities:Design and implement accelerated...


  • Stanford, United States Foundry Technologies, Inc. Full time

    About FoundryFoundry Technologies, Inc. is revolutionizing the cloud computing industry by making AI compute universally accessible and useful. Our mission is to orchestrate the world's compute capacity, rendering it accessible and useful for all.We are a dynamic and rapidly growing organization, backed by Sequoia, Lightspeed, Jeff Dean, Eric Schmidt, and...


  • Stanford, United States Tesla Full time

    Job Title: Reliability Engineer for Drive InvertersWe are seeking a highly skilled Reliability Engineer to join our team at Tesla. As a Reliability Engineer for Drive Inverters, you will play a critical role in designing and developing reliable high voltage power modules and components for our Tesla Semi.Key Responsibilities:Design and develop accelerated...


  • Stanford, United States Tesla Full time

    Job Title: Reliability Characterization EngineerAs a Reliability Characterization Engineer at Tesla, you will play a crucial role in enhancing the reliability of our innovative Industrial Energy, Residential Energy, Charging, and Solar products. Your primary responsibility will be to investigate underlying mechanisms of reliability test failures during the...


  • Stanford, California, United States Supernal Full time

    Senior Battery Management EngineerAt Supernal, we're pushing the boundaries of human possibility with our electric vertical take-off and landing (eVTOL) vehicle and the ground-to-air ecosystem to support the emerging Advanced Air Mobility (AAM) industry. We're committed to creating a sustainable, integrated, and human-centered ecosystem that meets the high...


  • Stanford, United States Tesla Full time

    About the RoleWe are seeking a highly skilled Mechanical Reliability Engineer to join our team at Tesla, working on the design and development of our humanoid robot, the Tesla Bot. As a key member of our Design for Reliability team, you will play a critical role in ensuring the reliability and performance of the bot's mechanical components and...