**Senior Site Reliability Engineer, Data**

5 days ago


Hawthorne, California, United States SpaceX Full time
About SpaceX

SpaceX is a pioneering space exploration company that aims to make humanity a multi-planetary species. Our mission is to reduce space transportation costs and enable the colonization of Mars.

Job Summary

We are seeking a highly skilled **Senior Site Reliability Engineer, Data** to join our team. As a key member of our Data Team, you will be responsible for designing, implementing, and maintaining scalable and reliable data systems that support our mission-critical applications.

Key Responsibilities
  • Design and implement sharded and geo-redundant distributed systems in multiple data centers
  • Advance existing deployment, monitoring, and alerting infrastructure to support a multi-region environment
  • Manage petabyte-scale bare metal compute clusters
  • Closely collaborate with engineers across all programs to create highly operable, scalable, and maintainable products
  • Engage throughout the whole software development lifecycle of services -- from inception to design, deployment, operation, and iterative refinement
  • Focus on performance bottlenecks and performance improvement techniques
Requirements
  • Bachelor's degree in computer science, engineering, math, or scientific discipline and 5 years of software development experience OR 7+ years of professional experience building software with site reliability or DevOps in lieu of a degree
  • Experience with Linux operating systems
Preferred Skills and Experience
  • 5+ years of rigorous experience with site reliability or DevOps
  • Experience with Kubernetes and Istio for on-premise deployment
  • Experience with in-stream, data processing and analytics using open source platforms such as Apache Kafka, Spark, HBase, HDFS, Flink
  • Experience troubleshooting hardware and network-layer issues
  • Programming experience in Python, C#, Java, Scala, or similar languages
  • Good understanding of version control, testing, continuous integration, build, deployment, and monitoring
Additional Requirements
  • Willing to work extended hours and weekends when needed
Compensation and Benefits

SpaceX offers a competitive salary and benefits package, including long-term incentives, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, paid parental leave, and various other discounts and perks.



  • Hawthorne, California, United States SpaceX Full time

    About the RoleSpaceX is a pioneering space exploration company that aims to make humanity a multi-planetary species. As a Site Reliability Engineer, Data, you will play a crucial role in ensuring the reliability and scalability of our mission-critical applications.Key ResponsibilitiesDesign and implement sharded and geo-redundant distributed systems to...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleAt SpaceX, we're pushing the boundaries of space exploration and development. As a Site Reliability Engineer, Data, you'll play a critical role in ensuring the reliability and scalability of our mission-critical applications.Key ResponsibilitiesDesign and implement sharded and geo-redundant distributed systems in multiple data centersAdvance...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleSpaceX is a pioneering space exploration company that aims to make humanity a multi-planetary species. As a Site Reliability Engineer, Data, you will play a crucial role in ensuring the reliability and scalability of our mission-critical applications.Key ResponsibilitiesDesign and implement sharded and geo-redundant distributed systems to...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Data team at SpaceX. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our data systems.Key ResponsibilitiesDesign and implement scalable and reliable data systems to support our mission-critical...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleAt SpaceX, we're pushing the boundaries of what's possible in space exploration and development. As a Site Reliability Engineer - Data, you'll play a critical role in ensuring the reliability and scalability of our data systems, enabling us to accelerate launch vehicle production and flight, as well as support the growth of our Starlink...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleSpaceX is a pioneering space exploration company that aims to make humanity a multi-planetary species. As a Site Reliability Engineer, Data, you will play a crucial role in ensuring the reliability and scalability of our mission-critical applications.Key ResponsibilitiesDesign and implement sharded and geo-redundant distributed systems to...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleAt SpaceX, we're pushing the boundaries of space exploration and development. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability and scalability of our systems.Key ResponsibilitiesDesign, develop, and test automation tools to deploy and manage applications on-premises and in the cloud.Deploy and manage core...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleAt SpaceX, we're pushing the boundaries of space exploration and development. As a Site Reliability Engineer, you'll play a critical role in designing, developing, and testing key aspects of our in-house solution for analysis, simulation, and prototyping of software in support of all SpaceX flight systems.Key ResponsibilitiesAutomation and...


  • Hawthorne, California, United States SpaceX Full time

    Job SummarySpaceX is seeking a highly skilled and experienced Senior Flight Reliability Engineer to join our team. As a key member of our Dragon Vehicle Reliability team, you will play a critical role in ensuring the safe and successful operation of our spacecraft.Key ResponsibilitiesAssess pre-flight risks and develop mitigation strategies in collaboration...


  • Hawthorne, California, United States SpaceX Full time

    SENIOR AEROSPACE RELIABILITY ENGINEER (DRAGON)SpaceX is on the lookout for a Senior Aerospace Reliability Engineer to become a vital part of our forward-thinking team. In this position, you will play a crucial role in influencing the Dragon spacecraft that facilitate missions to the International Space Station. Your duties will encompass evaluating...


  • Hawthorne, California, United States SpaceX Full time

    At SpaceX, we are driven by the vision of a future where humanity explores the cosmos, and we are dedicated to making that vision a reality. As a Senior Vehicle Reliability Engineer, you will play a crucial role in ensuring the safety and performance of our Dragon spacecraft, which are vital for transporting science, supplies, and crew to the International...


  • Hawthorne, California, United States SpaceX Full time

    SpaceX was established with the vision that a future where humanity explores the cosmos is significantly more thrilling than one where we remain confined to Earth. Currently, SpaceX is at the forefront of developing technologies that will make this vision a reality, with the ultimate aim of facilitating human life on Mars.SENIOR FLIGHT RELIABILITY ENGINEER...


  • Hawthorne, California, United States SpaceX Full time

    Job SummarySpaceX is seeking a highly skilled GNC Site Reliability Engineer to play a critical role in the development of our mission-critical products.Key ResponsibilitiesDeploy, upgrade, operate, and scale our suite of mission-critical GNC products and services.Provision and maintain virtual and physical servers, ensuring high availability and...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleAt SpaceX, we're pushing the boundaries of space technology and innovation. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability and performance of our systems, enabling us to achieve our ambitious goals.Key ResponsibilitiesDevelop automation scripts to deploy and manage compute resources on-premises and in...


  • Hawthorne, California, United States SpaceX Full time

    Job SummarySpaceX is seeking a highly skilled GNC Site Reliability Engineer to play a critical role in the development of our mission-critical products.Key ResponsibilitiesDeploy, upgrade, operate, and scale our suite of mission-critical GNC products and servicesProvision and maintain virtual and physical serversWork with the SpaceX HPC team to monitor and...


  • Hawthorne, California, United States SpaceX Full time

    SpaceX - Senior Electrical Reliability Engineer for Avionics SystemsAt SpaceX, our mission is to facilitate human existence on Mars through groundbreaking technological advancements. The Build Reliability division is on the lookout for a Senior Electrical Reliability Engineer to guarantee the dependability of avionics components. As a vital member of this...


  • Hawthorne, California, United States SpaceX Full time

    At SpaceX, we believe in a future where humanity explores the cosmos, and we are committed to developing the technologies that will make this vision a reality. Our mission is to enable human life on Mars.APPLICATION RELIABILITY ENGINEERThe application software division serves as the backbone of SpaceX, crafting essential applications that facilitate the...


  • Hawthorne, California, United States SpaceX Full time

    At SpaceX, we believe in a future where humanity explores the cosmos, and we are committed to developing the technologies that will make this vision a reality. Our mission is to enable human life on Mars.APPLICATION RELIABILITY ENGINEERThe application software division serves as the backbone of SpaceX, crafting essential applications that enhance launch...


  • Hawthorne, California, United States SpaceX Full time

    At SpaceX, we believe in a future where humanity explores the cosmos, and we are dedicated to developing the technologies that will make this vision a reality. Our mission is to enable human life on Mars.APPLICATION RELIABILITY ENGINEERThe application software division serves as the backbone of SpaceX, crafting essential applications that facilitate the...


  • Hawthorne, California, United States Space Exploration Technologies Corporation Full time

    Job SummaryThe Senior Flight Reliability Expert at Space Exploration Technologies Corporation is responsible for ensuring the safe and reliable operation of our flight vehicles. This individual will lead the technical efforts to mitigate risks and guarantee the success of every mission.Key ResponsibilitiesDevelop and implement reliability engineering...