**Senior Site Reliability Engineer, Data**
5 days ago
SpaceX is a pioneering space exploration company that aims to make humanity a multi-planetary species. Our mission is to reduce space transportation costs and enable the colonization of Mars.
Job SummaryWe are seeking a highly skilled **Senior Site Reliability Engineer, Data** to join our team. As a key member of our Data Team, you will be responsible for designing, implementing, and maintaining scalable and reliable data systems that support our mission-critical applications.
Key Responsibilities- Design and implement sharded and geo-redundant distributed systems in multiple data centers
- Advance existing deployment, monitoring, and alerting infrastructure to support a multi-region environment
- Manage petabyte-scale bare metal compute clusters
- Closely collaborate with engineers across all programs to create highly operable, scalable, and maintainable products
- Engage throughout the whole software development lifecycle of services -- from inception to design, deployment, operation, and iterative refinement
- Focus on performance bottlenecks and performance improvement techniques
- Bachelor's degree in computer science, engineering, math, or scientific discipline and 5 years of software development experience OR 7+ years of professional experience building software with site reliability or DevOps in lieu of a degree
- Experience with Linux operating systems
- 5+ years of rigorous experience with site reliability or DevOps
- Experience with Kubernetes and Istio for on-premise deployment
- Experience with in-stream, data processing and analytics using open source platforms such as Apache Kafka, Spark, HBase, HDFS, Flink
- Experience troubleshooting hardware and network-layer issues
- Programming experience in Python, C#, Java, Scala, or similar languages
- Good understanding of version control, testing, continuous integration, build, deployment, and monitoring
- Willing to work extended hours and weekends when needed
SpaceX offers a competitive salary and benefits package, including long-term incentives, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, paid parental leave, and various other discounts and perks.
-
Data Site Reliability Engineer
7 days ago
Hawthorne, California, United States SpaceX Full timeAbout the RoleSpaceX is a pioneering space exploration company that aims to make humanity a multi-planetary species. As a Site Reliability Engineer, Data, you will play a crucial role in ensuring the reliability and scalability of our mission-critical applications.Key ResponsibilitiesDesign and implement sharded and geo-redundant distributed systems to...
-
Site Reliability Engineer, Data
4 days ago
Hawthorne, California, United States SpaceX Full timeAbout the RoleAt SpaceX, we're pushing the boundaries of space exploration and development. As a Site Reliability Engineer, Data, you'll play a critical role in ensuring the reliability and scalability of our mission-critical applications.Key ResponsibilitiesDesign and implement sharded and geo-redundant distributed systems in multiple data centersAdvance...
-
Site Reliability Engineer, Data
7 days ago
Hawthorne, California, United States SpaceX Full timeAbout the RoleSpaceX is a pioneering space exploration company that aims to make humanity a multi-planetary species. As a Site Reliability Engineer, Data, you will play a crucial role in ensuring the reliability and scalability of our mission-critical applications.Key ResponsibilitiesDesign and implement sharded and geo-redundant distributed systems to...
-
Data Center Engineer
7 days ago
Hawthorne, California, United States SpaceX Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Data team at SpaceX. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our data systems.Key ResponsibilitiesDesign and implement scalable and reliable data systems to support our mission-critical...
-
Reliability Engineer
7 days ago
Hawthorne, California, United States SpaceX Full timeAbout the RoleAt SpaceX, we're pushing the boundaries of what's possible in space exploration and development. As a Site Reliability Engineer - Data, you'll play a critical role in ensuring the reliability and scalability of our data systems, enabling us to accelerate launch vehicle production and flight, as well as support the growth of our Starlink...
-
Reliability Engineer, Data Systems
7 days ago
Hawthorne, California, United States SpaceX Full timeAbout the RoleSpaceX is a pioneering space exploration company that aims to make humanity a multi-planetary species. As a Site Reliability Engineer, Data, you will play a crucial role in ensuring the reliability and scalability of our mission-critical applications.Key ResponsibilitiesDesign and implement sharded and geo-redundant distributed systems to...
-
Site Reliability Engineer
4 days ago
Hawthorne, California, United States SpaceX Full timeAbout the RoleAt SpaceX, we're pushing the boundaries of space exploration and development. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability and scalability of our systems.Key ResponsibilitiesDesign, develop, and test automation tools to deploy and manage applications on-premises and in the cloud.Deploy and manage core...
-
Site Reliability Engineer
7 days ago
Hawthorne, California, United States SpaceX Full timeAbout the RoleAt SpaceX, we're pushing the boundaries of space exploration and development. As a Site Reliability Engineer, you'll play a critical role in designing, developing, and testing key aspects of our in-house solution for analysis, simulation, and prototyping of software in support of all SpaceX flight systems.Key ResponsibilitiesAutomation and...
-
Senior Flight Reliability Engineer
7 days ago
Hawthorne, California, United States SpaceX Full timeJob SummarySpaceX is seeking a highly skilled and experienced Senior Flight Reliability Engineer to join our team. As a key member of our Dragon Vehicle Reliability team, you will play a critical role in ensuring the safe and successful operation of our spacecraft.Key ResponsibilitiesAssess pre-flight risks and develop mitigation strategies in collaboration...
-
Senior Aerospace Reliability Engineer
2 weeks ago
Hawthorne, California, United States SpaceX Full timeSENIOR AEROSPACE RELIABILITY ENGINEER (DRAGON)SpaceX is on the lookout for a Senior Aerospace Reliability Engineer to become a vital part of our forward-thinking team. In this position, you will play a crucial role in influencing the Dragon spacecraft that facilitate missions to the International Space Station. Your duties will encompass evaluating...
-
Senior Vehicle Reliability Engineer
2 weeks ago
Hawthorne, California, United States SpaceX Full timeAt SpaceX, we are driven by the vision of a future where humanity explores the cosmos, and we are dedicated to making that vision a reality. As a Senior Vehicle Reliability Engineer, you will play a crucial role in ensuring the safety and performance of our Dragon spacecraft, which are vital for transporting science, supplies, and crew to the International...
-
Hawthorne, California, United States SpaceX Full timeSpaceX was established with the vision that a future where humanity explores the cosmos is significantly more thrilling than one where we remain confined to Earth. Currently, SpaceX is at the forefront of developing technologies that will make this vision a reality, with the ultimate aim of facilitating human life on Mars.SENIOR FLIGHT RELIABILITY ENGINEER...
-
GNC Site Reliability Engineer
1 week ago
Hawthorne, California, United States SpaceX Full timeJob SummarySpaceX is seeking a highly skilled GNC Site Reliability Engineer to play a critical role in the development of our mission-critical products.Key ResponsibilitiesDeploy, upgrade, operate, and scale our suite of mission-critical GNC products and services.Provision and maintain virtual and physical servers, ensuring high availability and...
-
Site Reliability Specialist
7 days ago
Hawthorne, California, United States SpaceX Full timeAbout the RoleAt SpaceX, we're pushing the boundaries of space technology and innovation. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability and performance of our systems, enabling us to achieve our ambitious goals.Key ResponsibilitiesDevelop automation scripts to deploy and manage compute resources on-premises and in...
-
GNC Site Reliability Engineer
1 week ago
Hawthorne, California, United States SpaceX Full timeJob SummarySpaceX is seeking a highly skilled GNC Site Reliability Engineer to play a critical role in the development of our mission-critical products.Key ResponsibilitiesDeploy, upgrade, operate, and scale our suite of mission-critical GNC products and servicesProvision and maintain virtual and physical serversWork with the SpaceX HPC team to monitor and...
-
Hawthorne, California, United States SpaceX Full timeSpaceX - Senior Electrical Reliability Engineer for Avionics SystemsAt SpaceX, our mission is to facilitate human existence on Mars through groundbreaking technological advancements. The Build Reliability division is on the lookout for a Senior Electrical Reliability Engineer to guarantee the dependability of avionics components. As a vital member of this...
-
Application Reliability Engineer
1 week ago
Hawthorne, California, United States SpaceX Full timeAt SpaceX, we believe in a future where humanity explores the cosmos, and we are committed to developing the technologies that will make this vision a reality. Our mission is to enable human life on Mars.APPLICATION RELIABILITY ENGINEERThe application software division serves as the backbone of SpaceX, crafting essential applications that facilitate the...
-
Application Reliability Engineer
2 weeks ago
Hawthorne, California, United States SpaceX Full timeAt SpaceX, we believe in a future where humanity explores the cosmos, and we are committed to developing the technologies that will make this vision a reality. Our mission is to enable human life on Mars.APPLICATION RELIABILITY ENGINEERThe application software division serves as the backbone of SpaceX, crafting essential applications that enhance launch...
-
Application Reliability Engineer
2 weeks ago
Hawthorne, California, United States SpaceX Full timeAt SpaceX, we believe in a future where humanity explores the cosmos, and we are dedicated to developing the technologies that will make this vision a reality. Our mission is to enable human life on Mars.APPLICATION RELIABILITY ENGINEERThe application software division serves as the backbone of SpaceX, crafting essential applications that facilitate the...
-
Senior Flight Reliability Expert
4 days ago
Hawthorne, California, United States Space Exploration Technologies Corporation Full timeJob SummaryThe Senior Flight Reliability Expert at Space Exploration Technologies Corporation is responsible for ensuring the safe and reliable operation of our flight vehicles. This individual will lead the technical efforts to mitigate risks and guarantee the success of every mission.Key ResponsibilitiesDevelop and implement reliability engineering...