GNC Site Reliability Engineer

1 week ago


Hawthorne, California, United States SpaceX Full time
Job Summary

SpaceX is seeking a highly skilled GNC Site Reliability Engineer to play a critical role in the development of our mission-critical products.

Key Responsibilities
  • Deploy, upgrade, operate, and scale our suite of mission-critical GNC products and services.
  • Provision and maintain virtual and physical servers, ensuring high availability and performance.
  • Collaborate with the SpaceX HPC team to monitor and maintain a 4000+ thread HPC cluster, optimizing its performance and efficiency.
  • Work closely with GNC software engineers to create highly operable and maintainable products, ensuring seamless integration with our spacecraft systems.
  • Add monitoring for web applications and respond to outages, minimizing downtime and ensuring business continuity.
  • Manage the underlying computational infrastructure of GNC in collaboration with IT, ensuring scalability and reliability.
  • Engage in and improve the whole lifecycle of services, from inception and design, through deployment, operation, and refinement, ensuring continuous improvement and innovation.
  • Make recommendations for future hardware purchases, aligning with our company's strategic goals and objectives.
  • Practice sustainable incident response and postmortems, identifying root causes and implementing corrective actions to prevent future incidents.
  • Provide end-user support to GNC engineering for products, becoming an expert on analysis applications and supporting users in troubleshooting and pointing to features.
  • Configure automated deployment pipelines for web applications, ensuring efficient and reliable deployment.
  • Develop or improve GNC web applications and tools for better usability, maintainability, and robustness, enhancing the overall user experience.
  • Demonstrate and document new software changes, such as operating system upgrades, shared filesystem changes, or major tool rollouts, ensuring transparency and accountability.
  • Focus on performance bottlenecks and performance improvement techniques, optimizing our systems for maximum efficiency and performance.
Requirements
  • Bachelor's degree in computer science, information systems/IT, engineering, math, or scientific discipline and 2+ years of software development experience OR 4+ years of professional experience building software with site reliability or DevOps in lieu of a degree.
  • Experience with Linux operating systems, ensuring proficiency in system administration and troubleshooting.
  • Experience with Python and Python-based development frameworks, ensuring expertise in programming and software development.
Preferred Skills and Experience
  • 2+ years of systems administration, site reliability engineering, or DevOps experience, ensuring a strong foundation in system administration and software development.
  • 2+ years of experience with Python and Python-based development frameworks, ensuring expertise in programming and software development.
  • 2+ years of Linux experience, ensuring proficiency in system administration and troubleshooting.
  • Expertise with Docker, Vagrant, and Kubernetes or similar technologies, ensuring familiarity with containerization and orchestration.
  • Extensive experience with configuration management tools such as Ansible, Puppet, Terraform, ensuring proficiency in infrastructure automation.
  • Experience with build systems (Make, Bazel / Pants / Buck, Gradle) and package management tools (pip, npm), ensuring expertise in software development and deployment.
  • Strong understanding of virtualization and hypervisor technologies, ensuring familiarity with virtualization and cloud computing.
  • Understanding of databases and data modeling, ensuring expertise in data management and analysis.
  • Experience with automatically managing dozens or hundreds of servers, ensuring proficiency in system administration and scalability.
  • Strong networking knowledge of TCP/IP, ensuring expertise in network administration and troubleshooting.
  • Experience scaling web applications and optimizing applications for performance, ensuring expertise in software development and deployment.
  • Professional experience with standard front-end technologies like modern HTML, CSS, JavaScript (we use AngularJS, Polymer,, React, and more), REST, JSON, ensuring expertise in web development and user experience.
  • Solid understanding of UI/UX design to provide intuitive applications, ensuring expertise in user experience and interface design.
  • Experience with high-performance computing systems or large-scale data analysis systems, ensuring expertise in high-performance computing and data analysis.
Compensation and Benefits

SpaceX offers a competitive salary and benefits package, including comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, paid parental leave, and various other discounts and perks.

As an exempt employee, you will be eligible for 5 days of sick leave per year and 3 weeks of paid vacation. You will also have access to comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, paid parental leave, and various other discounts and perks.

SpaceX is an Equal Opportunity Employer and is committed to diversity and inclusion in the workplace. We welcome applications from qualified candidates who share our values and are passionate about making a difference in the space industry.



  • Hawthorne, California, United States SpaceX Full time

    Job SummarySpaceX is seeking a highly skilled GNC Site Reliability Engineer to play a critical role in the development of our mission-critical products.Key ResponsibilitiesDeploy, upgrade, operate, and scale our suite of mission-critical GNC products and servicesProvision and maintain virtual and physical serversWork with the SpaceX HPC team to monitor and...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleAt SpaceX, we're pushing the boundaries of space exploration and development. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability and scalability of our systems.Key ResponsibilitiesDesign, develop, and test automation tools to deploy and manage applications on-premises and in the cloud.Deploy and manage core...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleAt SpaceX, we're pushing the boundaries of space exploration and development. As a Site Reliability Engineer, you'll play a critical role in designing, developing, and testing key aspects of our in-house solution for analysis, simulation, and prototyping of software in support of all SpaceX flight systems.Key ResponsibilitiesAutomation and...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleSpaceX is a pioneering space exploration company that aims to make humanity a multi-planetary species. As a Site Reliability Engineer, Data, you will play a crucial role in ensuring the reliability and scalability of our mission-critical applications.Key ResponsibilitiesDesign and implement sharded and geo-redundant distributed systems to...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleAt SpaceX, we're pushing the boundaries of space exploration and development. As a Site Reliability Engineer, Data, you'll play a critical role in ensuring the reliability and scalability of our mission-critical applications.Key ResponsibilitiesDesign and implement sharded and geo-redundant distributed systems in multiple data centersAdvance...


  • Hawthorne, California, United States SpaceX Full time

    Guidance Navigation and Control Operations Engineer at SpaceXSpaceX is on a mission to revolutionize space exploration, driven by the vision of making human life multi-planetary. We are actively developing cutting-edge technologies to facilitate this ambitious goal.As a Guidance Navigation and Control Operations Engineer, you will play a pivotal role in...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleSpaceX is a pioneering space exploration company that aims to make humanity a multi-planetary species. As a Site Reliability Engineer, Data, you will play a crucial role in ensuring the reliability and scalability of our mission-critical applications.Key ResponsibilitiesDesign and implement sharded and geo-redundant distributed systems to...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Data team at SpaceX. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our data systems.Key ResponsibilitiesDesign and implement scalable and reliable data systems to support our mission-critical...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleSpaceX is seeking a highly skilled GNC Site Reliability Engineer to join our team. As a key member of our Guidance, Navigation, and Control (GNC) team, you will be responsible for operating and scaling custom-built mission-critical products.Key ResponsibilitiesDeploy, Upgrade, and Maintain Mission-Critical SystemsProvision and maintain virtual...


  • Hawthorne, California, United States SpaceX Full time

    About SpaceXSpaceX is a pioneering space exploration company that aims to make humanity a multi-planetary species. Our mission is to reduce space transportation costs and enable the colonization of Mars.Job SummaryWe are seeking a highly skilled **Senior Site Reliability Engineer, Data** to join our team. As a key member of our Data Team, you will be...


  • Hawthorne, California, United States SpaceX Full time

    At SpaceX, we are driven by the vision of a future where humanity explores the cosmos. Our mission is to develop cutting-edge technologies that will enable human life on Mars and beyond. POSITION: GNC ENGINEER (STARSHIP)As a GNC Engineer, you will play a pivotal role in advancing the flight control systems for our state-of-the-art rockets, including the...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleAt SpaceX, we're pushing the boundaries of space technology and innovation. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability and performance of our systems, enabling us to achieve our ambitious goals.Key ResponsibilitiesDevelop automation scripts to deploy and manage compute resources on-premises and in...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleSpaceX is a pioneering space exploration company that aims to make humanity a multi-planetary species. As a Site Reliability Engineer, Data, you will play a crucial role in ensuring the reliability and scalability of our mission-critical applications.Key ResponsibilitiesDesign and implement sharded and geo-redundant distributed systems to...


  • Hawthorne, California, United States SpaceX Full time

    At SpaceX, we believe in a future where humanity explores the cosmos, and we are committed to developing the technologies that will make this vision a reality. Our mission is to enable human life on Mars.APPLICATION RELIABILITY ENGINEERThe application software division serves as the backbone of SpaceX, crafting essential applications that enhance launch...


  • Hawthorne, California, United States SpaceX Full time

    At SpaceX, we believe in a future where humanity explores the cosmos, and we are committed to developing the technologies that will make this vision a reality. Our mission is to enable human life on Mars.APPLICATION RELIABILITY ENGINEERThe application software division serves as the backbone of SpaceX, crafting essential applications that facilitate the...


  • Hawthorne, California, United States SpaceX Full time

    At SpaceX, we believe in a future where humanity explores the cosmos, and we are dedicated to developing the technologies that will make this vision a reality. Our mission is to enable human life on Mars.APPLICATION RELIABILITY ENGINEERThe application software division serves as the backbone of SpaceX, crafting essential applications that facilitate the...


  • Hawthorne, California, United States SpaceX Full time

    About the RoleAt SpaceX, we're pushing the boundaries of what's possible in space exploration and development. As a Site Reliability Engineer - Data, you'll play a critical role in ensuring the reliability and scalability of our data systems, enabling us to accelerate launch vehicle production and flight, as well as support the growth of our Starlink...


  • Hawthorne, California, United States SpaceX Full time

    Job SummarySpaceX is seeking a highly skilled Mission Control Systems Engineer to join our team. As a key member of our Guidance Navigation and Control (GNC) team, you will be responsible for designing and implementing mission trajectories for our Falcon launch vehicles.Key ResponsibilitiesDesign and optimize mission trajectories for Falcon 9 and Falcon...


  • Hawthorne, California, United States Space Exploration Technologies Corporation Full time

    Position OverviewAs a Senior Guidance, Navigation, and Control Engineer, you will play a pivotal role in advancing the technical frontiers of flight control systems for some of the most sophisticated rockets in the industry. Your expertise will contribute to the development of our next-generation spacecraft, designed for human spaceflight.Key...


  • Hawthorne, California, United States SpaceX Full time

    At SpaceX, we believe in a future where humanity explores the cosmos, and we are dedicated to developing the technologies that will make this vision a reality, with the ultimate aim of establishing human life on Mars. As a Senior Guidance, Navigation, and Control (GNC) Engineer on the Falcon team, you will play a pivotal role in mission design and the...