Site Reliability Engineer

4 days ago


Los Angeles, United States Adastra replica Full time
Job DescriptionJob DescriptionOur client is looking for an experienced Site Reliability Engineer to design, operate, maintain, and scale mission-critical infrastructure and products. Products include (but are not limited to) automated Hardware-In-The-Loop (HITL) data analysis systems, vehicle configuration sign-off tools, continuous integration systems for simulation and flight software, and telemetry data analysis tooling. Working with a small team, the ideal candidate is a self-starter that possesses skills spanning across software development and product operations that thrives in a fast-paced and challenging environment. Responsibilities:
    • Develop automation to deploy and manage compute resources both on-premises and in the cloud
    • Deploy, operate, maintain, and scale mission-critical products and services
    • Partner closely with cross-functional groups to build highly scalable, operable and maintainable products
    • Exercise extreme ownership of the problem space and provide end-user support for internal customers
    • Deploy and manage core infrastructure such as servers, databases, monitoring and storage
    • Engage in and improve the whole software development lifecycle of services -- from concept and design through test, deployment, and maintenance
    • Collaborate closely with IT to manage underlying infrastructure
    • Participate and support off-site operations
    • Practice sustainable incident response
Basic Qualifications:
    • B.S. in Aerospace or Mechanical Engineering, Computer Science (CS), Computer Engineering (CE), Electrical Engineering (EE), or similar from an accredited university
    • 2+ years experience in the development of aircraft, missile, spacecraft or similar electronic/avionic systems
    • 3+ years of Site Reliability or DevOps experience
    • 3+ years of experience with Linux operating systems
    • Experience with Puppet, Ansible, or other automation frameworks
    • Experience with source code and version control tools such as SVN and Git
    • Automation experience in Bash, Python, and other scripting languages
    • Must be comfortable working as a part of a high performance team to deliver mission critical products and services on schedule
Preferred Qualifications:
    • 5+ years of DevOps, System Administration, or Site Reliability Engineering
    • 5+ years of experience with Linux operating systems
    • 3+ years of experience with scripting development frameworks in Bash or Python.
    • Strong understanding of Docker, Vagrant, and Kubernetes, or equivalent technologies.
    • Strong grasp of virtualization and hypervisor technologies (e.g. VirtualBox, VMware, Nutanix)
    • Experience with issue tracking and management systems such as JIRA
    • Understanding of databases and data modeling.
    • Experience maintaining and managing dozens of engineering workstations and servers
    • Ability to assess performance bottlenecks and identify performance improvement techniques.
    • Strong networking knowledge of TCP/IP.
    • Demonstrates exceptional communication skills with the ability to clearly communicate with internal customers, peers, and management
    • Demonstrated skills in applications development in at least one or more high-level programming languages (e.g. C++, C#, Rust)
    • Day of Launch and systems testing support, including vehicle network, vehicle configuration sign-off, tool deployment, etc.
    • Demonstrated leadership of or within a small project team either in current role or during project-based team in school
What We Look For
  • Critical thinking: Our client's engineers understand the "why" behind all design decisions, operational events and test outcomes.
  • Ability to deal with ambiguity: there is no roadmap and our client's engineers must be comfortable defining their own pathway to an objective.
  • End-to-end ownership: projects are delivered fully complete and ready for flight; there is no one to pick up the slack of partially complete work.
ITAR RequirementThis position requires access to information protected under US export control laws, including the International Traffic in Arms Regulations and/or the Export Administration Regulations. As such, US person status (including US citizens, permanent residents, asylees, and refugees) is a required qualification for this position.Equal Opportunity EmployerThis client is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex (including pregnancy), sexual orientation, gender identity and/or expression, national origin, protected veteran status, disability, genetics, or citizenship status (when otherwise legally authorized to work and access export-controlled data) and will not be discriminated against on the basis of such characteristics or any other status protected by the laws or regulations in the locations where we operate. We encourage applicants of all ages.Benefits:
  • Unlimited PTO (and people actually take vacation time)
  • Great healthcare coverage (Including Dental, Vision, Disability & Life Insurance)
  • Market Value Compensation
  • Working on incredible technology with brilliant people
Company Overview for AdAstra
This position is a direct hire, permanent placement for a client of AdAstra. AdAstra's mission is building thriving teams within elite aerospace organizations.
Our commitment is to curate and close premier talent for our partners, achieved via specialized technical screening, personalized culture assessment, and high-touch candidate engagement. We are motivated to foster boundless team satisfaction and catalyze innovation for future generations, enabling inconceivable technology from Earth, to the stars.Connect with us on LinkedIn
Connect with us on Facebook

  • Los Angeles, United States OPEN MIND Technologies Full time

    Our client is looking for a Site Reliability Engineer who can serve their community of users and customers by working tirelessly to preserve free expression and choice, create limitless interactivity, and create a marketplace that enables the economic success of all its participants. If interested, kindly reply back with your updated resume and contact info....


  • Los Angeles, United States X (formerly Twitter) Full time

    Are you prepared to join the X team and help build the ultimate real-time information-sharing app, revolutionizing how people connect? At X, we're on a mission to become a trusted global digital public square, committed to minimal censorship within legal boundaries. Our goal is to empower every user to freely create and share ideas, fostering open public...


  • Los Angeles, United States X (formerly Twitter) Full time

    Are you prepared to join the X team and help build the ultimate real-time information-sharing app, revolutionizing how people connect? At X, we're on a mission to become a trusted global digital public square, committed to minimal censorship within legal boundaries. Our goal is to empower every user to freely create and share ideas, fostering open public...


  • Los Angeles, United States X (formerly Twitter) Full time

    Are you prepared to join the X team and help build the ultimate real-time information-sharing app, revolutionizing how people connect? At X, we're on a mission to become a trusted global digital public square, committed to minimal censorship within legal boundaries. Our goal is to empower every user to freely create and share ideas, fostering open public...


  • Los Angeles, United States developrec Full time

    SRE Lead/Manager | San Diego, CA | Full-time Role Overview: As the Engineering Manager for Site Reliability, you'll lead the charge in transitioning to cloud-based solutions while ensuring the stability of our existing systems for our rapidly growing user base, currently standing at around one million. You'll spearhead our cloud infrastructure strategy...


  • Los Angeles, United States developrec Full time

    SRE Lead/Manager | San Diego, CA | Full-time Role Overview: As the Engineering Manager for Site Reliability, you'll lead the charge in transitioning to cloud-based solutions while ensuring the stability of our existing systems for our rapidly growing user base, currently standing at around one million. You'll spearhead our cloud infrastructure strategy...


  • Los Angeles, United States Adastra replica Full time

    Job Description Job Description Our client is looking for an experienced Site Reliability Engineer to design, operate, maintain, and scale mission-critical infrastructure and products. Products include (but are not limited to) automated Hardware-In-The-Loop (HITL) data analysis systems, vehicle configuration sign-off tools, continuous integration systems for...


  • Los Angeles, United States Adastra replica Full time

    Job Description Job Description Our client is looking for an experienced Site Reliability Engineer to design, operate, maintain, and scale mission-critical infrastructure and products. Products include (but are not limited to) automated Hardware-In-The-Loop (HITL) data analysis systems, vehicle configuration sign-off tools, continuous integration systems for...


  • Los Angeles, United States Adastra replica Full time

    Job DescriptionJob DescriptionOur client is looking for an experienced Site Reliability Engineer to design, operate, maintain, and scale mission-critical infrastructure and products. Products include (but are not limited to) automated Hardware-In-The-Loop (HITL) data analysis systems, vehicle configuration sign-off tools, continuous integration systems for...


  • Los Angeles, United States Adastra replica Full time

    Job DescriptionJob DescriptionOur client is looking for an experienced Site Reliability Engineer to design, operate, maintain, and scale mission-critical infrastructure and products. Products include (but are not limited to) automated Hardware-In-The-Loop (HITL) data analysis systems, vehicle configuration sign-off tools, continuous integration systems for...


  • Los Angeles, United States eTek IT Services, Inc. Full time

    Job DescriptionJob DescriptionOverviewThe Site Reliability Engineer will play a crucial role in ensuring the reliability, scalability, and performance of our infrastructure and applications, ultimately contributing to the seamless operations of our systems. This role is vital in maintaining a high level of uptime and system efficiency, enhancing the overall...


  • Los Angeles, California, United States Beacon Hill Full time

    NOTE:This is a Hybrid position in Los AngelesLinux Site Reliability Engineer (SRE)If you're passionate about Linux, cloud infrastructure, and contributing to open-source projects, you've come to the right place.One of our clients in Los Angeles area is looking for Linux Site Reliability (SRE) with the following skills and experience:Job Overview:As a Linux...

  • Reliability Engineer

    1 month ago


    Los Angeles, United States Kindeva Drug Delivery Company Full time

    The Reliability Engineer will lead the sites Asset Reliability agenda, effectively promoting analytical problem-solving techniques and structured reliability improvement processes. We have an immediate opening for a Reliability Engineers at Kindeva’s Northridge, CA manufacturing facility. The Reliability Engineer will lead the sites Asset Reliability...

  • Reliability Engineer

    4 weeks ago


    Los Angeles, United States Kindeva Drug Delivery Company Full time

    The Reliability Engineer will lead the sites Asset Reliability agenda, effectively promoting analytical problem-solving techniques and structured reliability improvement processes. We have an immediate opening for a Reliability Engineers at Kindeva’s Northridge, CA manufacturing facility. The Reliability Engineer will lead the sites Asset Reliability...


  • Los Angeles, United States Saxon Global Full time

    Looking for a highly motivated Site Reliability Engineer, who is capable of build and run large-scale, massively distributed, fault-tolerant systems. Individual to work with teams across the organization and ensures core services reliability and keep an eye on capacity and performance. This is for a migration from AWS into GCP. Knowledge and experience with...


  • Los Angeles, United States Saxon Global Full time

    Looking for a highly motivated Site Reliability Engineer, who is capable of build and run large-scale, massively distributed, fault-tolerant systems. Individual to work with teams across the organization and ensures core services reliability and keep an eye on capacity and performance. This is for a migration from AWS into GCP. Knowledge and experience with...


  • Los Angeles, California, United States Fox Corporation Full time

    OVERVIEW OF THE COMPANY Fox CorporationUnder the FOX banner, we produce and distribute content through some of the world's leading and most valued brands, including: FOX News Media, FOX Sports, FOX Entertainment, FOX Television Stations and Tubi Media Group. We empower a diverse range of creators to imagine and develop culturally significant content,...


  • Los Angeles, United States forhyre.com Full time

    Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to improve our platform through the ever-changing technology landscape. To be successful in this role You'll have the opportunity to design and implement major infrastructure...


  • Los Angeles, United States forhyre.com Full time

    Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to improve our platform through the ever-changing technology landscape. To be successful in this role You'll have the opportunity to design and implement major infrastructure...

  • Uncapped Games

    2 weeks ago


    Los Angeles, United States LightSpeed Studios Full time

    Uncapped Games - Site Reliability Engineer page is loaded Uncapped Games - Site Reliability Engineer Apply remote type Hybrid locations US-Los Angeles US-California-Remote time type Full time posted on Posted 30+ Days Ago job requisition id R100321 Work Mode: Onsite Responsibilities: Description: Seeking the opportunity to build a game from scratch and...