Current jobs related to Senior HPC Systems Engineer - Mountain View, California - ASRC Federal Holding Company


  • Mountain View, California, United States ASRC Federal Holding Company Full time

    Job Title: Senior HPC EngineerJob Summary:ASRC Federal Holding Company is seeking a highly skilled Senior HPC Engineer to support Inuteq LLC. This role is fully telework.The successful candidate will be an active supporting member of the ASRC Federal team, reporting directly to the Manager of the Application Performance and Productivity (APP) group and...

  • HPC Systems Engineer

    1 month ago


    Mountain View, California, United States ASRC Federal Holding Company Full time

    Job TitleStaff HPC EngineerLocationNASA/AMES, MOFFETT FIELD-CA026Job DescriptionASRC Federal is seeking a Staff HPC Engineer to support Inuteq LLC out of NASA AMES, CA.Our company provides High Performance Computing services throughout the HPC lifecycle for computational requirements, architecture, acquisition, and operations to federal government customers....


  • Mountain View, California, United States ASRC Federal Holding Company Full time

    Job TitleSenior HPC Systems AdministratorLocationASRC Federal Holding CompanyJob DescriptionASRC Federal Holding Company is seeking a highly skilled Senior HPC Systems Administrator to support our High Performance Computing (HPC) services. The successful candidate will be responsible for designing, deploying, and maintaining HPC clusters with over 2000+...


  • Mountain View, California, United States ASRC Federal Holding Company Full time

    Job DescriptionASRC Federal Holding Company is seeking a Senior HPC Applications Manager to support Inuteq LLC out of NASA AMES, CA. The successful candidate will directly oversee four HPC related teams, known as subtasks, in the following areas:HPC Application Services and ToolsHPC Cloud ComputingData Science Applications supporting HPC UsersHPC...


  • Mountain View, California, United States Groq Full time

    We are seeking a highly skilled Senior Systems Software Engineer to join our team at Groq. As a key member of our multi-disciplinary team, you will play a crucial role in the development, integration, and testing of machine learning HPC platforms.Key Responsibilities:Work within a multi-disciplinary team environment to develop, integrate, and test machine...


  • Mountain View, California, United States Apex Systems Full time

    Job Title: Senior RF Test EngineerApex Systems is seeking a highly skilled Senior RF Test Engineer to join our team. As a key member of our RF Engineering team, you will be responsible for designing, developing, and testing wireless systems and sub-systems.Key Responsibilities:Design and develop test automation frameworks for wireless systems and...


  • Mountain View, California, United States Enfabrica Full time

    Technical ExpertiseAs a Principal Customer Engineer at Enfabrica, you will be responsible for delivering technical solutions to our customers. This role requires a deep understanding of data center and AI/ML/HPC networking technologies, as well as experience in bring up, troubleshooting, and performance tuning of large-scale DC/HPC/AI/ML cluster...


  • Mountain View, California, United States Axient Full time

    Axient is seeking a highly skilled Senior Systems Engineer to lead our Aeronautics Systems Engineering team.Key Responsibilities:Develop and refine technical and Systems Engineering processesProvide technical leadership and mentorship to team membersCollaborate with Researchers and Technical Leads to identify project goals and objectivesDevelop and update...


  • Mountain View, California, United States BrickRed Systems Full time

    Senior Hardware Design EngineerWe are seeking a highly skilled Senior Hardware Design Engineer to contribute to the development of cutting-edge AI chip verification. In this role, you will be responsible for leading functional validation for complex ASIC SoC and FPGA SoC projects, performing pre-silicon RTL verification, post-silicon validation, and...


  • Mountain View, California, United States Enfabrica Full time

    Job OverviewEnfabrica is seeking a highly skilled Principal Customer Engineer to join our team. As a key member of our customer-facing team, you will be responsible for providing technical support and guidance to our customers, ensuring their success with our products and solutions.Key ResponsibilitiesProvide technical pre-sales support to customers,...


  • Mountain View, California, United States Muon Space Full time

    About the RoleMuon Space is seeking a highly skilled Senior Systems Engineer to join our spacecraft engineering team in Mountain View, CA. As a key member of our team, you will play a critical role in the development of multiple missions to understand and act on climate impacts.Key ResponsibilitiesTranslate customer requirements and needs into program...


  • Mountain View, California, United States Muon Space Full time

    About the RoleMuon Space is seeking a highly skilled Senior Systems Engineer to join our spacecraft engineering team in Mountain View, CA.This is a unique opportunity to be part of a lean and agile team that is developing multiple missions to help understand and act on climate impacts.Key ResponsibilitiesTranslate customer requirements and needs into program...


  • Mountain View, California, United States Muon Space Full time

    About the RoleMuon Space is seeking a highly skilled Senior Systems Engineer to join our spacecraft engineering team in Mountain View, CA. As a key member of our team, you will play a critical role in the development of multiple missions to understand and act on climate impacts.Key ResponsibilitiesTranslate customer requirements and needs into program...


  • Mountain View, California, United States Entegee Full time

    Job SummaryEntegee is seeking a highly skilled Senior Battery Systems Engineer to join our team. As a key member of our engineering team, you will be responsible for designing, developing, and integrating high-voltage battery systems for our eVTOL aircraft.Key Responsibilities:Design and develop high-voltage battery systems, including DC-DC converters and...


  • Mountain View, California, United States Moveworks Full time

    About MoveworksMoveworks is a leading provider of AI-powered automation solutions for businesses. Our mission is to empower employees to work faster and more efficiently across applications.Job DescriptionWe are seeking a Senior Business Systems Engineer to join our team. As a key member of our engineering team, you will be responsible for designing,...


  • Mountain View, California, United States Aurora Innovation Full time

    Job Title: Senior Vehicle Platform Systems EngineerAurora Innovation is seeking a highly skilled Senior Vehicle Platform Systems Engineer to join our team. As a key member of our engineering team, you will be responsible for developing and implementing system-level requirements for vehicle platforms, managing the exchange of content with OEM partners, and...


  • Mountain View, California, United States Apex Systems Full time

    Apex Systems is seeking a skilled Radio Frequency (RF) Test Engineer to join our team.About the Role:We are looking for a highly motivated and experienced RF Test Engineer to work on the development and testing of wireless systems. The ideal candidate will have a strong background in RF engineering, wireless calibration, and test automation.Key...


  • Mountain View, California, United States Enfabrica Full time

    Technical Customer Interaction and SupportWe are seeking a highly skilled Principal Customer Engineer to join our team at Enfabrica. As a key member of our customer-facing team, you will be responsible for providing technical support and guidance to our customers, ensuring their success with our products and solutions.Key ResponsibilitiesProvide technical...


  • Mountain View, California, United States Enfabrica Full time

    Principal Customer EngineerWe are seeking a highly skilled Principal Customer Engineer to join our team at Enfabrica. As a key member of our technical team, you will be responsible for delivering exceptional customer experiences and driving technical success for our clients.Key Responsibilities:Present Enfabrica products and solutions to customers and...


  • Mountain View, California, United States Aurora Innovation Full time

    About Aurora InnovationAurora Innovation is a leading technology company that is revolutionizing the transportation industry with its self-driving system, the Aurora Driver. Our mission is to make transportation safer, more accessible, and more efficient than ever before.Job Title: Senior Vehicle Platform Systems EngineerWe are seeking a highly skilled...

Senior HPC Systems Engineer

2 months ago


Mountain View, California, United States ASRC Federal Holding Company Full time

Job Title

Senior HPC Systems Engineer

Location

NASA/AMES, MOFFETT FIELD-CA026

Job Description

ASRC Federal Holding Company is seeking a Senior HPC Systems Engineer to support Inuteq LLC

ASRC Federal InuTeq provides High Performance Computing services throughout the HPC lifecycle for computational requirements, architecture, acquisition, and operations to federal government customers. Our employees embrace innovation and are committed to a culture of continuous, standards-driven process improvement, and assimilation of industry best practices. We are seeking to fill a role that primarily provides development for Supercomputing Batch Scheduling with Supercomputing Systems Administration secondary support for our NASA NACS High Performance Computing (HPC) contract.

Summary

The successful candidate will be an active supporting member of the ASRC Federal team reporting directly to the Manager of the Application Performance and Productivity (APP) group and matrixed directly to the Supercomputing Systems Team Manager.

Key Responsibilities

  • Designs, deploys and maintains HPC clusters with over 2000+ nodes with InfiniBand, 100+ petabytes of data storage in production.
  • Write and shepherd scalable feature designs through the entire software development process, from requirements and use cases to release
  • Designs and develops scripts for system administration, monitoring and usage reporting.
  • Modify existing software to correct errors and/or improve performance
  • Designs and develops scripts for system regression test and performance (file systems (Luster), scheduler (PBS), interconnect (HDR/NDR, Slingshot, ), high availability, etc.).
  • Troubleshoots, isolates and resolves application, system and other technical problems (hardware, software, and network).
  • Understands research use cases, researches and deploys new technologies, defining cost, performance and other trade-offs.
  • Manages and maintains tools for configuration management (HPCM, Ansible & GIT), resource management, scheduling and all necessary aspects of HPC in accordance with best practices.
  • Researches, deploys and manages networking and security infrastructure, including development of policies and procedures.
  • Assists in developing and writing proposals and publications.
  • Creates and provides clear documentation.
  • Mentoring junior staff and cross training peers
  • After hours/weekend support as required
  • Moderate Supercomputing System Administration that contributes to: Day-to-day operations of the Linux HPC clusters and storage systemsProactive monitoring, analyze, and correct system issuesDevelopment of scripts to automate repetitive tasks or tools to enhance support of the HPC systemsSystem performance analysis and tuningBuilding, installing, and supporting user-requested softwareSupporting evaluation and assessment of new HPC technologyResolving user report issues and manage support tickets requests in Remedy

Requirements

  • Bachelor's degree in computer science or related field
  • Strong computer science background with in-depth systems-level knowledge in operating systems and networking
  • A minimum of 10 years experience of administration of HPC systems and scheduling software (PBS, Slurm, or Moab/Torque)
  • A minimum of 10 years of experience of systems programming in heterogeneous, multi-platform HPC environments
  • Strong ability to analyze, debug and maintain the integrity of an existing code base
  • Demonstrated equivalence of 5 years of Linux/UNIX user support experience and hands-on experience with administration of Linux systems
  • Experience working with HPC applications and proficiency in at least C, C++, or Fortran
  • Superior scripting skills and excellent attention to detail; proficiency in at least Python, Perl, or Bash
  • Strong ability to interact with customers to understand needs, elicit requirements, and get feedback on prototype solutions
  • Excellent communication and people skills; excellent time management and organizational skills
  • Experience with system configuration management tools e.g., puppet, chef, ansible
  • Experience with revision control software e.g. CVS, SVN, Git
  • Track record of delivering commercial quality software on schedule with excellent quality through multiple release cycles
  • Proficiency at technical writing

Preferred Skills (Requesting Manager Defines)

  • Proficiency with analysis and problem-solving skills for debugging and optimization of applications
  • Familiarity/proficiency with OpenMP and Message Passing Interface (MPI) programming
  • Experience with Lustre, and InfiniBand
  • Experience with cloud technologies (AWS, Azure, GCP), OpenStack or Kubernetes is a plus