HPC Linux System Administrator

3 weeks ago


Greenbelt, United States ASRC Federal Holding Company Full time
Job Description

ASRC Federal InuTeq is seeking aLinux System Administrator (HPC)to join our team in support of NASA's Center for Climate Simulation (NCCS) project. ASRC Federal InuTeq provides High Performance Computing services throughout the HPC lifecycle for computational requirements, architecture, acquisition, and operations to federal government customers. Our employees embrace innovation and are committed to a culture of continuous, standards-driven process improvement and assimilation of industry best practices.

  • Configures, installs, maintains, and upgrades Linux HPC clusters (compute, storage, and network) and applications in support of research computing environments.
  • Provides end-user support for problem resolution, and training on Linux and HPC usage best practices.
  • Diagnoses, isolates, and resolves application and system technical problems.
  • Develops scripts and automation to enhance operational services and service quality.
  • Develops, implements, and documents system architectures, new capabilities, and operational standards.
  • Supports compute, storage, and network technology evaluations and assessments.
  • Recommends and implements improvements to existing HPC system management tools and processes.
  • Provides technical expertise to improve HPC cluster performance and resiliency.
  • Leads and collaborates on projects to enhance functionality in areas such as systems monitoring, configuration management, and backups
This position will interact with the HPC Operations Manager, Program Manager, Site Lead, customer, users, and site staff, attending regularly scheduled customer meetings to keepstakeholdersinformed of activities and progress, and answer inquiries concerning all aspects ofthe program. An individual at this skill level should have demonstrated problem-solving ability in relevant areas of expertise and should have an interest in mentoring and leading others in small team environments.

Requirements

Requirements:
  • Bachelor's degree (B.A/B.S.) in Computer Science, Engineering, Physics, or related course of study, or equivalent combination of education and relevant experience
  • Minimum of 8 years of Linux System Administration experience
  • Experience Managing HPC Clusters
  • Vast knowledge in trouble shooting both hardware and software, with the ability to come on site and replace hardware if needed
  • Experience managing storage servers/hardware
  • Knowledge of at least one of CentOS or RedHat, and experience maintaining and upgrading Linux.
  • Experience with the use of configuration management and orchestration tools such as Puppet, Ansible, Chef, Cobbler.
  • Experience with system management, monitoring/alerting tools (e.g., Ganglia, Nagios, Prometheus, Zabbix).
  • Understanding of infrastructure technologies including server, storage, network, database, and virtualization.
  • Demonstrated ability to quantify, analyze, determine root cause, and resolve system and communication network issues, and develop preventive actions.
  • Ability to work independently as well as collaboratively within a team, to include the ability to lead moderately complex projects or small project teams.
  • Excellent written and oral communication skills for interacting with customers, team members, and management.
  • Proactive and innovative, with ability to foresee and prevent potential problems.
  • Organizational and time management skills, exceptional follow-through, and ability to manage multiple priorities.
  • Passion for providing excellent customer service.
  • Experience providing support for large Linux HPC clusters used for scientific computing.
  • Scripting/programming capabilities with Bash, Python, Perl.
  • Shows ability to execute and maintain a Standard Security Protocol
  • Willing to track tasks with persistent record keeping and project management
  • US Citizenship is Required and the ability to obtain a Public Trust Clearance
Preferred Skills:
  • Experience integrating systems or designing solutions for HPC workloads.
  • Experience with MPI and OpenMP.
  • Experience with performance benchmarking using profilers and debuggers to recommend code improvements for scalability and performance.
  • Experience with distributed and parallel file systems such as BeeGFS, GPFS, Lustre, NFS, Ceph.
  • Familiarity with high-performance networks such as Infiniband, and with network management.
  • Demonstrated ability to perform complex performance analysis including system processes, I/O subsystems, networks and other related components.
  • Experience installing, configuring, and maintaining workload management tools (such as Slurm, LSF, PBS, etc.).
  • Interest or previous experience in technologies including but not limited to Singularity, Docker, Spack and new emerging technologies.


ASRC Federal and its Subsidiaries are Equal Opportunity / Affirmative Action employers. All qualified applicants will receive consideration for employment without regard to race, gender, color.

EEO Statement

ASRC Federal and its Subsidiaries are Equal Opportunity / Affirmative Action employers. All qualified applicants will receive consideration for employment without regard to race, gender, color, age, sexual orientation, gender identification, national origin, religion, marital status, ancestry, citizenship, disability, protected veteran status, or any other factor prohibited by applicable law.

  • Greenbelt, United States ASRC Federal Holding Company Full time

    Job DescriptionASRC Federal is searching for a HPC Linux Systems Administrator to support Inuteq LLC out of Greenbelt, MD ASRC Federal InuTeq is seeking aLinux System Administrator (HPC)to join our team in support of NASA's Center for Climate Simulation (NCCS) project. ASRC Federal InuTeq provides High Performance Computing services throughout the HPC...


  • Greenbelt, United States Cornerstone Defense Full time

    Location: Greenbelt, Maryland Type: Contract Job #2746 Title: Linux System Administrator Location: Greenbelt, MD *Clearance: *Active TS/SCI w/ Polygraph needed to apply * Company Overview: Cornerstone Defense is the Employer of Choice within the Intelligence, Defense, and Space communities of the U.S. Government. Realizing early on that our...


  • Greenbelt, Maryland, United States GAMA-1 Technologies Full time

    Provide support for Linux systems running on Red Hat enterprise Linux. Activities includes troubleshooting, patching, backing up / restoring of VM's, deploying new VM's and working with application teams on Middleware support. Lead team of Junior admins in resolving system issuesTask Description:Linux Administrative activities include:Maintaining a stable...


  • Greenbelt, United States Vibrint Full time

    Vibrint is a trusted provider of mission-critical systems and analysis that transform our customers' capacity and capability in harvesting and harnessing data. Working alongside many of the most talented professionals in public service, we work tirelessly to create and sustain new solutions and services that meet the stringent demands across a variety of...


  • Greenbelt, United States Pearl River Technologies LLC Full time

    The IT System Administrator is responsible to help employ standards, methodologies, and technical solutions for a team maintaining Windows, Linux, VMWare, on-prem, and AWS environments, and a strong security posture. This position supports a group of flight dynamics engineers in the GSFC Flight Dynamics Facility (FDF). The FDF is a NASA Mission Essential...


  • Greenbelt, United States Pearl River Technologies LLC Full time

    Job Location: Greenbelt, MD (Hybrid) Description The IT System Administrator is responsible to help employ standards, methodologies, and technical solutions for a team maintaining Windows, Linux, VMWare, on-prem, and AWS environments, and a strong security posture. This position supports a group of flight dynamics engineers in the GSFC Flight Dynamics...


  • Greenbelt, United States Pearl River Technologies Full time

    Job DescriptionJob DescriptionSalary: Job Location: Greenbelt, MD (Hybrid)Description The IT System Administrator is responsible to help employ standards, methodologies, and technical solutions for a team maintaining Windows, Linux, VMWare, on-prem, and AWS environments, and a strong security posture.This position supports a group of flight dynamics...

  • System Engineer III

    1 week ago


    Greenbelt, United States Vibrint Full time

    Vibrint is a trusted provider of mission-critical systems and analysis that transform our customers' capacity and capability in harvesting and harnessing data. Working alongside many of the most talented professionals in public service, we work tirelessly to create and sustain new solutions and services that meet the stringent demands across a variety of...

  • System Engineer III

    58 minutes ago


    Greenbelt, United States Vibrint Full time

    Vibrint is a trusted provider of mission-critical systems and analysis that transform our customers' capacity and capability in harvesting and harnessing data. Working alongside many of the most talented professionals in public service, we work tirelessly to create and sustain new solutions and services that meet the stringent demands across a variety of...


  • Greenbelt, United States Adnet Systems Full time

    IT016 Systems Engineer The Computational and Information Sciences and Technology Office (CISTO) at the NASA Goddard Space Flight Center (GSFC) provides high-end information technology systems and services for science, including assessment and evaluation of new information technologies, hardware, and software and manages the NASA Center for Climate...


  • Greenbelt, United States Adnet Systems Full time

    IT016 Systems Engineer The Computational and Information Sciences and Technology Office (CISTO) at the NASA Goddard Space Flight Center (GSFC) provides high-end information technology systems and services for science, including assessment and evaluation of new information technologies, hardware, and software and manages the NASA Center for Climate Simulation...


  • Greenbelt, United States Halvik Full time

    Job DescriptionJob DescriptionHalvik is a highly successful company that puts people first, and we are looking for someone just like you. We are committed to delivering smarter IT-driven solutions bolstered by quality and innovation to help our customers succeed. Come be a part of something truly special!SUMMARY:As a Systems Middleware Engineer with NASA...


  • Greenbelt, United States Halvik Full time

    Halvik is a highly successful company that puts people first, and we are looking for someone just like you. We are committed to delivering smarter IT-driven solutions bolstered by quality and innovation to help our customers succeed. Come be a part of something truly special! SUMMARY: As a Systems Middleware Engineer with NASA SEWP, you will be a key member...


  • Greenbelt, United States Halvik Full time

    Job DescriptionJob DescriptionHalvik is a highly successful company that puts people first, and we are looking for someone just like you. We are committed to delivering smarter IT-driven solutions bolstered by quality and innovation to help our customers succeed. Come be a part of something truly special!SUMMARY:As a Systems Middleware Engineer with NASA...


  • Greenbelt, United States Halvik Full time

    Halvik is a highly successful company that puts people first, and we are looking for someone just like you. We are committed to delivering smarter IT-driven solutions bolstered by quality and innovation to help our customers succeed. Come be a part of something truly special! SUMMARY: As a Systems Middleware Engineer with NASA SEWP, you will be a key member...


  • Greenbelt, United States Halvik Full time

    Halvik is a highly successful company that puts people first, and we are looking for someone just like you. We are committed to delivering smarter IT-driven solutions bolstered by quality and innovation to help our customers succeed. Come be a part of something truly special! SUMMARY: As a Systems Middleware Engineer with NASA SEWP, you will be a key member...

  • Technical Manager I

    3 weeks ago


    Greenbelt, United States ASRC Federal Holding Company Full time

    Job DescriptionASRC Federal InuTeq provides High Performance Computing services throughout the HPC lifecycle for computational requirements, architecture, acquisition, and operations to federal government customers. Our employees embrace innovation and are committed to a culture of continuous, standards-driven process improvement and assimilation of industry...

  • Technical Manager I

    5 days ago


    Greenbelt, United States ASRC Federal Holding Company Full time

    Job DescriptionASRC Federal InuTeq provides High Performance Computing services throughout the HPC lifecycle for computational requirements, architecture, acquisition, and operations to federal government customers. Our employees embrace innovation and are committed to a culture of continuous, standards-driven process improvement and assimilation of industry...

  • Technical Manager I

    3 weeks ago


    Greenbelt, United States ASRC Federal Holding Company Full time

    Job Description ASRC Federal InuTeq provides High Performance Computing services throughout the HPC lifecycle for computational requirements, architecture, acquisition, and operations to federal government customers. Our employees embrace innovation and are committed to a culture of continuous, standards-driven process improvement and assimilation of...

  • Technical Manager I

    6 days ago


    Greenbelt, United States ASRC Federal Holding Company Full time

    Job Description ASRC Federal InuTeq provides High Performance Computing services throughout the HPC lifecycle for computational requirements, architecture, acquisition, and operations to federal government customers. Our employees embrace innovation and are committed to a culture of continuous, standards-driven process improvement and assimilation of...