HPC Linux System Administrator
3 weeks ago
ASRC Federal InuTeq is seeking aLinux System Administrator (HPC)to join our team in support of NASA's Center for Climate Simulation (NCCS) project. ASRC Federal InuTeq provides High Performance Computing services throughout the HPC lifecycle for computational requirements, architecture, acquisition, and operations to federal government customers. Our employees embrace innovation and are committed to a culture of continuous, standards-driven process improvement and assimilation of industry best practices.
- Configures, installs, maintains, and upgrades Linux HPC clusters (compute, storage, and network) and applications in support of research computing environments.
- Provides end-user support for problem resolution, and training on Linux and HPC usage best practices.
- Diagnoses, isolates, and resolves application and system technical problems.
- Develops scripts and automation to enhance operational services and service quality.
- Develops, implements, and documents system architectures, new capabilities, and operational standards.
- Supports compute, storage, and network technology evaluations and assessments.
- Recommends and implements improvements to existing HPC system management tools and processes.
- Provides technical expertise to improve HPC cluster performance and resiliency.
- Leads and collaborates on projects to enhance functionality in areas such as systems monitoring, configuration management, and backups
Requirements
Requirements:
- Bachelor's degree (B.A/B.S.) in Computer Science, Engineering, Physics, or related course of study, or equivalent combination of education and relevant experience
- Minimum of 8 years of Linux System Administration experience
- Experience Managing HPC Clusters
- Vast knowledge in trouble shooting both hardware and software, with the ability to come on site and replace hardware if needed
- Experience managing storage servers/hardware
- Knowledge of at least one of CentOS or RedHat, and experience maintaining and upgrading Linux.
- Experience with the use of configuration management and orchestration tools such as Puppet, Ansible, Chef, Cobbler.
- Experience with system management, monitoring/alerting tools (e.g., Ganglia, Nagios, Prometheus, Zabbix).
- Understanding of infrastructure technologies including server, storage, network, database, and virtualization.
- Demonstrated ability to quantify, analyze, determine root cause, and resolve system and communication network issues, and develop preventive actions.
- Ability to work independently as well as collaboratively within a team, to include the ability to lead moderately complex projects or small project teams.
- Excellent written and oral communication skills for interacting with customers, team members, and management.
- Proactive and innovative, with ability to foresee and prevent potential problems.
- Organizational and time management skills, exceptional follow-through, and ability to manage multiple priorities.
- Passion for providing excellent customer service.
- Experience providing support for large Linux HPC clusters used for scientific computing.
- Scripting/programming capabilities with Bash, Python, Perl.
- Shows ability to execute and maintain a Standard Security Protocol
- Willing to track tasks with persistent record keeping and project management
- US Citizenship is Required and the ability to obtain a Public Trust Clearance
- Experience integrating systems or designing solutions for HPC workloads.
- Experience with MPI and OpenMP.
- Experience with performance benchmarking using profilers and debuggers to recommend code improvements for scalability and performance.
- Experience with distributed and parallel file systems such as BeeGFS, GPFS, Lustre, NFS, Ceph.
- Familiarity with high-performance networks such as Infiniband, and with network management.
- Demonstrated ability to perform complex performance analysis including system processes, I/O subsystems, networks and other related components.
- Experience installing, configuring, and maintaining workload management tools (such as Slurm, LSF, PBS, etc.).
- Interest or previous experience in technologies including but not limited to Singularity, Docker, Spack and new emerging technologies.
ASRC Federal and its Subsidiaries are Equal Opportunity / Affirmative Action employers. All qualified applicants will receive consideration for employment without regard to race, gender, color.
EEO Statement
ASRC Federal and its Subsidiaries are Equal Opportunity / Affirmative Action employers. All qualified applicants will receive consideration for employment without regard to race, gender, color, age, sexual orientation, gender identification, national origin, religion, marital status, ancestry, citizenship, disability, protected veteran status, or any other factor prohibited by applicable law.
-
HPC Linux System Administrator
7 days ago
Greenbelt, United States ASRC Federal Holding Company Full timeJob DescriptionASRC Federal is searching for a HPC Linux Systems Administrator to support Inuteq LLC out of Greenbelt, MD ASRC Federal InuTeq is seeking aLinux System Administrator (HPC)to join our team in support of NASA's Center for Climate Simulation (NCCS) project. ASRC Federal InuTeq provides High Performance Computing services throughout the HPC...
-
Linux System Administraotr
3 weeks ago
Greenbelt, United States Cornerstone Defense Full timeLocation: Greenbelt, Maryland Type: Contract Job #2746 Title: Linux System Administrator Location: Greenbelt, MD *Clearance: *Active TS/SCI w/ Polygraph needed to apply * Company Overview: Cornerstone Defense is the Employer of Choice within the Intelligence, Defense, and Space communities of the U.S. Government. Realizing early on that our...
-
Senior System Administrator
3 weeks ago
Greenbelt, Maryland, United States GAMA-1 Technologies Full timeProvide support for Linux systems running on Red Hat enterprise Linux. Activities includes troubleshooting, patching, backing up / restoring of VM's, deploying new VM's and working with application teams on Middleware support. Lead team of Junior admins in resolving system issuesTask Description:Linux Administrative activities include:Maintaining a stable...
-
System Administrator II
1 week ago
Greenbelt, United States Vibrint Full timeVibrint is a trusted provider of mission-critical systems and analysis that transform our customers' capacity and capability in harvesting and harnessing data. Working alongside many of the most talented professionals in public service, we work tirelessly to create and sustain new solutions and services that meet the stringent demands across a variety of...
-
IT System Administrator, NASA Support
1 week ago
Greenbelt, United States Pearl River Technologies LLC Full timeThe IT System Administrator is responsible to help employ standards, methodologies, and technical solutions for a team maintaining Windows, Linux, VMWare, on-prem, and AWS environments, and a strong security posture. This position supports a group of flight dynamics engineers in the GSFC Flight Dynamics Facility (FDF). The FDF is a NASA Mission Essential...
-
IT System Administrator, NASA Support
7 days ago
Greenbelt, United States Pearl River Technologies LLC Full timeJob Location: Greenbelt, MD (Hybrid) Description The IT System Administrator is responsible to help employ standards, methodologies, and technical solutions for a team maintaining Windows, Linux, VMWare, on-prem, and AWS environments, and a strong security posture. This position supports a group of flight dynamics engineers in the GSFC Flight Dynamics...
-
IT System Administrator, NASA Support
6 days ago
Greenbelt, United States Pearl River Technologies Full timeJob DescriptionJob DescriptionSalary: Job Location: Greenbelt, MD (Hybrid)Description The IT System Administrator is responsible to help employ standards, methodologies, and technical solutions for a team maintaining Windows, Linux, VMWare, on-prem, and AWS environments, and a strong security posture.This position supports a group of flight dynamics...
-
System Engineer III
1 week ago
Greenbelt, United States Vibrint Full timeVibrint is a trusted provider of mission-critical systems and analysis that transform our customers' capacity and capability in harvesting and harnessing data. Working alongside many of the most talented professionals in public service, we work tirelessly to create and sustain new solutions and services that meet the stringent demands across a variety of...
-
System Engineer III
58 minutes ago
Greenbelt, United States Vibrint Full timeVibrint is a trusted provider of mission-critical systems and analysis that transform our customers' capacity and capability in harvesting and harnessing data. Working alongside many of the most talented professionals in public service, we work tirelessly to create and sustain new solutions and services that meet the stringent demands across a variety of...
-
IT016 Systems Engineer
4 weeks ago
Greenbelt, United States Adnet Systems Full timeIT016 Systems Engineer The Computational and Information Sciences and Technology Office (CISTO) at the NASA Goddard Space Flight Center (GSFC) provides high-end information technology systems and services for science, including assessment and evaluation of new information technologies, hardware, and software and manages the NASA Center for Climate...
-
IT016 Systems Engineer
2 weeks ago
Greenbelt, United States Adnet Systems Full timeIT016 Systems Engineer The Computational and Information Sciences and Technology Office (CISTO) at the NASA Goddard Space Flight Center (GSFC) provides high-end information technology systems and services for science, including assessment and evaluation of new information technologies, hardware, and software and manages the NASA Center for Climate Simulation...
-
Linux Middleware Engineer
6 days ago
Greenbelt, United States Halvik Full timeJob DescriptionJob DescriptionHalvik is a highly successful company that puts people first, and we are looking for someone just like you. We are committed to delivering smarter IT-driven solutions bolstered by quality and innovation to help our customers succeed. Come be a part of something truly special!SUMMARY:As a Systems Middleware Engineer with NASA...
-
Linux Middleware Engineer
7 days ago
Greenbelt, United States Halvik Full timeHalvik is a highly successful company that puts people first, and we are looking for someone just like you. We are committed to delivering smarter IT-driven solutions bolstered by quality and innovation to help our customers succeed. Come be a part of something truly special! SUMMARY: As a Systems Middleware Engineer with NASA SEWP, you will be a key member...
-
Linux Middleware Engineer
1 day ago
Greenbelt, United States Halvik Full timeJob DescriptionJob DescriptionHalvik is a highly successful company that puts people first, and we are looking for someone just like you. We are committed to delivering smarter IT-driven solutions bolstered by quality and innovation to help our customers succeed. Come be a part of something truly special!SUMMARY:As a Systems Middleware Engineer with NASA...
-
Linux Middleware Engineer
3 days ago
Greenbelt, United States Halvik Full timeHalvik is a highly successful company that puts people first, and we are looking for someone just like you. We are committed to delivering smarter IT-driven solutions bolstered by quality and innovation to help our customers succeed. Come be a part of something truly special! SUMMARY: As a Systems Middleware Engineer with NASA SEWP, you will be a key member...
-
Linux Middleware Engineer
1 week ago
Greenbelt, United States Halvik Full timeHalvik is a highly successful company that puts people first, and we are looking for someone just like you. We are committed to delivering smarter IT-driven solutions bolstered by quality and innovation to help our customers succeed. Come be a part of something truly special! SUMMARY: As a Systems Middleware Engineer with NASA SEWP, you will be a key member...
-
Technical Manager I
3 weeks ago
Greenbelt, United States ASRC Federal Holding Company Full timeJob DescriptionASRC Federal InuTeq provides High Performance Computing services throughout the HPC lifecycle for computational requirements, architecture, acquisition, and operations to federal government customers. Our employees embrace innovation and are committed to a culture of continuous, standards-driven process improvement and assimilation of industry...
-
Technical Manager I
5 days ago
Greenbelt, United States ASRC Federal Holding Company Full timeJob DescriptionASRC Federal InuTeq provides High Performance Computing services throughout the HPC lifecycle for computational requirements, architecture, acquisition, and operations to federal government customers. Our employees embrace innovation and are committed to a culture of continuous, standards-driven process improvement and assimilation of industry...
-
Technical Manager I
3 weeks ago
Greenbelt, United States ASRC Federal Holding Company Full timeJob Description ASRC Federal InuTeq provides High Performance Computing services throughout the HPC lifecycle for computational requirements, architecture, acquisition, and operations to federal government customers. Our employees embrace innovation and are committed to a culture of continuous, standards-driven process improvement and assimilation of...
-
Technical Manager I
6 days ago
Greenbelt, United States ASRC Federal Holding Company Full timeJob Description ASRC Federal InuTeq provides High Performance Computing services throughout the HPC lifecycle for computational requirements, architecture, acquisition, and operations to federal government customers. Our employees embrace innovation and are committed to a culture of continuous, standards-driven process improvement and assimilation of...