Linux HPC Systems Engineer

7 days ago


Oak Park, Illinois, United States Oak Ridge National Laboratory Full time

Job Summary:

We are seeking a highly skilled Linux HPC Systems Engineer to join our team at Oak Ridge National Laboratory. As a key member of our Emerging Technologies & Computing Group, you will be responsible for designing, operating, and maintaining high-performance computing clusters, servers, and workstations that support scientific research and innovation.

Key Responsibilities:

  • Design and implement high-performance computing systems, including clusters, servers, and workstations, to support scientific research and innovation.
  • Collaborate with research organizations to identify and implement the best solutions for their high-performance computing needs.
  • Ensure the availability, performance, scalability, and security of production systems.
  • Leverage automation and monitoring solutions to minimize day-to-day maintenance and optimize system management practices.
  • Collaborate with technical POCs to install and tune the performance of various scientific toolsets.

Requirements:

  • Bachelor's degree in Computer Science, Computer Engineering, Information Technology, Science, Engineering, Business, or a related field of study.
  • 2-4 years of experience in high-performance computing, system administration, and automation.
  • 1+ year of experience managing UNIX/Linux systems.
  • 1+ year of experience utilizing configuration management and automation tools such as Git, Jenkins, Ansible, or Puppet.
  • Some proficiency in at least one scripting language such as Bash, Python, or equivalent.
  • Experience performing troubleshooting and system administration with Linux Servers.
  • Experience supporting large data systems.

Preferred Qualifications:

  • Understanding of multiple operating systems and cluster technologies.
  • Experience with Centos/RHEL, Ubuntu, VMware.
  • Understanding of HPC platforms to support users with SLURM job submissions and troubleshooting.
  • Experience building and running containerized applications in an HPC environment.
  • Experience with multiple deployment mechanisms like Diskless, Warewulf, and traditional deployment (Cobbler, PXEboot, and/or Bright).
  • Experience managing systems utilizing GPU (NVIDIA and AMD) clusters for AI/ML and/or image processing.
  • Knowledge of networking fundamentals including TCP/IP, traffic analysis, common protocols, and network diagnostics.
  • Experience with Infiniband networks and diagnostics.
  • Extensive experience with High Performance Parallel File Systems (Lustre, WEKA, GPFS, etc).
  • Experience with performance and diagnostic tools for benchmarking, analysis, and tuning of systems, networking, and storage.
  • Experience with Grafana, CheckMK, Nagios, Zabbix, SolarWinds, Ganglia, or other network and device monitoring systems.
  • Previous experience working in a government, scientific, or other highly technical environment.
  • Good documentation skills, including ability to prepare simple documentation web pages.

Special Requirements:

  • Visa sponsorship is not available for this position.
  • This position requires the ability to obtain and maintain a clearance from the Department of Energy.


  • Oak Park, Illinois, United States Oak Ridge National Laboratory Full time

    Job Summary:Oak Ridge National Laboratory is seeking a highly skilled Linux HPC Systems Engineer to join our team in the Emerging Technologies & Computing Group. As a key member of our team, you will be responsible for designing, operating, and maintaining clusters, servers, and workstations that support scientific research at ORNL.Key...


  • Oak Park, Illinois, United States ITR Full time

    Job OverviewHPC Linux Systems EngineerITR is in search of exceptional candidates to enhance the security, efficiency, and dependability of our advanced computing systems. This position plays a crucial role in supporting one of the leading supercomputers globally. As an HPC Linux Systems Engineer, you will be integrated into the Infrastructure team within the...


  • Oak Park, Illinois, United States ITR Full time

    Job OverviewHPC Linux Systems EngineerA leading research and development organization is in search of exceptional candidates to enhance the security, efficiency, and dependability of its computing systems. This position involves supporting one of the most advanced supercomputers globally. As an HPC Linux Systems Engineer, you will join the Infrastructure...


  • Oak Park, Illinois, United States ITR Full time

    Job OverviewHPC Linux Systems EngineerITR is in search of highly skilled professionals to enhance the security, efficiency, and dependability of our computational infrastructure. This position involves working with one of the leading supercomputers globally. As an HPC Linux Systems Engineer, you will be an integral part of the Infrastructure team within the...


  • Oak Park, Illinois, United States ITR Full time

    Job OverviewThe HPC Linux Systems Engineer position requires an individual with an active DOE Q Clearance or a DOD Top Secret Clearance that is convertible. This role necessitates onsite presence.Team ResponsibilitiesThe team is dedicated to supporting research and development initiatives. They are tasked with the design, deployment, optimization,...


  • Oak Park, Illinois, United States ITR Full time

    Position OverviewThe role of a Senior Linux Systems Engineer involves a variety of technical responsibilities that are crucial for maintaining the integrity and performance of our systems.Key ResponsibilitiesDiagnose and resolve issues related to Linux servers.Support Oracle ZFS systems effectively.Set up and configure Red Hat Linux Server...


  • Oak Park, Illinois, United States ITR Full time

    Job SummaryWe are seeking a highly skilled Linux Systems Engineer to join our team at ITR. The successful candidate will be responsible for facilitating R&D projects, providing Linux Systems deployment, automation, monitoring, and management for researchers.Key ResponsibilitiesDeploy, monitor, and manage research projects on Linux Systems.Ensure the...


  • Oak Park, Illinois, United States ITR Full time

    Job SummaryWe are seeking a highly skilled Linux Systems Management Engineer to join our team at ITR. As a key member of our Client Technologies Team, you will be responsible for the full life cycle of Linux systems, including deployment, management, and user documentation.Key ResponsibilitiesStreamline Processes: Improve the lifecycle of computer systems,...


  • Oak Park, Illinois, United States ITR Full time

    Job OverviewPosition: Linux Systems EngineerSpecial Requirements:Visa Sponsorship not availableQ clearance requiredNo Corp to CorpPosition Summary:We are seeking a skilled Linux Systems Engineer to play a pivotal role in our Systems Engineering division at ITR. This position is integral to our research and development initiatives, focusing on the management...


  • Oak Park, Illinois, United States ITR Full time

    Job OverviewPosition: Linux Systems EngineerSpecial Requirements:Visa Sponsorship not availableQ clearance requiredNo Corp to CorpPosition Summary:We are seeking a skilled Linux Systems Engineer to contribute to our R&D initiatives at ITR. This role is pivotal within our Systems Engineering division, which is committed to facilitating research activities...


  • Oak Park, Illinois, United States ITR Full time

    Linux Systems Management EngineerITR is in search of a skilled professional to enhance their Client Technologies Team as a Linux Systems Management Engineer. This role is pivotal in ensuring satisfaction among users within the Linux ecosystem across the organization. The selected candidate will oversee the entire lifecycle of devices, from initial deployment...


  • Oak Park, Illinois, United States ITR Full time

    Linux Systems Management EngineerITR is in search of a skilled professional to become a part of their Client Technologies Team as a Linux Systems Management Engineer. This role is vital in maintaining user satisfaction within the Linux ecosystem across the organization. The selected candidate will oversee the complete lifecycle of devices, from initial...


  • Oak Park, Illinois, United States ITR Full time

    Linux Systems Management EngineerITR is in search of a skilled professional to become a part of their Client Technologies Team as a Linux Systems Management Engineer. This role is vital in enhancing user satisfaction within the Linux community across the organization. The selected candidate will oversee the complete lifecycle of devices, from initial...


  • Oak Park, Illinois, United States ITR Full time

    Job OverviewPosition: Linux Systems EngineerSpecial Requirements:Visa Sponsorship not availableQ clearance requiredNo Corp to CorpRole Summary:We are seeking a skilled Linux Systems Engineer to play a pivotal role in our Systems Engineering division at ITR. This position is essential for supporting research initiatives by overseeing clusters, servers,...


  • Oak Park, Illinois, United States ITR Full time

    Job OverviewPosition Title: Linux Systems Management EngineerPurpose: The East Tennessee Research facility is seeking candidates for the role of Linux Systems Management Engineer within the Client Technologies Team. This role is pivotal as it serves as the primary owner of the Linux platform, ensuring optimal user satisfaction among the Linux community...


  • Oak Park, Illinois, United States ITR Full time

    Job OverviewPosition Title: Linux Systems Management EngineerPurpose: The East Tennessee Research facility is seeking candidates for the role of Linux Systems Management Engineer within the Client Technologies Team. This role is pivotal as it oversees the Linux platform, ensuring high levels of user satisfaction within the Linux community at the client site....


  • Oak Park, Illinois, United States ITR Full time

    Job OverviewPosition Title: Linux Systems Management EngineerPurpose: The East Tennessee Research facility is seeking candidates for the role of Linux Systems Management Engineer within the Client Technologies Team. This role serves as the primary authority for Linux systems, ensuring user satisfaction among the Linux community across the client site. This...


  • Oak Park, Illinois, United States ITR Full time

    Infrastructure Linux EngineerITR is in search of a skilled professional to fill the role of Infrastructure Linux Engineer within the Enterprise Infrastructure Services Team. This dedicated team is responsible for providing robust compute and storage solutions essential for organizational operations. The position entails engineering a private research cloud...


  • Oak Park, Illinois, United States ITR Full time

    Infrastructure Linux EngineerITR is in search of a skilled professional to fill the role of Infrastructure Linux Engineer within the Enterprise Infrastructure Services Team. This team is responsible for providing essential compute and storage infrastructure that supports company operations. The position entails engineering the private research cloud and...


  • Oak Park, Illinois, United States ITR Full time

    Infrastructure Linux EngineerITR is in search of a skilled professional to fill the role of Infrastructure Linux Engineer within our Enterprise Infrastructure Services Team. This team is responsible for delivering compute and storage solutions essential for our operational needs. The position entails engineering a private research cloud and managing various...