Senior HPC Linux Systems Engineer
2 weeks ago
Requisition Id 15559 Overview Oak Ridge National Laboratory (ORNL) is seeking a Senior HPC Linux Systems Engineer to serve as a technical leader supporting some of the most advanced computing environments in the world. This evergreen posting represents multiple potential openings for senior-level roles across ORNL’s high-performance computing ecosystem. Senior HPC Linux Systems Engineers are recognized experts who lead the design, implementation, and optimization of complex HPC infrastructure. They manage large-scale technical projects, guide technical direction for their teams, and serve as trusted advisors to scientific and operational leadership across ORNL. Major Duties and Responsibilities Provide technical leadership in the design, integration, and administration of large-scale Linux-based HPC clusters, high-speed networks, and storage systems. Lead medium to large technical projects, coordinating requirements, schedules, and deliverables across internal and external stakeholders. Architect and deploy advanced infrastructure solutions supporting exascale-class and mission-critical computing environments. Serve as a technical mentor for HPC engineers, guiding best practices in automation, performance tuning, and system security. Develop, implement, and maintain configuration management and automation frameworks (e.g., Ansible, Puppet, Salt) to enhance reliability and reproducibility. Perform advanced system performance analysis, troubleshooting, and optimization, ensuring system scalability and long-term sustainability. Manage critical vendor and partner relationships, representing ORNL’s technical requirements during procurement, integration, and system acceptance. Contribute to strategic planning and technology roadmaps, influencing unit goals and technical direction. Collaborate closely with scientists, researchers, and IT specialists to align infrastructure capabilities with research and security objectives. Ensure compliance with DOE cybersecurity standards, configuration baselines, and operational policies. Author technical documentation, present internal briefings, and communicate complex issues and resolutions to management and stakeholders. Participate in on-call rotations, maintenance windows, and incident response as needed to support 24x7 operations. Basic Qualifications Bachelor’s degree in computer science, engineering, or a related technical field. A minimum of 8 years of relevant experience in Linux systems administration or HPC systems engineering. Preferred Qualifications Demonstrated experience leading the design and deployment of HPC or large-scale distributed computing systems. Expertise with batch schedulers (SLURM, PBS, LSF) and parallel file systems (Lustre, GPFS/Spectrum Scale). Proven ability to lead technical projects from concept through implementation, balancing technical depth with project delivery. Strong proficiency in automation and infrastructure-as-code frameworks (Ansible, Puppet, Salt). Advanced scripting or programming skills (Python, Bash, Go) for automation and operational tooling. In-depth understanding of high-speed interconnects (InfiniBand, Slingshot, Ethernet) and storage architectures. Experience managing identity and access management systems, including MFA, SSO, and zero-trust frameworks (PingFederate, RSA SecureID, Entra ID). Experience integrating virtualization or containerization solutions (VMware, KVM, Apptainer, Podman) into HPC environments. Ability to manage client and stakeholder relationships across multiple directorates and technical disciplines. Excellent written and verbal communication skills, including the ability to present complex technical concepts to diverse audiences. Proven ability to influence technical strategy and mentor staff in a collaborative research environment. Special Requirement This position requires the ability to obtain and maintain clearance from the Department of Energy. As such, this position is a Workplace Substance Abuse (WSAP) testing designated position. WSAP positions require passing a pre-placement drug test and participation in an ongoing random drug testing program. About ORNL As a U.S. Department of Energy (DOE) Office of Science national laboratory, ORNL has an impressive 80-year legacy of addressing the nation’s most pressing challenges. Our team is made up of over 7,000 dedicated and innovative individuals Our goal is to create an environment where a variety of perspectives and backgrounds are valued, ensuring ORNL is known as a top choice for employment. These principles are essential for supporting our broader mission to drive scientific breakthroughs and translate them into solutions for energy, environmental, and security challenges facing the nation. Why Join Us Work on the world’s most powerful supercomputers, including Frontier, the first system to achieve exascale performance. Enable breakthrough science in fields like fusion energy, climate modeling, AI, and national security. Collaborate with diverse teams of scientists, engineers, and technologists from across the DOE complex and academia. Grow your career in a mission-driven, innovation-focused environment with access to professional development and leadership opportunities. Enjoy life in East Tennessee, with a thriving research community, scenic outdoor recreation, and a high quality of life. This position will remain open for a minimum of 5 days after which it will close when a qualified candidate is identified and/or hired. We accept Word (.doc, .docx), Adobe (unsecured .pdf), Rich Text Format (.rtf), and HTML (.htm, .html) up to 5MB in size. Resumes from third party vendors will not be accepted; these resumes will be deleted and the candidates submitted will not be considered for employment. ORNL is an equal opportunity employer. All qualified applicants, including individuals with disabilities and protected veterans, are encouraged to apply. UT-Battelle is an E-Verify employer.
-
HPC Linux Systems Engineer
5 days ago
Oak Ridge, United States Oak Ridge National Laboratory Full timeRequisition Id 15558 Overview Oak Ridge National Laboratory (ORNL) is seeking highly motivated HPC Linux Systems Engineers to join teams operating some of the most advanced computing environments in the world.This evergreen posting represents multiple potential openings across ORNL’s high-performance computing ecosystem.Successful candidates will help...
-
HPC Linux Systems Engineer
1 week ago
Oak Ridge, United States Oak Ridge National Laboratory Full timeRequisition Id 15558 Overview Oak Ridge National Laboratory (ORNL) is seeking highly motivated HPC Linux Systems Engineers to join teams operating some of the most advanced computing environments in the world. This evergreen posting represents multiple potential openings across ORNL’s high-performance computing ecosystem. Successful candidates will help...
-
HPC Linux Systems Engineer
2 weeks ago
Oak Ridge, TN, United States Oak Ridge National Laboratory Full timeRequisition Id 15558 Overview Oak Ridge National Laboratory (ORNL) is seeking highly motivated HPC Linux Systems Engineers to join teams operating some of the most advanced computing environments in the world. This evergreen posting represents multiple potential openings across ORNL's high-performance computing ecosystem. Successful candidates will help...
-
HPC Linux Systems Engineer
4 days ago
Oak Ridge, TN, United States Oak Ridge National Laboratory Full timeRequisition Id 15558 Overview Oak Ridge National Laboratory (ORNL) is seeking highly motivated HPC Linux Systems Engineers to join teams operating some of the most advanced computing environments in the world. This evergreen posting represents multiple potential openings across ORNL's high-performance computing ecosystem. Successful candidates will help...
-
Senior HPC Linux Systems Engineer
2 days ago
Oak Ridge, Tennessee, United States Oak Ridge National Laboratory Full time $120,000 - $180,000 per yearRequisition Id 15310Overview:The National Center for Computational Sciences (NCCS) at Oak Ridge National Lab (ORNL), which hosts several of the world's most powerful computer systems, is seeking a highly qualified individual to play a key role in improving the security, performance, and reliability of the NCCS computing environments. This includes supporting...
-
Senior Linux HPC Systems Engineer
2 weeks ago
Oak Ridge, Tennessee, United States ITR Full time $120,000 - $180,000 per yearMust be able to work a hybrid work schedule in Oak Ridge, TNMust be eligible to obtain a federal security clearance (US Citizen)Major Duties/Responsibilities:1. Advocate and promote HPC and clustered computing services to researchers who process large data sets and/or develop code as a part of their project.Ensure the availability, performance, scalability,...
-
Senior Linux HPC Storage Engineer
4 weeks ago
Oak Ridge, United States Oak Ridge National Laboratory Full timeWe are hiring a Senior Linux HPC Storage Engineer to design, operate and maintain clusters, servers, and workstations storage supporting services where science happens at ORNL! This position resides in the Emerging Technologies & Computing team in the Research Computing group in the Information Technology Services Directorate at Oak Ridge National Laboratory...
-
Senior HPC Storage Systems Engineer
3 days ago
Oak Ridge, United States Xcel Engineering Full timeCOMPANY OVERVIEWXCEL Engineering, Inc. is an award-winning small business that provides trusted information technology, engineering, consulting and project management solutions and services to federal agencies and organizations. Originally founded in 1971 by professional engineers at the University of Tennessee, XCEL was acquired in 2003 by U.S. Army and...
-
Lead Linux HPC Systems Engineer, 8+ Yrs Exp
2 days ago
Wheat Ridge, United States Aspen Systems Inc Full timeHigh Performance Computing (HPC) Linux Engineering - In-House Only - DenverCandidates should take the time to read all the elements of this job advert carefully Please make your application promptly.We're seeking an experienced Sr. HPC Linux Engineer to join our talented engineering team in Denver, Colorado. Our best-in-class engineering team thrives on...
-
Lead Linux HPC Systems Engineer, 8+ Yrs Exp
2 days ago
Wheat Ridge, CO, United States Aspen Systems Inc Full timeHigh Performance Computing (HPC) Linux Engineering - In-House Only - Denver Candidates should take the time to read all the elements of this job advert carefully Please make your application promptly. HPC Linux Engineer to join our talented engineering team in Denver, Colorado. Our best-in-class engineering team thrives on staying up-to-date with the...