HPC Linux Engineer

1 week ago


Oak Park, Illinois, United States Xcel Engineering Full time

COMPANY OVERVIEW

XCEL Engineering, Inc. is an award-winning small business that provides trusted information technology, engineering, consulting and project management solutions and services to federal agencies and organizations. Originally founded in 1971 by professional engineers at the University of Tennessee, XCEL was acquired in 2003 by U.S. Army and Navy veterans and in 2023 became a MartinFed company.

XCEL Engineering is a part of IT Lab Partners (ITLP) which was created to support a leading research facility in the East Tennessee region in recruiting the best and the brightest technical talent. Considering joining our impressive team today

This position requires working onsite in Oak Ridge, TN.

JOB OVERVIEW

XCEL Engineering has an opening for a HPC Linux Engineer to join our team of talented and diverse individuals. The team is responsible for facilitating R&D projects. They provide design, deployment, optimization, monitoring, and tooling support across multiple clustered infrastructures for the research organizations that they support. They support clusters that range in scope from just a handful of nodes to 40,000+ cores.

This role will advocate and promote HPC and clustered computing services to researchers who process large data sets and/or develop code as a part of their project. You will be responsible for the availability, performance, scalability, and security of production systems. You should have a strong desire to push the envelope and identify new technologies and opportunities and be able to communicate the potential benefits of those choices to others within the team and our research partners. The team heavily utilizes automation and monitoring solutions to minimize day-to-day maintenance and you should always be looking for opportunities to optimize system management practices or system performance. As the primary SMEs for these systems, you will work with technical POCs for the programs supported to install and help tune the performance of various scientific toolsets. You should be a collaborative and energetic team member who thrives on the opportunity to build trust and credibility, and ultimately become a trusted advisor to the research teams.

The group optimizes workflows and monitoring solutions to take advantage of the 24/7 operations staff, which significantly reduces the need for off-hours support. They also offer a flexible work schedule and utilize Email, Jira, Confluence, Teams, Slack, and other collaboration solutions to stay in contact.

BASIC QUALIFICATIONS

  • Bachelor's degree in Computer Science (or related field) or combined work experience plus education of 5 years.
  • A minimum of 3 years of experience managing UNIX/Linux Systems.
  • A minimum of 2 years utilizing configuration management and automation tools such as Git, Jenkins, Ansible, or Puppet.
  • Moderate fluency in at least one scripting language such as Bash, Python, or equivalent.
  • Experience performing advanced troubleshooting and system administration with Linux Servers.
  • Experience supporting large data systems
  • Demonstrated capabilities to work in a dynamic environment.
  • Preferably candidates will have an active DOE Q or DOE Top Secret Clearance, which requires US Citizenship. Candidates that have held a Q or Top Secret in the past 7 years will be considered.

DESIRED QUALIFICATIONS

  • 7+ years of experience managing UNIX/Linux Systems.
  • Strong knowledge of multiple operating systems.
  • Experience with Centos/RHEL 7+, Centos/RHEL 8+, Ubuntu 18+
  • Understanding of HPC platforms to support users with job submissions and troubleshooting.
  • Experience managing systems utilizing GPU/Cuda clusters for AI/ML and/or image processing.
  • Knowledge of networking fundamentals including TCP/IP, traffic analysis, common protocols, and network diagnostics.
  • Experience with performance and diagnostic tools for benchmarking, analysis and tuning of systems, networking, and storage.
  • Experience with Grafana, CheckMK, Nagios, Zabbix, SolarWinds, Ganglia, or other network and device monitoring systems.
  • Previous experience working in a government, scientific or other highly technical environment.
  • Good documentation skills, including ability to prepare simple documentation web pages.

PHYSICAL REQUIREMENTS & ENVIRONMENTAL CONDITIONS

  • Inside office environment.
  • Working on a computer for long periods of time.
  • May involve long period of sitting at a desk.
  • The work environment is fast-paced and sometimes involves extreme deadline pressures.

OTHER DUTIES

This job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities and activities may change at any time with or without notice.


Xcel Engineering is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regards to race, color, religion, religious creed, gender, sexual orientation, gender identity, gender expression, transgender, pregnancy, marital status, national origin, ancestry, citizenship status, age, disability, protected Veteran Status, genetics or any other characteristics protected by applicable federal, state or local law.

If you are a qualified individual with a disability or disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access Xcel Engineering's current openings as a result of your disability. You can request reasonable accommodations by calling Thank you for your interest in Xcel Engineering.



All positions at Xcel Engineering, Inc. are contingent upon passing both a background check and drug screening prior to a start date and are subject to random drug screenings during the employment period. In addition, Xcel Engineering is an E-Verify employer.

Job Posted by ApplicantPro

  • Oak Park, Illinois, United States ITR Full time

    Job DescriptionJob Description­Position TitleLinux Systems Management EngineerPurposeEast Tennessee client Information Technology Services Division invites applications for the position of Linux Systems Management Engineer in the Client Technologies Team. This position acts as a platform owner for Linux and works to ensure user satisfaction among the Linux...


  • Oak Park, Illinois, United States Oak Ridge National Laboratory Full time

    Requisition Id 13181 Overview: As a U.S. Department of Energy (DOE) Office of Science national laboratory, ORNL has an extraordinary 80-year history of solving the nation's biggest problems. We have a dedicated and creative staff of over 6,000 people Our vision for diversity, equity, inclusion, and accessibility (DEIA) is to cultivate an environment and...


  • Tinley Park, Illinois, United States Randstad USA Full time

    job summary: Your responsibilities as a Vision Software Engineer include: Writing software to solve complex technology interoperability problemsAssisting with design concepts of new products and applicationsTesting the designs of equipment, products, and systemsEvaluating and reporting on new vendor products and prototypesDeveloping, setting up, and testing...


  • Tinley Park, Illinois, United States Randstad USA Full time

    job summary: Your responsibilities as a Vision Software Engineer include: Writing software to solve complex technology interoperability problemsAssisting with design concepts of new products and applicationsTesting the designs of equipment, products, and systemsEvaluating and reporting on new vendor products and prototypesDeveloping, setting up, and testing...


  • Oak Brook, Illinois, United States CNH Industrial Full time

    Senior Embedded Software Engineer Location US-IL-Oak Brook IDCategory Engineering Position Type Full-time Overview CNH Industrial is a world-class equipment and services company dedicated to advancing the noble work of agriculture and construction workers. Driven by our shared purpose of Breaking New Ground, we are passionate about bringing Innovation,...


  • Tinley Park, Illinois, United States W. H. Leary Company Inc Full time

    Vision Software Engineer at W. H. LearyWe are excited to get in touch with you!W. H. Leary is looking forward to having a chat with you to learn more about your dream job and work environment. We invite you to consider becoming a full-time Vision Software Engineer with us. In this role, you will be the project leader and technical guru responsible for...


  • Oak Park, Illinois, United States Oak Ridge National Laboratory Full time

    Requisition Id 13104 Overview:As a U.S. Department of Energy (DOE) Office of Science national laboratory, Oak Ridge National Laboratory (ORNL) has an extraordinary 80-year history of solving the nation's biggest problems. We have a dedicated and creative staff of over 6,000 people Our vision for diversity, equity, inclusion, and accessibility (DEIA) is to...


  • Oak Park, Illinois, United States Oak Ridge National Laboratory Full time

    Requisition Id 13104 Overview:As a U.S. Department of Energy (DOE) Office of Science national laboratory, Oak Ridge National Laboratory (ORNL) has an extraordinary 80-year history of solving the nation's biggest problems. We have a dedicated and creative staff of over 6,000 people Our vision for diversity, equity, inclusion, and accessibility (DEIA) is to...


  • Villa Park, Illinois, United States Insperity Full time

    Senior Software Engineer Are you a passionate Senior Software Engineer looking to make a significant impact in the world of supply chain execution? Do you thrive in an innovative environment where your skills can shape the future of logistics? Look no further Our client is a leading provider of supply chain execution software solutions. Their innovative...


  • Oak Brook, Illinois, United States Inspira Financial Full time

    Take the next step in your journey at Inspira Financial. You will help businesses and individuals thrive today, tomorrow, and into retirement. Become part of a company that is people centric and client obsessed in every interaction; a community of forward-thinking individuals focused on driving results to deliver our mission with an unwavering commitment to...


  • Oak Brook, Illinois, United States Inspira Financial Full time

    Take the next step in your journey at Inspira Financial. You will help businesses and individuals thrive today, tomorrow, and into retirement. Become part of a company that is people centric and client obsessed in every interaction; a community of forward-thinking individuals focused on driving results to deliver our mission with an unwavering commitment to...

  • HPC Linux Engineer

    4 weeks ago


    Oak Ridge, United States ITR Full time

    Job DescriptionJob DescriptionHPC Linux Systems EngineerAn East Tennessee DOE Research and Development facility, which hosts several of the world’s most powerful computer systems, is seeking highly qualified individuals to play a key role in improving the security, performance, and reliability of the computing infrastructure. This includes supporting the...


  • Oak Ridge, United States ITR Full time

    Job DescriptionJob DescriptionHPC Linux Engineer - Cleared Must have an active DOE Q Clearance or active DOD Top Secret Clearance that can be convertedMust be able to work onsite in Oak Ridge, TNThe TeamThe team is responsible for facilitating R&D projects. They provide design, deployment, optimization, monitoring, and tooling support across multiple...


  • Oak Ridge, United States ITR Full time

    Job DescriptionJob DescriptionHPC Linux Engineer - Cleared Must have an active DOE Q Clearance or active DOD Top Secret Clearance that can be convertedMust be able to work onsite in Oak Ridge, TNThe TeamThe team is responsible for facilitating R&D projects. They provide design, deployment, optimization, monitoring, and tooling support across multiple...


  • Oak Ridge, United States ITR Full time

    Job DescriptionJob DescriptionHPC Linux Engineer - Cleared Must have an active DOE Q Clearance or active DOD Top Secret Clearance that can be convertedMust be able to work onsite in Oak Ridge, TNThe TeamThe team is responsible for facilitating R&D projects. They provide design, deployment, optimization, monitoring, and tooling support across multiple...


  • Oak Ridge, United States Oak Ridge National Laboratory Full time

    Press Tab to Move to Skip to Content Link Select how often (in days) to receive an alert: Select how often (in days) to receive an alert: The National Center for Computational Sciences (NCCS) at Oak Ridge National Lab (ORNL), which hosts several of the world’s most powerful computer systems, is seeking highly qualified individuals to play a key role in...


  • Oak Ridge, Tennessee, United States Oak Ridge National Laboratory Full time

    Requisition Id13074 Visa sponsorship of any kind is unavailable for this position (H1B, F1, OPT, H4, J1, etc.). Overview: We are seeking a Linux Systems Engineer who will play a crucial role in managing and supporting Linux swerver and large storage systems Areas of focus include maintaining system health, analyzing system performance, and ensuring all...


  • Menlo Park, United States GenomeWeb Full time

    Senior Software Engineer, C++ / HPC System Job Description At Pacific Biosciences, our R&D team is committed to developing innovative products that enable scientists to excel in a wide variety of life science research fields, including human biomedical, plant and animal sciences, and microbiology and infectious disease. Our unique Single Molecule, Real-Time...


  • Menlo Park, United States GenomeWeb Full time

    Senior Software Engineer, C++ / HPC System Job Description At Pacific Biosciences, our R&D team is committed to developing innovative products that enable scientists to excel in a wide variety of life science research fields, including human biomedical, plant and animal sciences, and microbiology and infectious disease. Our unique Single Molecule, Real-Time...


  • Menlo Park, United States Meta Inc Full time

    Summary: Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics. The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist on the web. Some aspects of this role as...


  • Oak Ridge, United States Oak Ridge National Laboratory Full time

    Requisition Id11976 Overview: Are you looking for a way to use your hard-earned SRE skills in a more ambitious environment where you can also help protect national security? The National Center for Computational Sciences (NCCS) at Oak Ridge National Lab (ORNL), which hosts several of the world's most powerful computer systems, is seeking highly qualified...

  • Linux Engineer

    2 months ago


    Oak Ridge, United States ITR Full time

    Job DescriptionJob DescriptionJob ActivitiesThere are various technical tasks that will need to be performed in this role. Some of these tasks can include:Troubleshoot various Linux server related issues.Provide Oracle ZFS system support.Install and configure Red Hat Linux Server systems.Build out both physical and virtual systems.Engineer and provide...

  • Linux Engineer

    1 week ago


    Oak Ridge, United States ITR Full time

    Job DescriptionJob DescriptionJob ActivitiesThere are various technical tasks that will need to be performed in this role. Some of these tasks can include:Troubleshoot various Linux server related issues.Provide Oracle ZFS system support.Install and configure Red Hat Linux Server systems.Build out both physical and virtual systems.Engineer and provide...

  • Linux Engineer

    3 weeks ago


    Oak Ridge, United States ITR Full time

    Job DescriptionJob DescriptionJob ActivitiesThere are various technical tasks that will need to be performed in this role. Some of these tasks can include:Troubleshoot various Linux server related issues.Provide Oracle ZFS system support.Install and configure Red Hat Linux Server systems.Build out both physical and virtual systems.Engineer and provide...


  • Oak Creek, United States ASTRONAUTICS CORPAMERICA Full time

    JOB REQUIREMENTS: Tracking Code 2017980 Job Description What You Will Do: We are seeking a Linux Software Engineer to support the development of new products and maintain existing products in the AeroSync product line. In this role, you will join an Agile software team that designs Linux applications for avionics communications products.You will participate...


  • Oak Creek, United States ASTRONAUTICS CORPAMERICA Full time

    JOB REQUIREMENTS: Tracking Code 2017980 Job Description What You Will Do: We are seeking a Linux Software Engineer to support the development of new products and maintain existing products in the AeroSync product line. In this role, you will join an Agile software team that designs Linux applications for avionics communications products.You will participate...

  • Linux Systems Engineer

    2 months ago


    Oak Ridge, United States ITR Full time

    Job DescriptionJob DescriptionLinux Systems EngineerSpecial Requirements:Visa Sponsorship: Visa sponsorship is not available for this position.Q clearance: This position requires the ability to obtain and maintain a clearance from the Department of Energy. As such, this position is a Workplace Substance Abuse (WSAP) testing designated position. WSAP...


  • Oak Ridge, United States ITR Full time

    Job DescriptionJob DescriptionInfrastructure Linux EngineerEast Tennessee company is seeking a remote qualified applicants for an Infrastructure Linux Engineer on the Enterprise Infrastructure Services Team. This team exists to provide compute and storage infrastructure for the enterprise operations of the company. This position will assist in the...


  • Oak Ridge, United States ITR Full time

    Job DescriptionJob DescriptionLinux Systems Management EngineerPurposeEast Tennessee company is seeking applications for the position of Linux Systems Management Engineer in the Client Technologies Team. This position acts as a platform owner for Linux and works to ensure user satisfaction among the Linux community. Responsible for the full life cycle of...


  • Oak Ridge, United States ITR Full time

    Job DescriptionJob Description­Position TitleLinux Systems Management EngineerPurposeEast Tennessee company invites applications for the position of Linux Systems Management Engineer in the Client Technologies Team. This position acts as a platform owner for Linux and works to ensure user satisfaction among the Linux community across client site....


  • Oak Ridge, United States ITR Full time

    Job DescriptionJob Description­Position TitleLinux Systems Management EngineerPurposeEast Tennessee Research facility invites applications for the position of Linux Systems Management Engineer in the Client Technologies Team. This position acts as a platform owner for Linux and works to ensure user satisfaction among the Linux community across client site....