HPC Systems Administrator, School of Engineering

2 weeks ago


Hempstead NY United States Hofstra University Full time
Job DescriptionAbout Hofstra:

Hofstra University is nationally ranked and recognized as Long Island’s largest private university located in Hempstead, N.Y. When you work at Hofstra, you join a team of talented professionals committed to preparing students for the challenges of tomorrow, in an environment that cultivates learning through the free and open exchange of ideas for the betterment of humankind. The work we do at Hofstra supports the education and well-being of our students, and the workforce of the future. While working towards this mission, employees can take advantage of many enriching experiences on campus. Whether it’s a lunchtime lecture, a Division I NCAA athletics game, a musical concert, a theatre performance, or a visit to one of our two accredited museums, there is always something exciting to do at Hofstra. Enjoy the ease of going to the fitness center, taking a swim, or grabbing a bite to eat without having to leave our beautiful campus Hofstra University is dedicated to recruiting and retaining a highly qualified and diverse academic community of students, faculty, staff, and administrators respectful of the contributions and dignity of each of its members. We especially encourage women, people of color, members of the LGBTQ+ community, veterans, and people with disabilities to apply.

Position Title:

HPC Systems Administrator, School of Engineering

Position Number:

896422

School/Division:

School of Engineering

Full-Time or Part-Time:

Full-Time

Description:

Reporting to the National Science Foundation Principal Investigator, the HPC Systems Administrator is responsible for the management of the new ‘Star’ high-performance computing (HPC) environment at Hofstra University. Responsibilities will include managing configuration, monitoring, optimizing performance, troubleshooting, and ensuring security and high availability of the cluster, as well as providing technical assistance and supporting the research community by collaborating with researchers to understand their needs and facilitating training to support effective use of the cluster. The HPC Systems Administrator will manage the Linux environment, hardware, network infrastructure, and software components of the HPC systems. This role requires expertise in systems administration and effective communication skills in engaging with researchers and providing technical guidance to students and faculty to help them explore, assess, and pinpoint technology solutions. Daily operations will include logging changes, managing job scheduler policies, monitoring job performance, advising and communicating with clients, installing software, applying patches, auditing, testing, troubleshooting, repair, maintenance, etc. The HRC Systems Administrator must be responsive to evolving research and class needs, and able to manage or provide services that have a broad business impact. This position is on-site during normal business hours but may require periodic remote work and occasional work during non-standard hours for system maintenance and urgent issues. This is a grant-funded position for a two-year period. This position is contingent upon grant funding.

Responsibilities include, but are not limited to:

  • Manages all operational aspects of the cluster, including installing, configuring, maintaining, and administering software and hardware components.
  • Designs and plans for the implementation of future expansions and integrations.
  • Administers the scheduler and its policies to ensure efficient job scheduling and resource allocation.
  • Performs regular patching, updates, and maintenance of the cluster to ensure optimal performance and security.
  • Performs Linux administration tasks, including software installation, managing system configuration, writing/maintaining shell scripts, analyzing logs, diagnosing, and resolving issues, performance tuning, and system maintenance.
  • Deploys, administers, and monitors containerized applications.
  • Conducts management and monitoring of cluster nodes.
  • Serves as the primary contact for HPC inquiries, requests, and technical support.
  • Supports clients and their applications through consultation and project planning.
  • Communicates with researchers, faculty, and students to provide support and understand their needs.
  • Advises clients on technical design and implementation of technology solutions for classes and research.
  • Troubleshoots and resolves a variety of complex system and job execution issues.
  • Conducts system monitoring to assess system health, proactively identify potential issues, optimizing performance, and maintaining overall system integrity, stability, and availability.
  • Meets with stakeholders to review cluster usage, understand challenges, discuss policies, plan changes, and anticipate upcoming computing needs.
  • Develops and manages processes to streamline operations.
  • Implements and oversees security measures to protect data and uphold privacy standards.
  • Coordinates and maintains periodic backups of systems.
  • Maintains documentation of system configuration, changes, operating procedures, cluster components, troubleshooting instructions, and resolutions to promote knowledge transfer and teamwork.
  • Writes guides to direct users through common tasks and procedures.
  • Onboards new users, including account setup, access control, and customized user environment configurations.
  • Collaborates with partners and vendors to operate and maintain the cluster and resolve issues.
  • Provides user consultation and training to support effective use of the HPC resources and enhance research outcomes.
  • Handles periodic on-call duty as well as out-of-band requests.
  • Performs other related duties as assigned.
Qualifications:

  • Bachelor’s degree in Computer Science or related field required.
  • At least 3 years of relevant experience with Linux systems administration, including working with kernel modules.
  • Fluency in multiple programming languages, including solid skills in shell scripting.
  • Experience with Slurm administration and HPC clusters.
  • Strong problem-solving and troubleshooting skills.
  • Effective written and oral communication skills.
  • Proficiency with data center GPU management and domain-specific tools.
  • Proficiency with command line tools.
  • Ability to work independently as well as collaboratively within a team.
Preferred Qualifications:

  • Familiarity with Message Passing Interface (MPI) and parallel computing environments preferred.
  • Experience with containerization technologies such as Singularity/Apptainer.
  • Knowledge of networking, storage, and parallel file systems in a clustered environment.
Special Instructions:

Interested applicants should submit a resume detailing their relevant work experience and qualifications, a cover letter explaining their interest in the position, and references. Inquiries can be emailed to starhpc@hofstra.edu .

Deadline:

Open Until Filled

Date Posted:

06/03/2024

EEO Statement:

Hofstra University is an equal opportunity employer, committed to fostering diversity in its faculty, administrative staff and student body, and encourages applications from the entire spectrum of a diverse community.

Salary/Salary Range:

$85,000



  • Hempstead, United States Hofstra University Full time

    Job DescriptionAbout Hofstra:Hofstra University is nationally ranked and recognized as Long Island’s largest private university located in Hempstead, N.Y. When you work at Hofstra, you join a team of talented professionals committed to preparing students for the challenges of tomorrow, in an environment that cultivates learning through the free and open...


  • Hempstead, United States InsideHigherEd Full time

    About Hofstra:Hofstra University is nationally ranked and recognized as Long Island’s largest private university located in Hempstead, N.Y. When you work at Hofstra, you join a team of talented professionals committed to preparing students for the challenges of tomorrow, in an environment that cultivates learning through the free and open exchange of ideas...


  • Hempstead, United States InsideHigherEd Full time

    About InsideHigherEdInsideHigherEd is a leading online source for news, opinion, and resources for professionals in higher education. Our mission is to provide accurate, unbiased, and timely information to help institutions and individuals succeed in an ever-changing landscape.Position Title:HPC Systems Administrator, School of EngineeringJob Summary:We are...


  • Hempstead, United States Hofstra University Full time

    Position InformationAbout HofstraHofstra University is nationally ranked and recognized as Long Island's largest private university located in Hempstead, N.Y. When you work at Hofstra, you join a team of talented professionals committed to preparing students for the challenges of tomorrow, in an environment that cultivates learning through the free and open...

  • HPC Engineer III

    2 weeks ago


    Atlanta, GA, United States TEKsystems Full time

    Description:We are looking for a highly motivated team member to help support our clients innovative, new AWS HPC computing platform and service. This position will work with a variety of technical teams and with faculty to help engineer well-designed high performance computing solutions that advance knowledge discovery, tackle challenging scientific...


  • Hanover, NH, United States Dartmouth College Full time

    Posting date: 06/10/2024Open Until Filled: YesPosition Number: Position Title: Research Cyberinfrastructure Engineer II, HPC and GPU Cluster (RCIEII)Department this Position Reports to: Research CyberinfrastructureHiring Range Minimum: $99,400Hiring Range Maximum: $114,300Union Type: Not a Union PositionSEIU Level: Not an SEIU PositionFLSA Status:...

  • System Administrator

    2 weeks ago


    Worcester, MA, United States University of Massachusetts Medical School Full time

    OverviewGENERAL SUMMARY OF POSITION: The Enterprise Client Solution Engineer is a critical role in UMass Chan Medical School's efforts to build a world class Information Technology support organization. The chief responsibility is to administer the enterprise computer management system(s) as part of the standardization of operating systems and software...

  • Systems Administrator

    2 weeks ago


    Florida, NY, United States University of Miami Full time

    Current Employees:If you are a current Staff, Faculty or Temporary employee at the University of Miami, please click here to log in to Workday to use the internal application process. To learn how to apply for a faculty or staff position using the Career worklet, please review this tip sheet .The department of Clinical Engineering has an exciting opportunity...


  • Florida, NY, United States University of Miami Full time

    Current Employees:If you are a current Staff, Faculty or Temporary employee at the University of Miami, please click here to log in to Workday to use the internal application process. To learn how to apply for a faculty or staff position using the Career worklet, please review this tip sheet .The University of Miami Health System, "UHealth", Information...


  • Tucson, AZ, United States Raytheon Full time

    Date Posted:2024-04-18Country:United States of AmericaLocation:AZ855: RMS AP Bldg M East Hermans Road Building M05, Tucson, AZ, 85756 USAPosition Role Type:OnsiteSystem Administrator, High Performance Computing (HPC) - P3Principal Specialist levelRaytheon Digital TechnologyTucson AZ - onsite roleAt Raytheon, the foundation of everything we do is rooted in...


  • Tucson, AZ, United States Raytheon Full time

    Date Posted:2024-08-16Country:United States of AmericaLocation:AZ805: RMS AP Bldg East Hermans Road Building 805, Tucson, AZ, 85756 USAPosition Role Type:OnsiteAt Raytheon, the foundation of everything we do is rooted in our values and a higher calling - to help our nation and allies defend freedoms and deter aggression. We bring the strength of more than...

  • Systems Administrator

    2 weeks ago


    Florida, NY, United States University of Miami Full time

    Current Employees:If you are a current Staff, Faculty or Temporary employee at the University of Miami, please click here to log in to Workday to use the internal application process. To learn how to apply for a faculty or staff position using the Career worklet, please review this tip sheet .As member of the Digital Transformation and Ancillary Applications...


  • Providence, RI, United States Brown University Full time

    Senior Systems Administrator/Lead Systems Administrator REQ195753 Barus & Holley Job Description:The School of Engineering at Brown University has a new and exciting opportunity to hire a Senior Systems Administrator / Lead Systems Administrator.Manage, maintain and support the School of Engineering (SoE), and related, AD domain-based computing...


  • Tucson, AZ, United States Raytheon Full time

    Date Posted:2024-07-30 Country:United States of America Location:AZ855: RMS AP Bldg M East Hermans Road Building M05, Tucson, AZ, 85756 USA Position Role Type:Onsite System Administrator, High Performance Computing (HPC) - P2 Raytheon Digital Technology Tucson AZ - onsite role Ability to obtain a US Clearance At Raytheon, the foundation of everything we do...


  • Hempstead, United States InsideHigherEd Full time

    About InsideHigherEdInsideHigherEd is a leading source of news, opinion, and data for the higher education community. We are dedicated to providing accurate and unbiased information to help institutions and individuals succeed in an ever-changing landscape.Job SummaryWe are seeking a highly skilled and detail-oriented Senior Support Specialist to join our...


  • Marlborough, MA, United States Raytheon Full time

    Date Posted:2024-08-12Country:United States of AmericaLocation:MA803: Marlborough, MA Building 3 1001 Boston Post Road Building 3, Marlborough, MA, 01752 USAPosition Role Type:OnsiteAt Raytheon, the foundation of everything we do is rooted in our values and a higher calling - to help our nation and allies defend freedoms and deter aggression. We bring the...


  • El Segundo, CA, United States Raytheon Full time

    Date Posted:2024-08-29Country:United States of AmericaLocation:CA220: 2202 E El Segundo Blvd BldgE East El Segundo Boulevard Building E02, El Segundo, CA, 90245 USAPosition Role Type:OnsiteAt Raytheon, the foundation of everything we do is rooted in our values and a higher calling - to help our nation and allies defend freedoms and deter aggression. We bring...


  • Tewksbury, MA, United States Raytheon Full time

    Date Posted:2024-08-13Country:United States of AmericaLocation:MA131: Tewksbury, MA Bldg 1 Assabet 50 Apple Hill Drive Assabet - Building 1, Tewksbury, MA, 01876 USAPosition Role Type:OnsiteAt Raytheon, the foundation of everything we do is rooted in our values and a higher calling - to help our nation and allies defend freedoms and deter aggression. We...

  • Systems Administrator

    2 weeks ago


    Maryland, NY, United States College of Southern Maryland Full time

    Position SummaryLocated 45 minutes from the Nation's Capital, nestled in a history-rich community of southern Maryland, The College of Southern Maryland (CSM) is a two-time Aspen Award-winning institution (top 15% of Community Colleges) with academic programs in over 100 disciplines. CSM is among America's top 100 producers of Minority Associate Degrees in...


  • Worcester, MA, United States University of Massachusetts Medical School Full time

    Job DescriptionOverviewGENERAL SUMMARY OF POSITION:     Under the general supervision of the Lead Operating Engineer or designee, the Building System Operating Engineer is responsible for the safe and reliable operation of the building and its operating systems.  Respond to all building related problems reported by the staff, patients and visitors. ...