High Performance Computing Engineer

3 weeks ago


Washington, United States Howard University Full time

High Performance Computing Engineer

locations
Interdisciplinary Research Building
time type
Full time

job requisition id
JR104450
The Talent Acquisition department hires qualified candidates to fill positions which contribute to the overall strategic success of Howard University. Hiring staff “for fit” makes significant contributions to Howard University’s overall mission.

BASIC FUNCTION:

The purpose of this position is to serve as the High-Performance Computing Engineer for the Research Institute for Tactical Autonomy (RITA), University Affiliated Research Center (UARC).

SUPERVISORY ACCOUNTABILITY:

Typically responsible for performing some non-supervisory duties in addition to supervisory responsibilities.

NATURE AND SCOPE:

Serve as the High-Performance Computing Engineer for the Research Institute for Tactical Autonomy (RITA), internal contacts include executives, administrators, faculty, staff and students of the departments and the University at large with special emphasis on the HR, ETS, Finance, Office of Research and The Office of the CFO. External contacts include representatives from federal government, other colleges and universities, professional associations, consultants, vendors, and the general public.

PRINCIPAL ACCOUNTABILITIES:

Responsible for understanding the concepts, procedures, and guidelines to solve highly complex problems in the maintenance and hardware/software network infrastructure.

Experience performing system set-up, experiments, and diagnostics to evaluate printed circuit board exchanges, and troubleshoot and make component repairs based on test results.

Performs configurations and operates multipurpose, multi-tasking computer systems.

Supports day-to-day operations for the Computing team by monitoring computing resource performance, managing configurations, and addressing security administration. Applies revisions to system firmware and software. Engages and collaborates with vendors to assist with support activities as required.

Performs training and support for technical staff in the use of new software and hardware, either developed or acquired.

Creates, deletes, maintains and manages HPC researcher accounts and logins for RITA staff, performs system back-ups, and maintains system configuration files.

Installs, configures, modifies, tunes and maintains various research software applications for access on HPC clusters.

Performs researcher support and documentation for software applications, programs and enterprise services.

Designs, installs, configures, and performs document management for cluster infrastructure, including operating systems, job schedulers, resource managers, provisioning managers, configuration managers, network devices, and other components.

Investigates, debugs, and addresses researcher inquiries and requests efficiently through a customer issue ticketing system. Communicates complex technical concepts in simple, straightforward language.

Explores emerging technologies and technical developments to address expanding analytical requirements. Identifies new services and develops implementation plans. Stays current with best practices in the HPC field. Maintains collaborative relationships with peer HPC research organizations.

Performs other related duties as assigned or requested.

CORE COMPETENCIES:

Familiarity with low-latency/high-bandwidth, interconnected infrastructure (including InfiniBand, 10/100GigE, and others).

Expertise with HPC system software cluster management tools, job schedulers, and other HPC tools including Slurm, Ansible, and more.

Proficiency with fundamental programming skills (Tensorflow, PyTorch, ML/AI Tools, Python, C/C++ or similar languages). Expertise with administration, monitoring, and maintaining secure Linux/Unix operating systems (CentOS).

Knowledge of HPC storage (FC, SAS) principles, file systems (NFS, Lustre, BeegFS, ZFS, etc.), and compute node storage, Network Attached Storage.

Proficiency with web interfacing of ML/AI tools such as Tensorflow, PyTorch

Ability to drive technical leadership and management of complex, large-scale computing system projects.

Proficiency with multi-vendor management, security and network/Internet protocols.

Demonstrated expertise in design configuration and planning, with excellent organization skills, and the ability to identify and resolve problems and manage performance.

Excellent written and oral communication skills, with experience presenting technical topics to nontechnical audiences.

Ability to establish processes for maintaining system performance and managing best-in-class standards.

Knowledge of computer applications and experience with accompanying user-friendly software, e.g., Workday, word processing, spreadsheet, data base, outlook, presentation, etc.

Excellent leadership, training and developmental skills.

Skill in oral and written (English) communications with the ability to explain complicated, fiscal and budgetary processes to lay persons, and the ability to make public presentations.

Strong organizational skills to establish priorities meet deadlines and perform in a responsible, professional manner.

Ability to maintain harmonious working relationship with staff, students, faculty and University officials and the general public.

Skill in leadership with ability to delegate tasks and assignments appropriately.

Ability to manage cross-functional teams, delegate tasks, and promote and direct staff development.

Ability to conduct research, compile, and prepare comprehensive complex financial and budget reports.

Ability to keep abreast of and adhere to new policies initiated by changes in federal, District of Columbia or University regulations and to communicate this information to others.

Strong decision-making skills.

MINIMUM REQUIREMENTS:

Bachelor's degree (foreign equivalent or higher) in a relevant field, such as computer science, computer information systems, etc.

Four or more years of experience in one of the following fields: information technology, HPC system administration, network engineering, or large-scale HPC file systems.

Relevant experience must include Linux and scripting (for example, Python) and ML/AI programming such as Tensotflow, PyTorch, etc. In addition, it may also include maintaining hardware or software over their lifecycle (i.e., requirements analysis, implementation, testing, integration, deployment/installation, and maintenance), or computer/network security, or high-performance computing.

Experience with cloud computing and container technologies.



  • Washington, Washington, D.C., United States Howard University Full time

    High Performance Computing EngineerlocationsInterdisciplinary Research Buildingtime typeFull timejob requisition idJR104450The Talent Acquisition department hires qualified candidates to fill positions which contribute to the overall strategic success of Howard University. Hiring staff "for fit" makes significant contributions to Howard University's overall...

  • Performance Engineer

    1 month ago


    Washington, United States BOO Full time

    Boo Performance Engineer – Remote Boo is a personality-based social/dating app that allows you to deeply understand anyone and connect with people who intuitively understand you. The Role Performance Engineer Job Description: We are seeking a skilled Performance Engineer to join our team. The Performance Engineer will be responsible for ensuring the high...


  • Washington, DC, United States USM Systems Full time

    Company DescriptionUSM Business Systems Inc. is a quickly developing worldwide System Integrator, Software and Product Development, IT Outsourcing and Technology assistance supplier headquartered in Chantilly, VA with off-shore delivery centers in India. We offer world-class ability in giving most astounding quality and administrations through industry best...


  • Washington DC, United States ALTA IT Services Full time

    Cloud Engineer ALTA IT is looking for a Cloud Engineer to join our team to support our intelligence customer activities on site in Springfield, VA. The Cloud Engineer will be responsible for working with a highly functional development team automating service deployments; optimizing system performance; establishing/improving system monitoring; provide...

  • Automation Engineer

    3 weeks ago


    Washington, United States Global Alliant Inc Full time

    Job DescriptionJob DescriptionSalary: Job Title: Automation Engineer / Performance TesterLocation: Remote.Duration: Full-Time.NEED IRS MBI Clearance.Basic Qualifications:5+ years of experience with performance engineering in design, development, and performance testingExperience in Agile development, Sprints, Story Reviews, Scrum, DevOpsExperience with...

  • Automation Engineer

    23 hours ago


    Washington, United States Global Alliant Inc Full time

    Job DescriptionJob DescriptionSalary: Job Title: Automation Engineer / Performance TesterLocation: Remote.Duration: Full-Time.NEED IRS MBI Clearance.Basic Qualifications:5+ years of experience with performance engineering in design, development, and performance testingExperience in Agile development, Sprints, Story Reviews, Scrum, DevOpsExperience with...


  • Washington, United States The Building People Full time

    Job DescriptionJob DescriptionThe Building People, LLC, has a position open for a Computer Aided Facility Management (CAFM) Engineer.Conducting facility surveys using the ANSI/BOMA Z65 measurement standard.Assist in determining space requirements, equipment locations, construction costs, environmental constraints, encroachments, and other critical planning...


  • Washington, United States ORBIS INC. Full time

    This position supports the PEO Aircraft Carriers – Carrier Engineering Team (SEA 05V). The Network Engineer will apply technical knowledge in the design, implementation, and maintenance of networks on US Navy Ships/Aircraft Carriers. This position will focus on shipboard networks for Nimitz and Ford Class Carriers. Requirements: Must have minimum of five...


  • Washington, United States Orbis Corporation Full time

    This position supports the PEO Aircraft Carriers - Carrier Engineering Team (SEA 05V). The Network Engineer will apply technical knowledge in the design, implementation, and maintenance of networks on US Navy Ships/Aircraft Carriers. This position will focus on shipboard networks for Nimitz and Ford Class Carriers. Requirements: Must have minimum of five...


  • Washington, United States CoStar Group Full time

    Computer Vision/Machine Learning Engineer Job Description Job Description OVERVIEW CoStar Group (NASDAQ: CSGP) is a leading global provider of commercial and residential real estate information, analytics, and online marketplaces. Included in the S&P 500 Index and the NASDAQ 100, CoStar Group is on a mission to digitize the world’s real estate, empowering...

  • Computer Scientist

    2 days ago


    Washington, United States Federal Aviation Administration Full time

    The Computer Scientist provides technical leadership for highly complex and challenging system engineering activities and processes for the Aviation Safety Information Analysis and Sharing (ASIAS) Information Technology (IT) architecture platforms. Works as a part of the ASIAS team under the minimal direction of the AISAS program manager, this team member...

  • Computer Operator

    1 day ago


    Washington, United States Go intellects Inc Full time

    Job DescriptionJob DescriptionBenefits:Competitive salary This position is not telework eligible. All shifts are onsite. The Office of Lottery & Gaming is a full time 24hrs, 365 day operation. The position requires someone who can work Holidays and weekends (Saturday and Sunday).Complete Description:I. Major Duties and ResponsibilitiesThe resource is to...

  • Computer Operator

    1 week ago


    Washington, United States Go intellects Inc Full time

    Job DescriptionJob DescriptionBenefits:Competitive salary This position is not telework eligible. All shifts are onsite. The Office of Lottery & Gaming is a full time 24hrs, 365 day operation. The position requires someone who can work Holidays and weekends (Saturday and Sunday).Complete Description:I. Major Duties and ResponsibilitiesThe resource is to...

  • Computer Scientist

    1 month ago


    Washington, United States Applied Research Associates (ARA) Full time

    The Capital Area Division (CAD) of Applied Research Associates, Inc. (ARA) is seeking a motivated, energetic Computer Scientist to support the Navy in the development and application of cutting-edge high-performance computing (HPC) software and network architectures. This contingent position, expected to start late summer or early fall 2024, will support the...


  • Washington, United States ALTA IT Services Full time

    Mainframe Performance Systems Engineer 100% REMOTE Must reside in MD, VA, DC or WV Infrastructure engineer with a minimum of 8-10 years experience supporting IBM zOS Systems in a SYSPLEX environment with: • 8+ years of IBM z/OS performance engineering/capacity planning experience • Proficient with RMF, RMF Tools and Monitor III usage and...


  • Washington DC, United States Databricks Full time

    While candidates in the listed locations are encouraged for this role, we are open to remote candidates in other locations. Government information security and federal contractor regulations, including Department of Defense Cloud Computing Security Requirements for Impact Level 6 Cloud Service Provider personnel, and facilitate compliance with other...

  • Computer Scientist

    1 day ago


    Washington, Washington, D.C., United States Federal Aviation Administration Full time

    The Computer Scientist provides technical leadership for highly complex and challenging system engineering activities and processes for the Aviation Safety Information Analysis and Sharing (ASIAS) Information Technology (IT) architecture platforms. Works as a part of the ASIAS team under the minimal direction of the AISAS program manager, this team member...

  • Structural Engineer

    5 days ago


    Washington, United States The High Companies Full time

    Are you a structural engineer with expertise in forensic investigation, concrete construction, or structural restoration who would like the opportunity to have a mix of office and field engagement that leads to direct involvement with construction and repairs in your next role? Does working within a collaborative team of a rapidly scaling division of a...

  • Computer Scientist

    1 week ago


    Washington, United States Envisioneering Full time

    The position is eligible for a hiring bonus up to 10% of the first year’s annual salary.This position is an opportunity to join a team that offers innovative, exciting, and meaningful work linking military and civilian talents to achieve our mission and safeguard our freedoms. The Computer Engineer is responsible for creating state of the art software for...

  • Computer Scientist

    3 days ago


    Washington, United States US Federal Aviation Administration Full time

    **Duties**: The Computer Scientist provides technical leadership for highly complex and challenging system engineering activities and processes for the Aviation Safety Information Analysis and Sharing (ASIAS) Information Technology (IT) architecture platforms. Works as a part of the ASIAS team under the mínimal direction of the AISAS program manager, this...