High Performance Computing Engineer

3 weeks ago


Washington, Washington, D.C., United States Howard University Full time

High Performance Computing Engineer

locations
Interdisciplinary Research Building
time type
Full time

job requisition id
JR104450
The Talent Acquisition department hires qualified candidates to fill positions which contribute to the overall strategic success of Howard University. Hiring staff "for fit" makes significant contributions to Howard University's overall mission.

BASIC FUNCTION:

The purpose of this position is to serve as the High-Performance Computing Engineer for the Research Institute for Tactical Autonomy (RITA), University Affiliated Research Center (UARC).

SUPERVISORY ACCOUNTABILITY:

Typically responsible for performing some non-supervisory duties in addition to supervisory responsibilities.

NATURE AND SCOPE:

Serve as the High-Performance Computing Engineer for the Research Institute for Tactical Autonomy (RITA), internal contacts include executives, administrators, faculty, staff and students of the departments and the University at large with special emphasis on the HR, ETS, Finance, Office of Research and The Office of the CFO. External contacts include representatives from federal government, other colleges and universities, professional associations, consultants, vendors, and the general public.

PRINCIPAL ACCOUNTABILITIES:

Responsible for understanding the concepts, procedures, and guidelines to solve highly complex problems in the maintenance and hardware/software network infrastructure.

Experience performing system set-up, experiments, and diagnostics to evaluate printed circuit board exchanges, and troubleshoot and make component repairs based on test results.

Performs configurations and operates multipurpose, multi-tasking computer systems.

Supports day-to-day operations for the Computing team by monitoring computing resource performance, managing configurations, and addressing security administration. Applies revisions to system firmware and software. Engages and collaborates with vendors to assist with support activities as required.

Performs training and support for technical staff in the use of new software and hardware, either developed or acquired.

Creates, deletes, maintains and manages HPC researcher accounts and logins for RITA staff, performs system back-ups, and maintains system configuration files.

Installs, configures, modifies, tunes and maintains various research software applications for access on HPC clusters.

Performs researcher support and documentation for software applications, programs and enterprise services.

Designs, installs, configures, and performs document management for cluster infrastructure, including operating systems, job schedulers, resource managers, provisioning managers, configuration managers, network devices, and other components.

Investigates, debugs, and addresses researcher inquiries and requests efficiently through a customer issue ticketing system. Communicates complex technical concepts in simple, straightforward language.

Explores emerging technologies and technical developments to address expanding analytical requirements. Identifies new services and develops implementation plans. Stays current with best practices in the HPC field. Maintains collaborative relationships with peer HPC research organizations.

Performs other related duties as assigned or requested.

CORE COMPETENCIES:

Familiarity with low-latency/high-bandwidth, interconnected infrastructure (including InfiniBand, 10/100GigE, and others).

Expertise with HPC system software cluster management tools, job schedulers, and other HPC tools including Slurm, Ansible, and more.

Proficiency with fundamental programming skills (Tensorflow, PyTorch, ML/AI Tools, Python, C/C++ or similar languages). Expertise with administration, monitoring, and maintaining secure Linux/Unix operating systems (CentOS).

Knowledge of HPC storage (FC, SAS) principles, file systems (NFS, Lustre, BeegFS, ZFS, etc.), and compute node storage, Network Attached Storage.

Proficiency with web interfacing of ML/AI tools such as Tensorflow, PyTorch

Ability to drive technical leadership and management of complex, large-scale computing system projects.

Proficiency with multi-vendor management, security and network/Internet protocols.

Demonstrated expertise in design configuration and planning, with excellent organization skills, and the ability to identify and resolve problems and manage performance.

Excellent written and oral communication skills, with experience presenting technical topics to nontechnical audiences.

Ability to establish processes for maintaining system performance and managing best-in-class standards.

Knowledge of computer applications and experience with accompanying user-friendly software, e.g., Workday, word processing, spreadsheet, data base, outlook, presentation, etc.

Excellent leadership, training and developmental skills.

Skill in oral and written (English) communications with the ability to explain complicated, fiscal and budgetary processes to lay persons, and the ability to make public presentations.

Strong organizational skills to establish priorities meet deadlines and perform in a responsible, professional manner.

Ability to maintain harmonious working relationship with staff, students, faculty and University officials and the general public.

Skill in leadership with ability to delegate tasks and assignments appropriately.

Ability to manage cross-functional teams, delegate tasks, and promote and direct staff development.

Ability to conduct research, compile, and prepare comprehensive complex financial and budget reports.

Ability to keep abreast of and adhere to new policies initiated by changes in federal, District of Columbia or University regulations and to communicate this information to others.

Strong decision-making skills.

MINIMUM REQUIREMENTS:

Bachelor's degree (foreign equivalent or higher) in a relevant field, such as computer science, computer information systems, etc.

Four or more years of experience in one of the following fields: information technology, HPC system administration, network engineering, or large-scale HPC file systems.

Relevant experience must include Linux and scripting (for example, Python) and ML/AI programming such as Tensotflow, PyTorch, etc. In addition, it may also include maintaining hardware or software over their lifecycle (i.e., requirements analysis, implementation, testing, integration, deployment/installation, and maintenance), or computer/network security, or high-performance computing.

Experience with cloud computing and container technologies.



  • Washington, Washington, D.C., United States Howard University Full time

    Associate ChairlocationsDowning Hall (College of Engineering)time typePart timejob requisition idJR105278The Talent Acquisition department hires qualified candidates to fill positions which contribute to the overall strategic success of Howard University. Hiring staff "for fit" makes significant contributions to Howard University's overall mission.JOB...

  • Computer Technician

    4 weeks ago


    Washington, Washington, D.C., United States Unisys Full time

    We Believe in Better We are a global information technology company that builds high-performance, security-centric solutions that can help change the world. Enhancing people's lives through secure, reliable advanced technology is our vision.At Unisys, we believe in better Here, you have the opportunity to learn new skills, apply your expertise, and solve...

  • lead engineer

    4 weeks ago


    Washington, Washington, D.C., United States COLLEGE BOARD Full time

    College Board seeks Lead Engineer to use the best Cloud technologies to design and implement high-quality solutions in support of programs across the organization. Requires Bachelor's degree or foreign education equivalent in CS, Computer Apps or Engg + 7 years' Software development experience. 100% remote position performed from anywhere in U.S.To apply,...


  • Washington, Washington, D.C., United States Legislative Branch Full time

    Summary This position is located within the Architect of the Capitol (AOC), Utilities and Power Plant Operations. The High Voltage Technician is responsible for installing, repairing, and maintaining high voltage electric power-controlling equipment and/or distribution lines. The ability to work on a rotating shift will be required.The ideal candidate will...


  • Washington, Washington, D.C., United States MEDSTAR HEALTH Full time

    General Summary of Position***$10,000 Sign-on Bonus***MedStar Health is looking for a Computed Tomography Technologist to join our team Schedule: Week 1: M-W-FWeek 2: T-Th-Sat As a Computed Tomography Technologist, you will perform C.T. examinations in accordance with established protocols, as requested by referring physicians. These functions are performed...


  • Washington, Washington, D.C., United States Atechstar Full time

    What we are looking forExperience with building real time inference systems for deploying Machine Learning models. Proficiency in Python (preferred) or another high level programming language (e.g. Java C Scala) and familiarity with Linux/Unix/Shell environments. Advanced knowledge of complex software design distributed system design design patterns data...

  • Full stack engineer

    1 month ago


    Washington, Washington, D.C., United States menschforce Full time

    Objectives of this Role Work across the full stack building highly scalable distributed solutions that enable positive user experiences and measurable business growth Develop new features and infrastructure development in support of rapidly emerging business and project requirements Assume leadership of new projects from conceptualization to deployment...

  • Field IT Engineer

    1 day ago


    Washington, Washington, D.C., United States Non-Departmental Agency Full time

    Summary Field IT Technicians are responsible for the installation, operation, and maintenance of computer, network, and telecommunications systems in Agency data centers worldwide. Duties As a Field IT Engineer for CIA, you are responsible for the installation, operation, and maintenance of computer, network and telecommunications systems in Agency data...


  • Washington, Washington, D.C., United States ManTech Full time

    Secure our Nation, Ignite your FutureBecome an integral part of a diverse team while working at an Industry Leading Organization, where our employees come first. At ManTech , you'll help protect our national security while working on innovative projects that offer opportunities for advancement. Currently, ManTech is seeking a motivated, career and...

  • Software Engineer

    1 day ago


    Washington, Washington, D.C., United States ECS Full time

    ECS is seeking a Software Engineer to work in our Fairfax, VA office.Job Description:ECS is seeking a Software Engineer to work in our Fairfax, VA office.Job Description:ECS is seeking a Software Engineer to support the execution of a variety of projects including Artificial Intelligence/Machine Learning and Big Data/Cloud Solutions, with a focus supporting...


  • Washington, Washington, D.C., United States Allyon Full time

    Summary: Allyon, Inc. is an established IT and Healthcare Services firm and we love what we do It makes our day when we are able to help talented individuals achieve their career goals while at the same time helping our clients build quality teams. If you are interested in joining the Allyon Team, please apply or submit your resume for review today Job...

  • Chief Engineer

    1 day ago


    Washington, Washington, D.C., United States Cushman Wakefield Multifamily Full time

    Job Title [UNION] Chief Engineer Job Description Summary Chief Engineer is responsible for the effective daily leadership of his/her staff, managing the engineering program to the highest level of quality work and customer service as well as the administration of the engineering department in alignment with the management team, the C&W engineering...

  • Network Engineer

    1 day ago


    Washington, Washington, D.C., United States Honu Services Full time

    Security Clearance:Must be a US Citizen eligible for a DoD Secret Clearance.Must be a U.S. Citizen. A high-level Department of Defense (DoD) active security clearance may be required. Applicants selected will be subject to a security investigation and may need to meet eligibility requirements for access to government information.Job Summary: Network Engineer...

  • cloud engineer

    1 month ago


    Washington, Washington, D.C., United States Atechstar Full time

    Job DescriptionWhat you will needGraduate or Post Graduate degree in Computer Science or equivalent qualification Minimum 1-6 years of software engineering experience developing and delivering products and solutions in a commercial environment Enterprise architecture expertise; designing application ecosystems consisting of many sub-components Experience...


  • Washington, Washington, D.C., United States USAJobs Full time

    DutiesYou will serve as a working leader overseeing the High Voltage Electricians. You will check work in progress, reports to the supervisor on status and causes of work delay and when complete makes sure procedures, methods and deadlines have been met. You will perform duties in construction, maintenance and repair of electrical distribution systems;...


  • Washington, Washington, D.C., United States The Washington Post Full time

    Job DescriptionDuties: Design and build highly scalable applications to support Digital Subscriptions mission of acquiring and retaining digital subscribers. Optimize applications for maximum speed and scalability. Perform story grooming, plan work backlog, work estimation, sprint planning, retros, and demos. Ensure the technical feasibility of UI/UX...

  • Computer Scientist

    1 month ago


    Washington, Washington, D.C., United States Department Of Homeland Security Full time

    Summary The ideal candidate must have expertise in data science, mathematics, and statistics to evaluate data and understand network design. They should be able to identify potential assistance and review processes to ensure proper fraud, waste, and abuse data controls. The candidate must also possess excellent communication skills to present formal...

  • electrical engineer

    1 week ago


    Washington, Washington, D.C., United States USAJobs Full time

    DutiesYou will prepare electrical design drawings, calculations, and specifications to accompany design models generated on electrical engineering design software (e.g., AutoCAD). You will develop solutions to eliminate obstructions to engineering program goals. You will develop project plans (e.g., engineering design and construction project) to include...

  • General Engineer

    1 month ago


    Washington, Washington, D.C., United States USAJobs Full time

    DutiesYou will develop VHDL targeting Xilinx Field Programmable Gate Arrays (FPGA's) and System on a Chip (SOC's). You will interact with an interdisciplinary team consisting of Electrical Engineers and Software Engineers. You will analyze and perform experiments to develop prototype information security systems. You will use in-circuit emulators (ICE's) and...

  • engineer/scientist

    7 days ago


    Washington, Washington, D.C., United States USAJobs Full time

    DutiesYou will advance innovation and adoption of modern software technologies to rapidly build secure, resilient software systems. You will advance the innovation and adoption of modern software processes, practices and methods such as Agile and DevSecOps through collaboration across the Naval Research and Development Environment (NR and DE) and DON...