Current jobs related to Research Cyberinfrastructure Engineer II, HPC and GPU Cluster - Hanover NH United States - Dartmouth College
-
HPC Research Software Engineer
3 weeks ago
, TX, United States Amentum Full timeAbout the RoleWe are seeking a highly skilled HPC Research Software Engineer to join our team at Johnson Space Center (JSC) in support of the Flight Sciences Laboratory (FSL). This position will focus on developing, deploying, and optimizing software for use in the FSL and other NASA HPC resources.Key ResponsibilitiesPartner with FSL users to evaluate and...
-
HPC System Administrator I
2 weeks ago
Troy, MI, United States Roush Full timeRoushTitle HPC System Administrator ILocation Troy, MICategory Engineering & DesignHiring Type Full Time At Roush, we fuse technology and engineering to provide product development solutions to customers in a diverse range of industries. Widely recognized for providing engineering, testing, prototype, and manufacturing services to the transportation...
-
Senior Principal Systems Development Engineer
3 weeks ago
Austin, TX, United States Dell Full timeSenior Principal Systems Development EngineerOur customers’ system requirements are usually highly complex. Bringing together hardware and software systems design, Systems Development Engineering operates at the very cutting edge of technology to meet them. We design and develop electronic and electro-mechanical or systems-orientated products, conduct...
-
Austin, TX, United States Dell Full timeSenior Principal Systems Development Engineer Our customers' system requirements are usually highly complex. Bringing together hardware and software systems design, Systems Development Engineering operates at the very cutting edge of technology to meet them. We design and develop electronic and electro-mechanical or systems-orientated products, conduct...
-
Senior Solutions Architect, NIC and DPU
23 hours ago
San Francisco, CA, United States NVIDIA Full timeNVIDIA is the world leader in computer graphics, artificial intelligence, and accelerated computing. For over 25 years, we have been at the forefront of research and engineering around the greatest advances in technology. Our history of innovation drives us to solve the worlds hardest problems.NVIDIA is looking for Senior NIC/DPU Solutions Architect to join...
-
Senior Principal Systems Development Engineer
3 weeks ago
Anderson Mill, TX, United States Dell Full timeSenior Principal Systems Development Engineer Our customers’ system requirements are usually highly complex. Bringing together hardware and software systems design, Systems Development Engineering operates at the very cutting edge of technology to meet them. We design and develop electronic and electro-mechanical or systems-orientated products, conduct...
-
Round Rock, TX, United States Dell Full timeSenior Principal Systems Development EngineerOur customers’ system requirements are usually highly complex. Bringing together hardware and software systems design, Systems Development Engineering operates at the very cutting edge of technology to meet them. We design and develop electronic and electro-mechanical or systems-orientated products, conduct...
-
San Francisco, CA, United States NVIDIA Full timeNVIDIA is the world leader in computer graphics, artificial intelligence, and accelerated computing. For over 25 years, we have been at the forefront of research and engineering around the greatest advances in technology. Our history of innovation drives us to solve the worlds hardest problems.NVIDIA is looking for Senior Networking (ETH/IB) Solutions...
-
Senior Principal Systems Development Engineer
3 weeks ago
Austin, TX, United States Dell Full timeSenior Principal Systems Development EngineerOur customers’ system requirements are usually highly complex. Bringing together hardware and software systems design, Systems Development Engineering operates at the very cutting edge of technology to meet them. We design and develop electronic and electro-mechanical or systems-orientated products, conduct...
-
Orlando, FL, United States University of Central Florida Full timeThe OpportunityThe Faculty Cluster Initiative (FCI) at the University of Central Florida (UCF) is recruiting for a 9-month tenured professor who will act as the lead for the Genomics and Bioinformatics Cluster (). This position has an anticipated start date of August 8, 2025. An ideal candidate should have a strong background in genomics and bioinformatics,...
-
Montgomery, AL, United States TEKsystems Full timeDescription:TEKsystems is seeking a HPC Systems Administrator to support High Performance Computing systems at client sites in the Montgomery, AL and Oklahoma City, OK. This position will sit remotely (ideally in South region) but does require monthly travel to the client sites.Responsibilitieso HPE Performance Cluster Manager (HPCM) and Parallel File...
-
Systems/GPU Programmer
3 months ago
Los Angeles, CA, United States Vast.ai Full timeAbout Us: At Vast.ai, we value creativity, energy, drive and teamwork. We are looking for talented people who share these values to join as we grow our team. Our vision is to widely distribute AI computing to reshape our future for the good of humanity. Come join us. If witnessing the birth of AGI excites you, we can't wait to hear from you! Vast.ai is a...
-
Systems/GPU Programmer
2 weeks ago
Los Angeles, CA, United States Vast.ai Full timeAbout Us: At -, we value creativity, energy, drive and teamwork. We are looking for talented people who share these values to join as we grow our team. Our vision is to widely distribute AI computing to reshape our future for the good of humanity. Come join us. If witnessing the birth of AGI excites you, we can't wait to hear from you! -is a market based...
-
Sustaining Engineer II
4 weeks ago
, NH, United States SigSauer Full timeJob Title: Sustaining Engineer IISIG SAUER, Inc. is a leading provider of firearms, electro-optics, ammunition, and other products. We are seeking a highly skilled Sustaining Engineer II to join our team.Job Summary:The Sustaining Engineer II will be responsible for maintaining and improving existing firearms products to ensure consistent quality,...
-
Research Engineer Internship Opportunity
2 weeks ago
, OH, United States Matrix Research, Inc. Full timeResearch Engineer Internship OpportunityMatrix Research, Inc. is seeking highly motivated and talented students to join our team as Research Engineer Interns. As a Research Engineer Intern, you will have the opportunity to work on cutting-edge projects related to radar systems, radio frequency, and sensor exploitation technologies.ResponsibilitiesAssist with...
-
Research Scientist, Neural Code
5 months ago
Hanover, United States InsideHigherEd Full timeResearch Scientist, Neural Code Location:Hanover, NHOpen Date:Nov 27, 2023Description:The Dartmouth Presidential Cluster on Breaking the Neural Code invites applications for a staff Research Scientist. The Cluster is dedicated to the computational and neurophysiological underpinnings of human behavior and health. Our goal is to combine analyses of...
-
Environmental Technician II
23 hours ago
Denver, CO, United States Clean Harbors Full timeHPC-Industrial Powered by Clean Harbors in Denver, CO is looking for an Environmental Technician II to join their safety conscious team! The Environmental Technician II is responsible for working at an industrial site in Denver, Colorado, with the possibility of traveling to various locations across the country. The position encompasses a wide range of...
-
Environmental Technician II
2 days ago
United States, CO, Denver Clean Harbors Full timeHPC-Industrial Powered by Clean Harbors in Denver, CO is looking for an Environmental Technician II to join their safety conscious team! The Environmental Technician II is responsible for working at an industrial site in Denver, Colorado, with the possibility of traveling to various locations across the country. The position encompasses a wide range of...
-
Mechanical Engineer II
2 weeks ago
, MA, United States Johnson and Johnson Full timeJob Title: Mechanical Engineer IIAbiomed, a member of the Johnson & Johnson Family of Companies, is currently recruiting for a Mechanical Engineer II, to be located in Danvers, MA.About the RoleAs a Mechanical Engineer II in the Post-Market Engineering group, you will play an integral role in Abiomed's Quality Department as a hands-on engineering role...
-
Research and Development Engineer
2 weeks ago
Hanover, New Hampshire, United States Creare Full timeJoin Creare's R&D TeamWe are seeking exceptional engineers to pursue varied technical research in a multi-disciplinary environment.About CreareCreare is a leader in cutting-edge R&D since 1961, conducting applied research, developing new technologies, and providing analysis, design, experimental, computational, product, and consulting services to industry...
Research Cyberinfrastructure Engineer II, HPC and GPU Cluster
2 months ago
Open Until Filled: Yes
Position Number:
Position Title: Research Cyberinfrastructure Engineer II, HPC and GPU Cluster (RCIEII)
Department this Position Reports to: Research Cyberinfrastructure
Hiring Range Minimum: $99,400
Hiring Range Maximum: $114,300
Union Type: Not a Union Position
SEIU Level: Not an SEIU Position
FLSA Status: Exempt
Employment Category: Regular Full Time w/end date
Scheduled Months per Year: 12
Scheduled Hours per Week: 40
Schedule: M-F, 8a-5p
Location of Position: Hanover, NH
Remote Work Eligibility?: Hybrid
Is this a term position?: Yes
If yes, length of term in months.: 36
Is this a grant funded position?: No
Position Purpose: The Research Cyberinfrastructure Engineer II (RCIEII) enhances research computing infrastructure, focusing on administration, High-Performance Computing (HPC), cloud, and advanced computing solutions. Responsibilities include building and maintaining a graphical processing unit (GPU) cluster primarily used for artificial intelligence (AI) and machine learning (ML) workloads. This role increases infrastructure security, availability, and scalability, leading automation and system optimization initiatives to advance research capabilities. The RCIEII provides advanced support, develops innovative solutions, and leads projects to enhance research success.
Description:
Join Our Team as a Research Cyberinfrastructure Engineer II, HPC and GPU Cluster at Dartmouth
Are you ready to enhance the future of research computing? Dartmouth is looking for a dynamic Research Cyberinfrastructure Engineer II (RCIEII) to innovate and lead in HPC and GPU cluster administration.
About the Role:
As an RCIEII, you will enhance research computing infrastructure, focusing on building and maintaining a GPU cluster for AI and ML workloads. You will ensure infrastructure security, availability, and scalability while leading automation and system optimization initiatives.
What You'll Do:
Lead Projects: Manage and optimize HPC environments and cloud-based infrastructures, focusing on high availability and performance.
Innovate: Implement cutting-edge computing services and applications, integrating GPU technologies into HPC environments.
Collaborate: Build strategic partnerships with IT departments, technology providers, and research groups to foster collaboration.
Mentor and Train: Create knowledge-sharing platforms, coordinate hackathons and workshops, and promote continuous development.
Your Skills and Expertise:
- Bachelor's degree in Computer Science/IT or equivalent experience.
- 3+ years in research computing, focusing on HPC system optimization and security.
- Proficiency in scripting (Python, Bash) and automation tools (Ansible, Terraform).
- Expertise in Linux, Windows server management, and container technologies (Docker, Kubernetes).
- Skilled in cloud platforms (AWS, Azure, Google Cloud) and HPC software deployment.
Why Dartmouth?
Impactful Work: Contribute to groundbreaking research and innovative projects.
Collaborative Environment: Work with a diverse and interdisciplinary team of experts.
Professional Growth: Continuous learning and professional development opportunities.
Join Us:
Be a part of a team driving innovation in research computing. Apply now to lead the future of research cyberinfrastructure at Dartmouth
Required Qualifications - Education and Yrs Exp: Bachelors plus 3-5 years' experience or equivalent combination of education and experience
Required Qualifications - Skills, Knowledge and Abilities:
- Bachelor's degree or equivalent experience in Computer Science/IT.
- 3+ years in research computing, focusing on HPC system optimization and security.
- Proficient in scripting (Python, Bash) and automation tools.
- Proven project success in enhancing research computing environments.
- Expertise in Linux and Windows server management.
- Experienced in Docker and Kubernetes.
- Familiar with Ansible, Terraform, Puppet for automation.
- Strong analytical and problem-solving skills.
- Skilled in cloud platforms (AWS, Azure, Google Cloud).
- Effective communication and teamwork skills.
- Leadership experience in mentoring and team development.
Preferred Qualifications:
- Advanced degree or certifications in relevant fields.
- Expertise in AI/ML software and frameworks.
- Experience with CUDA programming and/or C/C++.
- Professional certifications (e.g., AWS Certified Solutions Architect, Google Cloud Professional Cloud Architect).
- Experience in academic/research IT environments.
- Hands-on data center operations experience.
- Proficient in HPC software deployment and troubleshooting.
- Skilled in cloud services for HPC workloads.
- Experience in developing and maintaining infrastructure documentation.
- Innovative in developing new services and applications.
- Comprehensive understanding of security in computing environments.
- Excellent troubleshooting skills using command-line tools and vendor support.
Department Contact for Recruitment Inquiries: Jonathan Kulp
Department Contact Phone Number:
Department Contact for Cover Letter and Title: Elijah Gagne
Department Contact's Phone Number:
Equal Opportunity Employer: Dartmouth College is an equal opportunity/affirmative action employer with a strong commitment to diversity and inclusion. We prohibit discrimination on the basis of race, color, religion, sex, age, national origin, sexual orientation, gender identity or expression, disability, veteran status, marital status, or any other legally protected status. Applications by members of all underrepresented groups are encouraged.
Background Check: Employment in this position is contingent upon consent to and successful completion of a pre-employment background check, which may include a criminal background check, reference checks, verification of work history, conduct review, and verification of any required academic credentials, licenses, and/or certifications, with results acceptable to Dartmouth College. A criminal conviction will not automatically disqualify an applicant from employment. Background check information will be used in a confidential, non-discriminatory manner consistent with state and federal law.
Is driving a vehicle (e.g. Dartmouth vehicle or off road vehicle, rental car, personal car) an essential function of this job?: Not an essential function
Special Instructions to Applicants: This position is a 36-month term position.
Dartmouth College has a Tobacco-Free Policy. Smoking and the use of tobacco-based products (including smokeless tobacco) are prohibited in all facilities, grounds, vehicles or other areas owned, operated or occupied by Dartmouth College with no exceptions. For details, please see our policy.
Quick Link:
Description: Cyberinfrastructure Operations
- Integrates GPU technologies into HPC environments, collaborating with researchers and HPC programmers.
- Acts as a Subject Matter Expert (SME) in cloud services, HPC, automation, storage, and container technologies (e.g., Docker, Kubernetes), providing advanced support and consultancy.
- Manages and optimizes HPC environments and cloud-based infrastructures, focusing on high availability, efficient load balancing, and performance across platforms such as AWS and GCP.
- Designs and implements networking configurations, maintaining security compliance (e.g., FISMA, PCI, GDPR, HIPAA).
- Develops and refines automation scripts and workflows using tools like Ansible, Terraform, Python, and PowerShell.
- Coordinates disaster recovery plans, data integrity strategies, oversees hypervisor environments, and ensures computing services' resilience.
- Provides on-call support, showcasing problem-solving capabilities and promoting knowledge sharing within the team.
- Implements security measures to protect HPC environments, applications, servers, and storage from cyber threats.
- Utilizes scalability techniques to ensure HPC systems can accommodate growing research demands.
- Monitors system availability, implementing redundancy and failover strategies.
Percentage Of Time: 40%
Description: Computing and HPC Initiatives
- Leads initiatives to design and implement computing services and applications addressing specific research challenges.
- Collaborates with researchers to understand computational needs, translating these into practical, scalable solutions.
- Oversees the integration of cloud-based solutions for HPC workloads.
- Designs and manages data storage infrastructures ensuring data integrity, availability, and compliance with policies and regulations.
Percentage Of Time: 20%
Description: Collaboration and Relationship Management
- Builds and nurtures strategic partnerships with IT departments, technology providers, and research groups click apply for full job details