HPC admin
2 days ago
Inviting applications for the role of Consultant - HPC Admin
Responsibilities:
• Install, configure, and maintain Linux operating systems on HPC clusters.
• Manage job schedulers such as Slurm or LSF.
• Utilize node provisioning tools like Werewolf.
• Troubleshoot system issues and provide technical support to users.
• Monitor system performance and ensure optimal operation of the HPC environment.
• Collaborate with other IT professionals to integrate new technologies into the existing infrastructure.
• Progressive experience in HPC system administration, preferably in a Redhat/CentOS Linux environment.
• Expertise in troubleshooting complex system issues.
• Experience with parallel file systems and storage solutions.
• Strong knowledge of job schedulers such as Slurm or LSF.
• Familiarity with node provisioning tools like Werewolf.
• Proficiency in Linux OS administration
• Knowledge of job scheduling tools (e.g., Slurm)
• Understanding of node provisioning tools (e.g., Werewolf)
• Excellent problem-solving abilities
• Strong communication skills
• Ability to work collaboratively in a team-oriented environment
• Security+ certification preferred
• Linux+ certification preferred
• Top Secret Clearance: TS/SCI preferred
• On-site presence at customer location in Stennis, MS
• Availability for some on-call/weekend work
• Hands on experience setting up HPC compute cluster.
• Install Nvidia drivers
• Install manage configure GPU software stack like Pytorch, tensorflow, cuda Python
• Setup PBS job scheduler and supporting PBS servers
• Experience with Redhat and Rocky Linux; bash scripting
• Nice to have Docker, Kubernetes experience
• Nice to have Storage knowledge
• Nice to have networking and devops knowledge.
Qualifications we seek in you
Minimum Qualifications / Skills
Bachelor's Degree required. Preferably in Computer Science, Information Systems, or related field
Preferred qualifications / Skills
Very good written and presentation / verbal communication skills with experience of customer interfacing role. In-depth requirement understanding skills with good analytical and problem solving ability, interpersonal efficiency, and positive attitude
-
IT Systems Administrator
1 month ago
San Francisco, United States Altair Engineering Full timeTransforming the Future with Convergence of Simulation and Data IT Systems Administrator Job Summary: Our client in San Francisco, CA is looking for an IT Systems Administrator. This is a contract position. What You Will Do: Responsible for supporting and maintaining Slack and collaboration VW Technology Group workspaces. Serve as a technical expert for the...
-
IT Systems Administrator
3 weeks ago
San Francisco, United States Altair Full timeTransforming the Future with Convergence of Simulation and Data IT Systems Administrator Job Summary: Our client in San Francisco, CA is looking for an IT Systems Administrator. This is a contract position. What You Will Do: Responsible for supporting and maintaining Slack and collaboration VW Technology Group workspaces. Serve as a technical expert for the...
-
Site Reliability Engineer
2 weeks ago
San Francisco, CA, United States Mistral AI Full timeAbout Mistral At Mistral AI, we are a tight-knit, nimble team dedicated to bringing our cutting-edge AI technology to the world. Our mission is to make AI ubiquitous and open. We are creative, low-ego, team-spirited, and have been passionate about AI for years. We hire people who thrive in competitive environments, because they find them more fun to work...
-
IT Systems Administrator
2 weeks ago
San Francisco, CA, United States Altair Engineering Full timeTransforming the Future with Convergence of Simulation and Data IT Systems Administrator Job Summary: Our client in San Francisco, CA is looking for an IT Systems Administrator. This is a contract position. What You Will Do: Responsible for supporting and maintaining Slack and collaboration VW Technology Group workspaces. Serve as a technical expert for...