HPC System Admin
3 weeks ago
Job Title: HPC System Admin
Work Location: Austin, TX
Position Type: Contract with possible extension
Duration: 12 + Months
Job Description:
Project Details:
Responsible for architecting and implementing Linux High Performance Computing (HPC) clusters. Performs system architecture duties on a Linux High performance computing (HPC) cluster including cluster management, virtualization, cluster usage monitoring, health monitoring, job scheduling, application integration/installation (open source as well as vendor supported), and application performance. Improve cluster performance through kernel changes, firmware updates, library stack changes, and application container management such as docker.
Mandatory Skills and Technologies, framework, and Methodologies:
Knowledge of Linux and UNIX operating systems, including scripting and programming proficiencies.
Experience with cloud bursting technologies.
Knowledge of cloud services like AWS SCOCA, Parallel Cluster, and Azure CycleCloud
Knowledge of HPC tools and storage: AWS Elastic Fabric Adapter, Azure ANF, Apache Spark, or Apache Ignite, Lustre, BeeFS
Demonstrate experience in programming system maintenance tasks in C, Java, Perl, batch/shell, or another general-purpose programming language.
Knowledge of NUMA and understanding of NUMA related APIs.
Be able to perform complex performance analysis including system processes, I/O subsystems, networks and other related components.
Must have experience with multi-threading and parallel processing tools and environments.
Must have experience as a systems administrator. Must have advanced ability to analyze complex IT systems.
Experience with high-performance servers and associated high-performance networks.
Experience installing and maintaining clustered environments, including automated installation methods.
Knowledge of common server hardware architectures including servers (CPU, bus, memory), SANS, disk arrays, network hardware.
Understanding of Red Hat Linux Operating system including processes, files, memory management and I/O systems; networking services and protocols (e.g., TCP/IP, SSL, FTP, Telnet, LDAP).
Understanding of IP networking, basic routing, TCP ports and network services, including SSH, LDAP, SFTP and HTTP(S). Ability to design, promote, and implement change control and configuration management, patch management, high availability systems, structured design and support methodologies.
Must be organized with a strong ability to deliver tasks on time, manage multiple efforts and be able to work with minimal supervision.
Demonstrated ability to proactively learn, adapt to and use new hardware/software technologies.
Good to have skills, Technologies, framework, and Methodologies
Performs system administration duties on a linux HPC Cluster, cluster management, virtualization, cluster usage monitoring, health monitoring, job scheduling, and application integration/installation.
Responsible for system implementation/integration and systems performance analysis.
Manages hardware and software applications in the production environment provided to HPC users.
Install software and updates
Coordinates with vendors to resolve hardware and software problems in HPC Cluster.
Facilitates the acquisition of hardware and software products and services for the HPC Cluster.
Knowledge of LSF or other open-source job schedulers.
Compile, configure, and integrate open source applications into HPC environment.
ble to learn and use internal software systems.
Monitors the availability of patches and updates and evaluates the importance to the environment and schedules installations accordingly.
Keeps abreast of the latest HPC hardware and software technology, evaluating technologies as needed.
Designs, implements and administers high performance computing cluster, performing proof of concepts such as software containers (ex. Docker).
Interacts effectively with a broad range of colleagues such as Applied Materials researchers and other IT staff.
Other duties may be assigned.
-
HPC Systems Engineer/ DELL
3 weeks ago
Austin, United States ShiftCode Analytics Full timeVisa: USC, GC or GC-EAD Duration: 9 months with potential extension Location: Onsite in Austin, TX They'll give preference to someone who is currently local to Austin and then will consider people willing to relocate. Requirements: -Experience with HPC Systems environments and Infrastructures technologies and workloads -B.S. in CS/CE/EE, or at...
-
HPC Engineer
1 week ago
Austin, United States Optiver Full timeOptiver is seeking a Research Infrastructure Engineer to contribute significantly to the development and management of our research infrastructure across both on-premises and cloud platforms. This role involves hands-on work in scaling and supporting high-performance computing (HPC) and storage systems, which are critical for our growing demand in research...
-
HPC Engineer
5 days ago
Austin, United States Optiver Full timeOptiver is seeking a Research Infrastructure Engineer to contribute significantly to the development and management of our research infrastructure across both on-premises and cloud platforms. This role involves hands-on work in scaling and supporting high-performance computing (HPC) and storage systems, which are critical for our growing demand in research...
-
HPC Network Engineer
1 month ago
Austin, United States Algo Capital Group Full timeHPC Network EngineerStanding at the forefront of industry excellence, this Algorithmic Trading firm is renowned for its cutting-edge technology and innovative approach to financial markets. Leveraging sophisticated algorithms and high-performance computing systems, they execute trades with precision and efficiency; pioneering new strategies and techniques to...
-
HPC Network Engineer
1 month ago
Austin, United States Algo Capital Group Full timeHPC Network EngineerStanding at the forefront of industry excellence, this Algorithmic Trading firm is renowned for its cutting-edge technology and innovative approach to financial markets. Leveraging sophisticated algorithms and high-performance computing systems, they execute trades with precision and efficiency; pioneering new strategies and techniques to...
-
HPC Network Engineer
2 weeks ago
Austin, United States Algo Capital Group Full timeHPC Network EngineerStanding at the forefront of industry excellence, this Algorithmic Trading firm is renowned for its cutting-edge technology and innovative approach to financial markets. Leveraging sophisticated algorithms and high-performance computing systems, they execute trades with precision and efficiency; pioneering new strategies and techniques to...
-
HPC Data Center Technician
2 days ago
Austin, United States Core Scientific Full timeWho We Are Bold. Unapologetic. Hardworking. We are building something special. We transform energy into high-value compute with superior efficiency at scale. Today, that means powering and securing the Bitcoin Network. Tomorrow, that could also include powering workloads in AI, HPC and other forms of high-value compute. Core Scientific is one of the...
-
Senior AI-HPC Cluster Engineer
2 weeks ago
Austin, United States NVIDIA Full timeNVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing. NVIDIA is a “learning machine” that constantly evolves by...
-
Senior AI-HPC Cluster Engineer
4 weeks ago
Austin, United States NVIDIA Full timeNVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing. NVIDIA is a “learning machine” that constantly evolves by...
-
Product Manager, AI/HPC
1 week ago
Austin, Texas, United States Meta Full timeSummary: We are hiring a Product Manager to help build the next generation of Meta's AI infrastructure. The ideal candidate for this role will be passionate about identifying, delivering and scaling the next generation of AI systems (hardware accelerators, compute, network fabrics) and have a proven track record of effectively guiding product teams,...
-
HPC Senior DevOps Engineer
3 days ago
Austin, United States NXP Semiconductors Full timeHPC DevOps Engineer Austin, US (Hybrid) This is what you will do as HPC DevOps engineer at NXP You are expected to work very closely with your global colleagues within R&D IT and help deliver the HPC services (High Performance Computing and Virtual Desktop Infrastructure) to our engineering and R&D customers. Your AMEC team has operational responsibility...
-
HPC Senior DevOps Engineer
5 days ago
Austin, United States NXP Semiconductors N.V. Full timeHPC DevOps Engineer Austin, US This is what you will do as HPC DevOps engineer at NXP You are expected to work very closely with your global colleagues within R&D IT and help deliver the HPC services (High Performance Computing and Virtual Desktop Infrastructure) to our engineering and R&D customers. Your AMEC team has operational responsibility for all...
-
Systems Admin
2 weeks ago
Austin, United States Kaygen Full timeKAYGEN is an emerging leader in providing top talent for technology based staffing services. We specialize in providing high-volume contingent staffing, direct hire staffing and project based solutions to companies worldwide ranging from startups to Fortune 500 and Managed Service Providers (MSP) across a wide variety of industries. Job Title: Systems...
-
Systems Admin
7 days ago
Austin, United States Kaygen Full timeKAYGEN is an emerging leader in providing top talent for technology based staffing services. We specialize in providing high-volume contingent staffing, direct hire staffing and project based solutions to companies worldwide ranging from startups to Fortune 500 and Managed Service Providers (MSP) across a wide variety of industries. Job Title: Systems Admin...
-
PLM Teamcenter Admin
2 weeks ago
Austin, United States Big Apple Infotech Full timeHi,Greetings from Bigapple Infotech,Hope you are doing well, Title: PLM Teamcenter AdminLocation: Austin, TX or Sunnyvale, CAContract Required QualificationsCandidate must be located within commuting distance of Atlanta, GA or willing to relocate.Bachelor's degree or foreign equivalent required from an accredited institution. Will also consider three years...
-
HPC Senior DevOps Engineer
7 days ago
Austin, United States NXP Semiconductors Full timeHPC DevOps EngineerAustin, US (Hybrid)This is what you will do as HPC DevOps engineer at NXPYou are expected to work very closely with your global colleagues within R&D IT and help deliver the HPC services (High Performance Computing and Virtual Desktop Infrastructure) to our engineering and R&D customers. Your AMEC team has operational responsibility for...
-
HPC Senior DevOps Engineer
1 week ago
Austin, United States NXP Semiconductors Full timeHPC DevOps EngineerAustin, US (Hybrid)This is what you will do as HPC DevOps engineer at NXPYou are expected to work very closely with your global colleagues within R&D IT and help deliver the HPC services (High Performance Computing and Virtual Desktop Infrastructure) to our engineering and R&D customers. Your AMEC team has operational responsibility for...
-
Systems Engineer
4 weeks ago
Austin, United States Hudson River Trading Full timeHudson River Trading (HRT) is looking for Systems Engineers to join our growing Research & Development team. This team builds and maintains an exceptionally large and growing distributed compute cluster, a petabyte-scale storage layer, operating systems, automation software, and development tools. Much of our hardware layer and operating system layer reflect...
-
Austin, United States Advanced Micro Devices , Inc. Full timeOverview: WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....
-
High Performance Computing Systems Engineer
4 weeks ago
Austin, Texas, United States Selby Jennings Full timeAn Elite Proprietary trading firm, based in Austin, is seeking an experienced High-Performance Computing (HPC) engineer who can make significant contributions to their team. Key Responsibilities:Design, optimize, and support high-performance computing and storage solutions in a hybrid cloud environment.Directly support the Research domain, driving innovation...