HPC System Admin

3 weeks ago


Austin, United States NR Consulting Full time

Job Title: HPC System Admin
Work Location: Austin, TX
Position Type: Contract with possible extension
Duration: 12 + Months

Job Description:
Project Details:
Responsible for architecting and implementing Linux High Performance Computing (HPC) clusters. Performs system architecture duties on a Linux High performance computing (HPC) cluster including cluster management, virtualization, cluster usage monitoring, health monitoring, job scheduling, application integration/installation (open source as well as vendor supported), and application performance. Improve cluster performance through kernel changes, firmware updates, library stack changes, and application container management such as docker.

Mandatory Skills and Technologies, framework, and Methodologies:
Knowledge of Linux and UNIX operating systems, including scripting and programming proficiencies.
Experience with cloud bursting technologies.
Knowledge of cloud services like AWS SCOCA, Parallel Cluster, and Azure CycleCloud
Knowledge of HPC tools and storage: AWS Elastic Fabric Adapter, Azure ANF, Apache Spark, or Apache Ignite, Lustre, BeeFS
Demonstrate experience in programming system maintenance tasks in C, Java, Perl, batch/shell, or another general-purpose programming language.
Knowledge of NUMA and understanding of NUMA related APIs.
Be able to perform complex performance analysis including system processes, I/O subsystems, networks and other related components.
Must have experience with multi-threading and parallel processing tools and environments.
Must have experience as a systems administrator. Must have advanced ability to analyze complex IT systems.
Experience with high-performance servers and associated high-performance networks.
Experience installing and maintaining clustered environments, including automated installation methods.
Knowledge of common server hardware architectures including servers (CPU, bus, memory), SANS, disk arrays, network hardware.
Understanding of Red Hat Linux Operating system including processes, files, memory management and I/O systems; networking services and protocols (e.g., TCP/IP, SSL, FTP, Telnet, LDAP).
Understanding of IP networking, basic routing, TCP ports and network services, including SSH, LDAP, SFTP and HTTP(S). Ability to design, promote, and implement change control and configuration management, patch management, high availability systems, structured design and support methodologies.
Must be organized with a strong ability to deliver tasks on time, manage multiple efforts and be able to work with minimal supervision.
Demonstrated ability to proactively learn, adapt to and use new hardware/software technologies.

Good to have skills, Technologies, framework, and Methodologies
Performs system administration duties on a linux HPC Cluster, cluster management, virtualization, cluster usage monitoring, health monitoring, job scheduling, and application integration/installation.
Responsible for system implementation/integration and systems performance analysis.
Manages hardware and software applications in the production environment provided to HPC users.
Install software and updates
Coordinates with vendors to resolve hardware and software problems in HPC Cluster.
Facilitates the acquisition of hardware and software products and services for the HPC Cluster.
Knowledge of LSF or other open-source job schedulers.
Compile, configure, and integrate open source applications into HPC environment.
ble to learn and use internal software systems.
Monitors the availability of patches and updates and evaluates the importance to the environment and schedules installations accordingly.
Keeps abreast of the latest HPC hardware and software technology, evaluating technologies as needed.
Designs, implements and administers high performance computing cluster, performing proof of concepts such as software containers (ex. Docker).
Interacts effectively with a broad range of colleagues such as Applied Materials researchers and other IT staff.
Other duties may be assigned.



  • Austin, United States ShiftCode Analytics Full time

    Visa: USC, GC or GC-EAD Duration: 9 months with potential extension Location: Onsite in Austin, TX They'll give preference to someone who is currently local to Austin and then will consider people willing to relocate. Requirements: -Experience with HPC Systems environments and Infrastructures technologies and workloads -B.S. in CS/CE/EE, or at...

  • HPC Engineer

    1 week ago


    Austin, United States Optiver Full time

    Optiver is seeking a Research Infrastructure Engineer to contribute significantly to the development and management of our research infrastructure across both on-premises and cloud platforms. This role involves hands-on work in scaling and supporting high-performance computing (HPC) and storage systems, which are critical for our growing demand in research...

  • HPC Engineer

    5 days ago


    Austin, United States Optiver Full time

    Optiver is seeking a Research Infrastructure Engineer to contribute significantly to the development and management of our research infrastructure across both on-premises and cloud platforms. This role involves hands-on work in scaling and supporting high-performance computing (HPC) and storage systems, which are critical for our growing demand in research...

  • HPC Network Engineer

    1 month ago


    Austin, United States Algo Capital Group Full time

    HPC Network EngineerStanding at the forefront of industry excellence, this Algorithmic Trading firm is renowned for its cutting-edge technology and innovative approach to financial markets. Leveraging sophisticated algorithms and high-performance computing systems, they execute trades with precision and efficiency; pioneering new strategies and techniques to...

  • HPC Network Engineer

    1 month ago


    Austin, United States Algo Capital Group Full time

    HPC Network EngineerStanding at the forefront of industry excellence, this Algorithmic Trading firm is renowned for its cutting-edge technology and innovative approach to financial markets. Leveraging sophisticated algorithms and high-performance computing systems, they execute trades with precision and efficiency; pioneering new strategies and techniques to...

  • HPC Network Engineer

    2 weeks ago


    Austin, United States Algo Capital Group Full time

    HPC Network EngineerStanding at the forefront of industry excellence, this Algorithmic Trading firm is renowned for its cutting-edge technology and innovative approach to financial markets. Leveraging sophisticated algorithms and high-performance computing systems, they execute trades with precision and efficiency; pioneering new strategies and techniques to...


  • Austin, United States Core Scientific Full time

    Who We Are Bold. Unapologetic. Hardworking. We are building something special. We transform energy into high-value compute with superior efficiency at scale. Today, that means powering and securing the Bitcoin Network. Tomorrow, that could also include powering workloads in AI, HPC and other forms of high-value compute. Core Scientific is one of the...


  • Austin, United States NVIDIA Full time

    NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing. NVIDIA is a “learning machine” that constantly evolves by...


  • Austin, United States NVIDIA Full time

    NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing. NVIDIA is a “learning machine” that constantly evolves by...


  • Austin, Texas, United States Meta Full time

    Summary: We are hiring a Product Manager to help build the next generation of Meta's AI infrastructure. The ideal candidate for this role will be passionate about identifying, delivering and scaling the next generation of AI systems (hardware accelerators, compute, network fabrics) and have a proven track record of effectively guiding product teams,...


  • Austin, United States NXP Semiconductors Full time

    HPC DevOps Engineer Austin, US (Hybrid) This is what you will do as HPC DevOps engineer at NXP You are expected to work very closely with your global colleagues within R&D IT and help deliver the HPC services (High Performance Computing and Virtual Desktop Infrastructure) to our engineering and R&D customers. Your AMEC team has operational responsibility...


  • Austin, United States NXP Semiconductors N.V. Full time

    HPC DevOps Engineer Austin, US This is what you will do as HPC DevOps engineer at NXP You are expected to work very closely with your global colleagues within R&D IT and help deliver the HPC services (High Performance Computing and Virtual Desktop Infrastructure) to our engineering and R&D customers. Your AMEC team has operational responsibility for all...

  • Systems Admin

    2 weeks ago


    Austin, United States Kaygen Full time

    KAYGEN is an emerging leader in providing top talent for technology based staffing services. We specialize in providing high-volume contingent staffing, direct hire staffing and project based solutions to companies worldwide ranging from startups to Fortune 500 and Managed Service Providers (MSP) across a wide variety of industries. Job Title: Systems...

  • Systems Admin

    7 days ago


    Austin, United States Kaygen Full time

    KAYGEN is an emerging leader in providing top talent for technology based staffing services. We specialize in providing high-volume contingent staffing, direct hire staffing and project based solutions to companies worldwide ranging from startups to Fortune 500 and Managed Service Providers (MSP) across a wide variety of industries. Job Title: Systems Admin...

  • PLM Teamcenter Admin

    2 weeks ago


    Austin, United States Big Apple Infotech Full time

    Hi,Greetings from Bigapple Infotech,Hope you are doing well, Title: PLM Teamcenter AdminLocation: Austin, TX or Sunnyvale, CAContract Required QualificationsCandidate must be located within commuting distance of Atlanta, GA or willing to relocate.Bachelor's degree or foreign equivalent required from an accredited institution. Will also consider three years...


  • Austin, United States NXP Semiconductors Full time

    HPC DevOps EngineerAustin, US (Hybrid)This is what you will do as HPC DevOps engineer at NXPYou are expected to work very closely with your global colleagues within R&D IT and help deliver the HPC services (High Performance Computing and Virtual Desktop Infrastructure) to our engineering and R&D customers. Your AMEC team has operational responsibility for...


  • Austin, United States NXP Semiconductors Full time

    HPC DevOps EngineerAustin, US (Hybrid)This is what you will do as HPC DevOps engineer at NXPYou are expected to work very closely with your global colleagues within R&D IT and help deliver the HPC services (High Performance Computing and Virtual Desktop Infrastructure) to our engineering and R&D customers. Your AMEC team has operational responsibility for...

  • Systems Engineer

    4 weeks ago


    Austin, United States Hudson River Trading Full time

    Hudson River Trading (HRT) is looking for Systems Engineers to join our growing Research & Development team. This team builds and maintains an exceptionally large and growing distributed compute cluster, a petabyte-scale storage layer, operating systems, automation software, and development tools. Much of our hardware layer and operating system layer reflect...


  • Austin, United States Advanced Micro Devices , Inc. Full time

    Overview: WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....


  • Austin, Texas, United States Selby Jennings Full time

    An Elite Proprietary trading firm, based in Austin, is seeking an experienced High-Performance Computing (HPC) engineer who can make significant contributions to their team. Key Responsibilities:Design, optimize, and support high-performance computing and storage solutions in a hybrid cloud environment.Directly support the Research domain, driving innovation...