HPC Engineer

3 weeks ago


Dallas TX United States American Systems Corporation Full time
THIS POSITION COMES WITH A 10K SIGNING BONUS Are you anHPC Engineerlooking to be part of something that is truly unique - not just a job, but a mission? We are in search of an application engineer with specialties and background in large-scale, high-performance computing, supercomputing and parallel processing design to optimize and build applications and tune them for speed and efficiency in an R&D setting
AMERICAN SYSTEMS is building the next generation of high-speed analytics needed to protect our nation, and the environments needed to support them
We need you to join us on the ground floor
As anHPCEngineer and member of our select team, you will:
• Apply comprehensive knowledge of High Performance Computing (HPC) systems, comprised of high-speed, multi-petabyte Lustre file systems, Red Hat Enterprise Linux (RHEL) servers, CPU/GPU compute nodes, and high performance storage arrays, using Ethernet, fiber, Omni-Path, and InfiniBand interconnections.
• Provide functional and technical expertise in support of user-developed software and technical advice and leadership to other technical staff
• Join us at an exciting time as we introduce next-generation technologies
• Be part of a group that provides game-changing capabilities to the nation
• Receive a robust benefits package that includes our Employee Stock Ownership Plan A week in the life of anHPCEngineer:
• Utilize a wide variety of skills in system and network monitoring; large-scale systems administration; scripting and automation; security compliance; network distributed services; storage and backups; and hardware and software problem diagnosis and resolution.
• Diagnose and troubleshoot technical problems, often of a complex nature, associated with computer hardware and software interrelationships and dependencies.
• Conduct needs analysis, planning, and scheduling the installation of a wide variety of new or modified hardware/software.
• Develop functional and technical IT system requirements and specifications
Configure and optimize system tools and applications, to include job schedulers (Slurm and PBSPro) and system resources (GitLab, LUA/TCL modules, and system support applications).
• Create and brief technical presentations to technical and non-technical stakeholders
Maintain detailed documentation of system configurations, procedures, and troubleshooting guides
Develop user facing documentation
Job Requirements
• DoD Top Secret (TS) clearance with SCI eligibility
• Bachelor's in Computer Engineering, Computer Science, or related field and ten or more years of job related experience.
• Thorough knowledge of complex concepts, practices, and troubleshooting associated with HPC cluster systems design, installation, and maintenance.
• Advanced knowledge in distributed computing theory, parallel processing, applications, and associated infrastructure is required.
• Extensive experience with Linux/Unix systems including installation, configuration, networking, backups, updates and patching, data archiving, and system security.
• Functional knowledge of HPC middleware, and platform managers such as Bright Cluster Manager; employing job schedulers such as PBS, Slurm, Torque, etc.; and, optimizing job queues.
• Experience with HPC or large-scale distributed computing environments and technologies such as high-speed low-latency interconnects (e.g
InifiniBand), parallel file systems (e.g
Lustre), and virtualization environments and tools (e.g
VMWare).
• Experience developing Python/bash/Perl scripts and employing automation frameworks such as Ansible.
• General knowledge employing Docker containers and Kubernetes ecosystems.
• Working knowledge in one or more programming languages (e.g
C/C++, Fortran, etc.)
• Must be able and willing to travel to northern Virginia approximately 25% of the time Founded in 1975, AMERICAN SYSTEMS is one of the largest employee-owned companies in the United States
We are a government services contractor focused on delivering Strategic Solutions to complex national priority programs with 100+ locations worldwide
Through our focus on quality, strong cultural beliefs and innovation we deliver excellence every day
Company Awards: • Forbes National Best Midsize Companies • Energage National Best Workplaces, National • Washington Post Best Workplaces Veteran Hiring Awards: • U.S
Department of Labor Hire Vets Medallion • BEST FOR VETS by Military Times • TOP 10 MILITARY FRIENDLY COMPANY by MilitaryFriendly.com AMERICAN SYSTEMS is committed to pay transparency for our applicants and employee-owners
The salary range for this position is $150,000 - $200,000
Actual compensation will be determined based on several factors permitted by law
AMERICAN SYSTEMS provides for the welfare of its employees and their dependents through a comprehensive benefits program by offering healthcare benefits, paid leave, retirement plans (including ESOP and 401k), insurance programs, and education and training assistance
#CJPOST AMS1 EOE Minorities/Women/Disabled/Veterans/Gender Identity/Sexual Orientation

  • Santa Clara, CA, United States NVIDIA Full time

    NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing. NVIDIA is a “learning machine” that constantly evolves by...

  • HPC Software Engineer

    4 weeks ago


    Seattle, WA, United States Fourier Ltd Full time

    An opening for a highly skilled Software Engineer to enhance the high-performance computing (HPC) storage infrastructure of a world leading market maker. You will be working on critical projects that impact all trading teams and significantly improve data storage workflows for performance and reliability. As the team expands you will have the opportunity to...


  • Santa Clara, CA, United States NVIDIA Full time

    NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing. NVIDIA is a “learning machine” that constantly evolves by...


  • Dallas, United States The LaSalle Group Full time

    Our client - a global leader in the HPC (High Powered Computers) Services space - is seeking a Senior Technical Recruiter with a strong background in sourcing and recruiting for highly specialized technical positions! Based in Dallas, TX, the successful candidate will be responsible for attracting and hiring talent for various skilled technical roles,...


  • Santa Clara, CA, United States NVIDIA Full time

    For two decades, we have pioneered visual computing, the art and science of computer graphics - with our invention of the GPUs, the engine of modern AI technologies, the field has expanded to encompass AI-powered video games, social networking and web search, IC & other product design, medical diagnosis, and scientific research. Today, visual computing is...


  • Santa Clara, CA, United States NVIDIA Full time

    For two decades, we have pioneered visual computing, the art and science of computer graphics - with our invention of the GPUs, the engine of modern AI technologies, the field has expanded to encompass AI-powered video games, social networking and web search, IC & other product design, medical diagnosis, and scientific research. Today, visual computing is...


  • New York, NY, United States Fourier Ltd Full time

    This opportunity offers the chance to be in a very senior position of one of the worlds most technical and successful high freuqency trading funds.  Experience working in hyperscale environments is necessary as you'd be the first dedicated research network architect for the business. This team designs and engineers the communications infrastructure that...


  • Sunnyvale, TX, United States Google Full time

    Minimum qualifications:Bachelor's degree or equivalent practical experience.2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree.2 years of experience with data structures or algorithms.2 years of experience with performance, systems data analysis, visualization tools, or...


  • Sunnyvale, TX, United States Google Full time

    Minimum qualifications:Bachelor's degree or equivalent practical experience.8 years of experience in software development, and with data structures/algorithms.5 years of experience building and developing infrastructure, distributed systems, or networks.5 years of experience testing, and launching software products, and 3 years of experience with software...


  • Santa Clara, CA, United States NVIDIA Full time

    We’re currently seeking a Senior Developer Technology Engineer, Artificial IntelligenceWould you enjoy researching parallel algorithms to accelerate AI workloads on advanced computer architectures? Is it rewarding to investigate, find, and eliminate system bottlenecks to achieve the best possible performance of computer hardware? Could you be thrilled...


  • Santa Clara, CA, United States NVIDIA Full time

    NVIDIA is seeking hardworking and creative Senior Memory Controller Verification Engineer for our Tegra SoCs! At Nvidia, we have crafted a team of outstanding people stretching around the globe, whose mission is to push the frontiers of what is possible today and define the platform for the future of computing. In this position, you will partner with the...

  • Computational Engineer

    18 hours ago


    Memphis, TN, United States St. Jude Children's Research Hospital Full time

    About St. Jude There’s a reason St. Jude Children’s Research Hospital consistently earns a Glassdoor Employee Choice Award and is named to its "Best Place to Work" list. At our world-class pediatric research hospital, every one of our professionals shares our commitment to make a difference in the lives of the children we serve. There is a unique bond...


  • Austin, TX, United States Nvidia Full time

    We are building innovative server systems for GPU accelerated applications, such as Deep Learning. Data Center SW team architects and develops the end to end software and firmware stack for these systems. We are looking for a Senior Software Architect who has deep expertise in designing server platforms and has added understanding of application use cases in...