High Performance Computing

3 weeks ago


Atlanta, United States Brooksource Full time

HPC Engineer (HPC and AWS Environment) 100% Remote

(9AM-5PM EST Work Hours) Direct Hire

(Full-Time Employment)

We are hiring a

High-Performance Computing (HPC) Engineer

with experience working in a

hybrid on-premises HPC and AWS cloud environment . As an HPC Engineer, you will join an innovative HPC team responsible for configuring, integrating, and managing HPC clusters on AWS cloud for our prestigious client, a private research university based in Atlanta, GA.

You will play a pivotal role in supporting their

hybrid on-prem HPC infrastructure and AWS cloud-based HPC , while continually expanding and integrating HPC clusters with AWS services to meet the growing scientific computing needs of its researchers, allowing researchers to perform computationally intensive workloads more quickly and securely, particularly in the multi-disciplinary field of Artificial Intelligence (AI).

Key Responsibilities: Design, implement, and maintain high-performance computing (HPC) infrastructure on both AWS cloud and on-premises platforms. Manage HPC clusters on AWS cloud using

AWS ParallelCluster

and all related AWS services including Amazon EC2, AWS CloudFormation, Amazon FSx, and Amazon EFS. Implement and optimize the use of

Slurm , cluster management software, for efficient HPC job scheduling and management. Collaborate with researchers and faculty to understand their scientific computing and machine learning (ML) needs and provide tailored solutions. Actively seek to understand the latest AI research computing requirements and plan infrastructure upgrades to keep up with evolving trends. Provide training, assistance in scripting, software installation services, and technical troubleshooting services to end-users. Document use cases, reusable patterns, and technical guidelines. Ensure quality outcomes through best practices in security, infrastructure as code, streamlined releases processes, and thorough testing and validation.

Minimum Requirements: 3+ years of experience in Linux administration. 2+ years as an HPC Engineer with HPC cluster user support and troubleshooting. 1+ year of AWS cloud infrastructure experience with AWS services used for managing HPC clusters including AWS ParallelCluster, EC2, CloudFormation, FSx, and EFS. Experience with Slurm cluster management software. Scripting experience with Python or Bash, as well as related tools such as Ansible and Git. Knowledge of scientific computing and machine learning.

Preferred Qualifications: Experience working with researchers within an academic, research, or scientific institution. Experience with specialized computing including GPU utilization, parallelization, and DevOps aspects such as containerization and automation. Knowledge of scientific data, bioinformatics packages, big data analysis methods, and machine learning algorithms. AWS Certified Solutions Architect certification.

#J-18808-Ljbffr



  • Atlanta, United States Page Mechanical Group Inc Full time

    About Our Company: Delmock Technologies, Inc. (DTI) is seeking a Performance Engineerto explore exciting career opportunities. DTI is a leading HUBZone business in Baltimore, known for delivering innovated IT and Health solutions with a commitment to ethics, excellence, and superior customer service. At DTI, we balance continuous growth and innovation with a...


  • Atlanta, United States Snowflake Computing Full time

    **Director, Customer Experience Engineering** Location Atlanta, Georgia, USA Category Engineering REQ4840 JOB DESCRIPTION There is only one Data Cloud. Snowflakes founders started from scratch and designed a data platform built for the cloud that is effective, affordable, and accessible to all data users. But it didnt stop there. They engineered Snowflake to...


  • Atlanta, United States Snowflake Computing Full time

    **Director, Customer Experience Engineering** Location Atlanta, Georgia, USA Category Engineering REQ4840 JOB DESCRIPTION There is only one Data Cloud. Snowflakes founders started from scratch and designed a data platform built for the cloud that is effective, affordable, and accessible to all data users. But it didnt stop there. They engineered Snowflake to...

  • Performance Engineer

    1 month ago


    Atlanta, United States BOO Full time

    Boo: Performance Engineer – Remote Boo is a personality-based social/dating app that allows you to deeply understand anyone and connect with people who intuitively understand you. Role Description: We are seeking a skilled Performance Engineer to join our team. The Performance Engineer will be responsible for ensuring the high performance and scalability...


  • Atlanta, United States Snowflake Computing Full time

    Build the future of data. Join the Snowflake team. Snowflake Support is committed to providing high-quality resolutions to help deliver data-driven business insights and results. We are a team of subject matter experts collectively working toward our customers' success. We form partnerships with customers by listening, learning and building connections. ...


  • Atlanta, Georgia, United States Howmet Aerospace Full time

    Howmet Aerospace is hiring a Computer AI Engineer for our Research and Development facility in Whitehall, Michigan. The primary responsibilities include:Working with stakeholders throughout the organization to identify opportunities for leveraging artificial intelligence and machine vision systems to drive business solutions.Developing vision models and data...


  • Atlanta, United States Snowflake Computing Full time

    Build the future of data. Join the Snowflake team.Snowflake Support is committed to providing high-quality resolutions to help deliver data-driven business insights and results. We are a team of subject matter experts collectively working toward our customers' success. We form partnerships with customers by listening, learning, and building connections....


  • Atlanta, United States Spearsoft Tech Solutions Pvt. Ltd. Full time

    Title: Performance Test Engineer (Need Strong UI Coding Experience and Java Script) Location: Remote Duration: 6+ Months Company Overview Spearsoft Tech Solutions Pvt. Ltd. is a dynamic start-up specializing in providing high-quality IT solutions to clients at a competitive price. We are also involved in designing and developing cutting-edge robots. With...


  • Atlanta, United States Snowflake Computing Full time

    Build the future of data. Join the Snowflake team. Snowflake Support is committed to providing high-quality resolutions to help deliver data-driven business insights and results. We are a team of subject matter experts collectively working toward our customers' success. We form partnerships with customers by listening, learning, and building connections. ...


  • Atlanta, GA, United States Crash Champions Full time

    Crash Champions is the one of the fastest growing and most exciting brands in the collision repair industry. The company is the largest founder-led multi-shop operator (MSO) of high-quality collision repair service in the U.serving customers and business partners at more than 600 state-of-the-art repair centers in 36 states across the U.Crash Champions was...

  • Computer Science

    4 weeks ago


    Atlanta, United States Purpose Built Schools Atlanta Full time

    Job DescriptionJob DescriptionComputer ScienceAtlanta, GA, USComputer ScienceWho is Purpose Built Schools Atlanta?Purpose Built Schools, in partnership with Atlanta Public Schools (APS), operates three neighborhood schools in south Atlanta: Slater Elementary School, Price Middle School and Carver STEAM Academy (high school). Purpose Built Schools works to...

  • Performance Analyst

    1 month ago


    Atlanta, Georgia, United States SCP Health Full time

    DescriptionAt SCP Health, what you do mattersAs part of the SCP Health team, you have an opportunity to make a difference. At our core, we work to bring hospitals and healers together in the pursuit of clinical effectiveness. With a portfolio of over 8 million patients, 7500 providers, 30 states, and 400 healthcare facilities, SCP Health is a leader in...


  • Atlanta, United States Performance Foodservice Full time

    Company Description Performance Foodservice, PFG's broadline distributor, maintains a unique relationship with a variety of local customers, including independent restaurants and hotels, healthcare facilities, schools, and quick-service eateries. A team of sales reps, chefs, consultants, and other experts builds close relationships with customers -...


  • Atlanta, United States The Salvation Army USA Southern Territory Full time

    The Salvation Army South Territorial Headquarters, 1424 Northeast Expy, Atlanta, GA 30329, USA Req #30628 Monday, April 22, 2024 About this opportunity: This position is responsible for providing technical support to Territorial Headquarters staff in the use/application of computer technology for the daily functional activities of their respective...


  • Atlanta, United States Totally Joined for Achieving Collaborative Techniques Full time

    About Us: Totally Joined For Achieving Collaborative Techniques (TJFACT) is a minority-owned, CVE-verified Service Disabled Veteran Owned Small Business (SDVOSB) performance driven professional services government contracting company that provides a broad spectrum of services and solutions to the U.S. government agencies and organizations. About the...

  • Insurance Sales Agent

    2 weeks ago


    Atlanta, United States InCite Performance Group Full time

    Job DescriptionJob DescriptionWe are eager to take our business to the next level by hiring an experienced insurance agent with a proven track record of maintaining and growing customer portfolios. You’ll hone your sales skills by forging strong relationships that serve as the foundation for our firm’s prestige, and we’ll give you the support you need...


  • Atlanta, United States Avery Partners Full time

    Manager, End-User Computing * End-User Computing Manager is responsible for the supervision, technical development and guidance for the End-User Computing team across the organization. This role manages the day to day activities of the corporate offices, geographically distributed End User Computing team and End user devices across all locations. * Manage...


  • Atlanta, United States Datum Technologies Group Full time

    Role: Computer System Analyst Location: Atlanta, GA Duration: Full Time Qualifications: Bachelor's degree in CS or MIS strongly preferred; five (5) years of relevant technical experience is an acceptable substitute. A minimum of five (5) to eight (8) years of service (depending on job level), providing technical and organizational support for in-scope...


  • Atlanta, United States Datum Software, Inc. Full time

    Role: Computer System AnalystLocation: Atlanta, GA or Birmingham,AL Duration: Full Time Qualifications:Bachelor's degree in CS or MIS strongly preferred; five (5) years of relevant technical experience is an acceptable substitute.A minimum of five (5) to eight (8) years of service (depending on job level), providing technical and organizational support for...


  • Atlanta, United States Datum Software, Inc. Full time

    Job Details:Job Title: Computer System AnalystDuration: Long-Term ContractLocation: Atlanta, GA || On-Site Job Description:Qualifications:Bachelor's degree in CS or MIS strongly preferred; five (5) years of relevant technical experience is an acceptable substitute.A minimum of five (5) to eight (8) years of service (depending on job level), providing...