Current jobs related to Lead HPC Systems Engineer - Remote, Oregon - St. Jude Children's Research Hospital


  • Remote, Oregon, United States St. Jude Children's Research Hospital Full time

    Job SummaryWe are seeking a highly skilled Senior HPC Infrastructure Engineer to join our team at St. Jude Children's Research Hospital. As a key member of our research computing infrastructure team, you will be responsible for designing, implementing, and optimizing our state-of-the-art HPC clusters and servers.Key Responsibilities:Lead the architecture,...


  • Remote, Oregon, United States Sterling Engineering Inc. Full time

    Job OverviewSterling Engineering Inc. is seeking a highly skilled Safety Instrumentation Systems Engineer to join our team. As a key member of our team, you will be responsible for ensuring the safe and efficient operation of our industrial processes.Key ResponsibilitiesCollaborate with the Maintenance department to maintain and troubleshoot installed Safety...


  • Remote, Oregon, United States Parsons Corporation Full time

    About the Role:We are seeking a highly skilled Electrical Security Systems Engineer to join our team at Parsons Corporation. In this role, you will play a key part in designing and implementing complex electronic security systems for our clients.Key Responsibilities:Develop and implement technical solutions for electronic security systems, ensuring...

  • Electrical Engineer

    2 weeks ago


    Remote, Oregon, United States Sargent & Lundy Full time

    Job SummaryAs a Senior Electrical Engineer - Design Manager at Sargent & Lundy, you will lead a team of engineers in designing and implementing electrical engineering systems for power plants. You will be responsible for managing project scope, budget, and schedule, as well as ensuring that all deliverables meet technical, quality, and financial...


  • Remote, Oregon, United States Genesis10 Full time

    Job DescriptionGenesis10 is seeking an Electrical Engineer to join our team in Milwaukee, WI.Job Summary:We are looking for an experienced Electrical Engineer to provide professional engineering support for reviewing, designing, troubleshooting, managing, and inspecting current and proposed electrical systems. The ideal candidate will have a strong...


  • Remote, Oregon, United States NVIDIA Full time

    NVIDIA is a leader in computer graphics, artificial intelligence, and accelerated computing. We are seeking a Senior Cloud Infrastructure and DevOps Solutions Architect to join our team.The ideal candidate will have a strong background in cloud computing platforms, such as AWS, Azure, and Google Cloud, and experience with job scheduling workloads and...

  • Java Lead Developer

    3 weeks ago


    Remote, Oregon, United States Softcom Systems Inc Full time

    Job Title: Java LeadWe are seeking a highly skilled Java Lead to join our team at Softcom Systems Inc. The ideal candidate will have extensive experience in Java development, with a strong focus on Spring Boot Microservices and Loyalty (Crowdtwist) experience.Key Responsibilities:Proven expertise in Java development with a strong focus on Spring Boot...


  • Remote, Oregon, United States Plato Systems Full time

    About UsWe are a cutting-edge startup, Plato Systems, revolutionizing perception systems for autonomy. Our team of experts, based in the San Francisco Bay Area, has a proven track record of delivering innovative solutions in signal processing and machine learning.Our core product is the result of 5+ years of university R&D by our co-founders. We are...


  • Remote, Oregon, United States Unreal Gigs Full time

    Transform Industries with Advanced RoboticsAt Unreal Gigs, we're seeking a talented Robotics Engineer to join our team and push the boundaries of robotics technology. As a key member of our robotics team, you'll design, develop, and deploy advanced robotic systems that transform industries.Key Responsibilities:Robotic System Design and Development:Design and...


  • Remote, Oregon, United States EOS Aircraft Inc Full time

    Job Title: Sr. Principal Engineer System Integration Staff ScientistEOS Aircraft, Inc. is seeking a highly experienced and skilled Sr. Principal Engineer System Integration Staff Scientist to join our Advance Design Group. This role will provide guidance to the engineering team in system development, testing, and integration of new technology and define the...


  • Remote, Oregon, United States DraftKings Full time

    About the RoleWe're seeking a highly skilled Senior Lead Software Engineer, Android to lead our engineering teams and influence our product roadmap. As a key member of our Android development team, you'll be responsible for designing, developing, and maintaining high-quality Android apps written primarily in Kotlin, with SQLite databases and other NoSQL...

  • Electrical Engineer

    4 weeks ago


    Remote, Oregon, United States Symbotic Full time

    About the RoleWe are seeking an experienced Electrical Engineer to join our Industrial Controls team within the Hardware R&D organization. As a key member of our team, you will be responsible for designing, developing, and implementing advanced electrical systems for our warehouse automation platform.Key ResponsibilitiesDesign and develop industrial...

  • Lead AI ML Engineer

    3 days ago


    Remote, Oregon, United States Burq, Inc. Full time

    About Burq, Inc.Burq, Inc. is a leading provider of innovative delivery solutions. We're on a mission to simplify the complex process of offering delivery, and we're looking for a talented Lead AI ML Engineer to join our team.As a Lead AI ML Engineer at Burq, Inc., you will play a crucial role in designing, developing, and deploying machine learning models...


  • Remote, Oregon, United States GE Vernova Full time

    The Embedded Software Engineer for I&C Systems at GE Vernova works within the I&C Engineering team of the GE Vernova Engineering organization.The I&C team is responsible for designing and implementing I&C electronic hardware and software for I&C systems for Nuclear Power Plants.This role requires technical problem solving, strong software skills, and...


  • Remote, Oregon, United States GE Vernova Full time

    Job Description SummaryThe Embedded Software Engineer for I&C Systems at GE Vernova works within the I&C Engineering team of the GE Vernova Engineering organization.The I&C team is responsible for designing and implementing I&C electronic hardware and software for I&C systems for Nuclear Power Plants.This role requires technical problem solving, strong...

  • IT Systems Manager

    3 days ago


    Remote, Oregon, United States Unreal Gigs Full time

    Unlock Your Potential as an IT Systems Manager at Unreal GigsAre you a seasoned IT professional looking to take your career to the next level? Do you have a passion for leading and managing IT infrastructure teams? We're seeking a highly skilled and experienced IT Systems Manager to join our dynamic team at Unreal Gigs.Key Responsibilities:Systems Design and...

  • Data Engineer

    4 weeks ago


    Remote, Oregon, United States Dealer Tire LLC Full time

    About Dealer TireDealer Tire is a family-owned, international distributor of tires and parts established in 1918 in Cleveland, OH. We're laser focused on helping the world's largest and most trusted auto manufacturers grow their tire business—in fact, we've sold more than 60 million tires to date. We're a thriving company, and we're looking for driven...

  • Chief Engineer

    2 months ago


    Remote, Oregon, United States Parsons Corporation Full time

    Job Summary:We are seeking a highly skilled Chief Engineer to join our team at Parsons Corporation. As a key member of our Federal Solutions segment, you will play a critical role in delivering resources to our US government customers.Key Responsibilities:Lead the development of win themes, discriminators, and strategies to capture new business...


  • Remote, Oregon, United States Intelerad Full time

    Job OverviewIntelerad is seeking a highly skilled System Technology Specialist to join our team. As a key member of our support team, you will be responsible for resolving complex technical issues and providing exceptional customer service to our clients.The ideal candidate will have a strong background in technical support, with experience in medical...


  • Remote, Oregon, United States System Innovation Full time

    Job Title: Modernization Chief EngineerSystem Innovation is seeking a highly skilled Modernization Chief Engineer to join our team. As a key member of our organization, you will play a critical role in driving innovation and transformation within the DoD Test and Evaluation Enterprise.Key Responsibilities:Lead a team of modernization experts to develop and...

Lead HPC Systems Engineer

2 months ago


Remote, Oregon, United States St. Jude Children's Research Hospital Full time

Overview:

As a pivotal member of our innovative team, the Senior HPC Infrastructure Engineer will be instrumental in advancing our high-performance computing (HPC) and artificial intelligence (AI) frameworks. This role focuses on the design, implementation, and enhancement of our sophisticated HPC clusters and servers, ensuring optimal performance and reliability in our research computing environment.

Key Responsibilities:

  • Architect and develop cutting-edge HPC/AI systems to facilitate transformative research initiatives.
  • Manage the continuous monitoring, support, and upkeep of HPC/AI clusters to guarantee maximum efficiency and dependability.
  • Lead system enhancements, customizations, and integration efforts with database administrators, software developers, network operations, and data center teams.
  • Oversee a variety of computer systems and application software, ensuring they operate at peak functionality and efficiency.
  • Provide ongoing support and monitoring of our research computing infrastructure, delivering exceptional service around the clock.

What We Offer:

  • Opportunity to engage with state-of-the-art technology in a collaborative and dynamic setting.
  • A role that significantly contributes to the success of pioneering research projects.
  • Collaboration with leading professionals across multiple disciplines.

If you possess a passion for HPC technology and excel in a fast-paced, innovative environment, we invite you to consider this opportunity.

Job Responsibilities:

  • Oversee the configuration and management of IT infrastructure to meet diverse requirements, including data retention, security, business continuity, and disaster recovery.
  • Assess the efficiency and effectiveness of infrastructure service delivery methods and procedures.
  • Lead and manage internal infrastructure in accordance with established regulations and standards.
  • Implement and monitor incident/problem management and disaster recovery for infrastructure support.
  • Provide current systems usage statistics and future growth projections based on demand.
  • Collaborate with internal teams to establish prioritization, metrics, and processes for capacity planning and infrastructure availability.
  • Present capacity planning and performance reports to senior leadership during meetings.
  • Benchmark, analyze, and recommend improvements for IT infrastructure.
  • Perform additional duties as assigned to achieve departmental and institutional goals.
  • Maintain regular and predictable attendance.

Minimum Education and/or Training:

  • Bachelor's degree in Computer Science, Engineering, Business, or a related field is required.
  • A Master's degree is preferred.

Minimum Experience:

  • A minimum of four (4) years of IT experience, particularly in infrastructure operations and engineering environments.
  • Experience with Red Hat Enterprise Linux (RHEL) is highly preferred.
  • Proficiency in supporting Linux within a high-performance computing (HPC) cluster and research computing environment is highly preferred.
  • Experience managing an HPC cluster is essential.
  • Familiarity with Slurm and/or LSF is highly preferred.
  • Experience with Kubernetes (e.g., Rancher, OpenShift, etc.) is a plus.
  • Experience with HPC cluster management tools is highly preferred.
  • Knowledge of IBM Spectrum Scale (GPFS) is required; experience with Lustre is a plus.
  • Experience with Message Passing Interface (MPI) is highly preferred.
  • Familiarity with networking technologies such as InfiniBand, Ethernet, and TCP/IP is highly preferred.
  • Experience with HPE Aruba Ethernet switches is preferred.
  • Experience with NVIDIA GPUs is required; experience with AMD GPUs is a plus.
  • Knowledge of NVIDIA GPUDirect Storage is a plus.
  • Advanced understanding of HPC technologies and principles is essential.
  • Strong knowledge of Linux security and shell scripting is required.
  • Demonstrated performance in a comparable role is necessary.

Compensation:

In accordance with U.S. state and municipal pay transparency laws, a reasonable estimate of the compensation range for this role is $94,640 - $169,520 per year for the position of Senior HPC Infrastructure Engineer.

Diversity, Equity, and Inclusion:

St. Jude Children's Research Hospital is committed to a diverse, global workforce built on the principles of diversity, equity, and inclusion. Our mission is to advance cures and means of prevention for pediatric catastrophic diseases through research and treatment.

No Search Firms:

St. Jude Children's Research Hospital does not accept unsolicited assistance from search firms for employment opportunities.