High-Performance Computing Datacenter Engineer

2 weeks ago


Dallas, Texas, United States Alcority Full time
About the Role:

As an HPC Datacenter Engineer at Alcority, you will be responsible for designing and implementing state-of-the-art datacenter infrastructure solutions that support high-performance computing and scientific research. You will collaborate with cross-functional teams to understand their requirements and create efficient and scalable datacenter solutions. Your expertise in HPC technologies will be instrumental in driving innovation and optimizing performance within the datacenter environment.

Responsibilities:
  • Datacenter Architecture Design: Develop and refine datacenter architecture blueprints and guidelines, considering performance, scalability, security, and efficiency aspects. Design and implement solutions for compute, storage, networking, and cooling infrastructure that align with HPC requirements.
  • HPC Infrastructure Optimization: Continuously evaluate and enhance the datacenter infrastructure to maximize HPC performance and resource utilization. Identify and address potential bottlenecks and performance gaps, employing industry best practices and cutting-edge technologies.
  • System Integration and Deployment: Collaborate with system administrators and engineers to ensure seamless integration and deployment of HPC systems. Oversee hardware and software installation, configuration, and testing activities.
  • Research and Evaluation: Stay up to date with emerging HPC technologies, tools, and methodologies. Conduct research and feasibility studies on new hardware and software solutions to enhance datacenter capabilities. Evaluate vendor offerings and provide recommendations for procurement.
  • Performance Monitoring and Troubleshooting: Monitor and analyze datacenter performance metrics to identify issues and implement necessary optimizations. Troubleshoot complex system problems, working closely with technical teams to ensure efficient resolution and minimal impact on operations.
  • Security and Compliance: Collaborate with security teams to design and implement robust security measures within the datacenter infrastructure. Ensure compliance with relevant industry standards and regulations, such as HIPAA or GDPR, in data handling and storage.
  • Documentation and Reporting: Create comprehensive technical documentation, including architectural diagrams, standard operating procedures, and configuration guidelines. Prepare regular reports on datacenter performance, capacity planning, and future infrastructure requirements.
  • Team Collaboration and Leadership: Collaborate effectively with cross-functional teams, fostering a culture of knowledge sharing and innovation. Provide technical leadership and mentorship to junior team members, guiding them in adopting best practices and enhancing their skill sets.
Requirements:
  • Bachelor's or master's degree in computer science, engineering, or a related field or equivalent experience
  • Minimum 5 years of experience as an HPC engineer or similar role, with a strong focus on engineering and optimization
  • In-depth knowledge of HPC technologies, including parallel computing, distributed storage systems, job scheduling, InfiniBand and Ethernet networking, GPU acceleration, and job scheduling frameworks
  • ZFS and NiFi are a plus
  • Experience with automation tools Python, Ansible, Puppet / chef
  • Monitoring tools - Prometheus, Ganlia, Nagios, SNMP, and Telegraf
  • Experience with CFD (Computational Fluid Dynamics) workloads and associated HPC optimization a plus
  • Must have familiarity with industry-standard tools and software used in HPC environments, such as Slurm, PBS Pro, Lustre, GPFS, OpenStack, and containerization technologies (e.g., Docker, Kubernetes)
  • Strong problem-solving and analytical skills, with the ability to identify and resolve complex technical issues
  • Excellent communication and interpersonal skills, with the ability to collaborate effectively with diverse teams and stakeholders
  • Detail-oriented mindset with a strong focus on documentation and adherence to standards
  • Familiarity with security protocols and compliance requirements in the context of datacenter operations
  • Ability to adapt to a fast-paced and rapidly evolving technological landscape
Benefits & Perks:
  • Time Off: 25 days of PTO for full-time employees and 12 company holidays
  • Company Paid Benefits: Life insurance, Short-term disability, Long-term disability, Paid parental leave, Employee Assistance Program, and medical insurance in our high deductible health plan
  • Optional Employee Paid Benefits: Medical insurance in our EPO plan, Dental benefits, and Vision benefits. We also offer Health Savings Accounts, Flexible Spending Accounts, Supplemental Life insurance, and more
  • 401(k): Eligible after 60 days. Discretionary company match of 50% up to the first 6% of contributions
EQUAL OPPORTUNITY EMPLOYER

Alcority is an equal employment opportunity employer. The company's policy is not to discriminate against any applicant or employee based on race, color, religion, national origin, gender, age, sexual orientation, gender identity or expression, marital status, mental or physical disability, and genetic information, or any other basis protected by applicable law. The firm also prohibits harassment of applicants or employees based on any of these protected categories.

  • Dallas, Texas, United States Alcority Full time

    About the Role:We are seeking an experienced HPC Datacenter Engineer to join our team at Alcority. As a key member of our datacenter operations team, you will be responsible for designing, implementing, and supporting state-of-the-art datacenter infrastructure solutions that meet the high-performance computing needs of our researchers and scientists.As an...


  • Dallas, Texas, United States American Systems Full time

    Job Title: HPC EngineerWe are seeking an experienced HPC Engineer to join our team at American Systems. As an HPC Engineer, you will be responsible for designing, installing, and maintaining large-scale high-performance computing systems.Key Responsibilities:Apply comprehensive knowledge of HPC systems, including high-speed, multi-petabyte Lustre file...


  • Dallas, Texas, United States American Systems Full time

    Are you a storage expert looking to make a difference in the world of high-performance computing? We are seeking a skilled Data Storage Engineer to join our team at American Systems. As a key member of our team, you will be responsible for designing, implementing, and maintaining high-performance storage systems that meet the needs of our clients.As a Data...


  • Dallas, Texas, United States AT&T Full time

    About the CompanyAT&T is a leading integrated communications and entertainment company, developing new technologies to make it easier for customers to stay connected to their world.About the TeamOur Consumer Technology experience team is delivering innovative and reliable technology solutions to power differentiated, simplified customer experiences.Job...


  • Dallas, Texas, United States Tekfortune Inc Full time

    Job Title: Computer / Datacenter Hardware TechnicianAt Tekfortune Inc, we are seeking a skilled Computer / Datacenter Hardware Technician to join our team. As a key member of our infrastructure team, you will be responsible for installing, configuring, and maintaining our datacenter hardware.Responsibilities:Install and configure client and AMD servers,...


  • Dallas, Texas, United States Glow Networks Full time

    Site Reliability Engineer (SRE for Datacenter)At Glow Networks, we are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. As an SRE, you will be responsible for ensuring the reliability and performance of our datacenter infrastructure. Responsibilities:Data monitoring and alerting, data quality assurance, and anomaly...


  • Dallas, Texas, United States Saxon Global Full time

    Job Title: Sr Performance EngineerSaxon Global is seeking a highly skilled Sr Performance Engineer to join our team. As a key member of our engineering team, you will be responsible for designing, developing, and implementing high-performance computing solutions to meet the needs of our clients.Key Responsibilities:Design and develop high-performance...


  • Dallas, Texas, United States American Systems Full time

    Job Title / LevelHPC Data Storage Specialist - TS/SCIClearance Required?Top Secret/SCILocation:Secure Environment (Primary)0 - 10% TravelJob DescriptionWe are seeking a highly skilled HPC Data Storage Specialist to join our team at American Systems. As a key member of our team, you will be responsible for designing, implementing, and maintaining...


  • Dallas, Texas, United States Saxon Global Full time

    Job Title: Sr Performance EngineerWe are seeking a highly skilled Sr Performance Engineer to join our team at Saxon Global. As a key member of our engineering team, you will be responsible for designing, developing, and implementing high-performance computing solutions that meet the needs of our clients.Key Responsibilities:Design and develop...


  • Dallas, Texas, United States Saxon Global Full time

    Job Title: Sr Performance EngineerJob Summary:Saxon Global is seeking a highly skilled Sr Performance Engineer to join our team. As a key member of our engineering team, you will be responsible for designing, developing, and implementing high-performance computing solutions that meet the needs of our clients.Key Responsibilities:Design and develop...


  • Dallas, Texas, United States United Software Group Full time

    We are seeking a highly skilled and experienced System Admin and Datacenter Engineer to join our team at United Software Group. The ideal candidate will have a strong background in hardware management, datacenter migrations, and expertise in working with OpenShift/open stack.Responsibilities:Manage and maintain the organization's hardware infrastructure,...


  • Dallas, Texas, United States United Software Group Full time

    We are seeking a highly skilled and experienced System Admin and Datacenter Engineer to join our team at United Software Group Inc. The ideal candidate will have a strong background in hardware management, datacenter migrations, and expertise in working with OpenShift/open stack.Key Responsibilities:Manage and maintain the organization's hardware...


  • Dallas, Texas, United States Outcome Logix Full time

    Key Responsibilities:Design and implement RTL for high-performance digital IPs, including but not limited to PCIe, USB, Imaging IPs, high-speed interfaces (SerDes), UCIE, UFS, and DDR. Collaborate with cross-functional teams to define, design, and verify RTL components. Participate in the full design cycle, from specification to tape-out. Ensure high-quality...


  • Dallas, Texas, United States Dallas County Full time

    We are seeking a skilled Senior Software Engineer to join our team and contribute to the development of high-performance applications. The ideal candidate will have a strong background in software development and a passion for building scalable systems.Key Responsibilities:Design and implement high-performance software applicationsCollaborate with...


  • Dallas, Texas, United States Shield AI Full time

    Job Title: Principal Engineer, Aerodynamics-PerformanceShield AI is seeking a highly skilled Principal Engineer, Aerodynamics-Performance to lead the development and optimization of aerodynamic performance for aircraft engineering. As a key member of our team, you will be responsible for ensuring that our aircraft achieve optimal aerodynamic efficiency,...


  • Dallas, Texas, United States Shield AI Full time

    Aerodynamics Performance Engineer RoleShield AI is a defense technology company that aims to protect service members and civilians with intelligent systems. As an Aerodynamics Performance Engineer, you will lead the development and optimization of aerodynamic performance for aircraft engineering. Your expertise will be critical in ensuring that our aircraft...


  • Dallas, Texas, United States Shield AI Full time

    Job DescriptionShield AI is a venture-backed defense technology company whose mission is to protect service members and civilians with intelligent systems. As an Aerodynamics-Performance Engineer, you will lead the development and optimization of aerodynamic performance for aircraft engineering.Key ResponsibilitiesLead the aerodynamic design, analysis, and...

  • Performance Engineer

    4 weeks ago


    Dallas, Texas, United States Now100 Full time

    Job Title: Performance Engineer/TesterNow100 is seeking a highly skilled Performance Engineer/Tester to join our team. As a Performance Engineer/Tester, you will be responsible for ensuring the performance and reliability of our systems and applications.Key Responsibilities:Design and implement performance testing strategies to identify and mitigate...


  • Dallas, Texas, United States Shield AI Full time

    Aerodynamics Performance EngineerShield AI is seeking an experienced Aerodynamics Performance Engineer to lead the development and optimization of aerodynamic performance for aircraft engineering. As an Aerodynamics Performance Engineer, you will be responsible for ensuring that our aircraft achieve optimal aerodynamic efficiency, stability, and performance...


  • Dallas, Texas, United States Shield AI Full time

    Job DescriptionShield AI is seeking a highly skilled Aerodynamics-Performance Engineer to lead the development and optimization of aerodynamic performance for aircraft engineering.Key ResponsibilitiesLead the aerodynamic design, analysis, and optimization of UAS, focusing on performance, stability, and efficiency.Conduct computational fluid dynamics (CFD)...