HPC Kubernetes Engineering Manager
3 weeks ago
**The Company**NorthMark Compute & Cloud (NMC²) is backed by dedicated leadership and investment, with a clear mission as it operates at the bleeding edge of technology. Its goal is to scale and enhance the high-performance computing (HPC) and cloud infrastructure that supports its clients' research, production, and delivery, enabling breakthroughs that shape the industries of tomorrow. Its engineers build critical infrastructure to eliminate friction in scientific research, simulations, analysis, and decision-making, accelerating discovery and driving faster innovation.**The Position**We are seeking a highly skilled Kubernetes Engineering Manager with a focus on HPC to join our Platform Engineering function in Dallas. Kubernetes underpin all facets of our Research platforms and HPC estate here atNMC². As the HPC Kubernetes Engineering Manager you will take ownership of the strategic roadmap, design and delivery of our Kubernetes platform. In addition, you will focus on continuous optimizations and performance enhancements of our kubernetes platform as Research demands augment. We are looking for a highly experienced technical manager who can lead the significant scaling up our existing compute platforms and who excels working on the bleeding edge of technology; pushing the boundaries of HPC compute performance and providing an innovative approach to solving complex technical challenges that arise. The HPC Kubernetes Engineering Manager will collaborate closely with the Kubernetes Platform Management team to ensure a smooth transition of new engineering capabilities, with a strong focus on operational excellence in all aspects of design and implementation. **Responsibilities:*** Strong leadership and strategic vision in the design, deployment and scaling of a high-performance kubernetes platform* Pro-active stakeholder engagement, ensuring the Kubernetes platform supports broader business outcomes and research demands* Confident communication and collaboration, you will help drive cross functional engineering initiatives across the Technology and Research organizations* Vendor Management experience, working closely with our key vendors providing continuous feedback to leverage and influence roadmaps and ensuring efficient and timely deployment, support and maintenance of critical platforms* People leadership, managing and developing engineers and a high performing team across the UK and US* A deep understanding of emerging trends and technologies in the Kubernetes ecosystems, working closely with Architecture and Innovation Teams to appraise and adopt* Ensuring platforms are reliable, highly available and secure, managed with a DevOps mindset and Infrastructure-as-Code toolset* Budget control, capacity forecasting and management**Requirements:*** Extensive technical experience with Kubernetes tailored for HPC/ML workloads in a complex distributed environment* Contribute to performance tuning of ML workloads across GPU/CPU clusters - optimizations for workload scheduling, GPU integration, and resource management for distributed training jobs* Experience scaling a high performance kubernetes platforms geographically at scale* Implement and manage multi-tenant compute environments ensuring isolation and performance* Integrate with distributed file systems and high-speed interconnects (e.g., InfiniBand, RoCE)* Ability to collaborate effectively across teams to deliver engineering solutions with a strong emphasis on operational excellence and seamless capability handover* Confident stakeholder management and communication skills, aligning to value driven outcomes* Excellent team leadership, project management skills and promoting a high performance culture* Drive engineering best practices across CI/CD, automation & tooling, configuration management and SRE concepts* A commitment to security by designing and building secure, high-integrity systemsNorthMark Strategies is a leading investment firm, combining capital, innovation, and engineering to drive long-term value. From operating complex businesses to backing breakthrough technologies, our mission is to build enduring businesses. Our team combines intelligent risk-taking, operational excellence, exceptional talent, and world-class computing capacity to create shareholder value.Our company offers a dynamic environment where individuals have the freedom to lead companies toward bold achievements by embracing innovation, leveraging technology, and fostering differentiated business strategies. Our values are Integrity, Ability, and Energy, and the company aims to hire individuals who possess those qualities.At NorthMark Strategies, we believe the future isn’t something to hope for, it’s something to build. We don’t just invest, we create. Bringing together strategic insight and technical horsepower to deliver outcomes that endure.
#J-18808-Ljbffr
-
HPC Kubernetes Engineering Manager
2 weeks ago
Dallas, TX, United States NorthMark Strategies Full timeThe Company NorthMark Compute & Cloud (NMC²) is backed by dedicated leadership and investment, with a clear mission as it operates at the bleeding edge of technology. Its goal is to scale and enhance the high-performance computing (HPC) and cloud infrastructure that supports its clients' research, production, and delivery, enabling breakthroughs that shape...
-
Engineering Manager, HPC Kubernetes Platform
2 weeks ago
Dallas, United States NorthMark Compute & Cloud Full timeNorthMark Compute & Cloud (NMC²) is backed by dedicated leadership and investment, with a clear mission as it operates at the bleeding edge of technology. Its goal is to scale and enhance the high-performance computing (HPC) and cloud infrastructure that supports its clients' research, production, and delivery, enabling breakthroughs that shape the...
-
HPC-Kubernetes Solutions Architect
5 days ago
Dallas, Texas, United States INSPYR Solutions Full time $120,000 - $350,000 per yearTitle:HPC Kubernetes Solutions ArchitectLocation:Dallas, TXDuration:Permanent PositionCompensation: $200,000 - $350,000/yearWork Requirements:US Citizen, GC Holders or Authorized to Work in the U.S.HPC Kubernetes Solutions ArchitectAs an HPC Kubernetes Solutions Architect, you will act as a trusted advisor to customers, guiding them through the design,...
-
Kubernetes Engineer
3 days ago
Dallas, Texas, United States Broward Sheriff County Full timePosition: Kubernetes EngineerDuration: 12 Months plusLocation: Hybrid - Dallas, TXJob Description:In this role, you will design, implement, and optimise GPU-accelerated container platforms at scale, enabling high-performance workloads (AI/ML, HPC, LLM training) across hybrid or on-prem environments.You will have deep expertise with both NVIDIA and Kubernetes...
-
HPC Engineer
3 weeks ago
Dallas, United States AMERICAN SYSTEMS Full timeJoin to apply for the HPC Engineer role at AMERICAN SYSTEMSOverviewAMERICAN SYSTEMS is an employee-owned federal government contractor supporting national priority programs through our strategic solutions in the areas of Information Technology, Test & Evaluation, Program Mission Support, Engineering & Analysis, and Training.ResponsibilitiesUtilize a wide...
-
Kubernetes Engineer
3 weeks ago
Dallas, United States G-Research Full timeSenior Kubernetes EngineerDo you want to tackle the biggest questions in finance with near infinite compute power at your fingertips?G-Research is a leading quantitative research and technology firm, with offices in London and Dallas.We are proud to employ some of the best people in their field and to nurture their talent in a dynamic, flexible and highly...
-
HPC Solutions Architect Manager
6 days ago
Dallas, United States Glocomms Full timeHPC Solutions Architect Manager Dallas - Hybrid (3-days onsite) Role Overview:We are seeking an experienced technical leader to manage a team of architects focused on high-performance computing (HPC) solutions. This position involves guiding customers through the full solution lifecycle, from initial requirements to deployment and optimization, while...
-
HPC Solutions Architect Manager
6 days ago
Dallas, United States Glocomms Full timeHPC Solutions Architect Manager Dallas - Hybrid (3-days onsite) Role Overview:We are seeking an experienced technical leader to manage a team of architects focused on high-performance computing (HPC) solutions. This position involves guiding customers through the full solution lifecycle, from initial requirements to deployment and optimization, while...
-
HPC Engineer
2 weeks ago
Dallas, TX, United States Sabre Systems Full timeResponsibilitiesJob title: HPC Engineer Sabre is seeking an HPC Data Storage Engineer to support a mission-critical Department of Defense (DoD) program dedicated to high-performance computing operations. As an HPC Engineer, you will design, optimize, and maintain advanced high-performance computing environments that power large-scale data processing,...
-
HPC Solutions Architect Manager
4 days ago
Dallas, United States Glocomms Full timeHPC Solutions Architect Manager Dallas - Hybrid (3-days onsite) Role Overview:We are seeking an experienced technical leader to manage a team of architects focused on high-performance computing (HPC) solutions. This position involves guiding customers through the full solution lifecycle, from initial requirements to deployment and optimization, while...