Current jobs related to Hardware Engineer, GPU Infrastructure - Roseland - CoreWeave


  • Roseland, Nebraska, United States CoreWeave Full time

    Job DescriptionAt CoreWeave, we're seeking a highly skilled and motivated Infrastructure/Hardware Engineer to join our Hardware Provisioning team. As a key member of our team, you will play a crucial part in the design, development, and optimization of our server hardware infrastructure.Responsibilities:Develop and maintain hardware/firmware management...


  • Roseland, Nebraska, United States CoreWeave Full time

    About the RoleWe are seeking a talented and experienced Senior Software Engineer to join our Network Datapath Team. As a Senior Software Engineer, you will play a critical role in designing, developing, and maintaining the networking software/hardware that underpins our GPU cloud services. You will collaborate closely with cross-functional teams to optimize...


  • Roseland, Florida, United States CoreWeave Full time

    At CoreWeave, we're seeking a highly skilled and motivated Infrastructure/Hardware Engineer to join our Hardware Provisioning team.This role will play a crucial part in the design, development, and optimization of our server hardware infrastructure.You will collaborate closely with cross-functional teams, external vendors, and stakeholders to ensure the...


  • Roseland, Nebraska, United States CoreWeave Full time

    About CoreWeaveCoreWeave is the AI Hyperscaler, delivering a cloud platform of cutting-edge services powering the next wave of AI. The company's technology provides enterprises and leading AI labs with the most performant, efficient, and resilient solutions for accelerated computing.With a growing footprint of data centers covering every region of the US and...


  • Roseland, Nebraska, United States CoreWeave Full time

    Job DescriptionCoreWeave is a cloud provider delivering massive scale GPU compute resources on top of the industry's fastest and most flexible infrastructure. We're seeking a highly skilled and motivated Senior Systems Engineer to join our Kernel HAVOCK Team.ResponsibilitiesDevelop and maintain tooling to build custom Linux kernels and stateless OS...


  • Roseland, Florida, United States CoreWeave Full time

    About the Role:As a Solutions Architect at CoreWeave, you will play a vital and dynamic role in shaping the future of cloud computing. You will have the opportunity to demonstrate thought leadership and engage hands-on throughout our customers' entire lifecycle. From establishing their Kubernetes environment to developing proofs of concept, onboarding, and...


  • Roseland, Nebraska, United States SourcePro Search, LLC Full time

    Fantastic opportunity for a skilled professional to join our team as an Infrastructure Systems Specialist at our Roseland, NJ office.The ideal candidate will be responsible for the day-to-day administration and configuration of our IT infrastructure, including servers, cloud services, and network connectivity.Responsibilities include:Maintaining the...


  • Roseland, Nebraska, United States TEKsystems Full time

    Job OverviewTEKsystems is seeking a highly skilled Cloud Infrastructure Engineer to join our team.This is a 6-month contract to hire opportunity.The ideal candidate will have 2+ years of experience with Microsoft Azure administration, 5+ years of experience with Windows Server administration, and 3 years of experience with PowerShell scripting.Additionally,...


  • Roseland, New Jersey, United States CoreWeave Full time

    Job SummaryThe VP Product Management - Strategic Planning and Partnerships will be responsible for helping us evolve and execute our product strategy, fostering key industry partnerships, and aligning the company's product portfolio with long-term business objectives. You will collaborate with internal stakeholders, including product, engineering, and sales...


  • Roseland, Nebraska, United States CoreWeave Full time

    Job SummaryCoreWeave is a leading AI Hyperscaler, delivering a cloud platform of cutting-edge services powering the next wave of AI. We're seeking a skilled Senior Security Engineer to join our Infrastructure Security team, responsible for overseeing the security posture and security tooling around our core Kubernetes infrastructure.This is an exciting...


  • Roseland, Nebraska, United States CoreWeave Full time

    Job DescriptionCoreWeave is a pioneering cloud provider, delivering unparalleled GPU compute resources on top of the industry's fastest and most flexible infrastructure. Our cloud solutions for compute-intensive use cases — VFX and rendering, machine learning and AI, batch processing, and Pixel Streaming — are up to 35 times faster and 80% less expensive...


  • Roseland, Nebraska, United States CoreWeave Full time

    Job Title: Cloud Security Solutions ArchitectAt CoreWeave, we're revolutionizing the cloud computing industry by putting bleeding-edge GPU technology on top of the industry's fastest and most adaptable infrastructure. We're seeking a talented Cloud Security Solutions Architect to join our team and help shape the future of cloud security.Key...


  • Roseland, New Jersey, United States CoreWeave Full time

    Job Title: Vice President of Capacity PlanningCoreWeave is a leading cloud provider, delivering massive scale GPU compute resources on top of the industry's fastest and most flexible infrastructure.We are seeking an experienced leader to join our team as the Vice President of Capacity Planning.Key Responsibilities:Develop robust statistical demand forecasts...


  • Roseland, New Jersey, United States CoreWeave Full time

    Job Title: Vice President, Capacity PlanningCoreWeave is a leading cloud provider, delivering high-performance GPU compute resources on a scalable infrastructure. We're seeking an experienced leader to join our team as the Vice President of Capacity Planning, responsible for demand forecasting, supply planning, and portfolio management of our data center and...


  • Roseland, Nebraska, United States Maintec Technologies Full time

    Job Title: Cloud Architect RoleJob Description:We are seeking a highly skilled Cloud Architect with experience in building infrastructure in Microsoft Azure. The ideal candidate will have strong knowledge of cloud computing and be able to design and implement scalable and secure cloud solutions. Key responsibilities include: Designing and implementing cloud...


  • Roseland, New Jersey, United States Saxon Global Full time

    Job Title: Cloud Engineer for ADP's AI/ML PlatformJob Description:We are seeking a highly skilled Cloud Engineer to join our team at Saxon Global. As a Cloud Engineer, you will be responsible for supporting applications within ADP's AI/ML platform, which is currently transitioning from on-premise to Azure cloud.Key Responsibilities:Design and implement Azure...


  • Roseland, Florida, United States TEKsystems Full time

    Job DescriptionTEKsystems is seeking a skilled Cloud Infrastructure Specialist to join our team. The ideal candidate will have extensive experience in Microsoft Azure administration, Intune, and Windows Server administration. Additionally, proficiency in PowerShell scripting is required. The successful candidate will possess a strong background in cloud...


  • Roseland, New Jersey, United States AGM Tech Solutions Full time

    AGM Tech Solutions is seeking a highly skilled Data Center Engineer to join our team. As a key member of our team, you will be responsible for configuring and maintaining our data center infrastructure, with a focus on Nexus and Arista suite expertise. Your expertise in routing and switching will be crucial in ensuring the smooth operation of our data...


  • Roseland, Nebraska, United States CoreWeave Full time

    About the RoleAs the VP of Product Management - AI Services, you will lead CoreWeave's strategic efforts in shaping and developing cutting-edge AI services that drive innovation and business outcomes for our customers.You will lead a team of Product Managers and collaborate closely with engineers and designers to create world-class products that empower...


  • Roseland, Nebraska, United States TEKsystems Full time

    Job Title: Systems AdministratorTEKsystems is seeking a highly skilled Systems Administrator to join our team.Key Responsibilities:1. Administer Microsoft Azure infrastructure, with a focus on Intune and Windows Server administration.2. Develop and implement PowerShell scripts to automate tasks and improve system efficiency.3. Collaborate with...

Hardware Engineer, GPU Infrastructure

5 months ago


Roseland, United States CoreWeave Full time
Job DescriptionJob Description

CoreWeave is a specialized cloud provider, delivering a massive scale of GPU compute resources on top of the industry's fastest and most flexible infrastructure. CoreWeave builds cloud solutions for compute intensive use cases — VFX and rendering, machine learning and AI, batch processing, and Pixel Streaming — that are up to 35 times faster and 80% less expensive than the large, generalized public clouds. Learn more at www.coreweave.com.

CoreWeave is seeking a highly skilled and motivated Infrastructure/Hardware Engineer, focusing on GPU and PCIe troubleshooting, to join our Hardware Engineering team, reporting to the Director of Compute Architecture. In this role, you will play a crucial part in the design, development, troubleshooting, and optimization of our server hardware infrastructure. You will collaborate closely with cross-functional teams, external vendors, and stakeholders to ensure the successful delivery of highly performant and reliable hardware solutions.

Responsibilities:

  • Troubleshoot complex GPU and PCIe related failures
  • Partner with external vendors on failure analysis
  • Track component RMAs
  • Develop and maintain hardware/firmware management services.
  • Automate all aspects of the server hardware lifecycle.
  • Serve as the senior point of contact for hardware escalation and troubleshooting.
  • Collaborate with cross-functional teams to define hardware requirements, specifications, and system architecture.
  • Create and maintain accurate documentation of hardware designs, specifications, test procedures, and results.
  • Analyze and optimize the performance of hardware systems, identify bottlenecks, and propose improvements for enhanced efficiency.
  • Establish processes for internal hardware testing, deployment, and performance optimization.

The ideal candidate will have at least 2 years professional experience with the following:

  • Prior experience supporting and troubleshooting data center class GPUs (preferably A100 or newer)
  • Proficiency in ansible/python and experience with programmatically interacting with server BMCs, using IPMI or Redfish (preferably Redfish).
  • Experience using, integrating and automating data center class GPU diagnostics and troubleshooting tools
  • In-depth knowledge of server hardware, components, and management technologies, particularly GPUs and PCIe devices.
  • Proven ability to stay updated with the latest industry technologies and trends.
  • Previous experience collaborating with hardware vendors.
  • Strong passion for automation, with a commitment to automating processes comprehensively.
  • Excellent documentation skills and attention to detail.
  • Strong analytical and problem-solving abilities.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $160,000/year in our lowest geographic market up to $210,000/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.

Hybrid Workplace

Successful candidates will be expected to attend onboarding training at our NJ Headquarters within their first several weeks of employment, with subsequent quarterly travel requirements of 1 week duration.

If you reside within a 30-mile radius of our New Jersey, New York, or Philadelphia offices, we're excited for you to join us at the office at least three times a week, recognizing the significance we place on fostering connections, collaboration, and creativity within our office culture. Our commitment to operating as a hybrid workplace underscores our dedication to enabling our employees to tailor their work-life balance to their individual preferences.

Why CoreWeave?

At CoreWeave, we work hard, have fun, and move fast We're in an exciting stage of hyper-growth that you will not want to miss out on. We're not afraid of a little chaos, and we're constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values:

  • Be Curious at your Core
  • Act like an Owner
  • Empower Employees
  • Deliver Best In-Class Client Experience
  • Achieve More Together

We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for take off, the growth opportunities within the organization are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us

Benefits

We offer a competitive salary and benefits, including:

  • Medical, dental and vision insurance - 100% paid for the employee
  • Company paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Tuition Reimbursement
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our offices
  • Weekly massages in NJ office
  • A casual work environment
  • Work culture focused on innovative disruption

California Consumer Privacy Act - California applicants only

CoreWeave is an equal opportunity employer, committed to our diversity and inclusiveness. We will consider all qualified applicants without regard to race, color, nationality, gender, gender identity or expression, sexual orientation, religion, disability or age.