Deep Learning Infrastructure Engineer

1 week ago


Austin, United States Sustainable Talent Full time
Job DescriptionJob Description

Are you ready to make your mark in the forefront of technological innovation? As an HPC Cluster Engineer, you'll play a pivotal role in shaping the future of AI, deep learning, and machine learning initiatives. Join us and leverage Nvidia's cutting-edge GPU technology to drive groundbreaking discoveries and revolutionize industries.

Sustainable Talent is thrilled to partner with Nvidia, a global powerhouse with over 25 years of trailblazing advancements in computer graphics, gaming, and accelerated computing.

This is a W-2 full-time contract based in Santa Clara, CA - Hybrid work option. We offer competitive pay based on factors like experience, education, location, etc. and provide full benefits, PTO, and amazing company culture

Additional locations: MA, Westford; US, NC, Durham; US, TX, Austin.

What you'll be doing:

  • You'll lead the charge in optimizing our Infiniband network and managing Lustre and GPFS storage solutions, ensuring seamless performance for our cutting-edge initiatives.
  • Your expertise in the SLURM job scheduler will be instrumental in orchestrating the smooth operation of our clusters, from scheduling tasks to managing resources efficiently.
  • As a Linux sysadmin guru, you'll be responsible for maintaining the stability and security of our systems, leveraging your deep understanding of Linux environments.
  • Harnessing the power of Ansible, you'll automate routine tasks and streamline operations, freeing up time for innovation and optimization.
  • Advanced python and bash scripting will drive automation efforts and enable dynamic solutions to complex challenges.

What We Need to See:

  • Demonstrated experience with SLURM, coupled with a solid understanding of Infiniband networks and Lustre/GPFS storage systems, is essential.
  • A proven track record in Linux system administration, ensuring robustness and security in our computing environment.
  • Proficiency in Ansible is a must-have, enabling you to automate tasks and workflows efficiently.
  • Strong scripting abilities in Python and bash are critical for developing custom solutions and optimizing cluster performance.

Ways to Stand Out From the Crowd:

  • Showcase your knowledge of best practices in HPC cluster operations, automation, and upgrades, setting you apart as a seasoned professional in the field.

Sustainable Talent is a M/F+, disabled, and veteran equal employment opportunity and affirmative action employer.



  • Austin, United States Targeted Talent Full time

    Job DescriptionJob DescriptionWe're seeking top-notch engineers to join our team. As part of our group, you'll collaborate with hardware and software engineers to design, develop, and optimize software for our chip, making AI inference accessible to everyone. You'll excel in identifying and resolving functional/performance bottlenecks in complex...


  • Austin, United States Targeted Talent Full time

    Job DescriptionJob DescriptionWe're seeking top-notch engineers to join our team. As part of our group, you'll collaborate with hardware and software engineers to design, develop, and optimize software for our chip, making AI inference accessible to everyone. You'll excel in identifying and resolving functional/performance bottlenecks in complex...

  • EDA Infrastructure Engineer

    Found in: Talent US C2 - 2 weeks ago


    Austin, United States NVIDIA Full time

    NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing. NVIDIA is a “learning machine” that constantly evolves by...

  • Machine Learning Engineer

    Found in: Careerbuilder One Red US C2 - 20 hours ago


    Austin, TX, US Atlassian Full time

    Working at AtlassianAtlassians have flexibility in where they work whether in an office, from home, or a combination of the two. That way, Atlassians have more control over supporting their family, personal goals, and other priorities. We can hire people in any country where we have a legal entity. Interviews and onboarding are conducted virtually, a part...

  • Data Infrastructure Engineer

    Found in: Resume Library US A2 - 7 days ago


    Austin, Texas, United States Vericast Full time

    Job Description The Data Infrastructure Engineer is a critical contributor to developing and growing a modern data mesh architecture for scalable analytics and innovation within a data lakehouse framework. This new architecture enables organizations to build valuable data products within a self-service infrastructure, in turn making them discoverable,...


  • Austin, United States IDR (Internal Data Resources) Full time

    Immediate and Permanent opportunity for a Senior Infrastructure Cloud Engineer to join industry leading company in the Healthcare sector local to Austin, TX. Overview: This individual will work with the Infrastructure Team to develop, implement, optimize, and maintain cloud-based solutions. You will be responsible for deploying and migrating services to the...


  • Austin, United States Mastech Digital Full time

    Responsibilities: Architect, implement, and manage enterprise network wireless and switching environment(s) and services used throughout the company, in subsidiaries, production and warehouse locations. Administer and maintain business critical enterprise wireless and switching environment(s) and services used throughout the company in subsidiaries,...

  • Senior Engineer, Data Infrastructure

    Found in: Resume Library US A2 - 6 days ago


    Austin, Texas, United States Procore Technologies Full time

    Job Description What if you could use your technology skills to develop a product that impacts the way communities’ hospitals, homes, sports stadiums, and schools across the world are built? Construction impacts the lives of nearly everyone in the world, and yet it’s also one of the world’s least digitized industries. That’s why we’re looking for...

  • Senior Cloud Infrastructure Systems Engineer

    Found in: Jooble US O C2 - 1 day ago


    Austin, TX, United States IDR (Internal Data Resources) Full time

    Immediate and Permanent opportunity for a Senior Infrastructure Cloud Engineer to join industry leading company in the Healthcare sector local to Austin, TX. This individual will work with the Infrastructure Team to develop, implement, optimize, and maintain cloud-based solutions. You will be responsible for deploying and migrating services to the cloud,...

  • Senior Engineer, Data Infrastructure

    Found in: Resume Library US A2 - 1 week ago


    Austin, Texas, United States Procore Technologies Full time

    Job Description What if you could use your technology skills to develop a product that impacts the way communities’ hospitals, homes, sports stadiums, and schools across the world are built? Construction impacts the lives of nearly everyone in the world, and yet it’s also one of the world’s least digitized industries, not to mention one of the most...

  • Senior Cloud Infrastructure Systems Engineer

    Found in: Jooble US O C2 - 1 day ago


    Austin, TX, United States IDR (Internal Data Resources) Full time

    Immediate and Permanent opportunity for a Senior Infrastructure Cloud Engineer to join industry leading company in the Healthcare sector local to Austin, TX. Overview: This individual will work with the Infrastructure Team to develop, implement, optimize, and maintain cloud-based solutions. You will be responsible for deploying and migrating services to the...


  • Austin, United States Tenstorrent Inc. Full time

    Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high...


  • Austin, United States Tenstorrent Inc. Full time

    Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high...


  • Austin, United States Algo Capital Group Full time

    Senior Database Engineer Join a dynamic Research & Development team in a global financial technology firm, where your contributions will directly impact live trading and research operations. We are seeking talented Database Engineers to play a key role in maintaining and enhancing our cutting-edge infrastructure. Responsibilities: Develop and maintain...


  • Austin, United States Algo Capital Group Full time

    Senior Database Engineer Join a dynamic Research & Development team in a global financial technology firm, where your contributions will directly impact live trading and research operations. We are seeking talented Database Engineers to play a key role in maintaining and enhancing our cutting-edge infrastructure. Responsibilities: Develop and maintain...


  • Austin, United States CareerBuilder Full time

    Nomi Health was founded in 2019 as a direct healthcare company with a simple yet bold mission: rebuild the healthcare system so it is accessible and affordable for everyone. We are rebuilding the healthcare system by cutting costs, confusion, and complexity through direct contracts and payment with providers, deep data dives, and convenient patient care. We...


  • Austin, United States Algo Capital Group Full time

    Senior Database Engineer Join a dynamic Research & Development team in a global financial technology firm, where your contributions will directly impact live trading and research operations. We are seeking talented Database Engineers to play a key role in maintaining and enhancing our cutting-edge infrastructure.Responsibilities:Develop and maintain...

  • Senior Database Engineer

    Found in: Appcast Linkedin GBL C2 - 2 weeks ago


    Austin, United States Algo Capital Group Full time

    Senior Database Engineer Join a dynamic Research & Development team in a global financial technology firm, where your contributions will directly impact live trading and research operations. We are seeking talented Database Engineers to play a key role in maintaining and enhancing our cutting-edge infrastructure.Responsibilities:Develop and maintain...

  • Senior Database Engineer

    Found in: Appcast US C2 - 2 weeks ago


    Austin, United States Algo Capital Group Full time

    Senior Database Engineer Join a dynamic Research & Development team in a global financial technology firm, where your contributions will directly impact live trading and research operations. We are seeking talented Database Engineers to play a key role in maintaining and enhancing our cutting-edge infrastructure.Responsibilities:Develop and maintain...


  • Austin, United States Diverse Lynx Full time

    Responsibilities : We are looking for a Senior Data Engineer to join our team and help us build and maintain our data infrastructure and pipelines. As a Senior Data Engineer, you will be responsible for designing, developing, and implementing scalable and reliable data solutions to support our business needs. You will also work closely with other engineers...