GPU Architect

1 month ago


Los Angeles, California, United States Vast Full time
About Vast:

Vast is a cloud computing platform that empowers businesses to reduce the cost and friction of compute-intensive workloads. Our mission is to democratize access to large-scale GPU liquidity, enabling organizations to leverage cutting-edge AI technologies.

We are a market leader in low-cost GPU rentals, powering AI projects and businesses worldwide. Our innovative approach has made us a go-to solution for companies seeking to harness the power of AI.

As a fast-growing startup, we offer a dynamic and ambitious work environment. Our team is passionate about pushing the boundaries of AI and machine learning.

About the Role:

We are seeking a skilled Systems/GPU Programmer to join our team. As a key member of our engineering team, you will play a crucial role in developing high-performance matrix multiplication kernels and algorithms that enhance AI model inference. Your expertise in C++, CUDA, and GPGPU will be instrumental in designing and deploying new tensor libraries and auto-optimization tools.

You will work closely with our technical CEO and founder to co-design GPU kernels and model architecture, ensuring optimal performance and efficiency. Your ability to research and stay up-to-date with the latest advancements in AI model inference and GPU programming techniques will be vital in driving innovation.

Key Responsibilities:
  • Design and develop high-performance matrix multiplication kernels and algorithms
  • Develop new tensor libraries and auto-optimization tools
  • Collaborate with the technical team to co-design GPU kernels and model architecture
  • Research and stay current with the latest advancements in AI model inference and GPU programming techniques
Requirements:
  • Deep expertise in C++, CUDA, and GPGPU
  • Strong understanding of parallel computing and high-performance computing
  • Excellent problem-solving skills and ability to work independently
  • Passion for AI and machine learning
What We Offer:
  • Full-time, Monday-Friday schedule
  • Onsite at our HQ in Westwood, Los Angeles
  • Comprehensive health, dental, and vision insurance
  • Matching 401K plan
  • Generous equity options
  • Opportunity to grow into technical or management roles

  • Senior Architect

    2 weeks ago


    Los Angeles, California, United States Zynga, Inc. Full time

    Senior ArchitectWe are seeking an experienced Senior Architect to join our team at Zynga, Inc. to develop our next project using the Unreal Engine on mobile devices and other platforms.The ideal candidate will have a strong background in game architecture, with experience in designing and implementing large-scale game systems, as well as a deep understanding...


  • Los Angeles, California, United States Insight Global Full time

    Job Title: Digital Test Solutions ArchitectInsight Global is seeking a highly skilled Digital Test Solutions Architect to join our team. As a key member of our sales, application engineering, and product development teams, you will play a strategic leadership role in protecting and expanding our market-leading position within the Digital/HPC market...


  • Los Angeles, California, United States Intel Full time

    Job Title: AI and HPC Scale-out Systems ArchitectJoin Intel's Data Center and Artificial Intelligence Group as a key architect in shaping the future of AI and HPC systems. As a member of our team, you will be responsible for designing and architecting large-scale systems that support breakthrough performance on HPC and AI workloads.Key...


  • Los Angeles, California, United States That's No Moon Entertainment Full time

    Expert Systems EngineerThat's No Moon Entertainment is seeking a highly skilled Expert Systems Engineer to join our team. As a key member of our development team, you will be responsible for ensuring the performance and stability of our AAA title across multiple platforms.Responsibilities:Collaborate with engineers and other disciplines to provide holistic...


  • Los Angeles, California, United States Nvidia Full time

    Job Title: Senior Deep Learning Software EngineerWe are seeking a highly skilled Senior Deep Learning Software Engineer to join our team at NVIDIA. As a key member of our model optimization group, you will play a pivotal role in designing and building our automated inference and deployment solution.Key Responsibilities:Architect and design a modular and...


  • Los Angeles, California, United States Freeform Full time

    Job OverviewFreeform is a pioneering company in the field of software-defined, autonomous metal 3D printing factories. We are seeking a highly skilled Senior Software Engineer to join our team and contribute to the development of our proprietary technology stack. The ideal candidate will have a strong foundation in computer science and experience in...


  • Los Angeles, California, United States FreeForm Full time

    Simulation Engineer Job DescriptionWe are seeking a highly skilled Simulation Engineer to join our team at Freeform. As a Simulation Engineer, you will play a critical role in developing physics-based models that enable the first production scale, high quality, and fully automated metal 3D printing factory capability.Responsibilities:Develop models...


  • Los Angeles, California, United States FreeForm Full time

    Job DescriptionWe are seeking a highly skilled Senior Software Engineer to join our team at Freeform. As a key member of our software development team, you will be responsible for designing and developing the print preparation software pipeline for our advanced production-scale metal 3D printing system.Your primary focus will be on automating CAD import,...

  • DevSecOps Engineer

    7 days ago


    Los Angeles, California, United States Kindo Full time

    Job Title: DevSecOps EngineerCompany OverviewKindo is a pioneering AI company that is revolutionizing the way enterprises adopt AI. Our mission is to provide a secure and controllable way for businesses to leverage AI technology. We are committed to giving our customers full control over their data and how AI interacts with it, which is core to our product...


  • Los Angeles, United States CyberCoders Full time

    Located in the South San Francisco Bay Area, we are a Semiconductor Startup with over $70M in funding that's on pace to secure our Series C by the end of 2023! We are building a universal processor that combines the functionality of a CPU, GPU, and TPU into one handling some of the largest AI and Machine Learning workloads around.We Are Seeking Senior Design...