Founding AI Frameworks Engineer

4 days ago


San Francisco CA United States TensorLake Inc. Full time

Tensorlake is building a distributed data processing platform for developers building Generative AI applications. Our product, Indexify( ), enables building continuously evolving knowledge bases and indexes for Large Language Model applications by allowing structured data or embedding extraction algorithms on any unstructured data.

We are building a server-less product on top of Indexify that allows users to build real-time extraction pipelines for unstructured data. The extracted data and indexes would be directly consumed by AI Applications and LLMs to power business and consumer applications.

As an AI Frameworks Engineer , you will be responsible for optimizing our AI infrastructure, developing high-performance inference engines, and maximizing GPU utilization. You’ll work on the critical backend architecture that powers our platform’s scalability and performance, collaborating with both researchers and product engineers to ensure Tensorlake’s models run efficiently on a variety of hardware configurations.

Responsibilities

As an AI Frameworks Engineer, your focus will be on optimizing and building high-performance AI systems. You will:

  • Design and build custom inference engines optimized for high throughput and low latency.
  • Optimize GPU usage across our platform, ensuring that deep learning models run efficiently at scale.
  • Write and optimize custom CUDA kernels and other low-level operations to accelerate deep learning workloads.
  • Develop and implement techniques for model compression , including quantization and pruning , to make models more efficient for real-world deployment.
  • Collaborate with research scientists and engineers to integrate new models into Tensorlake’s platform while ensuring peak performance.
  • Utilize cuDNN , cuBLAS , and other GPU-accelerated libraries to optimize computational workloads.
  • Troubleshoot and debug performance bottlenecks using tools like nvprof and Nsight , and implement fixes to improve throughput and memory usage.
  • Work on scaling AI models to multiple GPUs and nodes using NCCL and other parallel computing techniques.
Basic Qualifications
  • 5+ years of experience in building and optimizing AI models for performance at scale.
  • Strong knowledge of deep learning frameworks such as TensorFlow , PyTorch , or JAX , with experience optimizing them for hardware.
  • Proficiency in GPU programming with CUDA , OpenCL , or similar parallel computing frameworks.
  • Expertise in writing custom CUDA kernels to optimize deep learning operations.
  • Experience with inference engines such as TensorRT , and understanding of model deployment optimization.
  • Software engineering proficiency in C/C++ , Python , and low-level system components like memory management and concurrency.
  • Experience in using profiling tools like nvprof , Nsight , and other debugging tools for performance tuning.
Benefits

- Ability to save in 401(k) plans

- Comprehensive Healthcare and Dental Benefits

#J-18808-Ljbffr

  • San Francisco, CA, United States LlamaIndex Inc. Full time

    Help us build the future of AI! Join our team to pioneer the future of Large Language Model (LLM) applications. Values we are looking for: Integrity, alignment and passion Adaptability and the ability to work with limited resources Intellectual horsepower (smart x get things done) Level up the people around you Domain specific knowledge About the...


  • San Francisco, United States Scout AI Full time

    Intro Scout AIis a new hiring platform that connects software engineers to opportunities with world-class companies. On Scout, you get a more relevant and growthful interviewing experience, you receive feedback on your performance, and you also get end-to-end support to improve your chances of getting hired. If you perform well on the Scout interview, you...

  • AI Framework Engineer

    3 weeks ago


    San Francisco, United States Jobot Full time

    Job DescriptionJob DescriptionWell-Funded Seed Stage Startup / Generative AI / Remote FlexibilityThis Jobot Job is hosted by: Caitlyn HardyAre you a fit? Easy Apply now by clicking the "Apply Now" buttonand sending us your resume.Salary: $190,000 - $215,000 per yearA bit about us:We are a well-funded Seed-stage startup that has plans to double our team size...


  • San Francisco, CA, United States Athina AI Full time

    Skills: Node.js, Python, React, SQL Overview: Athina is building an IDE to enable the development of AI-powered products. We are on a mission to build the new stack for AI product teams. This means defining how teams will prototype, experiment, train, evaluate and monitor GenAI products. The next decade will enable AI-powered products and features that are...


  • San Francisco, California, United States Jobot Full time

    Company Overview:We are a well-funded startup with ambitious growth plans, focusing on developing innovative Generative AI solutions for developers.Salary: $200,000 - $225,000 per yearJob Description:We are seeking an experienced AI Framework Engineer to join our team. The ideal candidate will have a strong background in large language models and/or GenAI,...

  • Founding Engineer

    3 days ago


    San Francisco, CA, United States Hamming AI Full time

    We are a fast-growing voice AI testing company. We are winning (8Xed our revenue last month) and are hiring a founding engineer to help us win faster. Here's what you'll do: 0 to 1 Build new products extremely quickly that make our customer’s voice agents more reliable. Our customers want new features, and we don’t have enough time to satisfy current...


  • San Francisco, California, United States Jobot Full time

    Seeking a skilled AI Framework Engineer to join our team at Jobot, a well-funded Seed-stage startup.Job OverviewWe are developing a unique product for developers building Generative AI applications, and we need an expert in large language models and/or GenAI to help us grow.Salary and BenefitsThe salary range for this position is $190,000 - $215,000 per...


  • San Francisco, California, United States Naptha AI Full time

    About Naptha AIWe are seeking exceptional Software Engineering interns to join Naptha AI and contribute to building the future of AI agent infrastructure.This internship offers hands-on experience working with frontier AI technology, backed by industry veterans and technical leaders through NVIDIA Inception, Google for Startups, and Microsoft for Startups.As...


  • San Francisco, United States Athina Ai Full time

    Skills: Node.js, Python, React, SQL Overview: Athina is building an IDE to enable the development of AI-powered products. We are on a mission to build the new stack for AI product teams. This means defining how teams will prototype, experiment, train, evaluate and monitor GenAI products. The next decade will enable AI-powered products and features that are...


  • San Francisco, United States Athina AI Full time

    Skills: Node.js, Python, React, SQL Overview: Athina is building an IDE to enable the development of AI-powered products. We are on a mission to build the new stack for AI product teams. This means defining how teams will prototype, experiment, train, evaluate and monitor GenAI products. The next decade will enable AI-powered products and features that are...

  • Software Engineer

    2 weeks ago


    San Francisco, California, United States Stack AI Full time

    About Stack AIWe're a fast-growing startup on a mission to democratize access to Large Language Models. Our user-friendly and intuitive No-Code platform integrates the best AI models, common data sources, and SaaS tools.Our Traction is impressive: launched 8 months ago with over 65,000 users and 300+ paying customers, including public companies and...


  • San Francisco, United States Scout AI Full time

    About Scout AI Our company Scout AI is an AI tech startup company working at the intersection of hiring and upskilling. Our mission is to increase the average skill level of everyone on the planet by an order of magnitude, while bridging the disconnect between education, interviewing, and hiring. We are a small and mighty team led by Christian Arredondo and...


  • San Francisco, United States Athina AI Full time

    Skills: Node.js, Python, React, SQLOverview:Athina is building an IDE to enable the development of AI-powered products. We are on a mission to build the new stack for AI product teams. This means defining how teams will prototype, experiment, train, evaluate and monitor GenAI products.The next decade will enable AI-powered products and features that are...


  • San Francisco, United States Scout AI Full time

    Intro Scout AI is a new hiring platform that connects software engineers to opportunities with world-class companies. On Scout, you get a more relevant and growthful interviewing experience, you receive feedback on your performance, and you also get end-to-end support to improve your chances of getting hired. If you perform well on the Scout interview, you...

  • Founding AI Engineer

    3 weeks ago


    San Francisco, United States Artie, Inc. Full time

    We are an early stage, Y Combinator-backed health tech start-up that uses generative AI to help automate the calls that providers’ offices make to payors. We are looking for a Founding AI Software Engineer to join our small team. Join us as we help improve healthcare efficiency.You’ll work directly with the two founders to shape the direction of the...


  • San Francisco, United States Scout AI Full time

    Intro Scout AI is a new hiring platform that connects software engineers to opportunities with world-class companies. On Scout, you get a more relevant and growthful interviewing experience, you receive feedback on your performance, and you also get end-to-end support to improve your chances of getting hired. If you perform well on the Scout interview, you...


  • Palo Alto, CA, United States Ai Brainer Full time

    The company is committed to leveraging AI to develop innovative features and products that enhance user experience in their matchmaking services. The role involves conducting applied research in Generative AI, developing prototypes, and implementing AI-driven features in production. Collaboration with other engineers and product managers is essential to...


  • San Francisco, California, United States Stack AI Full time

    About Stack AIWe are a fast-growing startup revolutionizing access to Large Language Models, enabling anyone to build AI-powered applications with positive impact.Our No-Code platform seamlessly integrates top AI models, common data sources, and SaaS tools, making it easy for developers to focus on product and business growth.We value innovation, agility,...


  • San Francisco, CA, United States TensorLake Inc. Full time

    Founding Applied AI Research Scientist Tensorlake is building a distributed data processing platform for developers building Generative AI applications. Our product, Indexify( ), enables building continuously evolving knowledge bases and indexes for Large Language Model applications by allowing structured data or embedding extraction algorithms on any...


  • San Francisco, California, United States Jobot Full time

    Overview:We are a well-funded Seed-stage startup, looking for an exceptional AI Framework Engineer to join our team. Our product is public and already generating revenue, created for developers building Generative AI applications. This role is ideal for individuals with a strong background in large language models and/or GenAI.Compensation:The estimated...