Senior AI and Deep Learning Architect – Model Compression and Quantization

2 days ago


San Diego CA United States Kneron, Inc. Full time
Senior AI and Deep Learning Architect – Model Compression and Quantization

Research and develop state-of-the-art model compression techniques including QAT, model distillation, pruning, quantization, model binarization, and others for deep learning models.

Implement novel deep neural network architectures and develop advanced training algorithms to support model structure training, auto pruning, and low-bit quantization.

Apply and optimize model compression and quantization techniques to a variety of models in computer vision applications, audio applications, and others.

Research and optimize model compression and quantization techniques for Kneron AI accelerator and jointly optimize hardware architecture for compressed models.

Requirements
  • M.S./PhD in Computer Science, Machine Learning, Mathematics or similar field (Ph.D. is preferred)
  • 3+ years of industry/academia experience with deep learning algorithm development and optimization.
  • 3-5 years of software engineering experience in an academic or industrial setting.
  • Research experience on any model compression and model quantization technique including model distillation, pruning, post-train quantization, quantization aware retrain, model binarization, and NAS.
  • Experience on model accuracy loss analysis for model compression and quantization is a strong plus. Noise modeling and noise analysis are strong plus.
  • Strong experience in C/C++ programming is a plus.
  • Hands-on experience in computer vision and deep learning frameworks, e.g., OpenCV, TensorFlow, Keras, PyTorch, and Caffe.
  • Ability to quickly adapt to new situations, learn new technologies, and collaborate and communicate effectively.
  • Experience with parallel computing, GPU/CUDA, DSP, and OpenCL programming is a plus.
  • Top-tier conference publication records, including but not limited to CVPR, ICCV, ECCV, NIPS, ICML, are strong plus.
Location #J-18808-Ljbffr

  • San Diego, United States Kneron Full time

    Senior AI and Deep Learning Architect – Model Compression and Quantization Research and develop state-of-the-art model compression techniques including QAT, model distillation, pruning, quantization, model binarization, and others for deep learning models. Implement novel deep neural network architectures and develop advanced training algorithms to support...


  • San Diego, United States Kneron Full time

    Senior AI and Deep Learning Architect – Model Compression and Quantization Job description Research and develop state-of-the-art model compression techniques including QAT, model distillation, pruning, quantization, model binarization, and others for deep learning models.  Implementing novel deep neural network architectures and developing advanced...


  • San Diego, CA, United States Amazon Full time

    Job ID: 2800225 | Amazon Web Services, Inc. Are you looking to work at the forefront of Machine Learning and AI? Would you be excited to apply cutting edge Generative AI algorithms to solve real world problems with significant impact? Machine learning (ML) has been strategic to Amazon from the early years. We are pioneers in areas such as recommendation...

  • AI Systems Engineer

    1 week ago


    San Diego, California, United States Qualcomm Full time

    Job Title: AI Systems EngineerQualcomm is seeking a highly skilled AI Systems Engineer to join our team. As an AI Systems Engineer, you will be responsible for designing and developing state-of-the-art AI systems for advanced driver assistance systems (ADAS) and autonomous driving solutions.Key Responsibilities:Apply machine learning knowledge to extend...


  • San Jose, United States Advanced Micro Devices, Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHINGWe care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • San Francisco, United States Advanced Micro Devices , Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • San Jose, United States Advanced Micro Devices , Inc. Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • San Jose, United States Advanced Micro Devices , Inc. Full time

    Overview: WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....


  • San Jose, United States Advanced Micro Devices , Inc. Full time

    Overview: WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....


  • San Diego, United States Kneron Full time

    AI and Deep Learning Engineer Job description -Implementing novel deep neural network architectures and learning techniques to solve a variety of computer vision and audio related tasks and push the state of the art in performance.  Research and develop audio applications including speech recognition, voice recognition, voice wakeup using state of art deep...


  • San Francisco, CA, United States TensorLake Inc. Full time

    Tensorlake is building a distributed data processing platform for developers building Generative AI applications. Our product, Indexify( ), enables building continuously evolving knowledge bases and indexes for Large Language Model applications by allowing structured data or embedding extraction algorithms on any unstructured data. We are building a...


  • San Diego, United States Ernst and Young Full time

    At EY, you’ll have the chance to build a career as unique as you are, with the global scale, support, inclusive culture and technology to become the best version of you. And we’re counting on your unique voice and perspective to help EY become even better. Join us and build an exceptional experience for yourself, and a better working world for all.The...


  • San Francisco, California, United States NovumTech Partners Full time

    Job SummaryWe are seeking a highly skilled Machine Learning Model Architect to join our team at NovumTech Partners. As a key member of our research and development team, you will be responsible for designing and implementing AI models that drive business growth and innovation.About the RoleThe successful candidate will have a strong background in machine...


  • San Francisco, California, United States Scale AI Full time

    Research Role OverviewScale AI's Generative AI team is pushing the boundaries of artificial intelligence by developing innovative models, algorithms, and supervision techniques. As a Senior AI Research Scientist for Generative Models, you will play a critical role in advancing our research agenda and driving product development. Your expertise in Generative...


  • San Francisco, CA, United States Truva AI Full time

    Why Join Truva.ai Truva stands at the forefront of SaaS innovation, specializing in automating tasks, optimizing workflows, and delivering unparalleled operational efficiency with LLMs. Truva is backed by top VCs such as YCombinator and Fintech Collective and led by Gaurav - 2x founder and an alumnus of Stanford, and Anuja - an alumnus of Haas MBA from UC...


  • Indianapolis, IN, United States Ernst and Young Full time

    At EY, you’ll have the chance to build a career as unique as you are, with the global scale, support, inclusive culture and technology to become the best version of you. And we’re counting on your unique voice and perspective to help EY become even better. Join us and build an exceptional experience for yourself, and a better working world for all. The...


  • San Francisco, California, United States Perplexity AI Full time

    We are a fast-growing AI company looking for an expert machine learning engineer to join our team. Our current stack is Python, C++, TensorRT-LLM, and Kubernetes.You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference. The ideal candidate should have experience with ML systems and deep learning...


  • Baltimore, MD, United States Ernst and Young Full time

    At EY, you’ll have the chance to build a career as unique as you are, with the global scale, support, inclusive culture and technology to become the best version of you. And we’re counting on your unique voice and perspective to help EY become even better. Join us and build an exceptional experience for yourself, and a better working world for all. The...


  • Miami, FL, United States Ernst and Young Full time

    At EY, you’ll have the chance to build a career as unique as you are, with the global scale, support, inclusive culture and technology to become the best version of you. And we’re counting on your unique voice and perspective to help EY become even better. Join us and build an exceptional experience for yourself, and a better working world for all. The...


  • Detroit, MI, United States Ernst and Young Full time

    At EY, you’ll have the chance to build a career as unique as you are, with the global scale, support, inclusive culture and technology to become the best version of you. And we’re counting on your unique voice and perspective to help EY become even better. Join us and build an exceptional experience for yourself, and a better working world for all. The...