Realistic acceleration of neural networks with fine-grained tensor decomposition

计算机科学计算换位（逻辑）张量（固有定义）张量分解循环神经网络人工神经网络推论算法压缩（物理）可扩展性分解加速度数据压缩人工智能数学生物物理生态学复合材料数据库材料科学纯数学经典力学

作者

Rui Lv,Dingheng Wang,Jiangbin Zheng,Yefan Xie,Zhao-Xu Yang

出处

期刊：Neurocomputing [Elsevier]
日期：2022-11-01 卷期号：512: 52-68 被引量：1

标识

DOI：10.1016/j.neucom.2022.09.057

摘要

As the modern deep neural networks (DNNs) have become more and more large-scale and expensive, the topic of DNN compression grows into a hot direction nowadays. Among variant compression methods, tensor decomposition seems to be the most promising and low-cost one because of its solid mathematical foundations and regular data structure. However, most of the existing tensor decompositions are not very good at accelerating DNNs, because there are always necessary transpositions on tensor modes to make the input data calculate with the decomposed factor tensors correctly, and transposition will bring extra memory and time cost for the realistic system without doubt. In this paper, we select a relatively novel Kronecker CANDECOMP/PARAFAC (KCP) tensor decomposition which has fine-grained factor tensors, and propose the transposition-free algorithm to calculate the contractions between the input data and the neural weight in KCP format. The theoretically analysis of computation complexity indicates that the proposed method is much more efficient than the existing algorithms. We further prove that the training complexity of KCP-DNN based on the proposed transposition-free algorithm can also be faster than the traditional ones, and make a comprehensive comparison of space and computation complexity including training and inference stages to show the superiority of our method. As a series of related works pay more attention to the recurrent neural networks (RNNs), we follow these existing practices and focus on the KCP-RNN to make a comprehensive comparison with them, and the experimental results show our KCP-RNN with transposition-free algorithm has systematically advantages including accuracy, space complexity, computation complexity, and realistic running time. Besides, some advanced characteristics of KCP-DNN such as collocation of ranks, have also been discussed.

求助该文献

最长约 10秒，即可获得该文献文件

Realistic acceleration of neural networks with fine-grained tensor decomposition

今日热心研友