计算机科学
推论
水准点(测量)
GSM演进的增强数据速率
架空(工程)
计算
边缘设备
人工智能
频道(广播)
深层神经网络
人工神经网络
计算机工程
卷积神经网络
深度学习
修剪
机器学习
算法
操作系统
生物
云计算
地理
计算机网络
大地测量学
农学
作者
Mengran Liu,Weiwei Fang,Xiaodong Ma,Wenyuan Xu,Naixue Xiong,Yi Ding
标识
DOI:10.1016/j.asoc.2021.107636
摘要
Deep Neural Networks (DNNs) have achieved remarkable success in many computer vision tasks recently, but the huge number of parameters and the high computation overhead hinder their deployments on resource-constrained edge devices. It is worth noting that channel pruning is an effective approach for compressing DNN models. A critical challenge is to determine which channels are to be removed, so that the model accuracy will not be negatively affected. In this paper, we first propose Spatial and Channel Attention (SCA), a new attention module combining both spatial and channel attention that respectively focuses on “where” and “what” are the most informative parts. Guided by the scale values generated by SCA for measuring channel importance, we further propose a new channel pruning approach called Channel Pruning guided by Spatial and Channel Attention (CPSCA). Experimental results indicate that SCA achieves the best inference accuracy, while incurring negligibly extra resource consumption, compared to other state-of-the-art attention modules. Our evaluation on two benchmark datasets shows that, with the guidance of SCA, our CPSCA approach achieves higher inference accuracy than other state-of-the-art pruning methods under the same pruning ratios.
科研通智能强力驱动
Strongly Powered by AbleSci AI