计算机科学
现场可编程门阵列
卷积神经网络
交错
延迟(音频)
预处理器
能源消耗
计算机工程
传输(电信)
协处理器
人工神经网络
嵌入式系统
人工智能
计算机硬件
电信
生态学
生物
操作系统
作者
Zhengjie Zhou,Yumei Liu,Yidong Xu
出处
期刊:Proceedings of the 2020 4th International Conference on Electronic Information Technology and Computer Engineering
日期:2020-11-06
被引量:4
标识
DOI:10.1145/3443467.3443911
摘要
Convolutional Neural Network (CNN) has been widely used in computer vision fields such as image recognition and target detection. However, in the forward reasoning stage, many practical applications often require features of low latency and low power consumption. In order to solve this problem, optimization methods such as channel interleaving, multi-channel transmission, and multi-level union are adopted to design and implement a convolutional neural network system based on the FPGA. After analyzing the performance and resource consumption of the accelerator, the actual transmission delay was also considered to reduce the delay error; input and output modules were added to reduce the time for image preprocessing and postprocessing. In this work, the YOLOv3-Tiny model algorithm was implemented on the Xilinx PYNQ-Z2 (ARM+FPGA) platform. Experimental results show that, compared with the CPU, it is greatly optimized in terms of energy efficiency and time, and it has been improved from some previous works.
科研通智能强力驱动
Strongly Powered by AbleSci AI