管道(软件)
计算机科学
现场可编程门阵列
吞吐量
可重组计算
目标检测
计算机硬件
能源消耗
带宽(计算)
嵌入式系统
硬件加速
计算机体系结构
人工智能
操作系统
无线
工程类
计算机网络
模式识别(心理学)
电气工程
出处
期刊:International Conference on Information Science and Technology
日期:2021-05-21
卷期号:: 571-577
被引量:36
标识
DOI:10.1109/icist52614.2021.9440554
摘要
The emergence of YOLOv3 makes it possible to detect small targets. Due to the characteristics of the YOLO network itself, the YOLOv3 network has exceptionally high requirements for computing power and memory bandwidth and it usually needs to be deployed on a dedicated hardware acceleration platform. FPGAs is a logically reconfigurable hardware chip with substantial advantages in terms of performance and power consumption, so it is a good choice to deploy a deep convolutional network. In the research of this paper, we proposed a reconfigurable YOLOv3 FPGA hardware accelerator based on the AXI bus ARM+FPGA architecture. The YOLOv3 network quantifies through Vitis AI, and a series of operations such as model compression and data pre-processing can save accelerator chips and the access time of external storage. Pipeline operation enables FPGAs to achieve higher throughput. Compared with the GPU implementation of the YOLOv3 model, it is found that the hardware implementation of the FPGA-based YOLOv3 accelerator has lower energy consumption and can achieve higher throughput.
科研通智能强力驱动
Strongly Powered by AbleSci AI