可重构性
现场可编程门阵列
计算机科学
卷积神经网络
计算机体系结构
Boosting(机器学习)
人工智能
硬件加速
人工神经网络
高效能源利用
计算机工程
深度学习
计算科学与工程
建筑
嵌入式系统
机器学习
操作系统
电气工程
工程类
艺术
视觉艺术
标识
DOI:10.1007/s00521-018-3761-1
摘要
Deep convolutional neural networks (CNNs) have recently shown very high accuracy in a wide range of cognitive tasks, and due to this, they have received significant interest from the researchers. Given the high computational demands of CNNs, custom hardware accelerators are vital for boosting their performance. The high energy efficiency, computing capabilities and reconfigurability of FPGA make it a promising platform for hardware acceleration of CNNs. In this paper, we present a survey of techniques for implementing and optimizing CNN algorithms on FPGA. We organize the works in several categories to bring out their similarities and differences. This paper is expected to be useful for researchers in the area of artificial intelligence, hardware architecture and system design.
科研通智能强力驱动
Strongly Powered by AbleSci AI