计算机科学
现场可编程门阵列
推论
人工神经网络
计算
软件
计算机工程
人工智能
嵌入式系统
计算机硬件
机器学习
计算机体系结构
硬件加速
算法
程序设计语言
作者
Kaiyuan Guo,Shulin Zeng,Jincheng Yu,Yu Wang,Huazhong Yang
出处
期刊:ACM Transactions on Reconfigurable Technology and Systems
[Association for Computing Machinery]
日期:2019-03-28
卷期号:12 (1): 1-26
被引量:200
摘要
Recent research on neural networks has shown a significant advantage in machine learning over traditional algorithms based on handcrafted features and models. Neural networks are now widely adopted in regions like image, speech, and video recognition. But the high computation and storage complexity of neural network inference poses great difficulty on its application. It is difficult for CPU platforms to offer enough computation capacity. GPU platforms are the first choice for neural network processes because of its high computation capacity and easy-to-use development frameworks. However, FPGA-based neural network inference accelerator is becoming a research topic. With specifically designed hardware, FPGA is the next possible solution to surpass GPU in speed and energy efficiency. Various FPGA-based accelerator designs have been proposed with software and hardware optimization techniques to achieve high speed and energy efficiency. In this article, we give an overview of previous work on neural network inference accelerators based on FPGA and summarize the main techniques used. An investigation from software to hardware, from circuit level to system level is carried out to complete analysis of FPGA-based neural network inference accelerator design and serves as a guide to future work.
科研通智能强力驱动
Strongly Powered by AbleSci AI