现场可编程门阵列
计算机科学
计算
内存占用
量化(信号处理)
嵌入式系统
人工神经网络
深度学习
计算机硬件
计算机体系结构
人工智能
算法
操作系统
作者
Eric Rex,Xiaofang Wang
标识
DOI:10.1109/uemcon59035.2023.10316010
摘要
As Deep Neural Networks (DNNs) have been growing to more and more complex models, their computation and storage requirements have increased dramatically. Recent research works on quantized neural networks (QNNs) have shown their great advantages for real-time machine learning on resource-limited embedded devices. FPGAs have emerged as a popular and efficient hardware platform to accelerate QNNs and DNNs when considering both performance and energy efficiencies. On the other hand, with the increasing complexity of hardware design, high-level synthesis tools are becoming popular to facilitate fast implementation. In this paper, we present our quantization and hardware implementation of the AlexNet using the Xilinx FINN framework. Extensive experiments with QNNs of various bits on a low-cost and resource-limited FPGA board demonstrate their good performance with much smaller memory footprint and lower computation complexity.
科研通智能强力驱动
Strongly Powered by AbleSci AI