X3D型
计算机科学
现场可编程门阵列
水准点(测量)
卷积神经网络
任务(项目管理)
动作识别
计算机体系结构
地理空间分析
人工智能
动作(物理)
嵌入式系统
人机交互
机器学习
实时计算
虚拟现实
系统工程
工程类
物理
VRML
地理
班级(哲学)
量子力学
地图学
大地测量学
作者
Petros Toupas,Christos-Savvas Bouganis,Dimitrios Tzovaras
标识
DOI:10.1109/asap57973.2023.00030
摘要
3D Convolutional Neural Networks are gaining increasing attention from researchers and practitioners and have found applications in many domains, such as surveillance systems, autonomous vehicles, human monitoring systems, and video retrieval. However, their widespread adoption is hindered by their high computational and memory requirements, especially when resource-constrained systems are targeted. This paper addresses the problem of mapping X3D, a state-of-the-art model in Human Action Recognition that achieves accuracy of 95.5% in the UCF101 benchmark, onto any FPGA device. The proposed toolflow generates an optimised stream-based hardware system, taking into account the available resources and off-chip memory characteristics of the FPGA device. The generated designs push further the current performance-accuracy pareto front, and enable for the first time the targeting of such complex model architectures for the Human Action Recognition task.
科研通智能强力驱动
Strongly Powered by AbleSci AI