快速傅里叶变换
计算机科学
预处理器
动作识别
动作(物理)
人工智能
领域
语音识别
模式识别(心理学)
机器学习
算法
物理
量子力学
政治学
法学
班级(哲学)
作者
Nguyen Nang Hung Van,Phuc Hao,Van Nam Hoang,A. Borodko,Tran Duc Le
标识
DOI:10.1145/3628797.3628827
摘要
Human Action Recognition (HAR) has emerged as a pivotal challenge in the rapidly evolving realm of artificial intelligence and computer vision. This research delves into enhancing action recognition in video sequences by synergizing the Fast Fourier Transform (FFT) capabilities and the EfficientNet model. We employed the UCF101 dataset, encompassing a diverse range of action categories, to validate our approach empirically. The results underscored that incorporating FFT in the preprocessing stage amplifies the recognition efficacy of the EfficientNet model. Comparative analyses further substantiated the superiority of this combined approach over conventional methods, marking a significant contribution in the domain of action video recognition.
科研通智能强力驱动
Strongly Powered by AbleSci AI