RGB颜色模型
计算机科学
人工智能
计算机视觉
运动(物理)
网(多面体)
动作(物理)
动作识别
物理
数学
班级(哲学)
几何学
量子力学
作者
Yutong Li,Miao Ma,Jie Wu,Kaifang Yang,Zhao Pei,Jie Ren
出处
期刊:IEEE Sensors Journal
[Institute of Electrical and Electronics Engineers]
日期:2024-02-12
卷期号:24 (7): 11770-11782
被引量:1
标识
DOI:10.1109/jsen.2024.3363042
摘要
In recent years, action recognition has received widespread attention, which classifies actions by extracting features from kinds of sensor data. However, with the growing difficulty of identifying fine-grained actions, certain methods cannot learn sufficient motion and temporal information. Therefore, an effective information enhancement method is required to reason motion clues in video sequences. This article proposes an end-to-end video action recognition framework called the motion information enhancement network (MIE-Net), which consists of two innovative components. The first component, the adaptive fusion module (AFM), selectively extracts the relationships between original and motion-enhanced features to enhance the interaction among different feature information. The second component, a double pooling temporal attention module (DPTAM), implements temporal modeling to enhance subtle information during feature extraction. Finally, a standing long jump dataset (SLJD) containing over 1000 videos from 116 participants is collected by sensor camera, which differs from existing datasets in terms of strong background unbiasedness, to evaluate the effectiveness of our model robustly. Experimental results on SLJD, Something-Something v2, and Diving48 datasets demonstrate that the proposed MIE-Net outperforms most state-of-the-art methods. Our code is released at https://github.com/li-stu-998/MIE-Net .
科研通智能强力驱动
Strongly Powered by AbleSci AI