RGB颜色模型
计算机科学
人工智能
图形
模式识别(心理学)
特征学习
帧速率
代表(政治)
水准点(测量)
理论计算机科学
大地测量学
政治
政治学
法学
地理
作者
Tianming Zhuang,Zhen Qin,Yi Ding,Fuhu Deng,LeDuo Chen,Zhiguang Qin,Kim‐Kwang Raymond Choo
出处
期刊:IEEE transactions on artificial intelligence
[Institute of Electrical and Electronics Engineers]
日期:2023-01-01
卷期号:: 1-14
标识
DOI:10.1109/tai.2023.3329799
摘要
Human skeleton data, which has served in the aspect of human activity recognition, ought to be the most representative biometric characteristics due to its intuitivity and visuality. The state-of-the-art approaches mainly focus on improving modeling spatial correlations within graph topologies. However, the interframes motional representations are also of vital importance, and we argue that they are worth paying attention to and exploring. Therefore, a temporal refinement module with contrastive learning mechanism is proposed, fusing as a complementary to the conventional spatial graph convolution layer. In addition, in order to further exploiting the inter-frame variances and generalizing GCN operation to temporal dimension, a temporal-correlation matrix is introduced to effectively capture dynamic dependencies within frame-pairs, enhancing semantic feature representation. Moreover, since GCN is a typical local operator which lacks of capability to fully model the long-term relations along spatial and temporal variation, to move beyond the limitation, a spatialtemporal cascaded aggregation module is designed to enlarge the receptive filter scale. The overall recognition framework consists of three above novelties, which is capable of achieving remarkable performance by evaluating on benchmark datasets(i.e., NTU RGB+D 60, NTU RGB+D 120, PKU-MMD and Kinetics Skeleton 400). Extensive experiments demonstrate the effectiveness of the proposed framework, e.g., performing recognition accuracy rate of 90.9% and 96.8% on NTU RGB+D 60, 87.9% and 88.9% on NTU RGB+D 120.
科研通智能强力驱动
Strongly Powered by AbleSci AI