计算机科学
动作识别
RGB颜色模型
核(代数)
图形
稳健性(进化)
卷积(计算机科学)
人工智能
模式识别(心理学)
理论计算机科学
算法
数学
人工神经网络
生物化学
化学
组合数学
基因
班级(哲学)
作者
Wenhua Li,Enzeng Dong,Jigang Tong,Sen Yang,Zufeng Zhang,Wenyu Li
标识
DOI:10.1109/icma57826.2023.10215860
摘要
Skeleton-based human action recognition has become a popular topic among researchers. This is because using skeletal data provides a robust solution to problems encountered in complex environments, such as changes in perspective and background interference. The robustness of skeletal data enables recognition methods to focus on more specific and relevant features. We propose a model called multilevel decomposition time aggregation graph convolution network (MDT-GCN), which utilizes a multilevel graph convolution kernel to capture higher-order spatial dependence relationships between joints. This is achieved by decomposing a human topology graph into smaller graphs, each of which has its own graph convolution kernel. To further enhance the performance of our model, we employ a two-flow framework and channel topology refinement strategy. Our experiments on the NTU-RGB+D60 and NTU-RGB+D120 datasets demonstrate that our MDT-GCN network outperforms the previous algorithm and significantly improves the accuracy of action recognition.
科研通智能强力驱动
Strongly Powered by AbleSci AI