计算机科学
骨架(计算机编程)
动作识别
人工智能
人体骨骼
卷积神经网络
图形
RGB颜色模型
模式识别(心理学)
噪音(视频)
计算机视觉
理论计算机科学
图像(数学)
班级(哲学)
程序设计语言
作者
Yongsang Yoon,Jongmin Yu,Moongu Jeon
标识
DOI:10.1007/s10489-021-02487-z
摘要
In skeleton-based action recognition, graph convolutional networks (GCNs), which model human body skeletons using graphical components such as nodes and connections, have recently achieved remarkable performance. While the current state-of-the-art methods for skeleton-based action recognition usually assume that completely observed skeletons will be provided, it is problematic to realize this assumption in real-world scenarios since the captured skeletons may be incomplete or noisy. In this work, we propose a skeleton-based action recognition method that is robust to noise interference for the given skeleton features. The key insight of our approach is to train a model by maximizing the mutual information between normal and noisy skeletons using predictive coding in the latent space. We conducted comprehensive skeleton-based action recognition experiments with defective skeletons using the NTU-RGB+D and Kinetics-Skeleton datasets. The experimental results demonstrate that when the skeleton samples are noisy, our approach achieves outstanding performances compared with the existing state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI