计算机科学
卷积神经网络
人工智能
骨架(计算机编程)
模式识别(心理学)
动作识别
循环神经网络
边距(机器学习)
RGB颜色模型
特征提取
深度学习
人工神经网络
计算机视觉
机器学习
程序设计语言
班级(哲学)
作者
Chao Li,Qiaoyong Zhong,Dong Xie,Shiliang Pu
标识
DOI:10.1109/icmew.2017.8026285
摘要
Current state-of-the-art approaches to skeleton-based action recognition are mostly based on recurrent neural networks (RNN). In this paper, we propose a novel convolutional neural networks (CNN) based framework for both action classification and detection. Raw skeleton coordinates as well as skeleton motion are fed directly into CNN for label prediction. A novel skeleton transformer module is designed to rearrange and select important skeleton joints automatically. With a simple 7-layer network, we obtain 89.3% accuracy on validation set of the NTU RGB+D dataset. For action detection in untrimmed videos, we develop a window proposal network to extract temporal segment proposals, which are further classified within the same network. On the recent PKU-MMD dataset, we achieve 93.7% mAP, surpassing the baseline by a large margin.
科研通智能强力驱动
Strongly Powered by AbleSci AI