计算机科学
人工智能
卷积神经网络
卷积(计算机科学)
图形
模式识别(心理学)
失败
人工神经网络
骨架(计算机编程)
理论计算机科学
并行计算
程序设计语言
作者
Cheng Qin,Ziliang Ren,Jun Cheng,Qieshi Zhang,Hao Yan,Jianming Liu
出处
期刊:IEEE International Conference on Real-time Computing and Robotics
日期:2021-07-15
被引量:1
标识
DOI:10.1109/rcar52367.2021.9517665
摘要
The skeleton data convey significant information for human action recognition since they can robustly accommodate cluttered background and illumination variation. Early convolutional neural networks (CNN) based method mainly structure the skeleton sequence into pseudo-image and feed it into image classification neural network such as Resnet, which can not capture comprehensive spatial-temporal feature. Recently, graph convolutional networks (GCNs) have obtained superior performance. However, the computational complexity of GCN-based methods is quite high, some works even reach 100 GFLOPs for one action sample. This is contrary to the highly condensed attributes of skeleton data. In this paper, a Multi-scale Spatial-temporal Convolution Neural Network (MSST-Net) is proposed for skeleton-based action recognition. Our MSST-Net abandons complex graph convolutions and takes the implicit complementary advantages across different scales of spatial-temporal representations, which are often ignored in the previous work. On two datasets for action recognition, MSST-Net achieves impressive recognition accuracy with a small amount of calculation.
科研通智能强力驱动
Strongly Powered by AbleSci AI