串联(数学)
计算机科学
人工智能
特征(语言学)
帧(网络)
RGB颜色模型
骨架(计算机编程)
构造(python库)
模式识别(心理学)
过程(计算)
运动(物理)
邻接矩阵
转化(遗传学)
变换矩阵
接头(建筑物)
算法
理论计算机科学
数学
经典力学
运动学
程序设计语言
化学
语言学
建筑工程
生物化学
基因
电信
哲学
图形
工程类
物理
组合数学
操作系统
作者
Xinpeng Yin,Jianqi Zhong,Deliang Lian,Wenming Cao
标识
DOI:10.1016/j.patcog.2024.110262
摘要
Previous works have realized that spatio-temporal entanglement features can not be ignored in skeleton-based motion recognition tasks, then they have not broken away from the barriers of traditional GCN (The entanglement feature is still modeled by the extended single-frame adjacency matrix). We introduce a new joint-correlations determination mechanism that uses a non-linear transformation of the distance between joints in multiple frames to construct the connection relationship. The proposed method results in improved accuracy while significantly reducing the number of parameters. Meanwhile, recent works have alleviated the problem of most actions being only related to the dynamic characteristics of local joints by aggregating features of different parts of the human body in parallel, while interacting with different features still remains at a lower level of concatenation or addition. We propose a progressive inward-outward structure (PIS) that allows joint features corresponding to the action to be extracted while taking into account the lightweight link between this part of the joints and the rest. Integrating the above two designs, we propose a Spatiotemporal Progressive Inward-Outward Aggregation Network (SPIANet) to model the complex spatiotemporal entanglement between joints in the process of human motion, which is validated on three public datasets (NTU-RGB+D60, NTU-RGB+D120, and UESTC varying-view) and outperforms state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI