动态时间归整
计算机科学
人工智能
公制(单位)
匹配(统计)
图像扭曲
集合(抽象数据类型)
工具箱
特征学习
代表(政治)
光学(聚焦)
RGB颜色模型
机器学习
模式识别(心理学)
政治学
经济
物理
光学
统计
政治
程序设计语言
法学
数学
运营管理
作者
Leiyu Xie,Yuxing Yang,Zeyu Fu,Syed Mohsen Naqvi
标识
DOI:10.1109/icassp49357.2023.10097186
摘要
In this paper, we address the classification of medical actions with only one single sample by developing a novel one-shot learning framework which contains both cross-attention and dynamic time warping (DTW) modules. To be concrete, we firstly transform the raw skeleton sequence into the signal-level image representation. We exploit a metric learning approach, which is the prototypical network for the proposed one-shot learning framework and choose the residual network (ResNet18) as the backbone which is widely used in recent years. Cross-attention is applied for guiding the network to focus on the more important joints from each specific action. The cross-attention mechanism that applies between the support and query set will be adapted for mining and matching the relationships with the human body. Furthermore, a DTW module is introduced to mitigate the temporal information mismatching issue between the actions from the support and query sets. The experimental results on the NTU RGB+D 120 dataset demonstrate the effectiveness of our proposed approach and the improved performance compared to the baseline approach. The code of this work is available at 1 .
科研通智能强力驱动
Strongly Powered by AbleSci AI