计算机科学
手势
人工智能
手势识别
分割
保险丝(电气)
过程(计算)
推论
计算机视觉
特征(语言学)
特征提取
模式识别(心理学)
余弦相似度
语言学
哲学
电气工程
工程类
操作系统
作者
Guoyu Zhou,Zhenchao Cui,Jing Qi
出处
期刊:IEEE robotics and automation letters
日期:2024-02-05
卷期号:9 (4): 3076-3083
被引量:3
标识
DOI:10.1109/lra.2024.3362144
摘要
Computer vision-based gesture recognition methods play a significant role in robot visual gesture interaction. since of low accuracy leading by insuffcient feature representation and fusion, the existing gesture segmentation and recognition methods fail to meet the requirements of practical applications. To address these issues, a lightweight two-stage end-to-end gesture recognition network called Fusing Gate Dual Stages Network (FGDSNet) is proposed. This network adopts a dual-branch network structure in the segmentation stage. Existing dual-branch network models often directly fuse detailed features and semantic features, which leads to detailed information being obscured by blurry semantic information. Additionally, there are redundant issues in the feature maps at different levels during the network inference process. Therefore, we embed Cosine Similarity-KL Divergence Attention Module (CoSKLAM) and Gate Filtering Module (GFM) between the local detail branch and the contextual semantic branch. The role of these two modules is to facilitate the fusion of local and global features during the feature extraction process and filter out redundant information. Finally, the segmentation result and original gesture image are used as inputs for the recognition network to predict gesture categories. The relevant experiments show that the proposed network performs well in both gesture segmentation and gesture recognition, while also having real-time inference speed and a smaller parameter size.
科研通智能强力驱动
Strongly Powered by AbleSci AI