计算机科学
融合
估计
姿势
人工智能
计算机视觉
模式识别(心理学)
工程类
系统工程
语言学
哲学
作者
Qian Zheng,Hualing Guo,Yunhua Yin,Bin Zheng,Hongxu Jiang
标识
DOI:10.1016/j.jvcir.2024.104093
摘要
To address the limitations of existing 2D human pose estimation methods in terms of speed and lightweight, we propose a method called Lightweight Fusion SimCC (LFSimCC). LFSimCC incorporates two modules: LiteFNet, which enhances multi-scale spatial information fusion, and LKC-GAU, which improves the modeling capability of spatial information. Specifically, LiteFNet utilizes a combination of self-attention mechanism and novel spatial convolution to enable feature maps to capture richer multi-level global feature representations within the network. On the other hand, LKC-GAU enhances SimCC’s ability to capture spatial relationships between joints by incorporating a large kernel of convolution and a self-attention mechanism. Furthermore, we design a keypoint information fusion loss (IFL) that enhances the model’s sensitivity to information between keypoints in the human body. Experimental results demonstrate that our method is capable of extracting more decisive information and suppressing redundant feature representations, leading to high recognition accuracy and low inference latency.
科研通智能强力驱动
Strongly Powered by AbleSci AI