计算机科学
姿势
变压器
解码方法
人工智能
光学(聚焦)
数据挖掘
机器学习
算法
工程类
电压
电气工程
光学
物理
作者
Liang Hong,Cuiping Wang,Mingwen Shao,Qian Zhang
出处
期刊:Research Square - Research Square
日期:2024-07-24
标识
DOI:10.21203/rs.3.rs-4648561/v1
摘要
Abstract Researchers are rapidly turning their focus to human pose estimation as a crucial area of computer vision. In light of the shortcomings of existing Transformer-based pose estimate methods when handling localized features, this work presents MAQT, an enhanced end-to-end method aimed at precise multi-human body pose estimation.To improve the localization of keypoints that are sensitive to scale changes, MAQT offers a Asym-Fusion block. Additionally, we design a new query strategy to optimize the initial selection of queries with Uncertainty-minimal Query Selection. This study combines two self-attention mechanisms in the decoding phase to more correctly understand and record the intricate relationships among keypoints. Based on experimental results on MS COCO using the CrowdPose dataset, MAQT performs better than current contemporary methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI