稳健性(进化)
计算机科学
人工智能
棱锥(几何)
姿势
模式识别(心理学)
推论
比例(比率)
自上而下和自下而上的设计
特征(语言学)
卷积(计算机科学)
特征提取
计算机视觉
变化(天文学)
机器学习
人工神经网络
数学
地理
基因
软件工程
物理
地图学
哲学
生物化学
天体物理学
化学
语言学
几何学
作者
Bowen Cheng,Bin Xiao,Jingdong Wang,Humphrey Shi,Thomas S. Huang,Lei Zhang
标识
DOI:10.1109/cvpr42600.2020.00543
摘要
Bottom-up human pose estimation methods have difficulties in predicting the correct pose for small persons due to challenges in scale variation. In this paper, we present HigherHRNet: a novel bottom-up human pose estimation method for learning scale-aware representations using high-resolution feature pyramids. Equipped with multi-resolution supervision for training and multi-resolution aggregation for inference, the proposed approach is able to solve the scale variation challenge in bottom-up multi-person pose estimation and localize keypoints more precisely, especially for small person. The feature pyramid in HigherHRNet consists of feature map outputs from HRNet and upsampled higher-resolution outputs through a transposed convolution. HigherHRNet outperforms the previous best bottom-up method by 2.5% AP for medium person on COCO test-dev, showing its effectiveness in handling scale variation. Furthermore, HigherHRNet achieves new state-of-the-art result on COCO test-dev (70.5% AP) without using refinement or other post-processing techniques, surpassing all existing bottom-up methods. HigherHRNet even surpasses all top-down methods on CrowdPose test (67.6% AP), suggesting its robustness in crowded scene.
科研通智能强力驱动
Strongly Powered by AbleSci AI