子网
计算机科学
人工智能
水准点(测量)
姿势
编码(集合论)
分辨率(逻辑)
光学(聚焦)
高分辨率
深度学习
代表(政治)
机器学习
过程(计算)
模式识别(心理学)
遥感
地理
法学
程序设计语言
地质学
集合(抽象数据类型)
大地测量学
物理
光学
操作系统
政治
计算机安全
政治学
作者
Ke Sun,Bin Xiao,Dong Liu,Jingdong Wang
出处
期刊:Cornell University - arXiv
日期:2019-06-01
被引量:3782
标识
DOI:10.1109/cvpr.2019.00584
摘要
In this paper, we are interested in the human pose estimation problem with a focus on learning reliable high-resolution representations. Most existing methods recover high-resolution representations from low-resolution representations produced by a high-to-low resolution network. Instead, our proposed network maintains high-resolution representations through the whole process. We start from a high-resolution subnetwork as the first stage, gradually add high-to-low resolution subnetworks one by one to form more stages, and connect the mutli-resolution subnetworks in parallel. We conduct repeated multi-scale fusions such that each of the high-to-low resolution representations receives information from other parallel representations over and over, leading to rich high-resolution representations. As a result, the predicted keypoint heatmap is potentially more accurate and spatially more precise. We empirically demonstrate the effectiveness of our network through the superior pose estimation results over two benchmark datasets: the COCO keypoint detection dataset and the MPII Human Pose dataset. In addition, we show the superiority of our network in pose tracking on the PoseTrack dataset. The code and models have been publicly available at https://github.com/leoxiaobin/deep-high-resolution-net.pytorch.
科研通智能强力驱动
Strongly Powered by AbleSci AI