姿势
计算机科学
推论
人工智能
任务(项目管理)
机器学习
航程(航空)
图形模型
图像(数学)
功能(生物学)
卷积(计算机科学)
卷积神经网络
模式识别(心理学)
人工神经网络
进化生物学
生物
复合材料
经济
管理
材料科学
作者
Shih-En Wei,Varun Ramakrishna,Takeo Kanade,Yaser Sheikh
标识
DOI:10.1109/cvpr.2016.511
摘要
Pose Machines provide a sequential prediction framework for learning rich implicit spatial models. In this work we show a systematic design for how convolutional networks can be incorporated into the pose machine framework for learning image features and image-dependent spatial models for the task of pose estimation. The contribution of this paper is to implicitly model long-range dependencies between variables in structured prediction tasks such as articulated pose estimation. We achieve this by designing a sequential architecture composed of convolutional networks that directly operate on belief maps from previous stages, producing increasingly refined estimates for part locations, without the need for explicit graphical model-style inference. Our approach addresses the characteristic difficulty of vanishing gradients during training by providing a natural learning objective function that enforces intermediate supervision, thereby replenishing back-propagated gradients and conditioning the learning procedure. We demonstrate state-of-the-art performance and outperform competing methods on standard benchmarks including the MPII, LSP, and FLIC datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI