计算机科学
人工智能
推论
姿势
组分(热力学)
探测器
代表(政治)
计算机视觉
钥匙(锁)
图像(数学)
非参数统计
模式识别(心理学)
数学
政治
热力学
统计
物理
电信
计算机安全
法学
政治学
作者
Zhe Cao,Gines Hidalgo,Tomas Simon,Shih-En Wei,Yaser Sheikh
标识
DOI:10.1109/tpami.2019.2929257
摘要
Realtime multi-person 2D pose estimation is a key component in enabling machines to have an understanding of people in images and videos. In this work, we present a realtime approach to detect the 2D pose of multiple people in an image. The proposed method uses a nonparametric representation, which we refer to as Part Affinity Fields (PAFs), to learn to associate body parts with individuals in the image. This bottom-up system achieves high accuracy and realtime performance, regardless of the number of people in the image. In previous work, PAFs and body part location estimation were refined simultaneously across training stages. We demonstrate that a PAF-only refinement rather than both PAF and body part location refinement results in a substantial increase in both runtime performance and accuracy. We also present the first combined body and foot keypoint detector, based on an internal annotated foot dataset that we have publicly released. We show that the combined detector not only reduces the inference time compared to running them sequentially, but also maintains the accuracy of each component individually. This work has culminated in the release of OpenPose, the first open-source realtime system for multi-person 2D pose detection, including body, foot, hand, and facial keypoints.
科研通智能强力驱动
Strongly Powered by AbleSci AI