基本事实
计算机科学
合成数据
人工智能
解析
域适应
一般化
一致性(知识库)
桥(图论)
机器学习
任务(项目管理)
标记数据
领域(数学分析)
模式识别(心理学)
图像(数学)
计算机视觉
数学
数学分析
内科学
经济
分类器(UML)
医学
管理
作者
Jiteng Mu,Weihong Qiu,Gregory D. Hager,Alan Yuille
出处
期刊:Cornell University - arXiv
日期:2019-12-17
被引量:1
标识
DOI:10.48550/arxiv.1912.08265
摘要
Despite great success in human parsing, progress for parsing other deformable articulated objects, like animals, is still limited by the lack of labeled data. In this paper, we use synthetic images and ground truth generated from CAD animal models to address this challenge. To bridge the domain gap between real and synthetic images, we propose a novel consistency-constrained semi-supervised learning method (CC-SSL). Our method leverages both spatial and temporal consistencies, to bootstrap weak models trained on synthetic data with unlabeled real images. We demonstrate the effectiveness of our method on highly deformable animals, such as horses and tigers. Without using any real image label, our method allows for accurate keypoint prediction on real images. Moreover, we quantitatively show that models using synthetic data achieve better generalization performance than models trained on real images across different domains in the Visual Domain Adaptation Challenge dataset. Our synthetic dataset contains 10+ animals with diverse poses and rich ground truth, which enables us to use the multi-task learning strategy to further boost models' performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI