计算机科学
卷积神经网络
一般化
人工智能
水准点(测量)
领域(数学分析)
特征(语言学)
模式识别(心理学)
特征提取
机器学习
元学习(计算机科学)
数学
数学分析
语言学
哲学
管理
大地测量学
经济
任务(项目管理)
地理
作者
Chaoqun Wan,Xu Shen,Yonggang Zhang,Zhiheng Yin,Xinmei Tian,Feng Gao,Jianqiang Huang,Xian-Sheng Hua
标识
DOI:10.1109/cvpr52688.2022.00464
摘要
In single domain generalization, models trained with data from only one domain are required to perform well on many unseen domains. In this paper, we propose a new model, termed meta convolutional neural network, to solve the single domain generalization problem in image recognition. The key idea is to decompose the convolutional features of images into meta features. Acting as “visual words”, meta features are defined as universal and basic visual elements for image representations (like words for documents in language). Taking meta features as reference, we propose compositional operations to eliminate irrelevant features of local convolutional features by an addressing process and then to reformulate the convolutional feature maps as a composition of related meta features. In this way, images are universally coded without biased information from the unseen domain, which can be processed by following modules trained in the source domain. The compositional operations adopt a regression analysis technique to learn the meta features in an online batch learning manner. Extensive experiments on multiple benchmark datasets verify the superiority of the proposed model in improving single domain generalization ability.
科研通智能强力驱动
Strongly Powered by AbleSci AI