一般化
计算机科学
判别式
卷积神经网络
人工智能
领域(数学分析)
概化理论
集合(抽象数据类型)
特征(语言学)
素描
机器学习
混合(物理)
航程(航空)
模式识别(心理学)
强化学习
算法
数学
统计
物理
数学分析
哲学
复合材料
量子力学
语言学
材料科学
程序设计语言
作者
Kaiyang Zhou,Yongxin Yang,Yu Qiao,Tao Xiang
出处
期刊:Cornell University - arXiv
日期:2021-01-01
被引量:123
标识
DOI:10.48550/arxiv.2104.02008
摘要
Though convolutional neural networks (CNNs) have demonstrated remarkable ability in learning discriminative features, they often generalize poorly to unseen domains. Domain generalization aims to address this problem by learning from a set of source domains a model that is generalizable to any unseen domain. In this paper, a novel approach is proposed based on probabilistically mixing instance-level feature statistics of training samples across source domains. Our method, termed MixStyle, is motivated by the observation that visual domain is closely related to image style (e.g., photo vs.~sketch images). Such style information is captured by the bottom layers of a CNN where our proposed style-mixing takes place. Mixing styles of training instances results in novel domains being synthesized implicitly, which increase the domain diversity of the source domains, and hence the generalizability of the trained model. MixStyle fits into mini-batch training perfectly and is extremely easy to implement. The effectiveness of MixStyle is demonstrated on a wide range of tasks including category classification, instance retrieval and reinforcement learning.
科研通智能强力驱动
Strongly Powered by AbleSci AI