计算机科学
人工智能
规范化(社会学)
可转让性
人工神经网络
深度学习
卷积神经网络
机器学习
模式识别(心理学)
学习迁移
深层神经网络
过度拟合
训练集
作者
Ximei Wang,Ying Jin,Mingsheng Long,Jianmin Wang,Michael I. Jordan
出处
期刊:Neural Information Processing Systems
日期:2019-09-06
卷期号:32: 1951-1961
被引量:77
摘要
Deep neural networks (DNNs) excel at learning representations when trained on large-scale datasets. Pre-trained DNNs also show strong transferability when fine-tuned to other labeled datasets. However, such transferability becomes weak when the target dataset is fully unlabeled as in Unsupervised Domain Adaptation (UDA). We envision that the loss of transferability may stem from the intrinsic limitation of the architecture design of DNNs. In this paper, we delve into the components of DNN architectures and propose Transferable Normalization (TransNorm) in place of existing normalization techniques. TransNorm is an end-to-end trainable layer to make DNNs more transferable across domains. As a general method, TransNorm can be easily applied to various deep neural networks and domain adaption methods, without introducing any extra hyper-parameters or learnable parameters. Empirical results justify that TransNorm not only improves classification accuracies but also accelerates convergence for mainstream DNN-based domain adaptation methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI