计算机科学
人工智能
机器学习
深度学习
人工神经网络
桥接(联网)
监督学习
噪音(视频)
一般化
深层神经网络
随机梯度下降算法
编码(集合论)
数学分析
图像(数学)
数学
集合(抽象数据类型)
程序设计语言
计算机网络
作者
Lang Huang,Chao Zhang,Chao Zhang
标识
DOI:10.1109/tpami.2022.3217792
摘要
We propose self-adaptive training-a unified training algorithm that dynamically calibrates and enhances training processes by model predictions without incurring an extra computational cost-to advance both supervised and self-supervised learning of deep neural networks. We analyze the training dynamics of deep networks on training data that are corrupted by, e.g., random noise and adversarial examples. Our analysis shows that model predictions are able to magnify useful underlying information in data and this phenomenon occurs broadly even in the absence of any label information, highlighting that model predictions could substantially benefit the training processes: self-adaptive training improves the generalization of deep networks under noise and enhances the self-supervised representation learning. The analysis also sheds light on understanding deep learning, e.g., a potential explanation of the recently-discovered double-descent phenomenon in empirical risk minimization and the collapsing issue of the state-of-the-art self-supervised learning algorithms. Experiments on the CIFAR, STL, and ImageNet datasets verify the effectiveness of our approach in three applications: classification with label noise, selective classification, and linear evaluation. To facilitate future research, the code has been made publicly available at https://github.com/LayneH/self-adaptive-training.
科研通智能强力驱动
Strongly Powered by AbleSci AI