正规化(语言学)
计算机科学
代表(政治)
最大化
人工智能
熵(时间箭头)
特征学习
机器学习
模式识别(心理学)
数学优化
数学
政治学
量子力学
政治
物理
法学
作者
Aiden Durrant,Georgios Leontidis
标识
DOI:10.1016/j.imavis.2022.104494
摘要
Bootstrap Your Own Latent (BYOL) introduced an approach to self-supervised learning avoiding the contrastive paradigm and subsequently removing the computational burden of negative sampling associated with such methods. However, we empirically find that the image representations produced under the BYOL's self-distillation paradigm are poorly distributed in representation space compared to contrastive methods. This work empirically demonstrates that feature diversity enforced by contrastive losses is beneficial to image representation uniformity when employed in BYOL, and as such, provides greater inter-class representation separability. Additionally, we explore and advocate the use of regularization methods, specifically the layer-wise minimization of hyperspherical energy (i.e. maximization of entropy) of network weights to encourage representation uniformity. We show that directly optimizing a measure of uniformity alongside the standard loss, or regularizing the networks of the BYOL architecture to minimize the hyperspherical energy of neurons can produce more uniformly distributed and therefore better performing representations for downstream tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI