聚类分析
计算机科学
相互信息
分拆(数论)
自编码
极小极大
人工智能
编码器
数据挖掘
约束聚类
一致性(知识库)
模糊聚类
模式识别(心理学)
机器学习
数学
CURE数据聚类算法
数学优化
人工神经网络
组合数学
操作系统
作者
Yiqiao Mao,Xiaoqiang Yan,Jiaming Liu,Yangdong Ye
标识
DOI:10.1109/tmm.2023.3330093
摘要
Aligning multiple heterogeneous modalities in a parameter-sharing encoder to mine consistent information is a core idea of multimodal learning. However, two drawbacks hinder the development of such methods for clustering tasks: (1) each modality contains a considerable amount of superfluous information that cannot be aligned, impeding the mining of consistent information; (2) one-to-one alignment is contradictory to the clustering principle of minimum intra-cluster distance, leading to suboptimal clustering results. In this paper, we propose a novel Consistency-Guided Multimodal Clustering method (ConGMC) to remove superfluous information within the modalities unsupervised through information theory while improving one-to-one alignment for the clustering task. ConGMC contains multiple unimodal encoders and a multimodal shared encoder, where the former learns unimodal representation while the latter aligns multiple modalities to learn the cluster partition. Specifically, we first construct a mutual information maximin function to distinguish consistent information from superfluous information, in which the consistent and superfluous information are maximally retained and removed, respectively. Then a Clustering-Friendly Alignment strategy (CF-Align) is designed to address the contradiction between the alignment and clustering tasks. CF-Align dynamically adjusts the set of negative samples according to the learned cluster partition to avoid increasing the intra-cluster distance. Finally, we consider the cluster partition as a consistent constraint to optimize the multimodal shared encoder, enabling consistent information to guide the training process iteratively. Moreover, a variational optimization algorithm is proposed to ensure that ConGMC converges to a local optimum. Numerous experimental results on twelve real-world datasets validate that the proposed ConGMC method outperforms the state-of-the-art multimodal clustering methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI