计算机科学
鉴别器
发电机(电路理论)
人工智能
情态动词
子空间拓扑
机器学习
特征学习
利用
相似性(几何)
功率(物理)
图像(数学)
高分子化学
化学
物理
探测器
电信
量子力学
计算机安全
作者
Zhuoyi Li,Hangjun Lu,Hao Fu,Zhongrui Wang,Guanghua Gu
标识
DOI:10.1016/j.engappai.2023.106439
摘要
There exists a heterogeneity gap between multi-modal data, hence it is difficult to directly measure the similarity between them. A common way to solve the problem is representation learning. Due to the unique adversarial optimization principle and the efficient cross-modal correlation learning ability, the cross-modal retrieval based on Generative Adversarial Network (GAN) has received significant attention recently. However, the potential semantic information is not fully explored in most GAN-based cross-modal learning approaches. In this paper, we propose a novel Adaptive Adversarial Learning (AAL) based cross-modal retrieval method. The generator of a specific modality projects heterogeneous data into the potential common subspace, while the discriminator is against the generator to maintain discriminability. In addition, three task-specific loss functions are designed in the generators to comprehensively exploit the semantic and label information. One problem is that directly optimizing the generator network will lead to ignoring the assessment of contribution to multi-loss functions. To overcome the above challenge, we present an adaptive balance strategy to match the appropriate contribution for each loss according to the degree of dispersion. Comprehensive experimental results on three widely-used databases show that the proposed method is effective and superior to the existing cross-modal retrieval methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI