水准点(测量)
计算机科学
匹配(统计)
情态动词
整改
样品(材料)
噪音(视频)
模式识别(心理学)
人工智能
k-最近邻算法
图像(数学)
零(语言学)
算法
数学
统计
工程类
高分子化学
语言学
化学
哲学
大地测量学
色谱法
电压
电气工程
地理
作者
Yuqing Li,Haoming Huang,Jian Xu,Shao‐Lun Huang
标识
DOI:10.1109/icassp48485.2024.10448059
摘要
The presence of noisy correspondence within cross-modal matching has significantly undermined the performance of existing matching methods. In this paper, we introduce a robust framework named Neighbor Auxiliary Corrector (NAC) for alleviating noise by utilizing the neighbors, which are indicative of similar textual targets. NAC is inspired by an observation that similar texts tend to correspond to similar images. Leveraging the zero-shot capabilities of Pre-trained Language Models (PLMs), we identify the top-k nearest neighbors for each positive image-text pair. Subsequently, the side information provided by these neighbors is harnessed for both sample verification and sample rectification. Extensive experiments on benchmark datasets demonstrate that our framework can significantly boost the performance and is more robust to various levels of noisy correspondence.
科研通智能强力驱动
Strongly Powered by AbleSci AI