计算机科学
杠杆(统计)
情态动词
边距(机器学习)
人工智能
模态(人机交互)
模式
过程(计算)
骨料(复合)
对比度(视觉)
保险丝(电气)
多模式学习
机器学习
自然语言处理
工程类
社会学
复合材料
化学
高分子化学
材料科学
电气工程
操作系统
社会科学
作者
Longzheng Wang,Chuang Zhang,Hui Xu,Yongxiu Xu,Xiao‐Jun Xu,Siqi Wang
标识
DOI:10.1145/3581783.3613850
摘要
Automatic detection of multimodal fake news has gained a widespread attention recently. Many existing approaches seek to fuse unimodal features to produce multimodal news representations. However, the potential of powerful cross-modal contrastive learning methods for fake news detection has not been well exploited. Besides, how to aggregate features from different modalities to boost the performance of the decision-making process is still an open question. To address that, we propose COOLANT, a cross-modal contrastive learning framework for multimodal fake news detection, aiming to achieve more accurate image-text alignment. To further improve the alignment precision, we leverage an auxiliary task to soften the loss term of negative samples during the contrast process. A cross-modal fusion module is developed to learn the cross-modality correlations. An attention mechanism with an attention guidance module is implemented to help effectively and interpretably aggregate the aligned unimodal representations and the cross-modality correlations. Finally, we evaluate the COOLANT and conduct a comparative study on two widely used datasets, Twitter and Weibo. The experimental results demonstrate that our COOLANT outperforms previous approaches by a large margin and achieves new state-of-the-art results on the two datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI