模态(人机交互)
模式
计算机科学
人工智能
代表(政治)
情绪分析
融合
自然语言处理
模式识别(心理学)
语言学
政治学
社会科学
政治
哲学
社会学
法学
作者
Jian Huang,Yanli Ji,Zhen Qin,Yang Yang,Heng Tao Shen
标识
DOI:10.1109/tmm.2023.3344358
摘要
Multimodal sentiment analysis remains a big challenge due to the lack of effective fusion solutions. An effective fusion is expected to obtain the correct semantic representation for all modalities, and simultaneously thoroughly explore the contribution of each modality. In this paper, we propose a dominant SIngle-Modal SUpplementary Fusion (SIMSUF) approach to perform effective multimodal fusion for sentiment analysis. The SIMSUF is composed of three major components, a dominant modality supplementary module, a modality enhancement module, and a multimodal fusion module. The dominant modality supplementary module realizes dominant modality determination by estimating mutual dependence between every two modalities, and then the dominant modality is adopted to supplement other modalities for representative feature learning. To further explore the modality contribution, we propose a two-branch modality enhancement module, where one branch learns common representation distribution for multiple modalities, and simultaneously a specific modality enhancement branch is presented to perform semantic difference enhancement and distribution difference enhancement for each modality. Finally, a dominant modality leading fusion module is designed to fuse multimodal representations of two branches for sentiment analysis. Extensive experiments are evaluated on the CMU-MOSEI and CMU-MOSI datasets. Experiment results certify that our approach is superior to the state-of-the-art approaches. The source code of this work is available at https://github.com/HumanCenteredUndestanding/SIMSUF .
科研通智能强力驱动
Strongly Powered by AbleSci AI