计算机科学
缺少数据
稳健性(进化)
人工智能
变压器
模态(人机交互)
模式识别(心理学)
机器学习
特征(语言学)
数据挖掘
工程类
基因
电气工程
哲学
生物化学
电压
化学
语言学
作者
Ziqi Yuan,Wei Li,Hua Xu,Wenmeng Yu
标识
DOI:10.1145/3474085.3475585
摘要
Improving robustness against data missing has become one of the core challenges in Multimodal Sentiment Analysis (MSA), which aims to judge speaker sentiments from the language, visual, and acoustic signals. In the current research, translation-based methods and tensor regularization methods are proposed for MSA with incomplete modality features. However, both of them fail to cope with random modality feature missing in non-aligned sequences. In this paper, a transformer-based feature reconstruction network (TFR-Net) is proposed to improve the robustness of models for the random missing in non-aligned modality sequences. First, intra-modal and inter-modal attention-based extractors are adopted to learn robust representations for each element in modality sequences. Then, a reconstruction module is proposed to generate the missing modality features. With the supervision of SmoothL1Loss between generated and complete sequences, TFR-Net is expected to learn semantic-level features corresponding to missing features. Extensive experiments on two public benchmark datasets show that our model achieves good results against data missing across various missing modality combinations and various missing degrees.
科研通智能强力驱动
Strongly Powered by AbleSci AI