A multimodal shared network with a cross-modal distribution constraint for continuous emotion recognition

计算机科学模式判别式人工智能推论稳健性（进化）杠杆（统计）情态动词利用约束（计算机辅助设计）机器学习共享空间空格（标点符号）机械工程化学高分子化学工程类社会科学生物化学计算机安全社会学基因操作系统

作者

Chiqin Li,Liang‐Liang Xie,Xingmao Shao,Hang Pan,Zhiliang Wang

出处

期刊：Engineering Applications of Artificial Intelligence [Elsevier]
日期：2024-07-01 卷期号：133: 108413-108413

标识

DOI：10.1016/j.engappai.2024.108413

摘要

Continuous emotion recognition has been a compelling topic in affective computing because it can interpret human emotions subtly and continuously. Existing studies have achieved advanced emotion recognition performance using multimodal knowledge. However, these studies generally ignore the circumstances where some particular modalities are missing in the inference phase and thus become sensitive to the absence of modalities. To resolve this issue, we propose a novel multimodal shared network with a cross-modal distribution constraint, i.e. the DS-Net, which aims to improve the robustness of the model to missing modalities. The training process of the proposed network generally includes two components: multimodal shared space modeling and a cross-modal distribution matching constraint. The former utilizes the local and temporal information of multimodal signals for multimodal shared space modeling, while the latter further enhances the multimodal shared space via a loose constraint method. Coupled with the latter, the former can effectively exploit the complementarity between videos and peripheral physiological signals (PPSs), thus enhancing the discriminative capability of the shared space. Based on the shared space, the DS-Net works during the inference phase with only one modality input and can leverage multimodal knowledge to improve emotion recognition accuracy. Comprehensive experiments were conducted on two public datasets. Results demonstrate that the proposed method is competitive or superior to the current state-of-the-art methods. Further experiments indicate that the proposed method can be extended to handle other modalities and to deal with partially missing modalities, demonstrating its potential in real-world applications.

求助该文献

最长约 10秒，即可获得该文献文件

A multimodal shared network with a cross-modal distribution constraint for continuous emotion recognition

今日热心研友