计算机科学
模式
人工智能
特征(语言学)
模态(人机交互)
情绪分析
机器学习
特征提取
任务(项目管理)
模式识别(心理学)
社会科学
语言学
哲学
社会学
经济
管理
作者
Huifang Sun,Hongyi Wang,Jiaqing Liu,Yen‐Wei Chen,Lanfen Lin
标识
DOI:10.1145/3503161.3548025
摘要
Multimodal sentiment analysis and depression estimation are two important research topics that aim to predict human mental states using multimodal data. Previous research has focused on developing effective fusion strategies for exchanging and integrating mind-related information from different modalities. Some MLP-based techniques have recently achieved considerable success in a variety of computer vision tasks. Inspired by this, we explore multimodal approaches with a feature-mixing perspective in this study. To this end, we introduce CubeMLP, a multimodal feature processing framework based entirely on MLP. CubeMLP consists of three independent MLP units, each of which has two affine transformations. CubeMLP accepts all relevant modality features as input and mixes them across three axes. After extracting the characteristics using CubeMLP, the mixed multimodal features are flattened for task predictions. Our experiments are conducted on sentiment analysis datasets: CMU-MOSI and CMU-MOSEI, and depression estimation dataset: AVEC2019. The results show that CubeMLP can achieve state-of-the-art performance with a much lower computing cost.
科研通智能强力驱动
Strongly Powered by AbleSci AI