Multimodal Co-attention Transformer for Video-Based Personality Understanding

可解释性 模式 计算机科学 人工智能 人格 机器学习 可视化 多媒体 心理学 社会心理学 社会科学 社会学
作者
Mingwei Sun,Kunpeng Zhang
标识
DOI:10.1109/bigdata59044.2023.10386376
摘要

Video has emerged as a pervasive medium for communication, entertainment, and information sharing. With the consumption of video content continuing to increase rapidly, understanding the impact of visual narratives on personality has become a crucial area of research. While text-based personality understanding has been extensively studied in the literature, video-based personality prediction remains relatively under-explored. Existing approaches to video-based personality prediction can be broadly categorized into two directions: learning a joint representation of audio and visual information using fully-connected feed-forward networks, and separating a video into its individual modalities (text, image, and audio), training each modality independently, and then ensembling the results for subsequent personality prediction. However, both approaches have notable limitations: ignoring complex interactions between visual and audio components, or considering all three modalities but not in a joint manner. Furthermore, all methods require high computational costs as they require high-resolution images to train. In this paper, we propose a novel Multimodal Co-attention Transformer neural network for video-based affect prediction. Our approach simultaneously models audio, visual, and text representations, as well as their inter-relations, to achieve accurate and efficient predictions. We demonstrate the effectiveness of our method via extensive experiments on a real-world dataset: First Impressions. Our results show that the proposed model outperforms state-of-the-art approaches while maintaining high computational efficiency. In addition to our performance evaluation, we also conduct interpretability analyses to investigate the contribution across different levels. Our findings reveal valuable insights into personality predictions. The implementation is available at: https://github.com/nestor-sun/mcoattention.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
5秒前
wang发布了新的文献求助10
6秒前
6秒前
迅凡波发布了新的文献求助10
7秒前
9秒前
智商洼地发布了新的文献求助10
9秒前
9秒前
小小应助4652376采纳,获得30
9秒前
海带完成签到,获得积分10
9秒前
yiiinng完成签到,获得积分10
10秒前
hxw完成签到,获得积分10
10秒前
ssion完成签到 ,获得积分10
11秒前
xiaojie发布了新的文献求助10
11秒前
dew应助李李李采纳,获得80
11秒前
青柠发布了新的文献求助10
12秒前
嘻嘻哈哈完成签到,获得积分10
12秒前
12秒前
勤恳小夏完成签到,获得积分20
13秒前
HHHHTTTT完成签到,获得积分20
13秒前
山桐发布了新的文献求助10
13秒前
yy完成签到,获得积分10
14秒前
地球发布了新的文献求助10
15秒前
15秒前
糖醋鱼发布了新的文献求助10
15秒前
16秒前
17秒前
zwx0201完成签到,获得积分10
17秒前
111发布了新的文献求助10
17秒前
点点完成签到,获得积分10
18秒前
xokey发布了新的文献求助10
18秒前
Owen应助SinfulG采纳,获得30
18秒前
123456完成签到 ,获得积分10
18秒前
852应助li采纳,获得10
18秒前
搜集达人应助勤恳小夏采纳,获得10
19秒前
19秒前
充电宝应助危机的百褶裙采纳,获得10
20秒前
鸽子的迷信完成签到,获得积分10
20秒前
Jasper应助一只学医的小杨采纳,获得10
20秒前
21秒前
hxw发布了新的文献求助10
22秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Organometallic Chemistry of the Transition Metals 800
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
The formation of Australian attitudes towards China, 1918-1941 640
Signals, Systems, and Signal Processing 610
全相对论原子结构与含时波包动力学的理论研究--清华大学 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6442190
求助须知:如何正确求助?哪些是违规求助? 8256014
关于积分的说明 17580099
捐赠科研通 5500765
什么是DOI,文献DOI怎么找? 2900436
邀请新用户注册赠送积分活动 1877361
关于科研通互助平台的介绍 1717191