计算机科学
预处理器
人工智能
数据预处理
分类器(UML)
变压器
协方差
安全性令牌
特征提取
空间分析
模式识别(心理学)
物理
遥感
数学
量子力学
电压
地质学
统计
计算机安全
作者
Yongfeng Tao,Minqiang Yang,Huiru Li,Yu-Shan Wu,Bin Hu
出处
期刊:IEEE Transactions on Knowledge and Data Engineering
[Institute of Electrical and Electronics Engineers]
日期:2024-01-05
卷期号:36 (7): 2956-2966
被引量:8
标识
DOI:10.1109/tkde.2024.3350071
摘要
Depression is one of the most common mental illnesses, but few of the currently proposed in-depth models based on social media data take into account both temporal and spatial information in the data for the detection of depression. In this paper, we present an efficient, low-covariance multimodal integrated spatio-temporal converter framework called DepMSTAT, which aims to detect depression using acoustic and visual features in social media data. The framework consists of four modules: a data preprocessing module, a token generation module, a Spatial-Temporal Attentional Transformer (STAT) module, and a depression classifier module. To efficiently capture spatial and temporal correlations in multimodal social media depression data, a plug-and-play STAT module is proposed. The module is capable of extracting unimodal spatio-temporal features and fusing unimodal information, playing a key role in the analysis of acoustic and visual features in social media data. Through extensive experiments on a depression database (D-Vlog), the method in this paper shows high accuracy (71.53%) in depression detection, achieving a performance that exceeds most models. This work provides a scaffold for studies based on multimodal data that assists in the detection of depression.
科研通智能强力驱动
Strongly Powered by AbleSci AI