计算机科学
深度学习
人工智能
情态动词
萧条(经济学)
背景(考古学)
地质学
材料科学
古生物学
宏观经济学
经济
高分子化学
作者
Genevieve Lam,Dongyan Huang,Weisi Lin
标识
DOI:10.1109/icassp.2019.8683027
摘要
In this study, we focus on automated approaches to detect depression from clinical interviews using multi-modal machine learning (ML). Our approach differentiates from other successful ML methods such as context-aware analysis through feature engineering and end-to-end deep neural networks for depression detection utilizing the Distress Analysis Interview Corpus. We propose a novel method that incorporates: (1) pre-trained Transformer combined with data augmentation based on topic modelling for textual data; and (2) deep 1D convolutional neural network (CNN) for acoustic feature modeling. The simulation results demonstrate the effectiveness of the proposed method for training multi-modal deep learning models. Our deep 1D CNN and Transformer models achieved state-of-the-art performance for audio and text modalities respectively. Combining them in a multi-modal framework also outperforms state-of-the-art for the combined setting. Code available at https://github.com/genandlam/multi-modal-depression-detection
科研通智能强力驱动
Strongly Powered by AbleSci AI