计算机科学
深度学习
特征工程
人工智能
卷积神经网络
情态动词
机器学习
特征学习
模式
人工神经网络
变压器
特征提取
特征(语言学)
背景(考古学)
数据建模
工程类
社会科学
数据库
语言学
高分子化学
化学
电压
古生物学
社会学
哲学
电气工程
生物
作者
Genevieve Lam,Dongyan Huang,Weisi Lin
标识
DOI:10.1109/icassp.2019.8683027
摘要
In this study, we focus on automated approaches to detect depression from clinical interviews using machine learning approached, which the models are trained on multi-modal data. Differentiating from successful machine learning approaches such as context-aware analysis through feature engineering and end-to-end deep neural networks to depression detection utilizing the Distress Analysis Interview Corpus, we propose a novel method that incorporates a data augmentation procedure based on topic modelling using transformer and deep 1D convolutional neural network (CNN) for acoustic feature modeling. The simulation results demonstrate the effectiveness of the proposed method for training multi-modal deep learning models. Our deep 1D CNN and transformer models achieve the state-of-the-art performance for the audio and text modalities respectively, while our multi-modal results are comparable with the state-of-the-art depression detection systems.
科研通智能强力驱动
Strongly Powered by AbleSci AI