计算机科学
水准点(测量)
背景(考古学)
人工智能
任务(项目管理)
代表(政治)
帧(网络)
卷积神经网络
机器学习
卷积(计算机科学)
比例(比率)
人工神经网络
模式识别(心理学)
经济
政治学
地理
法学
管理
量子力学
电信
古生物学
大地测量学
物理
政治
生物
作者
Siyang Song,Shashank Jaiswal,Linlin Shen,Michel Valstar
出处
期刊:IEEE Transactions on Affective Computing
[Institute of Electrical and Electronics Engineers]
日期:2020-01-30
卷期号:13 (2): 829-844
被引量:87
标识
DOI:10.1109/taffc.2020.2970712
摘要
Depression is a serious mental disorder affecting millions of people all over the world. Traditional clinical diagnosis methods are subjective, complicated and require extensive participation of clinicians. Recent advances in automatic depression analysis systems promise a future where these shortcomings are addressed by objective, repeatable, and readily available diagnostic tools to aid health professionals in their work. Yet there remain a number of barriers to the development of such tools. One barrier is that existing automatic depression analysis algorithms base their predictions on very brief sequential segments, sometimes as little as one frame. Another barrier is that existing methods do not take into account what the context of the measured behaviour is. In this article, we extract multi-scale video-level features for video-based automatic depression analysis. We propose to use automatically detected human behaviour primitives as the low-dimensional descriptor for each frame. We also propose two novel spectral representations, i.e., spectral heatmaps and spectral vectors, to represent video-level multi-scale temporal dynamics of expressive behaviour. Constructed spectral representations are fed to Convolution Neural Networks (CNNs) and Artificial Neural Networks (ANNs) for depression analysis. We conducted experiments on the AVEC 2013 and AVEC 2014 benchmark datasets to investigate the influence of interview tasks on depression analysis. In addition to achieving state of the art accuracy in severity of depression estimation, we show that the task conducted by the user matters, that fusion of a combination of tasks reaches highest accuracy, and that longer tasks are more informative than shorter tasks, up to a point.
科研通智能强力驱动
Strongly Powered by AbleSci AI