Spectral Representation of Behaviour Primitives for Depression Analysis

计算机科学水准点（测量）背景（考古学）人工智能任务（项目管理）代表（政治）帧（网络）卷积神经网络机器学习卷积（计算机科学）比例（比率）人工神经网络模式识别（心理学）古生物学电信物理管理大地测量学量子力学政治政治学法学经济生物地理

作者

Siyang Song,Shashank Jaiswal,Linlin Shen,Michel Valstar

出处

期刊：IEEE Transactions on Affective Computing [Institute of Electrical and Electronics Engineers]
日期：2020-01-30 卷期号：13 (2): 829-844 被引量：87

链接

worktribe.com worktribe.comdoi.org

标识

DOI：10.1109/taffc.2020.2970712

摘要

Depression is a serious mental disorder affecting millions of people all over the world. Traditional clinical diagnosis methods are subjective, complicated and require extensive participation of clinicians. Recent advances in automatic depression analysis systems promise a future where these shortcomings are addressed by objective, repeatable, and readily available diagnostic tools to aid health professionals in their work. Yet there remain a number of barriers to the development of such tools. One barrier is that existing automatic depression analysis algorithms base their predictions on very brief sequential segments, sometimes as little as one frame. Another barrier is that existing methods do not take into account what the context of the measured behaviour is. In this article, we extract multi-scale video-level features for video-based automatic depression analysis. We propose to use automatically detected human behaviour primitives as the low-dimensional descriptor for each frame. We also propose two novel spectral representations, i.e., spectral heatmaps and spectral vectors, to represent video-level multi-scale temporal dynamics of expressive behaviour. Constructed spectral representations are fed to Convolution Neural Networks (CNNs) and Artificial Neural Networks (ANNs) for depression analysis. We conducted experiments on the AVEC 2013 and AVEC 2014 benchmark datasets to investigate the influence of interview tasks on depression analysis. In addition to achieving state of the art accuracy in severity of depression estimation, we show that the task conducted by the user matters, that fusion of a combination of tasks reaches highest accuracy, and that longer tasks are more informative than shorter tasks, up to a point.

求助该文献

最长约 10秒，即可获得该文献文件

Spectral Representation of Behaviour Primitives for Depression Analysis

今日热心研友