计算机科学
人工智能
面部表情
支持向量机
Mel倒谱
直方图
语音识别
特征提取
特征(语言学)
模式识别(心理学)
特征向量
手势
语言学
哲学
图像(数学)
作者
Asim Jan,Hongying Meng,Yona Falinie binti Abd Gaus,Fan Zhang
出处
期刊:IEEE Transactions on Cognitive and Developmental Systems
[Institute of Electrical and Electronics Engineers]
日期:2017-07-31
卷期号:10 (3): 668-680
被引量:166
标识
DOI:10.1109/tcds.2017.2721552
摘要
A human being's cognitive system can be simulated by artificial intelligent systems. Machines and robots equipped with cognitive capability can automatically recognize a humans mental state through their gestures and facial expressions. In this paper, an artificial intelligent system is proposed to monitor depression. It can predict the scales of Beck depression inventory II (BDI-II) from vocal and visual expressions. First, different visual features are extracted from facial expression images. Deep learning method is utilized to extract key visual features from the facial expression frames. Second, spectral low-level descriptors and mel-frequency cepstral coefficients features are extracted from short audio segments to capture the vocal expressions. Third, feature dynamic history histogram (FDHH) is proposed to capture the temporal movement on the feature space. Finally, these FDHH and audio features are fused using regression techniques for the prediction of the BDI-II scales. The proposed method has been tested on the public Audio/Visual Emotion Challenges 2014 dataset as it is tuned to be more focused on the study of depression. The results outperform all the other existing methods on the same dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI