计算机科学
语音识别
卷积神经网络
特征(语言学)
人工智能
情绪识别
模式识别(心理学)
特征选择
特征提取
架空(工程)
选择(遗传算法)
过程(计算)
人工神经网络
机器学习
哲学
语言学
操作系统
出处
期刊:Ubiquitous Intelligence and Computing
日期:2021-10-01
卷期号:: 154-159
被引量:7
标识
DOI:10.1109/swc50871.2021.00030
摘要
Speech emotion recognition is an essential part of enhancing human-computer interaction, and its recognition accuracy depends on the selection of speech signal features and the construction of classification models. In this paper, we propose a speech emotion recognition framework. It is based on the XGBoost algorithm for feature selection, and combines the ID convolutional neural networks and the BLSTM (bi- directional long short-term memory) with attention model for classification. By calculating the importance score of each feature, the appropriate number of effective features can be obtained to alleviate the over-fitting problem, reduce the training overhead, accelerate the modeling process and improve the recognition accuracy. The proposed framework is evaluated on EmoDB, CASIA, EMA corpora, and is shown to provide more accurate predictions compared to the baseline method by achieving the recognition accuracy of 86.87%, 74.17%, 98.04%, respectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI