计算机科学
语音识别
稳健性(进化)
情绪识别
人工智能
特征(语言学)
融合
模式识别(心理学)
自然语言处理
语言学
生物化学
基因
哲学
化学
作者
Chunyi Wang,Ying Ren,Na Zhang,Fuwei Cui,Shiying Luo
标识
DOI:10.1007/s11042-021-10553-4
摘要
A speech emotion recognition algorithm based on multi-feature and Multi-lingual fusion is proposed in order to resolve low recognition accuracy caused bylack of large speech dataset and low robustness of acoustic features in the recognition of speech emotion. First, handcrafted and deep automatic features are extractedfrom existing data in Chinese and English speech emotions. Then, the various features are fused respectively. Finally, the fused features of different languages are fused again and trained in a classification model. Distinguishing the fused features with the unfused ones, the results manifest that the fused features significantly enhance the accuracy of speech emotion recognition algorithm. The proposedsolution is evaluated on the two Chinese corpus and two English corpus, and isshown to provide more accurate predictions compared to original solution. As a result of this study, the multi-feature and Multi-lingual fusion algorithm can significantly improve the speech emotion recognition accuracy when the dataset is small.
科研通智能强力驱动
Strongly Powered by AbleSci AI