极限学习机
计算机科学
人工智能
加权
特征提取
模式识别(心理学)
分类器(UML)
人工神经网络
语音识别
特征(语言学)
Gabor滤波器
滤波器(信号处理)
机器学习
计算机视觉
哲学
放射科
医学
语言学
作者
Fatemeh Daneshfar,Seyed Jahanshah Kabudian
标识
DOI:10.1109/iccke54056.2021.9721524
摘要
The importance of doing research into affective computing has multiplied with the growing popularity of intelligent and human-machine interface systems. In this research, a speech emotion recognition (SER) system is proposed using new techniques in different parts. The given system extracts speech features from speech and glottal signals in feature extraction section including spectro-temporal ones obtained from Gabor filter bank (GBFB) and separate Gabor filter bank (SGBFB) which have not been so far utilized for SER. At the classification step, a hierarchical adaptive weighted multi-layer extreme learning machine (H-AWELM) is employed. This hybrid classifier consists of two parts: the first part for sparse unsupervised feature learning using a multi-layer neural network (NN) with sparse extreme learning machine auto-encoder (ELM-AE) layers, and the second part for feature classification in the last layer using Tikhonov’s regularized least squares (LS) technique. One of the most important problems in multi-class ELM training process is how to deal with data imbalance issue. This paper presents an adaptive weighting method to solve this problem that can be more accurate than current weighting methods. Finally, the proposed system is evaluated to recognize the emotion of EMODB dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI