过滤器组
Mel倒谱
计算机科学
特征提取
语音识别
滤波器(信号处理)
主成分分析
噪音(视频)
模式识别(心理学)
人工智能
特征(语言学)
快速傅里叶变换
倒谱
计算机视觉
算法
语言学
哲学
图像(数学)
作者
Shang-Ming Lee,Shi-Hau Fang,Jeih-weih Hung,Lin-Shan Lee
标识
DOI:10.1109/asru.2001.1034586
摘要
Although Mel-frequency cepstral coefficients (MFCC) have been proven to perform very well under most conditions, some limited efforts have been made in optimizing the shape of the filters in the filter-bank in the conventional MFCC approach. This paper presents a new feature extraction approach that designs the shapes of the filters in the filter-bank. In this new approach, the filter-bank coefficients are data-driven and obtained by applying principal component analysis (PCA) to the FFT spectrum of the training data. The experimental results show that this method is robust under noisy environment and is well additive with other noise-handling techniques.
科研通智能强力驱动
Strongly Powered by AbleSci AI