Mel倒谱
样本熵
语音识别
倒谱
支持向量机
熵(时间箭头)
模式识别(心理学)
非线性系统
数学
人工智能
计算机科学
特征提取
物理
量子力学
作者
Deying Gan,Weiping Hu,Bingxin Zhao
出处
期刊:PubMed
日期:2014-10-01
卷期号:31 (5): 1149-54
被引量:1
摘要
By analyzing the mechanism of pronunciation, traditional acoustic parameters, including fundamental frequency, Mel frequency cepstral coefficients (MFCC), linear prediction cepstrum coefficient (LPCC), frequency perturbation, amplitude perturbation, and nonlinear characteristic parameters, including entropy (sample entropy, fuzzy entropy, multi-scale entropy), box-counting dimension, intercept and Hurst, are extracted as feature vectors for identification of pathological voice. Seventy-eight normal voice samples and 73 pathological voice samples for /a/, and 78 normal samples and 80 pathological samples for /i/ are recognized based on support vector machine (SVM). The results showed that compared with traditional acoustic parameters, nonlinear characteristic parameters could be well used to distinguish between healthy and pathological voices, and the recognition rates for /a/ were all higher than those for /i/ except for multi-scale entropy. That is why the /a/ sound data is used widely in related research at home and abroad for obtaining better identification of pathological voices. Adopting multi-scale entropy for /i/ could obtain higher recognition rate than /a/ between healthy and pathological samples, which may provide some useful inspiration for evaluating vocal compensatory function.
科研通智能强力驱动
Strongly Powered by AbleSci AI