共振峰
响度
自回归模型
元音
线性预测
语音识别
计算机科学
数学
光谱密度
统计
计算机视觉
出处
期刊:Journal of the Acoustical Society of America
[Acoustical Society of America]
日期:1990-04-01
卷期号:87 (4): 1738-1752
被引量:2505
摘要
A new technique for the analysis of speech, the perceptual linear predictive (PLP) technique, is presented and examined. This technique uses three concepts from the psychophysics of hearing to derive an estimate of the auditory spectrum: (1) the critical-band spectral resolution, (2) the equal-loudness curve, and (3) the intensity-loudness power law. The auditory spectrum is then approximated by an autoregressive all-pole model. A 5th-order all-pole model is effective in suppressing speaker-dependent details of the auditory spectrum. In comparison with conventional linear predictive (LP) analysis, PLP analysis is more consistent with human hearing. The effective second formant F2' and the 3.5-Bark spectral-peak integration theories of vowel perception are well accounted for. PLP analysis is computationally efficient and yields a low-dimensional representation of speech. These properties are found to be useful in speaker-independent automatic-speech recognition.
科研通智能强力驱动
Strongly Powered by AbleSci AI