语音识别
计算机科学
人工神经网络
隐马尔可夫模型
时滞神经网络
任务(项目管理)
反向传播
模式识别(心理学)
人工智能
字错误率
等级制度
工程类
系统工程
经济
市场经济
作者
Alexander Waibel,Toshiyuki Hanazawa,Geoffrey E. Hinton,Kiyohiro Shikano,Kevin Lang
出处
期刊:IEEE Transactions on Acoustics, Speech, and Signal Processing
[Institute of Electrical and Electronics Engineers]
日期:1989-03-01
卷期号:37 (3): 328-339
被引量:2464
摘要
The authors present a time-delay neural network (TDNN) approach to phoneme recognition which is characterized by two important properties: (1) using a three-layer arrangement of simple computing units, a hierarchy can be constructed that allows for the formation of arbitrary nonlinear decision surfaces, which the TDNN learns automatically using error backpropagation; and (2) the time-delay arrangement enables the network to discover acoustic-phonetic features and the temporal relationships between them independently of position in time and therefore not blurred by temporal shifts in the input. As a recognition task, the speaker-dependent recognition of the phonemes B, D, and G in varying phonetic contexts was chosen. For comparison, several discrete hidden Markov models (HMM) were trained to perform the same task. Performance evaluation over 1946 testing tokens from three speakers showed that the TDNN achieves a recognition rate of 98.5% correct while the rate obtained by the best of the HMMs was only 93.7%.< >
科研通智能强力驱动
Strongly Powered by AbleSci AI