计算机科学
语音识别
话筒
字错误率
特征提取
特征(语言学)
电话
词(群论)
信号(编程语言)
人工智能
模式识别(心理学)
Mel倒谱
电信
语言学
哲学
声压
程序设计语言
作者
Szu-Chen Stan Jou,Tanja Schultz
出处
期刊:Communications in computer and information science
日期:2008-01-01
卷期号:: 305-320
被引量:6
标识
DOI:10.1007/978-3-540-92219-3_23
摘要
This paper presents our studies of automatic speech recognition based on electromyographic biosignals captured from the articulatory muscles in the face using surface electrodes. We develop a phone-based speech recognizer and describe how the performance of this recognizer improves by carefully designing and tailoring the extraction of relevant speech feature toward electromyographic signals. Our experimental design includes the collection of audibly spoken speech simultaneously recorded as acoustic data using a close-speaking microphone and as electromyographic signals using electrodes. Our experiments indicate that electromyographic signals precede the acoustic signal by about 0.05-0.06 seconds. Furthermore, we introduce articulatory feature classifiers, which had recently shown to improved classical speech recognition significantly. We describe that the classification accuracy of articulatory features clearly benefits from the tailored feature extraction. Finally, these classifiers are integrated into the overall decoding framework applying a stream architecture. Our final system achieves a word error rate of 29.9% on a 100-word recognition task.
科研通智能强力驱动
Strongly Powered by AbleSci AI