Development of sEMG sensors and algorithms for silent speech recognition

语音识别计算机科学词汇字错误率面子（社会学概念）噪音（视频）人工智能社会科学语言学哲学社会学图像（数学）

作者

Geoffrey S. Meltzner,James T. Heaton,Yunbin Deng,Gianluca De Luca,Serge H. Roy,Joshua C. Kline

出处

期刊：Journal of Neural Engineering [IOP Publishing]
日期：2018-06-25 卷期号：15 (4): 046031-046031 被引量：69

链接

europepmc.org europepmc.org nih.gov nih.govdoi.org

标识

DOI：10.1088/1741-2552/aac965

摘要

Objective. Speech is among the most natural forms of human communication, thereby offering an attractive modality for human–machine interaction through automatic speech recognition (ASR). However, the limitations of ASR—including degradation in the presence of ambient noise, limited privacy and poor accessibility for those with significant speech disorders—have motivated the need for alternative non-acoustic modalities of subvocal or silent speech recognition (SSR). Approach. We have developed a new system of face- and neck-worn sensors and signal processing algorithms that are capable of recognizing silently mouthed words and phrases entirely from the surface electromyographic (sEMG) signals recorded from muscles of the face and neck that are involved in the production of speech. The algorithms were strategically developed by evolving speech recognition models: first for recognizing isolated words by extracting speech-related features from sEMG signals, then for recognizing sequences of words from patterns of sEMG signals using grammar models, and finally for recognizing a vocabulary of previously untrained words using phoneme-based models. The final recognition algorithms were integrated with specially designed multi-point, miniaturized sensors that can be arranged in flexible geometries to record high-fidelity sEMG signal measurements from small articulator muscles of the face and neck. Main results. We tested the system of sensors and algorithms during a series of subvocal speech experiments involving more than 1200 phrases generated from a 2200-word vocabulary and achieved an 8.9%-word error rate (91.1% recognition rate), far surpassing previous attempts in the field. Significance. These results demonstrate the viability of our system as an alternative modality of communication for a multitude of applications including: persons with speech impairments following a laryngectomy; military personnel requiring hands-free covert communication; or the consumer in need of privacy while speaking on a mobile phone in public.

求助该文献

Development of sEMG sensors and algorithms for silent speech recognition

今日热心研友