计算机科学
词汇
可穿戴计算机
语音识别
软件可移植性
学习迁移
卷积神经网络
电极阵列
人工智能
模式识别(心理学)
电极
哲学
语言学
化学
物理化学
嵌入式系统
程序设计语言
作者
Zhihang Deng,Xu Zhang,Xi Chen,Xiang Chen,Xun Chen,Erwei Yin
出处
期刊:IEEE Transactions on Instrumentation and Measurement
[Institute of Electrical and Electronics Engineers]
日期:2023-01-01
卷期号:72: 1-11
标识
DOI:10.1109/tim.2023.3244849
摘要
Although surface electromyogram recorded from high-density electrode array is believed to carry sufficient spatial information that can benefit the decoding of motor intentions, the complexity of using the array hindered its widespread applications especially in wearable devices. This study is aimed to develop a non-acoustic modality of silent speech recognition that transfers knowledge learned from high-density array to a system using a few channels, with both high portability and performance. A convolutional neural network was established for recognizing a vocabulary of 33 Chinese words during subvocal speech production. The network was trained by the data recorded from face and neck muscles using two arrays with 64 channels in the source domain. Then it was calibrated through a transfer learning approach to grant its adaption to a new target domain with the data recorded by 8 separated electrodes, while its good capability of characterizing subvocal speech word patterns is expected to be maintained. The proposed method significantly outperformed three common classification approaches and the baseline approach without transfer learning (a network trained with data just from the target domain). Under conditions of electrode shift and cross-user variability, it still obtained performance improvements. The method is demonstrated to be viable for transfer learning across domains of electrode settings and it facilitates to improve the performance of silent speech recognition systems using separate electrode sites under the guidance from high-density of arrays.
科研通智能强力驱动
Strongly Powered by AbleSci AI