计算机科学
人工智能
拉伤
面子(社会学概念)
语音识别
模式识别(心理学)
计算机视觉
面部肌肉
运动(音乐)
声学
元音
沟通
解剖
物理
医学
社会科学
社会学
作者
Hyunjun Yoo,Eunji Kim,Jong Won Chung,Hyeon Cho,Sujin Jeong,Heeseung Kim,Dongju Jang,Hayun Kim,Jinsu Yoon,Gae Hwang Lee,Hyunbum Kang,Jooyoung Kim,Youngjun Yun,Sungroh Yoon,Yongtaek Hong
标识
DOI:10.1021/acsami.2c14918
摘要
Silent communication based on biosignals from facial muscle requires accurate detection of its directional movement and thus optimally positioning minimum numbers of sensors for higher accuracy of speech recognition with a minimal person-to-person variation. So far, previous approaches based on electromyogram or pressure sensors are ineffective in detecting the directional movement of facial muscles. Therefore, in this study, high-performance strain sensors are used for separately detecting x- and y-axis strain. Directional strain distribution data of facial muscle is obtained by applying three-dimensional digital image correlation. Deep learning analysis is utilized for identifying optimal positions of directional strain sensors. The recognition system with four directional strain sensors conformably attached to the face shows silent vowel recognition with 85.24% accuracy and even 76.95% for completely nonobserved subjects. These results show that detection of the directional strain distribution at the optimal facial points will be the key enabling technology for highly accurate silent speech recognition.
科研通智能强力驱动
Strongly Powered by AbleSci AI