语音识别
计算机科学
听力受损者
阅读(过程)
深度学习
手语
人工智能
听力学
政治学
语言学
医学
哲学
法学
作者
L. Ashok Kumar,D. Karthika Renuka,S. Lovelyn Rose,M.C. Shunmuga Priya,I Made Wartana
标识
DOI:10.1016/j.ijcce.2022.01.003
摘要
Assistive technology would be an immense benefit for hearing impaired people by using Audio Visual Speech Recognition (AVSR). Around 466 million people worldwide suffer from hearing loss. Hearing impaired student rely on lip reading for understanding the speech. Lack of trained sign language facilitators and high cost of assistive devices are some of the major challenges faced by hearing impaired students. In this work, we have identified a visual speech recognition technique using cutting edge deep learning models. Moreover, the existing VSR techniques are erroneous. Hence to address the gaps identified, we propose a novel technique by fusion the results from audio and visual speech. This study proposes a new deep learning based audio visual speech recognition model for efficient lip reading. In this paper, an effort has been made to improve the performance of the system significantly by achieving a lowered word error rate of about 6.59% for ASR system and accuracy of about 95% using lip reading model.
科研通智能强力驱动
Strongly Powered by AbleSci AI