计算机科学
语音识别
脑电图
正规化(语言学)
可穿戴计算机
滤波器(信号处理)
模式识别(心理学)
人工智能
心理学
计算机视觉
精神科
嵌入式系统
作者
Wouter Biesmans,Neetha Das,Tom Francart,Alexander Bertrand
出处
期刊:IEEE Transactions on Neural Systems and Rehabilitation Engineering
[Institute of Electrical and Electronics Engineers]
日期:2016-05-24
卷期号:25 (5): 402-412
被引量:234
标识
DOI:10.1109/tnsre.2016.2571900
摘要
This paper considers the auditory attention detection (AAD) paradigm, where the goal is to determine which of two simultaneous speakers a person is attending to. The paradigm relies on recordings of the listener's brain activity, e.g., from electroencephalography (EEG). To perform AAD, decoded EEG signals are typically correlated with the temporal envelopes of the speech signals of the separate speakers. In this paper, we study how the inclusion of various degrees of auditory modelling in this speech envelope extraction process affects the AAD performance, where the best performance is found for an auditory-inspired linear filter bank followed by power law compression. These two modelling stages are computationally cheap, which is important for implementation in wearable devices, such as future neuro-steered auditory prostheses. We also introduce a more natural way to combine recordings (over trials and subjects) to train the decoder, which reduces the dependence of the algorithm on regularization parameters. Finally, we investigate the simultaneous design of the EEG decoder and the audio subband envelope recombination weights vector using either a norm-constrained least squares or a canonical correlation analysis, but conclude that this increases computational complexity without improving AAD performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI