已入深夜,您辛苦了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!祝你早点完成任务,早点休息,好梦!

The Potential for Speech Intelligibility Improvement Using the Ideal Binary Mask and the Ideal Wiener Filter in Single Channel Noise Reduction Systems: Application to Auditory Prostheses

可理解性(哲学) 计算机科学 二进制数 降噪 语音识别 听觉场景分析 数学 人工智能 算术 哲学 认识论
作者
Nilesh Madhu,Ann Spriet,Sofie Jansen,Raphael Koning,Jan Wouters
出处
期刊:IEEE Transactions on Audio, Speech, and Language Processing [Institute of Electrical and Electronics Engineers]
卷期号:21 (1): 63-72 被引量:54
标识
DOI:10.1109/tasl.2012.2213248
摘要

Whereas state-of-the-art single-channel noise reduction algorithms for auditory prostheses demonstrate an appreciable suppression of the noise and improved speech quality, they are unable, thus far, to improve the intelligibility of noise-degraded speech signals. Alternative approaches to speech enhancement using a binary time-frequency mask have demonstrated substantial intelligibility improvements in low signal-to-noise-ratio (SNR) conditions under ideal settings, making this a promising research direction for auditory prostheses. These approaches exploit the sparsity and disjoint-ness of speech spectra in their short-time-frequency representation to preserve only the target-dominant time-frequency regions in the processed output. State-of-the-art noise reduction algorithms in contrast are soft-decision approaches which weight each time-frequency region in proportion to the prevailing SNR. However, the potential for intelligibility improvement using these approaches has not been examined systematically vis-à-vis the binary mask alternative. This contribution compares the performance of an ideal soft-decision system, exemplified by the ideal Wiener filter (IWF), and the ideal binary mask (IBM) for single-channel speech enhancement for auditory prostheses. To obtain results relevant to this application area, a (relatively) low spectral resolution, modelled using the Bark-spectrum scale, is used for both the IWF and the IBM. This spectral resolution is comparable to that being used in commercial hearing instruments. The comparison is in terms of potential for intelligibility improvement and resulting signal quality. Intelligibility tests carried out under various noise conditions and SNRs show that the IWF leads to higher intelligibility scores than the IBM in low SNR conditions. Under non-ideal parameter estimates, it is demonstrated that the IWF approach is also much less sensitive to estimation errors. Quality-wise, a preference for the IWF exists. This was evaluated using a two-stage, pair-wise preference-rating test.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
贪吃的双下巴完成签到,获得积分10
刚刚
1秒前
2秒前
舟舟发布了新的文献求助10
2秒前
传奇3应助怀民已就寝采纳,获得10
4秒前
miketyson完成签到,获得积分10
6秒前
笨笨的元绿完成签到,获得积分20
9秒前
安静妙芙完成签到,获得积分20
11秒前
14秒前
18秒前
20秒前
新酱不爱吃青椒完成签到 ,获得积分10
28秒前
flyfish完成签到,获得积分10
28秒前
情怀应助怀民已就寝采纳,获得10
31秒前
LONG完成签到,获得积分10
32秒前
tjnksy完成签到,获得积分10
33秒前
斯文败类应助lana采纳,获得10
33秒前
33秒前
从容迎夏完成签到,获得积分10
35秒前
LONG发布了新的文献求助10
35秒前
36秒前
6666发布了新的文献求助10
38秒前
40秒前
WuFen完成签到 ,获得积分10
40秒前
内向的火车完成签到 ,获得积分10
45秒前
48秒前
50秒前
momo完成签到,获得积分10
51秒前
51秒前
52秒前
343123完成签到,获得积分10
54秒前
平常丝发布了新的文献求助10
55秒前
安渝完成签到 ,获得积分10
55秒前
58秒前
hhh发布了新的文献求助10
1分钟前
ucas大菠萝完成签到,获得积分10
1分钟前
兴奋的听筠完成签到,获得积分10
1分钟前
gaberella完成签到,获得积分10
1分钟前
华仔应助wangli采纳,获得10
1分钟前
shentaii完成签到,获得积分10
1分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
2025-2031全球及中国金刚石触媒粉行业研究及十五五规划分析报告 9000
Encyclopedia of the Human Brain Second Edition 8000
Translanguaging in Action in English-Medium Classrooms: A Resource Book for Teachers 700
Real World Research, 5th Edition 680
Qualitative Data Analysis with NVivo By Jenine Beekhuyzen, Pat Bazeley · 2024 660
Superabsorbent Polymers 600
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 计算机科学 有机化学 物理 生物化学 纳米技术 复合材料 内科学 化学工程 人工智能 催化作用 遗传学 数学 基因 量子力学 物理化学
热门帖子
关注 科研通微信公众号,转发送积分 5681113
求助须知:如何正确求助?哪些是违规求助? 5004606
关于积分的说明 15174989
捐赠科研通 4840793
什么是DOI,文献DOI怎么找? 2594460
邀请新用户注册赠送积分活动 1547586
关于科研通互助平台的介绍 1505524