计算机科学
语音识别
神经计算语音处理
语音处理
分割
言语感知
语音分割
人工智能
神经科学
心理学
感知
作者
Xue L. Gong,Alexander G. Huth,Fatma Deniz,Keith Johnson,Jack L. Gallant,Frédéric E. Theunissen
标识
DOI:10.1038/s41467-023-39872-w
摘要
Speech processing requires extracting meaning from acoustic patterns using a set of intermediate representations based on a dynamic segmentation of the speech stream. Using whole brain mapping obtained in fMRI, we investigate the locus of cortical phonemic processing not only for single phonemes but also for short combinations made of diphones and triphones. We find that phonemic processing areas are much larger than previously described: they include not only the classical areas in the dorsal superior temporal gyrus but also a larger region in the lateral temporal cortex where diphone features are best represented. These identified phonemic regions overlap with the lexical retrieval region, but we show that short word retrieval is not sufficient to explain the observed responses to diphones. Behavioral studies have shown that phonemic processing and lexical retrieval are intertwined. Here, we also have identified candidate regions within the speech cortical network where this joint processing occurs.
科研通智能强力驱动
Strongly Powered by AbleSci AI