计算机科学
人工智能
忠诚
计算机视觉
功能磁共振成像
过程(计算)
编码(集合论)
特征(语言学)
一致性(知识库)
视觉感受
感知
模式识别(心理学)
迭代重建
视皮层
连贯性(哲学赌博策略)
磁共振弥散成像
磁共振成像
神经科学
数学
医学
电信
语言学
哲学
统计
集合(抽象数据类型)
放射科
生物
程序设计语言
操作系统
作者
Haoyu Li,Hao Wu,Badong Chen
出处
期刊:IEEE transactions on image processing
[Institute of Electrical and Electronics Engineers]
日期:2025-01-01
卷期号:: 1-1
标识
DOI:10.1109/tip.2025.3526051
摘要
Reconstructing visual stimuli from functional Magnetic Resonance Imaging (fMRI) enables fine-grained retrieval of brain activity. However, the accurate reconstruction of diverse details, including structure, background, texture, color, and more, remains challenging. The stable diffusion models inevitably result in the variability of reconstructed images, even under identical conditions. To address this challenge, we first uncover the neuroscientific perspective of diffusion methods, which primarily involve top-down creation using pre-trained knowledge from extensive image datasets, but tend to lack detail-driven bottom-up perception, leading to a loss of faithful details. In this paper, we propose NeuralDiffuser which incorporates primary visual feature guidance to provide detailed cues in the form of gradients. This extension of the bottom-up process for diffusion models achieves both semantic coherence and detail fidelity when reconstructing visual stimuli. Furthermore, we have developed a novel guidance strategy for reconstruction tasks that ensures the consistency of repeated outputs with original images rather than with various outputs. Extensive experimental results on the Natural Senses Dataset (NSD) qualitatively and quantitatively demonstrate the advancement of NeuralDiffuser by comparing it against baseline and state-of-the-art methods horizontally, as well as conducting longitudinal ablation studies. Code can be available on https://github.com/HaoyyLi/NeuralDiffuser.
科研通智能强力驱动
Strongly Powered by AbleSci AI