Back to Search
Start Over
Dual-Guided Brain Diffusion Model: Natural Image Reconstruction from Human Visual Stimulus fMRI.
- Source :
-
Bioengineering (Basel) . Oct2023, Vol. 10 Issue 10, p1117. 16p. - Publication Year :
- 2023
-
Abstract
- The reconstruction of visual stimuli from fMRI signals, which record brain activity, is a challenging task with crucial research value in the fields of neuroscience and machine learning. Previous studies tend to emphasize reconstructing pixel-level features (contours, colors, etc.) or semantic features (object category) of the stimulus image, but typically, these properties are not reconstructed together. In this context, we introduce a novel three-stage visual reconstruction approach called the Dual-guided Brain Diffusion Model (DBDM). Initially, we employ the Very Deep Variational Autoencoder (VDVAE) to reconstruct a coarse image from fMRI data, capturing the underlying details of the original image. Subsequently, the Bootstrapping Language-Image Pre-training (BLIP) model is utilized to provide a semantic annotation for each image. Finally, the image-to-image generation pipeline of the Versatile Diffusion (VD) model is utilized to recover natural images from the fMRI patterns guided by both visual and semantic information. The experimental results demonstrate that DBDM surpasses previous approaches in both qualitative and quantitative comparisons. In particular, the best performance is achieved by DBDM in reconstructing the semantic details of the original image; the Inception, CLIP and SwAV distances are 0.611, 0.225 and 0.405, respectively. This confirms the efficacy of our model and its potential to advance visual decoding research. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 23065354
- Volume :
- 10
- Issue :
- 10
- Database :
- Academic Search Index
- Journal :
- Bioengineering (Basel)
- Publication Type :
- Academic Journal
- Accession number :
- 173267155
- Full Text :
- https://doi.org/10.3390/bioengineering10101117