Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:LLM4Brain: Training a Large Language Model for Brain Video Understanding

Sep 26, 2024

Ruizhe Zheng, Lichao Sun

Figure 1 for LLM4Brain: Training a Large Language Model for Brain Video Understanding

Figure 2 for LLM4Brain: Training a Large Language Model for Brain Video Understanding

Figure 3 for LLM4Brain: Training a Large Language Model for Brain Video Understanding

Figure 4 for LLM4Brain: Training a Large Language Model for Brain Video Understanding

Share this with someone who'll enjoy it:

Abstract:Decoding visual-semantic information from brain signals, such as functional MRI (fMRI), across different subjects poses significant challenges, including low signal-to-noise ratio, limited data availability, and cross-subject variability. Recent advancements in large language models (LLMs) show remarkable effectiveness in processing multimodal information. In this study, we introduce an LLM-based approach for reconstructing visual-semantic information from fMRI signals elicited by video stimuli. Specifically, we employ fine-tuning techniques on an fMRI encoder equipped with adaptors to transform brain responses into latent representations aligned with the video stimuli. Subsequently, these representations are mapped to textual modality by LLM. In particular, we integrate self-supervised domain adaptation methods to enhance the alignment between visual-semantic information and brain responses. Our proposed method achieves good results using various quantitative semantic metrics, while yielding similarity with ground-truth information.

* ECCV2024 Workshop

View paper on

Share this with someone who'll enjoy it:

Title:LLM4Brain: Training a Large Language Model for Brain Video Understanding

Paper and Code