Guess What I Think: Streamlined EEG-to-Image Generation with Latent Diffusion Models

Lopez, Eleonora; Sigillo, Luigi; Colonnese, Federica; Panella, Massimo; Comminiello, Danilo

Computer Science > Computer Vision and Pattern Recognition

arXiv:2410.02780 (cs)

[Submitted on 17 Sep 2024]

Title:Guess What I Think: Streamlined EEG-to-Image Generation with Latent Diffusion Models

Authors:Eleonora Lopez, Luigi Sigillo, Federica Colonnese, Massimo Panella, Danilo Comminiello

View PDF HTML (experimental)

Abstract:Generating images from brain waves is gaining increasing attention due to its potential to advance brain-computer interface (BCI) systems by understanding how brain signals encode visual cues. Most of the literature has focused on fMRI-to-Image tasks as fMRI is characterized by high spatial resolution. However, fMRI is an expensive neuroimaging modality and does not allow for real-time BCI. On the other hand, electroencephalography (EEG) is a low-cost, non-invasive, and portable neuroimaging technique, making it an attractive option for future real-time applications. Nevertheless, EEG presents inherent challenges due to its low spatial resolution and susceptibility to noise and artifacts, which makes generating images from EEG more difficult. In this paper, we address these problems with a streamlined framework based on the ControlNet adapter for conditioning a latent diffusion model (LDM) through EEG signals. We conduct experiments and ablation studies on popular benchmarks to demonstrate that the proposed method beats other state-of-the-art models. Unlike these methods, which often require extensive preprocessing, pretraining, different losses, and captioning models, our approach is efficient and straightforward, requiring only minimal preprocessing and a few components. Code will be available after publication.

Comments:	Submitted to ICASSP 2025
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2410.02780 [cs.CV]
	(or arXiv:2410.02780v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2410.02780

Submission history

From: Eleonora Lopez [view email]
[v1] Tue, 17 Sep 2024 19:07:13 UTC (20,601 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Guess What I Think: Streamlined EEG-to-Image Generation with Latent Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Guess What I Think: Streamlined EEG-to-Image Generation with Latent Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators