The Illusion-Illusion: Vision Language Models See Illusions Where There are None

Ullman, Tomer

Quantitative Biology > Neurons and Cognition

arXiv:2412.18613 (q-bio)

[Submitted on 7 Dec 2024]

Title:The Illusion-Illusion: Vision Language Models See Illusions Where There are None

Authors:Tomer Ullman

View PDF HTML (experimental)

Abstract:Illusions are entertaining, but they are also a useful diagnostic tool in cognitive science, philosophy, and neuroscience. A typical illusion shows a gap between how something "really is" and how something "appears to be", and this gap helps us understand the mental processing that lead to how something appears to be. Illusions are also useful for investigating artificial systems, and much research has examined whether computational models of perceptions fall prey to the same illusions as people. Here, I invert the standard use of perceptual illusions to examine basic processing errors in current vision language models. I present these models with illusory-illusions, neighbors of common illusions that should not elicit processing errors. These include such things as perfectly reasonable ducks, crooked lines that truly are crooked, circles that seem to have different sizes because they are, in fact, of different sizes, and so on. I show that many current vision language systems mistakenly see these illusion-illusions as illusions. I suggest that such failures are part of broader failures already discussed in the literature.

Comments:	9 pages, 5 figures
Subjects:	Neurons and Cognition (q-bio.NC); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2412.18613 [q-bio.NC]
	(or arXiv:2412.18613v1 [q-bio.NC] for this version)
	https://doi.org/10.48550/arXiv.2412.18613

Submission history

From: Tomer Ullman [view email]
[v1] Sat, 7 Dec 2024 03:30:51 UTC (1,878 KB)

Quantitative Biology > Neurons and Cognition

Title:The Illusion-Illusion: Vision Language Models See Illusions Where There are None

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Quantitative Biology > Neurons and Cognition

Title:The Illusion-Illusion: Vision Language Models See Illusions Where There are None

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators