MIEB: Massive Image Embedding Benchmark

Xiao, Chenghao; Chung, Isaac; Kerboua, Imene; Stirling, Jamie; Zhang, Xin; Kardos, Márton; Solomatin, Roman; Moubayed, Noura Al; Enevoldsen, Kenneth; Muennighoff, Niklas

Computer Science > Computer Vision and Pattern Recognition

arXiv:2504.10471 (cs)

[Submitted on 14 Apr 2025]

Title:MIEB: Massive Image Embedding Benchmark

Authors:Chenghao Xiao, Isaac Chung, Imene Kerboua, Jamie Stirling, Xin Zhang, Márton Kardos, Roman Solomatin, Noura Al Moubayed, Kenneth Enevoldsen, Niklas Muennighoff

View PDF HTML (experimental)

Abstract:Image representations are often evaluated through disjointed, task-specific protocols, leading to a fragmented understanding of model capabilities. For instance, it is unclear whether an image embedding model adept at clustering images is equally good at retrieving relevant images given a piece of text. We introduce the Massive Image Embedding Benchmark (MIEB) to evaluate the performance of image and image-text embedding models across the broadest spectrum to date. MIEB spans 38 languages across 130 individual tasks, which we group into 8 high-level categories. We benchmark 50 models across our benchmark, finding that no single method dominates across all task categories. We reveal hidden capabilities in advanced vision models such as their accurate visual representation of texts, and their yet limited capabilities in interleaved encodings and matching images and texts in the presence of confounders. We also show that the performance of vision encoders on MIEB correlates highly with their performance when used in multimodal large language models. Our code, dataset, and leaderboard are publicly available at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
Cite as:	arXiv:2504.10471 [cs.CV]
	(or arXiv:2504.10471v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2504.10471

Submission history

From: Chenghao Xiao [view email]
[v1] Mon, 14 Apr 2025 17:54:28 UTC (22,289 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MIEB: Massive Image Embedding Benchmark

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MIEB: Massive Image Embedding Benchmark

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators