M-MRE: Extending the Mutual Reinforcement Effect to Multimodal Information Extraction

Gan, Chengguang; Lee, Sunbowen; Cai, Zhixi; Wei, Yanbin; Zheng, Lei; Liang, Yunhao; Ni, Shiwen; Mori, Tatsunori

Computer Science > Computation and Language

arXiv:2504.17353 (cs)

[Submitted on 24 Apr 2025]

Title:M-MRE: Extending the Mutual Reinforcement Effect to Multimodal Information Extraction

Authors:Chengguang Gan, Sunbowen Lee, Zhixi Cai, Yanbin Wei, Lei Zheng, Yunhao Liang, Shiwen Ni, Tatsunori Mori

View PDF HTML (experimental)

Abstract:Mutual Reinforcement Effect (MRE) is an emerging subfield at the intersection of information extraction and model interpretability. MRE aims to leverage the mutual understanding between tasks of different granularities, enhancing the performance of both coarse-grained and fine-grained tasks through joint modeling. While MRE has been explored and validated in the textual domain, its applicability to visual and multimodal domains remains unexplored. In this work, we extend MRE to the multimodal information extraction domain for the first time. Specifically, we introduce a new task: Multimodal Mutual Reinforcement Effect (M-MRE), and construct a corresponding dataset to support this task. To address the challenges posed by M-MRE, we further propose a Prompt Format Adapter (PFA) that is fully compatible with various Large Vision-Language Models (LVLMs). Experimental results demonstrate that MRE can also be observed in the M-MRE task, a multimodal text-image understanding scenario. This provides strong evidence that MRE facilitates mutual gains across three interrelated tasks, confirming its generalizability beyond the textual domain.

Subjects:	Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
Cite as:	arXiv:2504.17353 [cs.CL]
	(or arXiv:2504.17353v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2504.17353

Submission history

From: Chengguang Gan [view email]
[v1] Thu, 24 Apr 2025 08:14:36 UTC (1,170 KB)

Computer Science > Computation and Language

Title:M-MRE: Extending the Mutual Reinforcement Effect to Multimodal Information Extraction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:M-MRE: Extending the Mutual Reinforcement Effect to Multimodal Information Extraction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators