Continual Multimodal Knowledge Graph Construction

Chen, Xiang; Zhang, Jintian; Wang, Xiaohan; Wu, Tongtong; Deng, Shumin; Wang, Yongheng; Si, Luo; Chen, Huajun; Zhang, Ningyu

Computer Science > Computation and Language

arXiv:2305.08698v1 (cs)

[Submitted on 15 May 2023 (this version), latest version 26 May 2024 (v3)]

Title:Continual Multimodal Knowledge Graph Construction

Authors:Xiang Chen, Jintian Zhang, Xiaohan Wang, Tongtong Wu, Shumin Deng, Yongheng Wang, Luo Si, Huajun Chen, Ningyu Zhang

View PDF

Abstract:Multimodal Knowledge Graph Construction (MMKC) refers to the process of creating a structured representation of entities and relationships through multiple modalities such as text, images, videos, etc. However, existing MMKC models have limitations in handling the introduction of new entities and relations due to the dynamic nature of the real world. Moreover, most state-of-the-art studies in MMKC only consider entity and relation extraction from text data while neglecting other multi-modal sources. Meanwhile, the current continual setting for knowledge graph construction only consider entity and relation extraction from text data while neglecting other multi-modal sources. Therefore, there arises the need to explore the challenge of continuous multimodal knowledge graph construction to address the phenomenon of catastrophic forgetting and ensure the retention of past knowledge extracted from different forms of data. This research focuses on investigating this complex topic by developing lifelong multimodal benchmark datasets. Based on the empirical findings that several state-of-the-art MMKC models, when trained on multimedia data, might unexpectedly underperform compared to those solely utilizing textual resources in a continual setting, we propose a Lifelong MultiModal Consistent Transformer Framework (LMC) for continuous multimodal knowledge graph construction. By combining the advantages of consistent KGC strategies within the context of continual learning, we achieve greater balance between stability and plasticity. Our experiments demonstrate the superior performance of our method over prevailing continual learning techniques or multimodal approaches in dynamic scenarios. Code and datasets can be found at this https URL.

Comments:	Work in progress
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB); Machine Learning (cs.LG); Multimedia (cs.MM)
Cite as:	arXiv:2305.08698 [cs.CL]
	(or arXiv:2305.08698v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.08698

Submission history

From: Ningyu Zhang [view email]
[v1] Mon, 15 May 2023 14:58:28 UTC (32,540 KB)
[v2] Tue, 1 Aug 2023 10:23:20 UTC (24,294 KB)
[v3] Sun, 26 May 2024 16:29:05 UTC (19,536 KB)

Computer Science > Computation and Language

Title:Continual Multimodal Knowledge Graph Construction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Continual Multimodal Knowledge Graph Construction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators