ReMatch: Retrieval Enhanced Schema Matching with LLMs

Sheetrit, Eitam; Brief, Menachem; Mishaeli, Moshik; Elisha, Oren

Computer Science > Databases

arXiv:2403.01567v2 (cs)

[Submitted on 3 Mar 2024 (v1), last revised 30 May 2024 (this version, v2)]

Title:ReMatch: Retrieval Enhanced Schema Matching with LLMs

Authors:Eitam Sheetrit, Menachem Brief, Moshik Mishaeli, Oren Elisha

View PDF HTML (experimental)

Abstract:Schema matching is a crucial task in data integration, involving the alignment of a source schema with a target schema to establish correspondence between their elements. This task is challenging due to textual and semantic heterogeneity, as well as differences in schema sizes. Although machine-learning-based solutions have been explored in numerous studies, they often suffer from low accuracy, require manual mapping of the schemas for model training, or need access to source schema data which might be unavailable due to privacy concerns. In this paper we present a novel method, named ReMatch, for matching schemas using retrieval-enhanced Large Language Models (LLMs). Our method avoids the need for predefined mapping, any model training, or access to data in the source database. Our experimental results on large real-world schemas demonstrate that ReMatch is an effective matcher. By eliminating the requirement for training data, ReMatch becomes a viable solution for real-world scenarios.

Subjects:	Databases (cs.DB); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.01567 [cs.DB]
	(or arXiv:2403.01567v2 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.2403.01567

Submission history

From: Menachem Brief [view email]
[v1] Sun, 3 Mar 2024 17:14:40 UTC (611 KB)
[v2] Thu, 30 May 2024 14:33:46 UTC (606 KB)

Computer Science > Databases

Title:ReMatch: Retrieval Enhanced Schema Matching with LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Databases

Title:ReMatch: Retrieval Enhanced Schema Matching with LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators