Leveraging Large Language Models for Entity Matching

Huang, Qianyu; Zhao, Tongfang

Computer Science > Computation and Language

arXiv:2405.20624 (cs)

[Submitted on 31 May 2024]

Title:Leveraging Large Language Models for Entity Matching

Authors:Qianyu Huang, Tongfang Zhao

View PDF HTML (experimental)

Abstract:Entity matching (EM) is a critical task in data integration, aiming to identify records across different datasets that refer to the same real-world entities. Traditional methods often rely on manually engineered features and rule-based systems, which struggle with diverse and unstructured data. The emergence of Large Language Models (LLMs) such as GPT-4 offers transformative potential for EM, leveraging their advanced semantic understanding and contextual capabilities. This vision paper explores the application of LLMs to EM, discussing their advantages, challenges, and future research directions. Additionally, we review related work on applying weak supervision and unsupervised approaches to EM, highlighting how LLMs can enhance these methods.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2405.20624 [cs.CL]
	(or arXiv:2405.20624v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2405.20624

Submission history

From: Qianyu Huang [view email]
[v1] Fri, 31 May 2024 05:22:07 UTC (625 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2024-05

Change to browse by:

cs
cs.CL

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Leveraging Large Language Models for Entity Matching

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Leveraging Large Language Models for Entity Matching

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators