NASTyLinker: NIL-Aware Scalable Transformer-based Entity Linker

Heist, Nicolas; Paulheim, Heiko

Computer Science > Computation and Language

arXiv:2303.04426 (cs)

[Submitted on 8 Mar 2023 (v1), last revised 13 Mar 2023 (this version, v2)]

Title:NASTyLinker: NIL-Aware Scalable Transformer-based Entity Linker

Authors:Nicolas Heist, Heiko Paulheim

View PDF

Abstract:Entity Linking (EL) is the task of detecting mentions of entities in text and disambiguating them to a reference knowledge base. Most prevalent EL approaches assume that the reference knowledge base is complete. In practice, however, it is necessary to deal with the case of linking to an entity that is not contained in the knowledge base (NIL entity). Recent works have shown that, instead of focusing only on affinities between mentions and entities, considering inter-mention affinities can be used to represent NIL entities by producing clusters of mentions. At the same time, inter-mention affinities can help to substantially improve linking performance for known entities. With NASTyLinker, we introduce an EL approach that is aware of NIL entities and produces corresponding mention clusters while maintaining high linking performance for known entities. The approach clusters mentions and entities based on dense representations from Transformers and resolves conflicts (if more than one entity is assigned to a cluster) by computing transitive mention-entity affinities. We show the effectiveness and scalability of NASTyLinker on NILK, a dataset that is explicitly constructed to evaluate EL with respect to NIL entities. Further, we apply the presented approach to an actual EL task, namely to knowledge graph population by linking entities in Wikipedia listings, and provide an analysis of the outcome.

Comments:	Preprint of a paper in the research track of the 20th Extended Semantic Web Conference (ESWC'23)
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
Cite as:	arXiv:2303.04426 [cs.CL]
	(or arXiv:2303.04426v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2303.04426

Submission history

From: Nicolas Heist [view email]
[v1] Wed, 8 Mar 2023 08:08:57 UTC (424 KB)
[v2] Mon, 13 Mar 2023 08:43:27 UTC (424 KB)

Computer Science > Computation and Language

Title:NASTyLinker: NIL-Aware Scalable Transformer-based Entity Linker

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:NASTyLinker: NIL-Aware Scalable Transformer-based Entity Linker

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators