Multilingual End to End Entity Linking
Authors:
Mikhail Plekhanov,
Nora Kassner,
Kashyap Popat,
Louis Martin,
Simone Merello,
Borislav Kozlovskii,
Frédéric A. Dreyer,
Nicola Cancedda
Abstract:
Entity Linking is one of the most common Natural Language Processing tasks in practical applications, but so far efficient end-to-end solutions with multilingual coverage have been lacking, leading to complex model stacks. To fill this gap, we release and open source BELA, the first fully end-to-end multilingual entity linking model that efficiently detects and links entities in texts in any of 97…
▽ More
Entity Linking is one of the most common Natural Language Processing tasks in practical applications, but so far efficient end-to-end solutions with multilingual coverage have been lacking, leading to complex model stacks. To fill this gap, we release and open source BELA, the first fully end-to-end multilingual entity linking model that efficiently detects and links entities in texts in any of 97 languages. We provide here a detailed description of the model and report BELA's performance on four entity linking datasets covering high- and low-resource languages.
△ Less
Submitted 15 June, 2023;
originally announced June 2023.
Polar Ducks and Where to Find Them: Enhancing Entity Linking with Duck Ty** and Polar Box Embeddings
Authors:
Mattia Atzeni,
Mikhail Plekhanov,
Frédéric A. Dreyer,
Nora Kassner,
Simone Merello,
Louis Martin,
Nicola Cancedda
Abstract:
Entity linking methods based on dense retrieval are an efficient and widely used solution in large-scale applications, but they fall short of the performance of generative models, as they are sensitive to the structure of the embedding space. In order to address this issue, this paper introduces DUCK, an approach to infusing structural information in the space of entity representations, using prio…
▽ More
Entity linking methods based on dense retrieval are an efficient and widely used solution in large-scale applications, but they fall short of the performance of generative models, as they are sensitive to the structure of the embedding space. In order to address this issue, this paper introduces DUCK, an approach to infusing structural information in the space of entity representations, using prior knowledge of entity types. Inspired by duck ty** in programming languages, we propose to define the type of an entity based on the relations that it has with other entities in a knowledge graph. Then, porting the concept of box embeddings to spherical polar coordinates, we propose to represent relations as boxes on the hypersphere. We optimize the model to cluster entities of similar type by placing them inside the boxes corresponding to their relations. Our experiments show that our method sets new state-of-the-art results on standard entity-disambiguation benchmarks, it improves the performance of the model by up to 7.9 F1 points, outperforms other type-aware approaches, and matches the results of generative models with 18 times more parameters.
△ Less
Submitted 20 October, 2023; v1 submitted 19 May, 2023;
originally announced May 2023.