Skip to main content

Showing 1–50 of 64 results for author: Schockaert, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09529  [pdf, ps, other

    cs.AI cs.LG

    Differentiable Reasoning about Knowledge Graphs with Region-based Graph Neural Networks

    Authors: Aleksandar Pavlovic, Emanuel Sallinger, Steven Schockaert

    Abstract: Methods for knowledge graph (KG) completion need to capture semantic regularities and use these regularities to infer plausible knowledge that is not explicitly stated. Most embedding-based methods are opaque in the kinds of regularities they can capture, although region-based KG embedding models have emerged as a more transparent alternative. By modeling relations as geometric regions in high-dim… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2403.17216  [pdf, other

    cs.CL

    Ontology Completion with Natural Language Inference and Concept Embeddings: An Analysis

    Authors: Na Li, Thomas Bailleux, Zied Bouraoui, Steven Schockaert

    Abstract: We consider the problem of finding plausible knowledge that is missing from a given ontology, as a generalisation of the well-studied taxonomy expansion task. One line of work treats this task as a Natural Language Inference (NLI) problem, thus relying on the knowledge captured by language models to identify the missing knowledge. Another line of work uses concept embeddings to identify what diffe… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  3. arXiv:2403.16984  [pdf, other

    cs.AI cs.CL

    Modelling Commonsense Commonalities with Multi-Facet Concept Embeddings

    Authors: Hanane Kteich, Na Li, Usashi Chatterjee, Zied Bouraoui, Steven Schockaert

    Abstract: Concept embeddings offer a practical and efficient mechanism for injecting commonsense knowledge into downstream tasks. Their core purpose is often not to predict the commonsense properties of concepts themselves, but rather to identify commonalities, i.e.\ sets of concepts which share some property of interest. Such commonalities are the basis for inductive generalisation, hence high-quality conc… ▽ More

    Submitted 4 June, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  4. arXiv:2402.15337  [pdf, other

    cs.CL cs.LG

    Ranking Entities along Conceptual Space Dimensions with LLMs: An Analysis of Fine-Tuning Strategies

    Authors: Nitesh Kumar, Usashi Chatterjee, Steven Schockaert

    Abstract: Conceptual spaces represent entities in terms of their primitive semantic features. Such representations are highly valuable but they are notoriously difficult to learn, especially when it comes to modelling perceptual and subjective features. Distilling conceptual spaces from Large Language Models (LLMs) has recently emerged as a promising strategy, but existing work has been limited to probing p… ▽ More

    Submitted 5 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Accepted in ACL 2024 (Findings)

  5. arXiv:2401.16270  [pdf, other

    cs.AI

    Capturing Knowledge Graphs and Rules with Octagon Embeddings

    Authors: Victor Charpenay, Steven Schockaert

    Abstract: Region based knowledge graph embeddings represent relations as geometric regions. This has the advantage that the rules which are captured by the model are made explicit, making it straightforward to incorporate prior knowledge and to inspect learned models. Unfortunately, existing approaches are severely restricted in their ability to model relational composition, and hence also their ability to… ▽ More

    Submitted 18 June, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Accepted for IJCAI 2024

  6. arXiv:2312.11062  [pdf, other

    cs.CL

    Entity or Relation Embeddings? An Analysis of Encoding Strategies for Relation Extraction

    Authors: Frank Mtumbuka, Steven Schockaert

    Abstract: Relation extraction is essentially a text classification problem, which can be tackled by fine-tuning a pre-trained language model (LM). However, a key challenge arises from the fact that relation extraction cannot straightforwardly be reduced to sequence or token classification. Existing approaches therefore solve the problem in an indirect way: they fine-tune an LM to learn embeddings of the hea… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  7. arXiv:2310.14793  [pdf, other

    cs.CL cs.AI cs.LG

    What do Deck Chairs and Sun Hats Have in Common? Uncovering Shared Properties in Large Concept Vocabularies

    Authors: Amit Gajbhiye, Zied Bouraoui, Na Li, Usashi Chatterjee, Luis Espinosa Anke, Steven Schockaert

    Abstract: Concepts play a central role in many applications. This includes settings where concepts have to be modelled in the absence of sentence context. Previous work has therefore focused on distilling decontextualised concept embeddings from language models. But concepts can be modelled from different perspectives, whereas concept embeddings typically mostly capture taxonomic structure. To address this… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted for EMNLP 2023

  8. arXiv:2310.12379  [pdf, other

    cs.CL cs.AI

    Solving Hard Analogy Questions with Relation Embedding Chains

    Authors: Nitesh Kumar, Steven Schockaert

    Abstract: Modelling how concepts are related is a central topic in Lexical Semantics. A common strategy is to rely on knowledge graphs (KGs) such as ConceptNet, and to model the relation between two concepts as a set of paths. However, KGs are limited to a fixed set of relation types, and they are incomplete and often noisy. Another strategy is to distill relation embeddings from a fine-tuned language model… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  9. arXiv:2310.05481  [pdf, other

    cs.CL cs.AI

    Cabbage Sweeter than Cake? Analysing the Potential of Large Language Models for Learning Conceptual Spaces

    Authors: Usashi Chatterjee, Amit Gajbhiye, Steven Schockaert

    Abstract: The theory of Conceptual Spaces is an influential cognitive-linguistic framework for representing the meaning of concepts. Conceptual spaces are constructed from a set of quality dimensions, which essentially correspond to primitive perceptual features (e.g. hue or size). These quality dimensions are usually learned from human judgements, which means that applications of conceptual spaces tend to… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted for EMNLP 2023

  10. arXiv:2310.00299  [pdf, other

    cs.CL

    RelBERT: Embedding Relations with Language Models

    Authors: Asahi Ushio, Jose Camacho-Collados, Steven Schockaert

    Abstract: Many applications need access to background knowledge about how different concepts and entities are related. Although Knowledge Graphs (KG) and Large Language Models (LLM) can address this need to some extent, KGs are inevitably incomplete and their relational schema is often too coarse-grained, while LLMs are inefficient and difficult to control. As an alternative, we propose to extract relation… ▽ More

    Submitted 8 October, 2023; v1 submitted 30 September, 2023; originally announced October 2023.

  11. arXiv:2309.15217  [pdf, other

    cs.CL

    RAGAS: Automated Evaluation of Retrieval Augmented Generation

    Authors: Shahul Es, Jithin James, Luis Espinosa-Anke, Steven Schockaert

    Abstract: We introduce RAGAs (Retrieval Augmented Generation Assessment), a framework for reference-free evaluation of Retrieval Augmented Generation (RAG) pipelines. RAG systems are composed of a retrieval and an LLM based generation module, and provide LLMs with knowledge from a reference textual database, which enables them to act as a natural language layer between a user and textual databases, reducing… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: Reference-free (not tied to having ground truth available) evaluation framework for retrieval agumented generation

  12. arXiv:2308.07942  [pdf, other

    cs.AI cs.LG cs.SI

    Inductive Knowledge Graph Completion with GNNs and Rules: An Analysis

    Authors: Akash Anil, Víctor Gutiérrez-Basulto, Yazmín Ibañéz-García, Steven Schockaert

    Abstract: The task of inductive knowledge graph completion requires models to learn inference patterns from a training graph, which can then be used to make predictions on a disjoint test graph. Rule-based methods seem like a natural fit for this task, but in practice they significantly underperform state-of-the-art methods based on Graph Neural Networks (GNNs), such as NBFNet. We hypothesise that the under… ▽ More

    Submitted 24 March, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

  13. arXiv:2305.15002  [pdf, other

    cs.CL cs.LG

    A RelEntLess Benchmark for Modelling Graded Relations between Named Entities

    Authors: Asahi Ushio, Jose Camacho Collados, Steven Schockaert

    Abstract: Relations such as "is influenced by", "is known for" or "is a competitor of" are inherently graded: we can rank entity pairs based on how well they satisfy these relations, but it is hard to draw a line between those pairs that satisfy them and those that do not. Such graded relations play a central role in many applications, yet they are typically not covered by existing Knowledge Graphs. In this… ▽ More

    Submitted 30 January, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EACL 2024 main conference

  14. arXiv:2305.12924  [pdf, other

    cs.CL

    EnCore: Fine-Grained Entity Ty** by Pre-Training Entity Encoders on Coreference Chains

    Authors: Frank Mtumbuka, Steven Schockaert

    Abstract: Entity ty** is the task of assigning semantic types to the entities that are mentioned in a text. In the case of fine-grained entity ty** (FET), a large set of candidate type labels is considered. Since obtaining sufficient amounts of manual annotations is then prohibitively expensive, FET models are typically trained using distant supervision. In this paper, we propose to improve on this proc… ▽ More

    Submitted 27 January, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: To appear at EACL 2024

  15. arXiv:2305.12802  [pdf, other

    cs.CL cs.AI

    Ultra-Fine Entity Ty** with Prior Knowledge about Labels: A Simple Clustering Based Strategy

    Authors: Na Li, Zied Bouraoui, Steven Schockaert

    Abstract: Ultra-fine entity ty** (UFET) is the task of inferring the semantic types, from a large set of fine-grained candidates, that apply to a given entity mention. This task is especially challenging because we only have a small number of training examples for many of the types, even with distant supervision strategies. State-of-the-art models, therefore, have to rely on prior knowledge about the type… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  16. arXiv:2305.09785  [pdf, other

    cs.CL cs.AI

    Distilling Semantic Concept Embeddings from Contrastively Fine-Tuned Language Models

    Authors: Na Li, Hanane Kteich, Zied Bouraoui, Steven Schockaert

    Abstract: Learning vectors that capture the meaning of concepts remains a fundamental challenge. Somewhat surprisingly, perhaps, pre-trained language models have thus far only enabled modest improvements to the quality of such concept embeddings. Current strategies for using language models typically represent a concept by averaging the contextualised representations of its mentions in some corpus. This is… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  17. arXiv:2305.08414  [pdf, other

    cs.CL cs.AI

    What's the Meaning of Superhuman Performance in Today's NLU?

    Authors: Simone Tedeschi, Johan Bos, Thierry Declerck, Jan Hajic, Daniel Hershcovich, Eduard H. Hovy, Alexander Koller, Simon Krek, Steven Schockaert, Rico Sennrich, Ekaterina Shutova, Roberto Navigli

    Abstract: In the last five years, there has been a significant focus in Natural Language Processing (NLP) on develo** larger Pretrained Language Models (PLMs) and introducing benchmarks such as SuperGLUE and SQuAD to measure their abilities in language understanding, reasoning, and reading comprehension. These PLMs have achieved impressive results on these benchmarks, even surpassing human performance in… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 9 pages, long paper at ACL 2023 proceedings

  18. arXiv:2210.05723  [pdf, other

    cs.AI cs.LG

    Embeddings as Epistemic States: Limitations on the Use of Pooling Operators for Accumulating Knowledge

    Authors: Steven Schockaert

    Abstract: Various neural network architectures rely on pooling operators to aggregate information coming from different sources. It is often implicitly assumed in such contexts that vectors encode epistemic states, i.e. that vectors capture the evidence that has been obtained about some properties of interest, and that pooling these vectors yields a vector that combines this evidence. We study, for a number… ▽ More

    Submitted 3 July, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

  19. arXiv:2210.02771  [pdf, other

    cs.CL cs.AI cs.LG

    Modelling Commonsense Properties using Pre-Trained Bi-Encoders

    Authors: Amit Gajbhiye, Luis Espinosa-Anke, Steven Schockaert

    Abstract: Gras** the commonsense properties of everyday concepts is an important prerequisite to language understanding. While contextualised language models are reportedly capable of predicting such commonsense properties with human-level accuracy, we argue that such results have been inflated because of the high similarity between training and test concepts. This means that models which capture concept… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: COLING 2022

  20. arXiv:2112.01037  [pdf, other

    cs.CV cs.CL

    Inferring Prototypes for Multi-Label Few-Shot Image Classification with Word Vector Guided Attention

    Authors: Kun Yan, Chenbin Zhang, Jun Hou, ** Wang, Zied Bouraoui, Shoaib Jameel, Steven Schockaert

    Abstract: Multi-label few-shot image classification (ML-FSIC) is the task of assigning descriptive labels to previously unseen images, based on a small number of training examples. A key feature of the multi-label setting is that images often have multiple labels, which typically refer to different regions of the image. When estimating prototypes, in a metric-based setting, it is thus important to determine… ▽ More

    Submitted 7 December, 2021; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: Accepted by AAAI2022

  21. Distilling Relation Embeddings from Pre-trained Language Models

    Authors: Asahi Ushio, Jose Camacho-Collados, Steven Schockaert

    Abstract: Pre-trained language models have been found to capture a surprisingly rich amount of lexical knowledge, ranging from commonsense properties of everyday concepts to detailed factual knowledge about named entities. Among others, this makes it possible to distill high-quality word vectors from pre-trained language models. However, it is currently unclear to what extent it is possible to distill relat… ▽ More

    Submitted 21 September, 2021; originally announced October 2021.

    Comments: EMNLP 2021 main conference

  22. arXiv:2106.14431  [pdf, ps, other

    cs.AI

    Modelling Monotonic and Non-Monotonic Attribute Dependencies with Embeddings: A Theoretical Analysis

    Authors: Steven Schockaert

    Abstract: During the last decade, entity embeddings have become ubiquitous in Artificial Intelligence. Such embeddings essentially serve as compact but semantically meaningful representations of the entities of interest. In most approaches, vectors are used for representing the entities themselves, as well as for representing their associated attributes. An important advantage of using attribute embeddings… ▽ More

    Submitted 14 September, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: Accepted for AKBC 2021

  23. arXiv:2106.07947  [pdf, other

    cs.CL

    Deriving Word Vectors from Contextualized Language Models using Topic-Aware Mention Selection

    Authors: Yixiao Wang, Zied Bouraoui, Luis Espinosa Anke, Steven Schockaert

    Abstract: One of the long-standing challenges in lexical semantics consists in learning representations of words which reflect their semantic properties. The remarkable success of word embeddings for this purpose suggests that high-quality representations can be obtained by summarizing the sentence contexts of word mentions. In this paper, we propose a method for learning word representations that follows t… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

  24. arXiv:2106.07285  [pdf, other

    cs.CL cs.AI

    Probing Pre-Trained Language Models for Disease Knowledge

    Authors: Israa Alghanmi, Luis Espinosa-Anke, Steven Schockaert

    Abstract: Pre-trained language models such as ClinicalBERT have achieved impressive results on tasks such as medical Natural Language Inference. At first glance, this may suggest that these models are able to perform medical reasoning tasks, such as map** symptoms to diseases. However, we find that standard benchmarks such as MedNLI contain relatively few examples that require such forms of reasoning. To… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: Accepted by ACL 2021 Findings

  25. arXiv:2105.10195  [pdf, other

    cs.CV

    Aligning Visual Prototypes with BERT Embeddings for Few-Shot Learning

    Authors: Kun Yan, Zied Bouraoui, ** Wang, Shoaib Jameel, Steven Schockaert

    Abstract: Few-shot learning (FSL) is the task of learning to recognize previously unseen categories of images from a small number of training examples. This is a challenging task, as the available examples may not be enough to unambiguously determine which visual features are most characteristic of the considered categories. To alleviate this issue, we propose a method that additionally takes into account t… ▽ More

    Submitted 21 May, 2021; originally announced May 2021.

    Comments: Accepted by ICMR2021

  26. arXiv:2105.04949  [pdf, other

    cs.CL cs.LG

    BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies?

    Authors: Asahi Ushio, Luis Espinosa-Anke, Steven Schockaert, Jose Camacho-Collados

    Abstract: Analogies play a central role in human commonsense reasoning. The ability to recognize analogies such as "eye is to seeing what ear is to hearing", sometimes referred to as analogical proportions, shape how we structure knowledge and understand language. Surprisingly, however, the task of identifying such analogies has not yet received much attention in the language model era. In this paper, we an… ▽ More

    Submitted 9 September, 2022; v1 submitted 11 May, 2021; originally announced May 2021.

    Comments: Accepted by ACL 2021 main conference

  27. arXiv:2105.04620  [pdf, ps, other

    cs.AI

    A Description Logic for Analogical Reasoning

    Authors: Steven Schockaert, Yazmín Ibáñez-García, Víctor Gutiérrez-Basulto

    Abstract: Ontologies formalise how the concepts from a given domain are interrelated. Despite their clear potential as a backbone for explainable AI, existing ontologies tend to be highly incomplete, which acts as a significant barrier to their more widespread adoption. To mitigate this issue, we present a mechanism to infer plausible missing knowledge, which relies on reasoning by analogy. To the best of o… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: Accepted for IJCAI 2021

  28. arXiv:2102.00801  [pdf, ps, other

    cs.CV cs.AI

    Few-shot Image Classification with Multi-Facet Prototypes

    Authors: Kun Yan, Zied Bouraoui, ** Wang, Shoaib Jameel, Steven Schockaert

    Abstract: The aim of few-shot learning (FSL) is to learn how to recognize image categories from a small number of training examples. A central challenge is that the available training examples are normally insufficient to determine which visual features are most characteristic of the considered categories. To address this challenge, we organize these visual features into facets, which intuitively group feat… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: Accepted by ICASSP2021

  29. arXiv:2012.07580  [pdf, other

    cs.CL cs.AI

    Modelling General Properties of Nouns by Selectively Averaging Contextualised Embeddings

    Authors: Na Li, Zied Bouraoui, Jose Camacho Collados, Luis Espinosa-Anke, Qing Gu, Steven Schockaert

    Abstract: While the success of pre-trained language models has largely eliminated the need for high-quality static word vectors in many NLP applications, such vectors continue to play an important role in tasks where words need to be modelled in the absence of linguistic context. In this paper, we explore how the contextualised embeddings predicted by BERT can be used to produce high-quality word vectors fo… ▽ More

    Submitted 17 May, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

  30. arXiv:2011.08320  [pdf, ps, other

    cs.CL

    Don't Patronize Me! An Annotated Dataset with Patronizing and Condescending Language towards Vulnerable Communities

    Authors: Carla Pérez-Almendros, Luis Espinosa-Anke, Steven Schockaert

    Abstract: In this paper, we introduce a new annotated dataset which is aimed at supporting the development of NLP models to identify and categorize language that is patronizing or condescending towards vulnerable communities (e.g. refugees, homeless people, poor families). While the prevalence of such language in the general media has long been shown to have harmful effects, it differs from other types of h… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.

  31. arXiv:2006.14437  [pdf, other

    cs.AI

    Plausible Reasoning about EL-Ontologies using Concept Interpolation

    Authors: Yazmín Ibáñez-García, Víctor Gutiérrez-Basulto, Steven Schockaert

    Abstract: Description logics (DLs) are standard knowledge representation languages for modelling ontologies, i.e. knowledge about concepts and the relations between them. Unfortunately, DL ontologies are difficult to learn from data and time-consuming to encode manually. As a result, ontologies for broad domains are almost inevitably incomplete. In recent years, several data-driven approaches have been prop… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: 16 pages, 3 figures, accepted at KR 2020

  32. arXiv:1912.06612  [pdf, ps, other

    cs.AI

    From Shallow to Deep Interactions Between Knowledge Representation, Reasoning and Machine Learning (Kay R. Amel group)

    Authors: Zied Bouraoui, Antoine Cornuéjols, Thierry Denœux, Sébastien Destercke, Didier Dubois, Romain Guillaume, João Marques-Silva, Jérôme Mengin, Henri Prade, Steven Schockaert, Mathieu Serrurier, Christel Vrain

    Abstract: This paper proposes a tentative and original survey of meeting points between Knowledge Representation and Reasoning (KRR) and Machine Learning (ML), two areas which have been develo** quite separately in the last three decades. Some common concerns are identified and discussed such as the types of used representation, the roles of knowledge and data, the lack or the excess of information, or th… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

    Comments: 53 pages

  33. arXiv:1912.01220  [pdf, other

    cs.CL cs.AI

    Modelling Semantic Categories using Conceptual Neighborhood

    Authors: Zied Bouraoui, Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert

    Abstract: While many methods for learning vector space embeddings have been proposed in the field of Natural Language Processing, these methods typically do not distinguish between categories and individuals. Intuitively, if individuals are represented as vectors, we can think of categories as (soft) regions in the embedding space. Unfortunately, meaningful regions can be difficult to estimate, especially s… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

    Comments: Accepted to AAAI 2020

  34. arXiv:1911.12753  [pdf, ps, other

    cs.CL cs.AI

    Inducing Relational Knowledge from BERT

    Authors: Zied Bouraoui, Jose Camacho-Collados, Steven Schockaert

    Abstract: One of the most remarkable properties of word embeddings is the fact that they capture certain types of semantic and syntactic relationships. Recently, pre-trained language models such as BERT have achieved groundbreaking results across a wide range of Natural Language Processing tasks. However, it is unclear to what extent such models capture relational knowledge beyond what is already captured b… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

    Comments: Accepted to AAAI 2020

  35. arXiv:1910.07221  [pdf, other

    cs.CL

    Meemi: A Simple Method for Post-processing and Integrating Cross-lingual Word Embeddings

    Authors: Yerai Doval, Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert

    Abstract: Word embeddings have become a standard resource in the toolset of any Natural Language Processing practitioner. While monolingual word embeddings encode information about words in the context of a particular language, cross-lingual embeddings define a multilingual space where word embeddings from two or more languages are integrated together. Current state-of-the-art approaches learn these embeddi… ▽ More

    Submitted 11 November, 2020; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: 22 pages, 2 figures, 9 tables. Preprint submitted to Natural Language Engineering

    MSC Class: 68T50

  36. arXiv:1909.06414  [pdf, other

    cs.CL cs.AI cs.LG

    Learning Household Task Knowledge from WikiHow Descriptions

    Authors: Yilun Zhou, Julie A. Shah, Steven Schockaert

    Abstract: Commonsense procedural knowledge is important for AI agents and robots that operate in a human environment. While previous attempts at constructing procedural knowledge are mostly rule- and template-based, recent advances in deep learning provide the possibility of acquiring such knowledge directly from natural language sources. As a first step in this direction, we propose a model to learn embedd… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

    Comments: IJCAI 2019 Workshop on Semantic Deep Learning

  37. arXiv:1908.07742  [pdf, other

    cs.CL

    On the Robustness of Unsupervised and Semi-supervised Cross-lingual Word Embedding Learning

    Authors: Yerai Doval, Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert

    Abstract: Cross-lingual word embeddings are vector representations of words in different languages where words with similar meaning are represented by similar vectors, regardless of the language. Recent developments which construct these embeddings by aligning monolingual spaces have shown that accurate alignments can be obtained with little or no supervision. However, the focus has been on a particular con… ▽ More

    Submitted 3 March, 2020; v1 submitted 21 August, 2019; originally announced August 2019.

    Comments: 11 pages, 2 figures, 7 tables. Camera-ready submitted to LREC 2020

  38. arXiv:1906.01373  [pdf, other

    cs.CL

    Relational Word Embeddings

    Authors: Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert

    Abstract: While word embeddings have been shown to implicitly encode various forms of attributional knowledge, the extent to which they capture relational information is far more limited. In previous work, this limitation has been addressed by incorporating relational knowledge from external knowledge bases when learning the word embedding. Such strategies may not be optimal, however, as they are limited by… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: To appear at ACL 2019. 11 pages

  39. arXiv:1905.07358  [pdf, other

    cs.CL cs.SI

    Learning Cross-lingual Embeddings from Twitter via Distant Supervision

    Authors: Jose Camacho-Collados, Yerai Doval, Eugenio Martínez-Cámara, Luis Espinosa-Anke, Francesco Barbieri, Steven Schockaert

    Abstract: Cross-lingual embeddings represent the meaning of words from different languages in the same vector space. Recent work has shown that it is possible to construct such representations by aligning independently learned monolingual embedding spaces, and that accurate alignments can be obtained even without external bilingual data. In this paper we explore a research direction that has been surprising… ▽ More

    Submitted 31 March, 2020; v1 submitted 17 May, 2019; originally announced May 2019.

    Comments: Accepted to ICWSM 2020. 11 pages, 1 appendix. Pre-trained embeddings available at https://github.com/pedrada88/crossembeddings-twitter

  40. Predicting ConceptNet Path Quality Using Crowdsourced Assessments of Naturalness

    Authors: Yilun Zhou, Steven Schockaert, Julie A. Shah

    Abstract: In many applications, it is important to characterize the way in which two concepts are semantically related. Knowledge graphs such as ConceptNet provide a rich source of information for such characterizations by encoding relations between concepts as edges in a graph. When two concepts are not directly connected by an edge, their relationship can still be described in terms of the paths that conn… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.

    Comments: In Proceedings of the Web Conference (WWW) 2019

  41. arXiv:1810.12091  [pdf, other

    cs.IR cs.CL cs.CV cs.LG stat.ML

    Embedding Geographic Locations for Modelling the Natural Environment using Flickr Tags and Structured Data

    Authors: Shelan S. Jeawak, Christopher B. Jones, Steven Schockaert

    Abstract: Meta-data from photo-sharing websites such as Flickr can be used to obtain rich bag-of-words descriptions of geographic locations, which have proven valuable, among others, for modelling and predicting ecological features. One important insight from previous work is that the descriptions obtained from Flickr tend to be complementary to the structured information that is available from traditional… ▽ More

    Submitted 12 October, 2018; originally announced October 2018.

  42. arXiv:1808.08780  [pdf, other

    cs.CL

    Improving Cross-Lingual Word Embeddings by Meeting in the Middle

    Authors: Yerai Doval, Jose Camacho-Collados, Luis Espinosa-Anke, Steven Schockaert

    Abstract: Cross-lingual word embeddings are becoming increasingly important in multilingual NLP. Recently, it has been shown that these embeddings can be effectively learned by aligning two disjoint monolingual vector spaces through linear transformations, using no more than a small bilingual dictionary as supervision. In this work, we propose to apply an additional transformation after the initial alignmen… ▽ More

    Submitted 27 August, 2018; originally announced August 2018.

    Comments: 11 pages, 4 tables, 1 figure. EMNLP 2018 camera-ready

  43. arXiv:1808.06068  [pdf, ps, other

    cs.CL

    SeVeN: Augmenting Word Embeddings with Unsupervised Relation Vectors

    Authors: Luis Espinosa-Anke, Steven Schockaert

    Abstract: We present SeVeN (Semantic Vector Networks), a hybrid resource that encodes relationships between words in the form of a graph. Different from traditional semantic networks, these relations are represented as vectors in a continuous vector space. We propose a simple pipeline for learning such relation vectors, which is based on word vector averaging in combination with an ad hoc autoencoder. We sh… ▽ More

    Submitted 18 August, 2018; originally announced August 2018.

  44. arXiv:1805.10461  [pdf, other

    cs.AI

    From Knowledge Graph Embedding to Ontology Embedding? An Analysis of the Compatibility between Vector Space Representations and Rules

    Authors: Víctor Gutiérrez-Basulto, Steven Schockaert

    Abstract: Recent years have witnessed the successful application of low-dimensional vector space representations of knowledge graphs to predict missing facts or find erroneous ones. However, it is not yet well-understood to what extent ontological knowledge, e.g. given as a set of (existential) rules, can be embedded in a principled way. To address this shortcoming, in this paper we introduce a general fram… ▽ More

    Submitted 21 August, 2018; v1 submitted 26 May, 2018; originally announced May 2018.

    Comments: Full version of a paper accepted at KR-2018

  45. arXiv:1805.01276  [pdf, other

    cs.AI

    Learning Conceptual Space Representations of Interrelated Concepts

    Authors: Zied Bouraoui, Steven Schockaert

    Abstract: Several recently proposed methods aim to learn conceptual space representations from large text collections. These learned representations asso- ciate each object from a given domain of interest with a point in a high-dimensional Euclidean space, but they do not model the concepts from this do- main, and can thus not directly be used for catego- rization and related cognitive tasks. A natural solu… ▽ More

    Submitted 4 May, 2018; v1 submitted 3 May, 2018; originally announced May 2018.

  46. arXiv:1804.06188  [pdf, ps, other

    cs.LG cs.AI stat.ML

    VC-Dimension Based Generalization Bounds for Relational Learning

    Authors: Ondrej Kuzelka, Yuyi Wang, Steven Schockaert

    Abstract: In many applications of relational learning, the available data can be seen as a sample from a larger relational structure (e.g. we may be given a small fragment from some social network). In this paper we are particularly concerned with scenarios in which we can assume that (i) the domain elements appearing in the given sample have been uniformly sampled without replacement from the (unknown) ful… ▽ More

    Submitted 4 July, 2018; v1 submitted 17 April, 2018; originally announced April 2018.

    Comments: Longer version of paper accepted at ECML PKDD 2018

  47. arXiv:1803.05768  [pdf, ps, other

    cs.AI cs.LG

    PAC-Reasoning in Relational Domains

    Authors: Ondrej Kuzelka, Yuyi Wang, Jesse Davis, Steven Schockaert

    Abstract: We consider the problem of predicting plausible missing facts in relational data, given a set of imperfect logical rules. In particular, our aim is to provide bounds on the (expected) number of incorrect inferences that are made in this way. Since for classical inference it is in general impossible to bound this number in a non-trivial way, we consider two inference relations that weaken, but rema… ▽ More

    Submitted 4 July, 2018; v1 submitted 15 March, 2018; originally announced March 2018.

    Comments: Longer version of paper appearing in UAI 2018

  48. arXiv:1711.05294  [pdf, other

    cs.CL

    Modeling Semantic Relatedness using Global Relation Vectors

    Authors: Shoaib Jameel, Zied Bouraoui, Steven Schockaert

    Abstract: Word embedding models such as GloVe rely on co-occurrence statistics from a large corpus to learn vector representations of word meaning. These vectors have proven to capture surprisingly fine-grained semantic and syntactic information. While we may similarly expect that co-occurrence statistics can be used to capture rich information about the relationships between different words, existing appro… ▽ More

    Submitted 14 November, 2017; originally announced November 2017.

  49. arXiv:1710.02221  [pdf, other

    cs.LG cs.AI stat.ML

    Stacked Structure Learning for Lifted Relational Neural Networks

    Authors: Gustav Sourek, Martin Svatos, Filip Zelezny, Steven Schockaert, Ondrej Kuzelka

    Abstract: Lifted Relational Neural Networks (LRNNs) describe relational domains using weighted first-order rules which act as templates for constructing feed-forward neural networks. While previous work has shown that using LRNNs can lead to state-of-the-art results in various ILP tasks, these results depended on hand-crafted rules. In this paper, we extend the framework of LRNNs with structure learning, th… ▽ More

    Submitted 5 October, 2017; originally announced October 2017.

    Comments: Presented at ILP 2017

  50. arXiv:1709.05825  [pdf, ps, other

    cs.AI

    Relational Marginal Problems: Theory and Estimation

    Authors: Ondrej Kuzelka, Yuyi Wang, Jesse Davis, Steven Schockaert

    Abstract: In the propositional setting, the marginal problem is to find a (maximum-entropy) distribution that has some given marginals. We study this problem in a relational setting and make the following contributions. First, we compare two different notions of relational marginals. Second, we show a duality between the resulting relational marginal problems and the maximum likelihood estimation of the par… ▽ More

    Submitted 25 April, 2018; v1 submitted 18 September, 2017; originally announced September 2017.

    Comments: Long version of a paper that appeared in AAAI 2018; added a paragraph to Related Work