Skip to main content

Showing 1–8 of 8 results for author: Herbelot, A

.
  1. arXiv:2310.10262  [pdf, other

    cs.CL

    Enhancing Interpretability using Human Similarity Judgements to Prune Word Embeddings

    Authors: Natalia Flechas Manrique, Wanqian Bao, Aurelie Herbelot, Uri Hasson

    Abstract: Interpretability methods in NLP aim to provide insights into the semantics underlying specific system architectures. Focusing on word embeddings, we present a supervised-learning method that, for a given domain (e.g., sports, professions), identifies a subset of model features that strongly improve prediction of human similarity judgments. We show this method keeps only 20-40% of the original embe… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted for presentation at the BlackboxNLP workshop at EMNLP 2023

  2. arXiv:2302.03589  [pdf, other

    cs.CL

    CALaMo: a Constructionist Assessment of Language Models

    Authors: Ludovica Pannitto, Aurélie Herbelot

    Abstract: This paper presents a novel framework for evaluating Neural Language Models' linguistic abilities using a constructionist approach. Not only is the usage-based model in line with the underlying stochastic philosophy of neural architectures, but it also allows the linguist to keep meaning as a determinant factor in the analysis. We outline the framework and present two possible scenarios for its ap… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

  3. arXiv:2104.10270  [pdf, other

    cs.CL cs.AI

    Novel Aficionados and Doppelgängers: a referential task for semantic representations of individual entities

    Authors: Andrea Bruera, Aurélie Herbelot

    Abstract: In human semantic cognition, proper names (names which refer to individual entities) are harder to learn and retrieve than common nouns. This seems to be the case for machine learning algorithms too, but the linguistic and distributional reasons for this behaviour have not been investigated in depth so far. To tackle this issue, we show that the semantic distinction between proper names and common… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

  4. arXiv:2010.04637  [pdf, other

    cs.CL

    Recurrent babbling: evaluating the acquisition of grammar from limited input data

    Authors: Ludovica Pannitto, Aurélie Herbelot

    Abstract: Recurrent Neural Networks (RNNs) have been shown to capture various aspects of syntax from raw linguistic input. In most previous experiments, however, learning happens over unrealistic corpora, which do not reflect the type and amount of data a child would be exposed to. This paper remedies this state of affairs by training a Long Short-Term Memory network (LSTM) over a realistically sized subset… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

  5. arXiv:2009.07936  [pdf, other

    cs.CL

    How to marry a star: probabilistic constraints for meaning in context

    Authors: Katrin Erk, Aurelie Herbelot

    Abstract: In this paper, we derive a notion of 'word meaning in context' that characterizes meaning as both intensional and conceptual. We introduce a framework for specifying local as well as global constraints on word meaning in context, together with their interactions, thus modelling the wide range of lexical shifts and ambiguities observed in utterance interpretation. We represent sentence meaning as a… ▽ More

    Submitted 12 September, 2022; v1 submitted 16 September, 2020; originally announced September 2020.

  6. arXiv:1707.06556  [pdf, other

    cs.CL cs.LG

    High-risk learning: acquiring new word vectors from tiny data

    Authors: Aurelie Herbelot, Marco Baroni

    Abstract: Distributional semantics models are known to struggle with small data. It is generally accepted that in order to learn 'a good vector' for a word, a model must have sufficient examples of its usage. This contradicts the fact that humans can guess the meaning of a word from a few occurrences only. In this paper, we show that a neural language model such as Word2Vec only necessitates minor modificat… ▽ More

    Submitted 20 July, 2017; originally announced July 2017.

    Comments: Accepted as short paper at EMNLP 2017

  7. arXiv:1705.01359  [pdf, other

    cs.CV cs.CL cs.MM

    FOIL it! Find One mismatch between Image and Language caption

    Authors: Ravi Shekhar, Sandro Pezzelle, Yauhen Klimovich, Aurelie Herbelot, Moin Nabi, Enver Sangineto, Raffaella Bernardi

    Abstract: In this paper, we aim to understand whether current language and vision (LaVi) models truly grasp the interaction between the two modalities. To this end, we propose an extension of the MSCOCO dataset, FOIL-COCO, which associates images with both correct and "foil" captions, that is, descriptions of the image that are highly similar to the original ones, but contain one single mistake ("foil word"… ▽ More

    Submitted 3 May, 2017; originally announced May 2017.

    Comments: To appear at ACL 2017

  8. arXiv:1704.02923  [pdf, other

    cs.CL cs.AI cs.CV

    Pay Attention to Those Sets! Learning Quantification from Images

    Authors: Ionut Sorodoc, Sandro Pezzelle, Aurélie Herbelot, Mariella Dimiccoli, Raffaella Bernardi

    Abstract: Major advances have recently been made in merging language and vision representations. But most tasks considered so far have confined themselves to the processing of objects and lexicalised relations amongst objects (content words). We know, however, that humans (even pre-school children) can abstract over raw data to perform certain types of higher-level reasoning, expressed in natural language b… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

    Comments: Submitted to Journal Paper, 28 pages, 12 figures, 5 tables