Skip to main content

Showing 1–6 of 6 results for author: Gelderloos, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2105.05582  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Discrete representations in neural models of spoken language

    Authors: Bertrand Higy, Lieke Gelderloos, Afra Alishahi, Grzegorz Chrupała

    Abstract: The distributed and continuous representations used by neural networks are at odds with representations employed in linguistics, which are typically symbolic. Vector quantization has been proposed as a way to induce discrete neural representations that are closer in nature to their linguistic counterparts. However, it is not clear which metrics are the best-suited to analyze such discrete represen… ▽ More

    Submitted 16 September, 2021; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: Accepted for publication at BlackboxNLP 2021

  2. arXiv:2005.02721  [pdf, other

    cs.CL

    Learning to Understand Child-directed and Adult-directed Speech

    Authors: Lieke Gelderloos, Grzegorz Chrupała, Afra Alishahi

    Abstract: Speech directed to children differs from adult-directed speech in linguistic aspects such as repetition, word choice, and sentence length, as well as in aspects of the speech signal itself, such as prosodic and phonemic variation. Human language acquisition research indicates that child-directed speech helps language learners. This study explores the effect of child-directed speech when learning t… ▽ More

    Submitted 16 July, 2021; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: Authors found an error in preprocessing of transcriptions before they were fed to SBERT. After correction, the experiments were rerun. The updated results can be found in this version. Importantly, - Most scores were affected to a small degree (performance was slightly worse). - The effect was consistent across conditions. Therefore, the general patterns remain the same

  3. arXiv:1906.01530  [pdf, other

    cs.CL cs.AI cs.CV

    The PhotoBook Dataset: Building Common Ground through Visually-Grounded Dialogue

    Authors: Janosch Haber, Tim Baumgärtner, Ece Takmaz, Lieke Gelderloos, Elia Bruni, Raquel Fernández

    Abstract: This paper introduces the PhotoBook dataset, a large-scale collection of visually-grounded, task-oriented dialogues in English designed to investigate shared dialogue history accumulating during conversation. Taking inspiration from seminal work on dialogue analysis, we propose a data-collection task formulated as a collaborative game prompting two online participants to refer to images utilising… ▽ More

    Submitted 26 June, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: Updates 26-06-2019: Changed caption sizes to comply with the ACL style guidelines and corrected some references

  4. arXiv:1803.08869  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    On the difficulty of a distributional semantics of spoken language

    Authors: Grzegorz Chrupała, Lieke Gelderloos, Ákos Kádár, Afra Alishahi

    Abstract: In the domain of unsupervised learning most work on speech has focused on discovering low-level constructs such as phoneme inventories or word-like units. In contrast, for written language, where there is a large body of work on unsupervised induction of semantic representations of words, whole sentences and longer texts. In this study we examine the challenges of adapting these approaches from wr… ▽ More

    Submitted 26 October, 2018; v1 submitted 23 March, 2018; originally announced March 2018.

    Comments: Proceedings of the Society for Computation in Linguistics 2019

  5. arXiv:1702.01991  [pdf, other

    cs.CL cs.AI cs.LG

    Representations of language in a model of visually grounded speech signal

    Authors: Grzegorz Chrupała, Lieke Gelderloos, Afra Alishahi

    Abstract: We present a visually grounded model of speech perception which projects spoken utterances and images to a joint semantic space. We use a multi-layer recurrent highway network to model the temporal nature of spoken speech, and show that it learns to extract both form and meaning-based linguistic knowledge from the input signal. We carry out an in-depth analysis of the representations used by diffe… ▽ More

    Submitted 30 June, 2017; v1 submitted 7 February, 2017; originally announced February 2017.

    Comments: Accepted at ACL 2017

  6. arXiv:1610.03342  [pdf, other

    cs.CL cs.LG

    From phonemes to images: levels of representation in a recurrent neural model of visually-grounded language learning

    Authors: Lieke Gelderloos, Grzegorz Chrupała

    Abstract: We present a model of visually-grounded language learning based on stacked gated recurrent neural networks which learns to predict visual features given an image description in the form of a sequence of phonemes. The learning task resembles that faced by human language learners who need to discover both structure and meaning from noisy and ambiguous data across modalities. We show that our model i… ▽ More

    Submitted 11 October, 2016; originally announced October 2016.

    Comments: Accepted at COLING 2016