Skip to main content

Showing 1–5 of 5 results for author: Desbordes, T

.
  1. arXiv:2311.16978  [pdf, other

    cs.CL

    Assessing the influence of attractor-verb distance on grammatical agreement in humans and language models

    Authors: Christos-Nikolaos Zacharopoulos, Théo Desbordes, Mathias Sablé-Meyer

    Abstract: Subject-verb agreement in the presence of an attractor noun located between the main noun and the verb elicits complex behavior: judgments of grammaticality are modulated by the grammatical features of the attractor. For example, in the sentence "The girl near the boys likes climbing", the attractor (boys) disagrees in grammatical number with the verb (likes), creating a locally implausible transi… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 10 pages (5 main, 2 refs, 3 supplementary) ; 5 figures (3 main, 2 supplementary) ; accepted at EMNLP 2023 (no DOI yet)

  2. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  3. arXiv:2110.07240  [pdf, other

    cs.CL

    Causal Transformers Perform Below Chance on Recursive Nested Constructions, Unlike Humans

    Authors: Yair Lakretz, Théo Desbordes, Dieuwke Hupkes, Stanislas Dehaene

    Abstract: Recursive processing is considered a hallmark of human linguistic abilities. A recent study evaluated recursive processing in recurrent neural language models (RNN-LMs) and showed that such models perform below chance level on embedded dependencies within nested constructions -- a prototypical example of recursion in natural language. Here, we study if state-of-the-art Transformer LMs do any bette… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: None

  4. arXiv:2101.02258  [pdf, other

    cs.CL

    Can RNNs learn Recursive Nested Subject-Verb Agreements?

    Authors: Yair Lakretz, Théo Desbordes, Jean-Rémi King, Benoît Crabbé, Maxime Oquab, Stanislas Dehaene

    Abstract: One of the fundamental principles of contemporary linguistics states that language processing requires the ability to extract recursively nested tree structures. However, it remains unclear whether and how this code could be implemented in neural circuits. Recent advances in Recurrent Neural Networks (RNNs), which achieve near-human performance in some language tasks, provide a compelling model to… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

  5. arXiv:1903.07435  [pdf, other

    cs.CL

    The emergence of number and syntax units in LSTM language models

    Authors: Yair Lakretz, German Kruszewski, Theo Desbordes, Dieuwke Hupkes, Stanislas Dehaene, Marco Baroni

    Abstract: Recent work has shown that LSTMs trained on a generic language modeling objective capture syntax-sensitive generalizations such as long-distance number agreement. We have however no mechanistic understanding of how they accomplish this remarkable feat. Some have conjectured it depends on heuristics that do not truly take hierarchical structure into account. We present here a detailed study of the… ▽ More

    Submitted 2 April, 2019; v1 submitted 18 March, 2019; originally announced March 2019.

    Comments: To appear in Proceedings of NAACL, Minneapolis, MN, 2019