Skip to main content

Showing 1–9 of 9 results for author: Trott, S

.
  1. arXiv:2406.14678  [pdf, other

    cs.CL

    Bidirectional Transformer Representations of (Spanish) Ambiguous Words in Context: A New Lexical Resource and Empirical Analysis

    Authors: Pamela D. Rivière, Anne L. Beatty-Martínez, Sean Trott

    Abstract: Lexical ambiguity -- where a single wordform takes on distinct, context-dependent meanings -- serves as a useful tool to compare across different large language models' (LLMs') ability to form distinct, contextualized representations of the same stimulus. Few studies have systematically compared LLMs' contextualized word embeddings for languages beyond English. Here, we evaluate multiple bidirecti… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 16 pages, 12 figures, submitted to conference (EMNLP 2024)

  2. Do language models capture implied discourse meanings? An investigation with exhaustivity implicatures of Korean morphology

    Authors: Hagyeong Shin, Sean Trott

    Abstract: Markedness in natural language is often associated with non-literal meanings in discourse. Differential Object Marking (DOM) in Korean is one instance of this phenomenon, where post-positional markers are selected based on both the semantic features of the noun phrases and the discourse features that are orthogonal to the semantic features. Previous work has shown that distributional models of lan… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Proceedings of the Society for Computation in Linguistics (SCiL) 2024, Association for Computational Linguistics (ACL) Anthology

  3. arXiv:2403.13754  [pdf, other

    cs.CL

    Different Tokenization Schemes Lead to Comparable Performance in Spanish Number Agreement

    Authors: Catherine Arnett, Pamela D. Rivière, Tyler A. Chang, Sean Trott

    Abstract: The relationship between language model tokenization and performance is an open area of research. Here, we investigate how different tokenization schemes impact number agreement in Spanish plurals. We find that morphologically-aligned tokenization performs similarly to other tokenization schemes, even when induced artificially for words that would not be tokenized that way during training. We then… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  4. arXiv:2209.01515  [pdf, other

    cs.CL cs.AI

    Do Large Language Models know what humans know?

    Authors: Sean Trott, Cameron Jones, Tyler Chang, James Michaelov, Benjamin Bergen

    Abstract: Humans can attribute beliefs to others. However, it is unknown to what extent this ability results from an innate biological endowment or from experience accrued through child development, particularly exposure to language describing others' mental states. We test the viability of the language exposure hypothesis by assessing whether models exposed to large quantities of human language display sen… ▽ More

    Submitted 31 May, 2023; v1 submitted 3 September, 2022; originally announced September 2022.

  5. arXiv:2203.05648  [pdf, other

    cs.CL

    Contextualized Sensorimotor Norms: multi-dimensional measures of sensorimotor strength for ambiguous English words, in context

    Authors: Sean Trott, Benjamin Bergen

    Abstract: Most large language models are trained on linguistic input alone, yet humans appear to ground their understanding of words in sensorimotor experience. A natural solution is to augment LM representations with human judgments of a word's sensorimotor associations (e.g., the Lancaster Sensorimotor Norms), but this raises another challenge: most words are ambiguous, and judgments of words in isolation… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

  6. arXiv:2105.13266  [pdf, other

    cs.CL

    RAW-C: Relatedness of Ambiguous Words--in Context (A New Lexical Resource for English)

    Authors: Sean Trott, Benjamin Bergen

    Abstract: Most words are ambiguous--i.e., they convey distinct meanings in different contexts--and even the meanings of unambiguous words are context-dependent. Both phenomena present a challenge for NLP. Recently, the advent of contextualized word embeddings has led to success on tasks involving lexical ambiguity, such as Word Sense Disambiguation. However, there are few tasks that directly evaluate how we… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

    Comments: ACL-IJCNLP 2021 camera-ready

  7. arXiv:2005.09099  [pdf, other

    cs.CL

    (Re)construing Meaning in NLP

    Authors: Sean Trott, Tiago Timponi Torrent, Nancy Chang, Nathan Schneider

    Abstract: Human speakers have an extensive toolkit of ways to express themselves. In this paper, we engage with an idea largely absent from discussions of meaning in natural language understanding--namely, that the way something is expressed reflects different ways of conceptualizing or construing the information being conveyed. We first define this phenomenon more precisely, drawing on considerable prior w… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: ACL 2020 camera-ready

  8. arXiv:1607.06875  [pdf, other

    cs.AI cs.CL cs.HC cs.RO

    Processing Natural Language About Ongoing Actions

    Authors: Steve Doubleday, Sean Trott, Jerome Feldman

    Abstract: Actions may not proceed as planned; they may be interrupted, resumed or overridden. This is a challenge to handle in a natural language understanding system. We describe extensions to an existing implementation for the control of autonomous systems by natural language, to enable such systems to handle incoming language requests regarding actions. Language Communication with Autonomous Systems (LCA… ▽ More

    Submitted 30 July, 2016; v1 submitted 22 July, 2016; originally announced July 2016.

    Comments: 6 pages, 8 figures. Updated with PIPE citations

  9. arXiv:1604.06721  [pdf, other

    cs.AI cs.CL cs.RO

    Exploiting Deep Semantics and Compositionality of Natural Language for Human-Robot-Interaction

    Authors: Manfred Eppe, Sean Trott, Jerome Feldman

    Abstract: We develop a natural language interface for human robot interaction that implements reasoning about deep semantics in natural language. To realize the required deep analysis, we employ methods from cognitive linguistics, namely the modular and compositional framework of Embodied Construction Grammar (ECG) [Feldman, 2009]. Using ECG, robots are able to solve fine-grained reference resolution proble… ▽ More

    Submitted 22 April, 2016; originally announced April 2016.