Skip to main content

Showing 1–17 of 17 results for author: Pierrehumbert, J B

.
  1. arXiv:2404.18543  [pdf, other

    cs.CL cs.CE cs.LG

    Time Machine GPT

    Authors: Felix Drinkall, Eghbal Rahimikia, Janet B. Pierrehumbert, Stefan Zohren

    Abstract: Large language models (LLMs) are often trained on extensive, temporally indiscriminate text corpora, reflecting the lack of datasets with temporal metadata. This approach is not aligned with the evolving nature of language. Conventional methods for creating temporally adapted language models often depend on further pre-training static models on time-specific data. This paper presents a new approac… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: NAACL Findings 2024

    MSC Class: I.2.1; I.2.7

  2. arXiv:2404.03301  [pdf, other

    cs.CL

    Probing Large Language Models for Scalar Adjective Lexical Semantics and Scalar Diversity Pragmatics

    Authors: Fangru Lin, Daniel Altshuler, Janet B. Pierrehumbert

    Abstract: Scalar adjectives pertain to various domain scales and vary in intensity within each scale (e.g. certain is more intense than likely on the likelihood scale). Scalar implicatures arise from the consideration of alternative statements which could have been made. They can be triggered by scalar adjectives and require listeners to reason pragmatically about them. Some scalar adjectives are more likel… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Accepted for the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

  3. arXiv:2403.15885  [pdf, other

    cs.CL

    STEntConv: Predicting Disagreement with Stance Detection and a Signed Graph Convolutional Network

    Authors: Isabelle Lorge, Li Zhang, Xiaowen Dong, Janet B. Pierrehumbert

    Abstract: The rise of social media platforms has led to an increase in polarised online discussions, especially on political and socio-cultural topics such as elections and climate change. We propose a simple and novel unsupervised method to predict whether the authors of two posts agree or disagree, leveraging user stances about named entities obtained from their posts. We present STEntConv, a model which… ▽ More

    Submitted 26 March, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: Accepted for the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

  4. arXiv:2402.02805  [pdf, other

    cs.AI cs.CL cs.LG

    Graph-enhanced Large Language Models in Asynchronous Plan Reasoning

    Authors: Fangru Lin, Emanuele La Malfa, Valentin Hofmann, Elle Michelle Yang, Anthony Cohn, Janet B. Pierrehumbert

    Abstract: Planning is a fundamental property of human intelligence. Reasoning about asynchronous plans is challenging since it requires sequential and parallel planning to optimize time costs. Can large language models (LLMs) succeed at this task? Here, we present the first large-scale study investigating this question. We find that a representative set of closed and open-source LLMs, including GPT-4 and LL… ▽ More

    Submitted 3 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML-2024

  5. arXiv:2212.07547  [pdf, other

    cs.CL cs.AI cs.SI

    Unsupervised Detection of Contextualized Embedding Bias with Application to Ideology

    Authors: Valentin Hofmann, Janet B. Pierrehumbert, Hinrich Schütze

    Abstract: We propose a fully unsupervised method to detect bias in contextualized embeddings. The method leverages the assortative information latently encoded by social networks and combines orthogonality regularization, structured sparsity learning, and graph neural networks to find the embedding subspace capturing this information. As a concrete example, we focus on the phenomenon of ideological bias: we… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: ICML 2022

  6. arXiv:2205.10408  [pdf, other

    cs.CL cs.SI

    Forecasting COVID-19 Caseloads Using Unsupervised Embedding Clusters of Social Media Posts

    Authors: Felix Drinkall, Stefan Zohren, Janet B. Pierrehumbert

    Abstract: We present a novel approach incorporating transformer-based language models into infectious disease modelling. Text-derived features are quantified by tracking high-density clusters of sentence-level representations of Reddit posts within specific US states' COVID-19 subreddits. We benchmark these clustered embedding features against features extracted from other high-quality datasets. In a thresh… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  7. arXiv:2203.08565  [pdf, other

    cs.CL

    Geographic Adaptation of Pretrained Language Models

    Authors: Valentin Hofmann, Goran Glavaš, Nikola Ljubešić, Janet B. Pierrehumbert, Hinrich Schütze

    Abstract: While pretrained language models (PLMs) have been shown to possess a plethora of linguistic knowledge, the existing body of research has largely neglected extralinguistic knowledge, which is generally difficult to obtain by pretraining on text alone. Here, we contribute to closing this gap by examining geolinguistic knowledge, i.e., knowledge about geographic variation in language. We introduce ge… ▽ More

    Submitted 28 January, 2024; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: TACL 2024 (pre-MIT Press publication version)

  8. arXiv:2112.07475  [pdf, other

    cs.CL

    Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks

    Authors: Paul Röttger, Bertie Vidgen, Dirk Hovy, Janet B. Pierrehumbert

    Abstract: Labelled data is the foundation of most natural language processing tasks. However, labelling data is difficult and there often are diverse valid beliefs about what the correct data labels should be. So far, dataset creators have acknowledged annotator subjectivity, but rarely actively managed it in the annotation process. This has led to partly-subjective datasets that fail to serve a clear downs… ▽ More

    Submitted 29 April, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: Accepted at NAACL 2022 (Main Conference)

  9. arXiv:2104.08829  [pdf, other

    cs.CL cs.AI cs.SI

    Modeling Ideological Salience and Framing in Polarized Online Groups with Graph Neural Networks and Structured Sparsity

    Authors: Valentin Hofmann, Xiaowen Dong, Janet B. Pierrehumbert, Hinrich Schütze

    Abstract: The increasing polarization of online political discourse calls for computational tools that automatically detect and monitor ideological divides in social media. We introduce a minimally supervised method that leverages the network structure of online discussion forums, specifically Reddit, to detect polarized concepts. We model polarization along the dimensions of salience and framing, drawing u… ▽ More

    Submitted 14 December, 2022; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: NAACL 2022 (Findings)

  10. arXiv:2104.08116  [pdf, other

    cs.CL

    Temporal Adaptation of BERT and Performance on Downstream Document Classification: Insights from Social Media

    Authors: Paul Röttger, Janet B. Pierrehumbert

    Abstract: Language use differs between domains and even within a domain, language use changes over time. For pre-trained language models like BERT, domain adaptation through continued pre-training has been shown to improve performance on in-domain downstream tasks. In this article, we investigate whether temporal adaptation can bring additional benefits. For this purpose, we introduce a corpus of social med… ▽ More

    Submitted 8 September, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Accepted at EMNLP 2021 (Findings)

  11. arXiv:2101.00403  [pdf, other

    cs.CL

    Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words

    Authors: Valentin Hofmann, Janet B. Pierrehumbert, Hinrich Schütze

    Abstract: How does the input segmentation of pretrained language models (PLMs) affect their interpretations of complex words? We present the first study investigating this question, taking BERT as the example PLM and focusing on its semantic representations of English derivatives. We show that PLMs can be interpreted as serial dual-route models, i.e., the meanings of complex words are either stored or else… ▽ More

    Submitted 2 June, 2021; v1 submitted 2 January, 2021; originally announced January 2021.

    Comments: ACL 2021

  12. HateCheck: Functional Tests for Hate Speech Detection Models

    Authors: Paul Röttger, Bertram Vidgen, Dong Nguyen, Zeerak Waseem, Helen Margetts, Janet B. Pierrehumbert

    Abstract: Detecting online hate is a difficult task that even state-of-the-art models struggle with. Typically, hate speech detection models are evaluated by measuring their performance on held-out test data using metrics such as accuracy and F1 score. However, this approach makes it difficult to identify specific model weak points. It also risks overestimating generalisable model performance due to increas… ▽ More

    Submitted 27 May, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

    Comments: Accepted at ACL 2021 (Main Conference)

  13. arXiv:2010.12684  [pdf, other

    cs.CL

    Dynamic Contextualized Word Embeddings

    Authors: Valentin Hofmann, Janet B. Pierrehumbert, Hinrich Schütze

    Abstract: Static word embeddings that represent words by a single vector cannot capture the variability of word meaning in different linguistic and extralinguistic contexts. Building on prior work on contextualized and dynamic word embeddings, we introduce dynamic contextualized word embeddings that represent words as a function of both linguistic and extralinguistic context. Based on a pretrained language… ▽ More

    Submitted 8 June, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: ACL 2021

  14. arXiv:2005.00672  [pdf, other

    cs.CL

    DagoBERT: Generating Derivational Morphology with a Pretrained Language Model

    Authors: Valentin Hofmann, Janet B. Pierrehumbert, Hinrich Schütze

    Abstract: Can pretrained language models (PLMs) generate derivationally complex words? We present the first study investigating this question, taking BERT as the example PLM. We examine BERT's derivational capabilities in different settings, ranging from using the unmodified pretrained model to full finetuning. Our best model, DagoBERT (Derivationally and generatively optimized BERT), clearly outperforms th… ▽ More

    Submitted 7 October, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

  15. arXiv:1408.1985  [pdf, ps, other

    cs.CL nlin.AO physics.soc-ph

    A model of grassroots changes in linguistic systems

    Authors: Janet B. Pierrehumbert, Forrest Stonedahl, Robert Daland

    Abstract: Linguistic norms emerge in human communities because people imitate each other. A shared linguistic system provides people with the benefits of shared knowledge and coordinated planning. Once norms are in place, why would they ever change? This question, echoing broad questions in the theory of social dynamics, has particular force in relation to language. By definition, an innovator is in the min… ▽ More

    Submitted 8 August, 2014; originally announced August 2014.

    Comments: 30 pages, 7 figures

  16. arXiv:1009.3321  [pdf, other

    cs.CL cond-mat.dis-nn nlin.AO physics.soc-ph q-bio.PE

    Niche as a determinant of word fate in online groups

    Authors: Eduardo G. Altmann, Janet B. Pierrehumbert, Adilson E. Motter

    Abstract: Patterns of word use both reflect and influence a myriad of human activities and interactions. Like other entities that are reproduced and evolve, words rise or decline depending upon a complex interplay between {their intrinsic properties and the environments in which they function}. Using Internet discussion communities as model systems, we define the concept of a word niche as the relationship… ▽ More

    Submitted 2 June, 2011; v1 submitted 16 September, 2010; originally announced September 2010.

    Comments: Supporting Information is available here: http://www.plosone.org/article/fetchSingleRepresentation.action?uri=info:doi/10.1371/journal.pone.0019009.s001

    Journal ref: PLoS ONE 6(5), e19009 (2011)

  17. arXiv:0901.2349  [pdf, other

    cs.CL cond-mat.dis-nn physics.data-an physics.soc-ph

    Beyond word frequency: Bursts, lulls, and scaling in the temporal distributions of words

    Authors: Eduardo G. Altmann, Janet B. Pierrehumbert, Adilson E. Motter

    Abstract: Background: Zipf's discovery that word frequency distributions obey a power law established parallels between biological and physical processes, and language, laying the groundwork for a complex systems perspective on human communication. More recent research has also identified scaling regularities in the dynamics underlying the successive occurrences of events, suggesting the possibility of si… ▽ More

    Submitted 11 November, 2009; v1 submitted 15 January, 2009; originally announced January 2009.

    Journal ref: PLoS ONE 4 (11): e7678 (2009)