Skip to main content

Showing 1–3 of 3 results for author: McGovern, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14057  [pdf, other

    cs.CL cs.AI

    Your Large Language Models Are Leaving Fingerprints

    Authors: Hope McGovern, Rickard Stureborg, Yoshi Suhara, Dimitris Alikaniotis

    Abstract: It has been shown that finetuned transformers and other supervised detectors effectively distinguish between human and machine-generated text in some situations arXiv:2305.13242, but we find that even simple classifiers on top of n-gram and part-of-speech features can achieve very robust performance on both in- and out-of-domain data. To understand how this is possible, we analyze machine-generate… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2311.08886  [pdf, other

    cs.CL

    CLIMB: Curriculum Learning for Infant-inspired Model Building

    Authors: Richard Diehl Martinez, Zebulon Goriely, Hope McGovern, Christopher Davis, Andrew Caines, Paula Buttery, Lisa Beinborn

    Abstract: We describe our team's contribution to the STRICT-SMALL track of the BabyLM Challenge. The challenge requires training a language model from scratch using only a relatively small training dataset of ten million words. We experiment with three variants of cognitively-motivated curriculum learning and analyze their effect on the performance of the model on linguistic evaluation tasks. In the vocabul… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  3. arXiv:2106.13382  [pdf, other

    cs.CL cs.LG

    A Source-Criticism Debiasing Method for GloVe Embeddings

    Authors: Hope McGovern

    Abstract: It is well-documented that word embeddings trained on large public corpora consistently exhibit known human social biases. Although many methods for debiasing exist, almost all fixate on completely eliminating biased information from the embeddings and often diminish training set size in the process. In this paper, we present a simple yet effective method for debiasing GloVe word embeddings (Penni… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.