Skip to main content

Showing 1–4 of 4 results for author: Feucht, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.20086  [pdf, other

    cs.CL cs.LG

    Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs

    Authors: Sheridan Feucht, David Atkinson, Byron Wallace, David Bau

    Abstract: LLMs process text as sequences of tokens that roughly correspond to words, where less common words are represented by multiple tokens. However, individual tokens are often semantically unrelated to the meanings of the words/concepts they comprise. For example, Llama-2-7b's tokenizer splits the word "northeastern" into the tokens ['_n', 'ort', 'he', 'astern'], none of which correspond to semantical… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 13 pages, 14 figures. Code and data at https://footprints.baulab.info/

    ACM Class: I.2.7

  2. arXiv:2310.09612  [pdf, other

    cs.CV cs.AI

    Deep Neural Networks Can Learn Generalizable Same-Different Visual Relations

    Authors: Alexa R. Tartaglini, Sheridan Feucht, Michael A. Lepori, Wai Keen Vong, Charles Lovering, Brenden M. Lake, Ellie Pavlick

    Abstract: Although deep neural networks can achieve human-level performance on many object recognition benchmarks, prior work suggests that these same models fail to learn simple abstract relations, such as determining whether two objects are the same or different. Much of this prior work focuses on training convolutional neural networks to classify images of two same or two different abstract shapes, testi… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  3. arXiv:2205.15661  [pdf, other

    cs.CL

    NEWTS: A Corpus for News Topic-Focused Summarization

    Authors: Seyed Ali Bahrainian, Sheridan Feucht, Carsten Eickhoff

    Abstract: Text summarization models are approaching human levels of fidelity. Existing benchmarking corpora provide concordant pairs of full and abridged versions of Web, news or, professional content. To date, all summarization datasets operate under a one-size-fits-all paradigm that may not reflect the full range of organic summarization needs. Several recently proposed models (e.g., plug and play languag… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

  4. arXiv:2111.05940  [pdf, other

    cs.CL

    A Novel Corpus of Discourse Structure in Humans and Computers

    Authors: Babak Hemmatian, Sheridan Feucht, Rachel Avram, Alexander Wey, Muskaan Garg, Kate Spitalnic, Carsten Eickhoff, Ellie Pavlick, Bjorn Sandstede, Steven Sloman

    Abstract: We present a novel corpus of 445 human- and computer-generated documents, comprising about 27,000 clauses, annotated for semantic clause types and coherence relations that allow for nuanced comparison of artificial and natural discourse modes. The corpus covers both formal and informal discourse, and contains documents generated using fine-tuned GPT-2 (Zellers et al., 2019) and GPT-3(Brown et al.,… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

    Comments: In the 2nd Workshop on Computational Approaches to Discourse (CODI) at EMNLP 2021 (extended abstract). 3 pages