Skip to main content

Showing 1–10 of 10 results for author: Petty, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.08819  [pdf, other

    cs.LG cs.CC cs.CL cs.FL

    The Illusion of State in State-Space Models

    Authors: William Merrill, Jackson Petty, Ashish Sabharwal

    Abstract: State-space models (SSMs) have emerged as a potential alternative architecture for building large language models (LLMs) compared to the previously ubiquitous transformer architecture. One theoretical weakness of transformers is that they cannot express certain kinds of sequential computation and state tracking (Merrill & Sabharwal, 2023), which SSMs are explicitly designed to address via their cl… ▽ More

    Submitted 4 June, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: To appear at ICML 2024. 9 pages + appendices

  2. arXiv:2311.12022  [pdf, other

    cs.AI cs.CL

    GPQA: A Graduate-Level Google-Proof Q&A Benchmark

    Authors: David Rein, Betty Li Hou, Asa Cooper Stickland, Jackson Petty, Richard Yuanzhe Pang, Julien Dirani, Julian Michael, Samuel R. Bowman

    Abstract: We present GPQA, a challenging dataset of 448 multiple-choice questions written by domain experts in biology, physics, and chemistry. We ensure that the questions are high-quality and extremely difficult: experts who have or are pursuing PhDs in the corresponding domains reach 65% accuracy (74% when discounting clear mistakes the experts identified in retrospect), while highly skilled non-expert v… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 28 pages, 5 figures, 7 tables

  3. arXiv:2311.08702  [pdf, other

    cs.AI cs.CL

    Debate Helps Supervise Unreliable Experts

    Authors: Julian Michael, Salsabila Mahdi, David Rein, Jackson Petty, Julien Dirani, Vishakh Padmakumar, Samuel R. Bowman

    Abstract: As AI systems are used to answer more difficult questions and potentially help create new knowledge, judging the truthfulness of their outputs becomes more difficult and more important. How can we supervise unreliable experts, which have access to the truth but may not accurately report it, to give answers that are systematically true and don't just superficially seem true, when the supervisor can… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 84 pages, 13 footnotes, 5 figures, 4 tables, 28 debate transcripts; data and code at https://github.com/julianmichael/debate/tree/2023-nyu-experiments

    ACM Class: I.2.0

  4. arXiv:2311.07811  [pdf, other

    cs.CL

    In-context Learning Generalizes, But Not Always Robustly: The Case of Syntax

    Authors: Aaron Mueller, Albert Webson, Jackson Petty, Tal Linzen

    Abstract: In-context learning (ICL) is now a common method for teaching large language models (LLMs) new tasks: given labeled examples in the input context, the LLM learns to perform the task without weight updates. Do models guided via ICL infer the underlying structure of the task defined by the context, or do they rely on superficial heuristics that only generalize to identically distributed examples? We… ▽ More

    Submitted 10 April, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Accepted to NAACL 2024

  5. arXiv:2311.04900  [pdf, other

    cs.CL

    How Abstract Is Linguistic Generalization in Large Language Models? Experiments with Argument Structure

    Authors: Michael Wilson, Jackson Petty, Robert Frank

    Abstract: Language models are typically evaluated on their success at predicting the distribution of specific words in specific contexts. Yet linguistic knowledge also encodes relationships between contexts, allowing inferences between word distributions. We investigate the degree to which pre-trained Transformer-based large language models (LLMs) represent such relationships, focusing on the domain of argu… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: Accepted to TACL; Presented at EMNLP 2023

  6. arXiv:2310.19956  [pdf, other

    cs.CL

    The Impact of Depth on Compositional Generalization in Transformer Language Models

    Authors: Jackson Petty, Sjoerd van Steenkiste, Ishita Dasgupta, Fei Sha, Dan Garrette, Tal Linzen

    Abstract: To process novel sentences, language models (LMs) must generalize compositionally -- combine familiar elements in new ways. What aspects of a model's structure promote compositional generalization? Focusing on transformers, we test the hypothesis, motivated by theoretical and empirical work, that deeper transformers generalize more compositionally. Simply adding layers increases the total number o… ▽ More

    Submitted 10 April, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted to NAACL 2024

  7. arXiv:2212.10003  [pdf, other

    cs.CL

    (QA)$^2$: Question Answering with Questionable Assumptions

    Authors: Najoung Kim, Phu Mon Htut, Samuel R. Bowman, Jackson Petty

    Abstract: Naturally occurring information-seeking questions often contain questionable assumptions -- assumptions that are false or unverifiable. Questions containing questionable assumptions are challenging because they require a distinct answer strategy that deviates from typical answers for information-seeking questions. For instance, the question "When did Marie Curie discover Uranium?" cannot be answer… ▽ More

    Submitted 29 August, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023 camera-ready

  8. arXiv:2202.03611  [pdf, other

    cs.CL

    Do Language Models Learn Position-Role Map**s?

    Authors: Jackson Petty, Michael Wilson, Robert Frank

    Abstract: How is knowledge of position-role map**s in natural language learned? We explore this question in a computational setting, testing whether a variety of well-performing pertained language models (BERT, RoBERTa, and DistilBERT) exhibit knowledge of these map**s, and whether this knowledge persists across alternations in syntactic, structural, and lexical alternations. In Experiment 1, we show th… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

    Comments: To appear in the BUCLD 46 Proceedings

  9. arXiv:2109.12036  [pdf, other

    cs.CL

    Transformers Generalize Linearly

    Authors: Jackson Petty, Robert Frank

    Abstract: Natural language exhibits patterns of hierarchically governed dependencies, in which relations between words are sensitive to syntactic structure rather than linear ordering. While re-current network models often fail to generalize in a hierarchically sensitive way (McCoy et al.,2020) when trained on ambiguous data, the improvement in performance of newer Trans-former language models (Vaswani et a… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

  10. arXiv:2011.00682  [pdf, other

    cs.CL

    Sequence-to-Sequence Networks Learn the Meaning of Reflexive Anaphora

    Authors: Robert Frank, Jackson Petty

    Abstract: Reflexive anaphora present a challenge for semantic interpretation: their meaning varies depending on context in a way that appears to require abstract variables. Past work has raised doubts about the ability of recurrent networks to meet this challenge. In this paper, we explore this question in the context of a fragment of English that incorporates the relevant sort of contextual variability. We… ▽ More

    Submitted 1 November, 2020; originally announced November 2020.

    Comments: 10 pages, 4 figures, 3 tables, accepted at CRAC 2020