Skip to main content

Showing 1–12 of 12 results for author: Logan, R L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2212.09955  [pdf, other

    cs.CL

    BUMP: A Benchmark of Unfaithful Minimal Pairs for Meta-Evaluation of Faithfulness Metrics

    Authors: Liang Ma, Shuyang Cao, Robert L. Logan IV, Di Lu, Shihao Ran, Ke Zhang, Joel Tetreault, Alejandro Jaimes

    Abstract: The proliferation of automatic faithfulness metrics for summarization has produced a need for benchmarks to evaluate them. While existing benchmarks measure the correlation with human judgements of faithfulness on model-generated summaries, they are insufficient for diagnosing whether metrics are: 1) consistent, i.e., indicate lower faithfulness as errors are introduced into a summary, 2) effectiv… ▽ More

    Submitted 4 June, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: Accepted as a long main conference paper at ACL 2023

  2. arXiv:2210.10258  [pdf, other

    cs.CL

    Continued Pretraining for Better Zero- and Few-Shot Promptability

    Authors: Zhaofeng Wu, Robert L. Logan IV, Pete Walsh, Akshita Bhagia, Dirk Groeneveld, Sameer Singh, Iz Beltagy

    Abstract: Recently introduced language model prompting methods can achieve high accuracy in zero- and few-shot settings while requiring few to no learned task-specific parameters. Nevertheless, these methods still often trail behind full model finetuning. In this work, we investigate if a dedicated continued pretraining stage could improve "promptability", i.e., zero-shot performance with natural language p… ▽ More

    Submitted 20 October, 2022; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  3. arXiv:2202.07206  [pdf, other

    cs.CL cs.LG

    Impact of Pretraining Term Frequencies on Few-Shot Reasoning

    Authors: Yasaman Razeghi, Robert L. Logan IV, Matt Gardner, Sameer Singh

    Abstract: Pretrained Language Models (LMs) have demonstrated ability to perform numerical reasoning by extrapolating from a few examples in few-shot settings. However, the extent to which this extrapolation relies on robust reasoning is unclear. In this paper, we investigate how well these models reason with terms that are less frequent in the pretraining data. In particular, we examine the correlations bet… ▽ More

    Submitted 23 May, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

  4. arXiv:2112.08634  [pdf, other

    cs.CL

    FRUIT: Faithfully Reflecting Updated Information in Text

    Authors: Robert L. Logan IV, Alexandre Passos, Sameer Singh, Ming-Wei Chang

    Abstract: Textual knowledge bases such as Wikipedia require considerable effort to keep up to date and consistent. While automated writing assistants could potentially ease this burden, the problem of suggesting edits grounded in external knowledge has been under-explored. In this paper, we introduce the novel generation task of *faithfully reflecting updated information in text* (FRUIT) where the goal is t… ▽ More

    Submitted 13 July, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: v2.0, NAACL 2022

  5. arXiv:2106.13353  [pdf, other

    cs.CL cs.LG

    Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models

    Authors: Robert L. Logan IV, Ivana Balažević, Eric Wallace, Fabio Petroni, Sameer Singh, Sebastian Riedel

    Abstract: Prompting language models (LMs) with training examples and task descriptions has been seen as critical to recent successes in few-shot learning. In this work, we show that finetuning LMs in the few-shot setting can considerably reduce the need for prompt engineering. In fact, one can use null prompts, prompts that contain neither task-specific templates nor training examples, and achieve competiti… ▽ More

    Submitted 1 July, 2021; v1 submitted 24 June, 2021; originally announced June 2021.

  6. arXiv:2010.15980  [pdf, other

    cs.CL cs.LG

    AutoPrompt: Eliciting Knowledge from Language Models with Automatically Generated Prompts

    Authors: Taylor Shin, Yasaman Razeghi, Robert L. Logan IV, Eric Wallace, Sameer Singh

    Abstract: The remarkable success of pretrained language models has motivated the study of what kinds of knowledge these models learn during pretraining. Reformulating tasks as fill-in-the-blanks problems (e.g., cloze tests) is a natural approach for gauging such knowledge, however, its usage is limited by the manual effort and guesswork required to write suitable prompts. To address this, we develop AutoPro… ▽ More

    Submitted 7 November, 2020; v1 submitted 29 October, 2020; originally announced October 2020.

    Comments: v2: Fixed error in Figure 2

  7. arXiv:2010.06694  [pdf, other

    cs.HC

    Easy, Reproducible and Quality-Controlled Data Collection with Crowdaq

    Authors: Qiang Ning, Hao Wu, Pradeep Dasigi, Dheeru Dua, Matt Gardner, Robert L. Logan IV, Ana Marasovic, Zhen Nie

    Abstract: High-quality and large-scale data are key to success for AI systems. However, large-scale data annotation efforts are often confronted with a set of common challenges: (1) designing a user-friendly annotation interface; (2) training enough annotators efficiently; and (3) reproducibility. To address these problems, we introduce Crowdaq, an open-source platform that standardizes the data collection… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted to the demo track of EMNLP 2020

  8. arXiv:2002.06532  [pdf, other

    stat.ML cs.LG

    Active Bayesian Assessment for Black-Box Classifiers

    Authors: Disi Ji, Robert L. Logan IV, Padhraic Smyth, Mark Steyvers

    Abstract: Recent advances in machine learning have led to increased deployment of black-box classifiers across a wide variety of applications. In many such situations there is a critical need to both reliably assess the performance of these pre-trained models and to perform this assessment in a label-efficient manner (given that labels may be scarce and costly to collect). In this paper, we introduce an act… ▽ More

    Submitted 15 March, 2021; v1 submitted 16 February, 2020; originally announced February 2020.

  9. arXiv:1909.04164  [pdf, other

    cs.CL

    Knowledge Enhanced Contextual Word Representations

    Authors: Matthew E. Peters, Mark Neumann, Robert L. Logan IV, Roy Schwartz, Vidur Joshi, Sameer Singh, Noah A. Smith

    Abstract: Contextual word representations, typically trained on unstructured, unlabeled text, do not contain any explicit grounding to real world entities and are often unable to remember facts about those entities. We propose a general method to embed multiple knowledge bases (KBs) into large scale models, and thereby enhance their representations with structured, human-curated knowledge. For each KB, we f… ▽ More

    Submitted 30 October, 2019; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: EMNLP 2019

  10. arXiv:1906.07241  [pdf, other

    cs.CL

    Barack's Wife Hillary: Using Knowledge-Graphs for Fact-Aware Language Modeling

    Authors: Robert L. Logan IV, Nelson F. Liu, Matthew E. Peters, Matt Gardner, Sameer Singh

    Abstract: Modeling human language requires the ability to not only generate fluent text but also encode factual knowledge. However, traditional language models are only capable of remembering facts seen at training time, and often have difficulty recalling them. To address this, we introduce the knowledge graph language model (KGLM), a neural language model with mechanisms for selecting and copying facts fr… ▽ More

    Submitted 20 June, 2019; v1 submitted 17 June, 2019; originally announced June 2019.

  11. arXiv:1904.03111  [pdf, other

    cs.CL

    PoMo: Generating Entity-Specific Post-Modifiers in Context

    Authors: Jun Seok Kang, Robert L. Logan IV, Zewei Chu, Yang Chen, Dheeru Dua, Kevin Gimpel, Sameer Singh, Niranjan Balasubramanian

    Abstract: We introduce entity post-modifier generation as an instance of a collaborative writing task. Given a sentence about a target entity, the task is to automatically generate a post-modifier phrase that provides contextually relevant information about the entity. For example, for the sentence, "Barack Obama, _______, supported the #MeToo movement.", the phrase "a father of two girls" is a contextually… ▽ More

    Submitted 8 April, 2019; v1 submitted 5 April, 2019; originally announced April 2019.

    Comments: NAACL-HLT 2019

  12. arXiv:1711.11118  [pdf, other

    cs.CL

    Multimodal Attribute Extraction

    Authors: Robert L. Logan IV, Samuel Humeau, Sameer Singh

    Abstract: The broad goal of information extraction is to derive structured information from unstructured data. However, most existing methods focus solely on text, ignoring other types of unstructured data such as images, video and audio which comprise an increasing portion of the information on the web. To address this shortcoming, we propose the task of multimodal attribute extraction. Given a collection… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

    Comments: AKBC 2017 Workshop Paper