Skip to main content

Showing 1–4 of 4 results for author: Kavumba, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2205.09295  [pdf, other

    cs.CL

    Are Prompt-based Models Clueless?

    Authors: Pride Kavumba, Ryo Takahashi, Yusuke Oda

    Abstract: Finetuning large pre-trained language models with a task-specific head has advanced the state-of-the-art on many natural language understanding benchmarks. However, models with a task-specific head require a lot of training data, making them susceptible to learning and exploiting dataset-specific superficial cues that do not generalize to other datasets. Prompting has reduced the data requirement… ▽ More

    Submitted 19 May, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

  2. arXiv:2201.06777  [pdf, other

    cs.CL

    COPA-SSE: Semi-structured Explanations for Commonsense Reasoning

    Authors: Ana Brassard, Benjamin Heinzerling, Pride Kavumba, Kentaro Inui

    Abstract: We present Semi-Structured Explanations for COPA (COPA-SSE), a new crowdsourced dataset of 9,747 semi-structured, English common sense explanations for Choice of Plausible Alternatives (COPA) questions. The explanations are formatted as a set of triple-like common sense statements with ConceptNet relations but freely written concepts. This semi-structured format strikes a balance between the high… ▽ More

    Submitted 11 May, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

    Comments: 6 pages, 6 figures, LREC 2022. Data available at https://github.com/a-brassard/copa-sse

  3. arXiv:2104.11514  [pdf, other

    cs.CL

    Learning to Learn to be Right for the Right Reasons

    Authors: Pride Kavumba, Benjamin Heinzerling, Ana Brassard, Kentaro Inui

    Abstract: Improving model generalization on held-out data is one of the core objectives in commonsense reasoning. Recent work has shown that models trained on the dataset with superficial cues tend to perform well on the easy test set with superficial cues but perform poorly on the hard test set without superficial cues. Previous approaches have resorted to manual methods of encouraging models not to overfi… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

  4. arXiv:1911.00225  [pdf, other

    cs.CL

    When Choosing Plausible Alternatives, Clever Hans can be Clever

    Authors: Pride Kavumba, Naoya Inoue, Benjamin Heinzerling, Keshav Singh, Paul Reisert, Kentaro Inui

    Abstract: Pretrained language models, such as BERT and RoBERTa, have shown large improvements in the commonsense reasoning benchmark COPA. However, recent work found that many improvements in benchmarks of natural language understanding are not due to models learning the task, but due to their increasing ability to exploit superficial cues, such as tokens that occur more often in the correct answer than the… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

    Comments: Accepted to the COmmonsense INference in Natural Language Processing workshop (COIN)