Skip to main content

Showing 1–6 of 6 results for author: Utama, P A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2205.06009  [pdf, other

    cs.CL

    Falsesum: Generating Document-level NLI Examples for Recognizing Factual Inconsistency in Summarization

    Authors: Prasetya Ajie Utama, Joshua Bambrick, Nafise Sadat Moosavi, Iryna Gurevych

    Abstract: Neural abstractive summarization models are prone to generate summaries which are factually inconsistent with their source documents. Previous work has introduced the task of recognizing such factual inconsistency as a downstream application of natural language inference (NLI). However, state-of-the-art NLI models perform poorly in this context due to their inability to generalize to the target ta… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: Accepted to appear at NAACL 2022

  2. arXiv:2109.04144  [pdf, other

    cs.CL cs.AI

    Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning

    Authors: Prasetya Ajie Utama, Nafise Sadat Moosavi, Victor Sanh, Iryna Gurevych

    Abstract: Recent prompt-based approaches allow pretrained language models to achieve strong performances on few-shot finetuning by reformulating downstream tasks as a language modeling problem. In this work, we demonstrate that, despite its advantages on low data regimes, finetuned prompt-based models for sentence pair classification tasks still suffer from a common pitfall of adopting inference heuristics… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP 2021

  3. arXiv:2010.12510  [pdf, other

    cs.CL

    Improving Robustness by Augmenting Training Sentences with Predicate-Argument Structures

    Authors: Nafise Sadat Moosavi, Marcel de Boer, Prasetya Ajie Utama, Iryna Gurevych

    Abstract: Existing NLP datasets contain various biases, and models tend to quickly learn those biases, which in turn limits their robustness. Existing approaches to improve robustness against dataset biases mostly focus on changing the training objective so that models learn less from biased examples. Besides, they mostly focus on addressing a specific bias, and while they improve the performance on adversa… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

  4. arXiv:2009.12303  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Debiasing NLU Models from Unknown Biases

    Authors: Prasetya Ajie Utama, Nafise Sadat Moosavi, Iryna Gurevych

    Abstract: NLU models often exploit biases to achieve high dataset-specific performance without properly learning the intended task. Recently proposed debiasing methods are shown to be effective in mitigating this tendency. However, these methods rely on a major assumption that the types of bias should be known a-priori, which limits their application to many NLU tasks and datasets. In this work, we present… ▽ More

    Submitted 13 October, 2020; v1 submitted 25 September, 2020; originally announced September 2020.

    Comments: Accepted at EMNLP 2020

  5. arXiv:2005.00315  [pdf, other

    cs.CL

    Mind the Trade-off: Debiasing NLU Models without Degrading the In-distribution Performance

    Authors: Prasetya Ajie Utama, Nafise Sadat Moosavi, Iryna Gurevych

    Abstract: Models for natural language understanding (NLU) tasks often rely on the idiosyncratic biases of the dataset, which make them brittle against test cases outside the training distribution. Recently, several proposed debiasing methods are shown to be very effective in improving out-of-distribution performance. However, their improvements come at the expense of performance drop when models are evaluat… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    Comments: to appear at ACL 2020

  6. arXiv:1909.08940  [pdf, ps, other

    cs.CL

    Improving Generalization by Incorporating Coverage in Natural Language Inference

    Authors: Nafise Sadat Moosavi, Prasetya Ajie Utama, Andreas Rücklé, Iryna Gurevych

    Abstract: The task of natural language inference (NLI) is to identify the relation between the given premise and hypothesis. While recent NLI models achieve very high performance on individual datasets, they fail to generalize across similar datasets. This indicates that they are solving NLI datasets instead of the task itself. In order to improve generalization, we propose to extend the input representatio… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.