Skip to main content

Showing 1–26 of 26 results for author: Camburu, O

.
  1. arXiv:2404.03189  [pdf, other

    cs.CL cs.AI

    The Probabilities Also Matter: A More Faithful Metric for Faithfulness of Free-Text Explanations in Large Language Models

    Authors: Noah Y. Siegel, Oana-Maria Camburu, Nicolas Heess, Maria Perez-Ortiz

    Abstract: In order to oversee advanced AI systems, it is important to understand their underlying decision-making process. When prompted, large language models (LLMs) can provide natural language explanations or reasoning traces that sound plausible and receive high ratings from human annotators. However, it is unclear to what extent these explanations are faithful, i.e., truly capture the factors responsib… ▽ More

    Submitted 7 June, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: To be published in ACL 2024. 19 pages, 2 figures

  2. arXiv:2311.08968  [pdf, other

    cs.CL cs.AI

    Identifying Linear Relational Concepts in Large Language Models

    Authors: David Chanin, Anthony Hunter, Oana-Maria Camburu

    Abstract: Transformer language models (LMs) have been shown to represent concepts as directions in the latent space of hidden activations. However, for any human-interpretable concept, how can we find its direction in the latent space? We present a technique called linear relational concepts (LRC) for finding concept directions corresponding to human-interpretable concepts by first modeling the relation bet… ▽ More

    Submitted 29 March, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: To be published in NAACL 2024

  3. arXiv:2311.07556  [pdf, other

    cs.CL

    Using Natural Language Explanations to Improve Robustness of In-context Learning

    Authors: Xuanli He, Yuxiang Wu, Oana-Maria Camburu, Pasquale Minervini, Pontus Stenetorp

    Abstract: Recent studies demonstrated that large language models (LLMs) can excel in many tasks via in-context learning (ICL). However, recent works show that ICL-prompted models tend to produce inaccurate results when presented with adversarial inputs. In this work, we investigate whether augmenting ICL with natural language explanations (NLEs) improves the robustness of LLMs on adversarial datasets coveri… ▽ More

    Submitted 20 May, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: accepted to ACL2024 (main)

  4. arXiv:2306.02980  [pdf, other

    cs.CL cs.AI

    KNOW How to Make Up Your Mind! Adversarially Detecting and Alleviating Inconsistencies in Natural Language Explanations

    Authors: Myeongjun Jang, Bodhisattwa Prasad Majumder, Julian McAuley, Thomas Lukasiewicz, Oana-Maria Camburu

    Abstract: While recent works have been considerably improving the quality of the natural language explanations (NLEs) generated by a model to justify its predictions, there is very limited research in detecting and alleviating inconsistencies among generated NLEs. In this work, we leverage external knowledge bases to significantly improve on an existing adversarial attack for detecting inconsistent NLEs. We… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Short paper, ACL 2023

    Journal ref: The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023)

  5. arXiv:2305.18029  [pdf, other

    cs.CL cs.AI

    Faithfulness Tests for Natural Language Explanations

    Authors: Pepa Atanasova, Oana-Maria Camburu, Christina Lioma, Thomas Lukasiewicz, Jakob Grue Simonsen, Isabelle Augenstein

    Abstract: Explanations of neural models aim to reveal a model's decision-making process for its predictions. However, recent work shows that current methods giving explanations such as saliency maps or counterfactuals can be misleading, as they are prone to present reasons that are unfaithful to the model's inner workings. This work explores the challenging question of evaluating the faithfulness of natural… ▽ More

    Submitted 30 June, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: Short paper, ACL 2023

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023)

  6. arXiv:2305.13235  [pdf, other

    cs.CL cs.AI

    SPARSEFIT: Few-shot Prompting with Sparse Fine-tuning for Jointly Generating Predictions and Natural Language Explanations

    Authors: Jesus Solano, Oana-Maria Camburu, Pasquale Minervini

    Abstract: Explaining the decisions of neural models is crucial for ensuring their trustworthiness at deployment time. Using Natural Language Explanations (NLEs) to justify a model's predictions has recently gained increasing interest. However, this approach usually demands large datasets of human-written NLEs for the ground-truth answers, which are expensive and potentially infeasible for some applications.… ▽ More

    Submitted 23 May, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

  7. arXiv:2305.13214  [pdf, other

    cs.CL

    Logical Reasoning for Natural Language Inference Using Generated Facts as Atoms

    Authors: Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Oana-Maria Camburu, Marek Rei

    Abstract: State-of-the-art neural models can now reach human performance levels across various natural language understanding tasks. However, despite this impressive performance, models are known to learn from annotation artefacts at the expense of the underlying task. While interpretability methods can identify influential features for each prediction, there are no guarantees that these features are respon… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    ACM Class: I.2.7

  8. arXiv:2302.05674  [pdf, other

    cs.CL cs.AI

    Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns

    Authors: Zhongbin Xie, Vid Kocijan, Thomas Lukasiewicz, Oana-Maria Camburu

    Abstract: Bias-measuring datasets play a critical role in detecting biased behavior of language models and in evaluating progress of bias mitigation methods. In this work, we focus on evaluating gender bias through coreference resolution, where previous datasets are either hand-crafted or fail to reliably measure an explicitly defined bias. To overcome these shortcomings, we propose a novel method to collec… ▽ More

    Submitted 11 February, 2023; originally announced February 2023.

    Comments: Long Paper at EACL 2023

  9. Rationalizing Predictions by Adversarial Information Calibration

    Authors: Lei Sha, Oana-Maria Camburu, Thomas Lukasiewicz

    Abstract: Explaining the predictions of AI models is paramount in safety-critical applications, such as in legal or medical domains. One form of explanation for a prediction is an extractive rationale, i.e., a subset of features of an instance that lead the model to give its prediction on that instance. For example, the subphrase ``he stole the mobile phone'' can be an extractive rationale for the predictio… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2012.08884

    Journal ref: Artificial Intelligence, Volume 315, February 2023

  10. arXiv:2207.04343  [pdf, other

    cs.CV cs.AI cs.CL

    Explaining Chest X-ray Pathologies in Natural Language

    Authors: Maxime Kayser, Cornelius Emde, Oana-Maria Camburu, Guy Parsons, Bartlomiej Papiez, Thomas Lukasiewicz

    Abstract: Most deep learning algorithms lack explanations for their predictions, which limits their deployment in clinical practice. Approaches to improve explainability, especially in medical imaging, have often been shown to convey limited information, be overly reassuring, or lack robustness. In this work, we introduce the task of generating natural language explanations (NLEs) to justify predictions mad… ▽ More

    Submitted 9 July, 2022; originally announced July 2022.

    Journal ref: MICCAI 2022

  11. arXiv:2112.06204  [pdf, other

    cs.CL

    Few-Shot Out-of-Domain Transfer Learning of Natural Language Explanations in a Label-Abundant Setup

    Authors: Yordan Yordanov, Vid Kocijan, Thomas Lukasiewicz, Oana-Maria Camburu

    Abstract: Training a model to provide natural language explanations (NLEs) for its predictions usually requires the acquisition of task-specific NLEs, which is time- and resource-consuming. A potential solution is the few-shot out-of-domain transfer of NLEs from a parent task with many NLEs to a child task. In this work, we examine the setup in which the child task has few NLEs but abundant labels. We estab… ▽ More

    Submitted 22 October, 2022; v1 submitted 12 December, 2021; originally announced December 2021.

    Comments: Accepted to the EMNLP Findings 2022

    ACM Class: I.2.7

  12. arXiv:2106.13876  [pdf, other

    cs.CL cs.AI cs.LG

    Knowledge-Grounded Self-Rationalization via Extractive and Natural Language Explanations

    Authors: Bodhisattwa Prasad Majumder, Oana-Maria Camburu, Thomas Lukasiewicz, Julian McAuley

    Abstract: Models that generate extractive rationales (i.e., subsets of features) or natural language explanations (NLEs) for their predictions are important for explainable AI. While an extractive rationale provides a quick view of the features most responsible for a prediction, an NLE allows for a comprehensive description of the decision-making process behind a prediction. However, current models that gen… ▽ More

    Submitted 16 September, 2022; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: Accepted in ICML 2022 as a spotlight

  13. arXiv:2105.03761  [pdf, other

    cs.CV cs.CL cs.LG

    e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks

    Authors: Maxime Kayser, Oana-Maria Camburu, Leonard Salewski, Cornelius Emde, Virginie Do, Zeynep Akata, Thomas Lukasiewicz

    Abstract: Recently, there has been an increasing number of efforts to introduce models capable of generating natural language explanations (NLEs) for their predictions on vision-language (VL) tasks. Such models are appealing, because they can provide human-friendly and comprehensive explanations. However, there is a lack of comparison between existing methods, which is due to a lack of re-usable evaluation… ▽ More

    Submitted 18 August, 2021; v1 submitted 8 May, 2021; originally announced May 2021.

    Comments: Accepted at ICCV 2021 (camera-ready version)

  14. arXiv:2012.08884  [pdf, other

    cs.CL cs.AI cs.LG

    Learning from the Best: Rationalizing Prediction by Adversarial Information Calibration

    Authors: Lei Sha, Oana-Maria Camburu, Thomas Lukasiewicz

    Abstract: Explaining the predictions of AI models is paramount in safety-critical applications, such as in legal or medical domains. One form of explanation for a prediction is an extractive rationale, i.e., a subset of features of an instance that lead the model to give its prediction on the instance. Previous works on generating extractive rationales usually employ a two-phase model: a selector that selec… ▽ More

    Submitted 18 December, 2020; v1 submitted 16 December, 2020; originally announced December 2020.

    Journal ref: Proceedings of the 35th AAAI Conference on Artificial Intelligence, 2021

  15. arXiv:2011.01837  [pdf, other

    cs.CL

    The Gap on GAP: Tackling the Problem of Differing Data Distributions in Bias-Measuring Datasets

    Authors: Vid Kocijan, Oana-Maria Camburu, Thomas Lukasiewicz

    Abstract: Diagnostic datasets that can detect biased models are an important prerequisite for bias reduction within natural language processing. However, undesired patterns in the collected data can make such tests incorrect. For example, if the feminine subset of a gender-bias-measuring coreference resolution dataset contains sentences with a longer average distance between the pronoun and the correct cand… ▽ More

    Submitted 15 December, 2020; v1 submitted 3 November, 2020; originally announced November 2020.

    Comments: Accepted to AAAI 2021 conference and AFCI workshop at NeurIPS 2020 conference

    Journal ref: AAAI 2021

  16. arXiv:2010.02570  [pdf, other

    cs.CL

    Does the Objective Matter? Comparing Training Objectives for Pronoun Resolution

    Authors: Yordan Yordanov, Oana-Maria Camburu, Vid Kocijan, Thomas Lukasiewicz

    Abstract: Hard cases of pronoun resolution have been used as a long-standing benchmark for commonsense reasoning. In the recent literature, pre-trained language models have been used to obtain state-of-the-art results on pronoun resolution. Overall, four categories of training and evaluation objectives have been introduced. The variety of training datasets and pre-trained language models used in these works… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Accepted to the EMNLP 2020 conference

    ACM Class: I.2.7

  17. arXiv:2010.01496  [pdf, other

    cs.CL cs.AI

    Explaining Deep Neural Networks

    Authors: Oana-Maria Camburu

    Abstract: Deep neural networks are becoming more and more popular due to their revolutionary success in diverse areas, such as computer vision, natural language processing, and speech recognition. However, the decision-making processes of these models are generally not interpretable to users. In various domains, such as healthcare, finance, or law, it is critical to know the reasons behind a decision made b… ▽ More

    Submitted 13 October, 2021; v1 submitted 4 October, 2020; originally announced October 2020.

    Comments: PhD Thesis, University of Oxford

  18. arXiv:2009.11023  [pdf, ps, other

    cs.CL

    The Struggles of Feature-Based Explanations: Shapley Values vs. Minimal Sufficient Subsets

    Authors: Oana-Maria Camburu, Eleonora Giunchiglia, Jakob Foerster, Thomas Lukasiewicz, Phil Blunsom

    Abstract: For neural models to garner widespread public trust and ensure fairness, we must have human-intelligible explanations for their predictions. Recently, an increasing number of works focus on explaining the predictions of neural models in terms of the relevance of the input features. In this work, we show that feature-based explanations pose problems even for explaining trivial models. We show that,… ▽ More

    Submitted 14 December, 2020; v1 submitted 23 September, 2020; originally announced September 2020.

    Journal ref: Explainable Agency in Artificial Intelligence Workshop at AAAI 2021

  19. arXiv:2004.03744  [pdf, other

    cs.CL cs.AI cs.CV

    e-SNLI-VE: Corrected Visual-Textual Entailment with Natural Language Explanations

    Authors: Virginie Do, Oana-Maria Camburu, Zeynep Akata, Thomas Lukasiewicz

    Abstract: The recently proposed SNLI-VE corpus for recognising visual-textual entailment is a large, real-world dataset for fine-grained multimodal reasoning. However, the automatic way in which SNLI-VE has been assembled (via combining parts of two related datasets) gives rise to a large number of errors in the labels of this corpus. In this paper, we first present a data collection effort to correct the c… ▽ More

    Submitted 19 August, 2021; v1 submitted 7 April, 2020; originally announced April 2020.

    Journal ref: IEEE CVPR Workshop on Fair, Data Efficient and Trusted Computer Vision, 2020

  20. arXiv:1910.03065  [pdf, ps, other

    cs.CL cs.AI

    Make Up Your Mind! Adversarial Generation of Inconsistent Natural Language Explanations

    Authors: Oana-Maria Camburu, Brendan Shillingford, Pasquale Minervini, Thomas Lukasiewicz, Phil Blunsom

    Abstract: To increase trust in artificial intelligence systems, a promising research direction consists of designing neural models capable of generating natural language explanations for their predictions. In this work, we show that such models are nonetheless prone to generating mutually inconsistent explanations, such as "Because there is a dog in the image" and "Because there is no dog in the [same] imag… ▽ More

    Submitted 2 May, 2020; v1 submitted 7 October, 2019; originally announced October 2019.

    Journal ref: Short Paper at ACL, 2020

  21. arXiv:1910.02065  [pdf, other

    cs.CL cs.LG

    Can I Trust the Explainer? Verifying Post-hoc Explanatory Methods

    Authors: Oana-Maria Camburu, Eleonora Giunchiglia, Jakob Foerster, Thomas Lukasiewicz, Phil Blunsom

    Abstract: For AI systems to garner widespread public acceptance, we must develop methods capable of explaining the decisions of black-box models such as neural networks. In this work, we identify two issues of current explanatory methods. First, we show that two prevalent perspectives on explanations --- feature-additivity and feature-selection --- lead to fundamentally different instance-wise explanations.… ▽ More

    Submitted 5 December, 2019; v1 submitted 4 October, 2019; originally announced October 2019.

    Journal ref: NeurIPS 2019 Workshop on Safety and Robustness in Decision Making, Vancouver, Canada

  22. arXiv:1908.08025  [pdf, other

    cs.CL

    WikiCREM: A Large Unsupervised Corpus for Coreference Resolution

    Authors: Vid Kocijan, Oana-Maria Camburu, Ana-Maria Cretu, Yordan Yordanov, Phil Blunsom, Thomas Lukasiewicz

    Abstract: Pronoun resolution is a major area of natural language understanding. However, large-scale training sets are still scarce, since manually labelling data is costly. In this work, we introduce WikiCREM (Wikipedia CoREferences Masked) a large-scale, yet accurate dataset of pronoun disambiguation instances. We use a language-model-based approach for pronoun resolution in combination with our WikiCREM… ▽ More

    Submitted 13 October, 2019; v1 submitted 21 August, 2019; originally announced August 2019.

    Comments: Accepted to the EMNLP 2019 conference

    Journal ref: IJCNLP-EMNLP 2019

  23. A Surprisingly Robust Trick for Winograd Schema Challenge

    Authors: Vid Kocijan, Ana-Maria Cretu, Oana-Maria Camburu, Yordan Yordanov, Thomas Lukasiewicz

    Abstract: The Winograd Schema Challenge (WSC) dataset WSC273 and its inference counterpart WNLI are popular benchmarks for natural language understanding and commonsense reasoning. In this paper, we show that the performance of three language models on WSC273 strongly improves when fine-tuned on a similar pronoun disambiguation problem dataset (denoted WSCR). We additionally generate a large unsupervised WS… ▽ More

    Submitted 4 August, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Comments: Appeared as part of the ACL 2019 conference

  24. arXiv:1812.01193  [pdf, ps, other

    cs.CL

    e-SNLI: Natural Language Inference with Natural Language Explanations

    Authors: Oana-Maria Camburu, Tim Rocktäschel, Thomas Lukasiewicz, Phil Blunsom

    Abstract: In order for machine learning to garner widespread public adoption, models must be able to provide interpretable and robust explanations for their decisions, as well as learn from human-provided explanations at train time. In this work, we extend the Stanford Natural Language Inference dataset with an additional layer of human-annotated natural language explanations of the entailment relations. We… ▽ More

    Submitted 6 December, 2018; v1 submitted 3 December, 2018; originally announced December 2018.

    Comments: NeurIPS 2018

  25. arXiv:1511.02283  [pdf, other

    cs.CV cs.CL cs.LG cs.RO

    Generation and Comprehension of Unambiguous Object Descriptions

    Authors: Junhua Mao, Jonathan Huang, Alexander Toshev, Oana Camburu, Alan Yuille, Kevin Murphy

    Abstract: We propose a method that can generate an unambiguous description (known as a referring expression) of a specific object or region in an image, and which can also comprehend or interpret such an expression to infer which object is being described. We show that our method outperforms previous methods that generate descriptions of objects without taking into account other potentially ambiguous object… ▽ More

    Submitted 10 April, 2016; v1 submitted 6 November, 2015; originally announced November 2015.

    Comments: We have released the Google Refexp dataset together with a toolbox for visualization and evaluation, see https://github.com/mjhucla/Google_Refexp_toolbox. Camera ready version for CVPR 2016

    ACM Class: I.2.6; I.2.7; I.2.10

  26. arXiv:1507.07098  [pdf, ps, other

    math.NT

    Cyclotomic coefficients: gaps and jumps

    Authors: Oana-Maria Camburu, Emil-Alexandru Ciolan, Florian Luca, Pieter Moree, Igor E. Shparlinski

    Abstract: We improve several recent results by Hong, Lee, Lee and Park (2012) on gaps and Bzdȩga (2014) on jumps amongst the coefficients of cyclotomic polynomials. Besides direct improvements, we also introduce several new techniques that have never been used in this area.

    Submitted 25 July, 2015; originally announced July 2015.

    Comments: 25 pages

    MSC Class: 11B83; 11L07; 11N25

    Journal ref: J. Number Theory 163 (2016), 211--237