Skip to main content

Showing 1–6 of 6 results for author: Wilming, R

.
  1. arXiv:2406.11547  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    GECOBench: A Gender-Controlled Text Dataset and Benchmark for Quantifying Biases in Explanations

    Authors: Rick Wilming, Artur Dox, Hjalmar Schulz, Marta Oliveira, Benedict Clark, Stefan Haufe

    Abstract: Large pre-trained language models have become popular for many applications and form an important backbone of many downstream tasks in natural language processing (NLP). Applying 'explainable artificial intelligence' (XAI) techniques to enrich such models' outputs is considered crucial for assuring their quality and shedding light on their inner workings. However, large language models are trained… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Under review

  2. arXiv:2405.12261  [pdf

    cs.LG cs.AI

    EXACT: Towards a platform for empirically benchmarking Machine Learning model explanation methods

    Authors: Benedict Clark, Rick Wilming, Artur Dox, Paul Eschenbach, Sami Hached, Daniel ** Wodke, Michias Taye Zewdie, Uladzislau Bruila, Marta Oliveira, Hjalmar Schulz, Luca Matteo Cornils, Danny Panknin, Ahcène Boubekki, Stefan Haufe

    Abstract: The evolving landscape of explainable artificial intelligence (XAI) aims to improve the interpretability of intricate machine learning (ML) models, yet faces challenges in formalisation and empirical validation, being an inherently unsupervised process. In this paper, we bring together various benchmark datasets and novel performance metrics in an initial benchmarking platform, the Explainable AI… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  3. arXiv:2306.12816  [pdf, other

    cs.LG cs.AI cs.CV

    XAI-TRIS: Non-linear image benchmarks to quantify false positive post-hoc attribution of feature importance

    Authors: Benedict Clark, Rick Wilming, Stefan Haufe

    Abstract: The field of 'explainable' artificial intelligence (XAI) has produced highly cited methods that seek to make the decisions of complex machine learning (ML) methods 'understandable' to humans, for example by attributing 'importance' scores to input features. Yet, a lack of formal underpinning leaves it unclear as to what conclusions can safely be drawn from the results of a given XAI method and has… ▽ More

    Submitted 7 December, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: Under review

  4. arXiv:2306.12150  [pdf, other

    cs.CV cs.AI cs.LG

    Benchmark data to study the influence of pre-training on explanation performance in MR image classification

    Authors: Marta Oliveira, Rick Wilming, Benedict Clark, Céline Budding, Fabian Eitel, Kerstin Ritter, Stefan Haufe

    Abstract: Convolutional Neural Networks (CNNs) are frequently and successfully used in medical prediction tasks. They are often used in combination with transfer learning, leading to improved performance when training data for the task are scarce. The resulting models are highly complex and typically do not provide any insight into their predictive mechanisms, motivating the field of 'explainable' artificia… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: Under review

  5. arXiv:2306.01464  [pdf, other

    cs.LG cs.AI stat.ML

    Theoretical Behavior of XAI Methods in the Presence of Suppressor Variables

    Authors: Rick Wilming, Leo Kieslich, Benedict Clark, Stefan Haufe

    Abstract: In recent years, the community of 'explainable artificial intelligence' (XAI) has created a vast body of methods to bridge a perceived gap between model 'complexity' and 'interpretability'. However, a concrete problem to be solved by XAI methods has not yet been formally stated. As a result, XAI methods are lacking theoretical and empirical evidence for the 'correctness' of their explanations, lim… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted at ICML 2023

  6. arXiv:2111.07473  [pdf, other

    stat.ML cs.AI cs.LG

    Scrutinizing XAI using linear ground-truth data with suppressor variables

    Authors: Rick Wilming, Céline Budding, Klaus-Robert Müller, Stefan Haufe

    Abstract: Machine learning (ML) is increasingly often used to inform high-stakes decisions. As complex ML models (e.g., deep neural networks) are often considered black boxes, a wealth of procedures has been developed to shed light on their inner workings and the ways in which their predictions come about, defining the field of 'explainable AI' (XAI). Saliency methods rank input features according to some m… ▽ More

    Submitted 22 June, 2023; v1 submitted 14 November, 2021; originally announced November 2021.

    Comments: Corrected typos

    MSC Class: 68T01; 68T07 ACM Class: I.2; I.5