Skip to main content

Showing 1–15 of 15 results for author: Stammer, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09949  [pdf, other

    cs.AI cs.LG cs.SC

    Neural Concept Binder

    Authors: Wolfgang Stammer, Antonia Wüst, David Steinmann, Kristian Kersting

    Abstract: The challenge in object-based visual reasoning lies in generating descriptive yet distinct concept representations. Moreover, doing this in an unsupervised fashion requires human users to understand a model's learned concepts and potentially revise false concepts. In addressing this challenge, we introduce the Neural Concept Binder, a new framework for deriving discrete concept representations res… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2402.08280  [pdf, other

    cs.AI cs.CV cs.LG

    Pix2Code: Learning to Compose Neural Visual Concepts as Programs

    Authors: Antonia Wüst, Wolfgang Stammer, Quentin Delfosse, Devendra Singh Dhami, Kristian Kersting

    Abstract: The challenge in learning abstract concepts from images in an unsupervised fashion lies in the required integration of visual perception and generalizable relational reasoning. Moreover, the unsupervised nature of this task makes it necessary for human users to be able to understand a model's learnt concepts and potentially revise false behaviours. To tackle both the generalizability and interpret… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  3. arXiv:2402.06434  [pdf, other

    cs.LG stat.ML

    Where is the Truth? The Risk of Getting Confounded in a Continual World

    Authors: Florian Peter Busch, Roshni Kamath, Rupert Mitchell, Wolfgang Stammer, Kristian Kersting, Martin Mundt

    Abstract: A dataset is confounded if it is most easily solved via a spurious correlation, which fails to generalize to new data. In this work, we show that, in a continual learning setting where confounders may vary in time across tasks, the challenge of mitigating the effect of confounders far exceeds the standard forgetting problem normally considered. In particular, we provide a formal description of suc… ▽ More

    Submitted 15 June, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

  4. arXiv:2401.05821  [pdf, other

    cs.LG cs.SC

    Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents

    Authors: Quentin Delfosse, Sebastian Sztwiertnia, Mark Rothermel, Wolfgang Stammer, Kristian Kersting

    Abstract: Goal misalignment, reward sparsity and difficult credit assignment are only a few of the many issues that make it difficult for deep reinforcement learning (RL) agents to learn optimal policies. Unfortunately, the black-box nature of deep neural networks impedes the inclusion of domain experts for inspecting the model and revising suboptimal policies. To this end, we introduce *Successive Concept… ▽ More

    Submitted 24 May, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: 20 pages, 8 of main text, 8 of appendix, 3 main figures

  5. arXiv:2309.08395  [pdf, other

    cs.AI cs.LG

    Learning by Self-Explaining

    Authors: Wolfgang Stammer, Felix Friedrich, David Steinmann, Manuel Brack, Hikaru Shindo, Kristian Kersting

    Abstract: Current AI research mainly treats explanations as a means for model inspection. Yet, this neglects findings from human psychology that describe the benefit of self-explanations in an agent's learning process. Motivated by this, we introduce a novel approach in the context of image classification, termed Learning by Self-Explaining (LSX). LSX utilizes aspects of self-refining AI and human-guided ex… ▽ More

    Submitted 5 April, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

  6. arXiv:2308.13453  [pdf, other

    cs.LG cs.AI

    Learning to Intervene on Concept Bottlenecks

    Authors: David Steinmann, Wolfgang Stammer, Felix Friedrich, Kristian Kersting

    Abstract: While deep learning models often lack interpretability, concept bottleneck models (CBMs) provide inherent explanations via their concept representations. Moreover, they allow users to perform interventional interactions on these concepts by updating the concept values and thus correcting the predictive output of the model. Up to this point, these interventions were typically applied to the model j… ▽ More

    Submitted 4 June, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

  7. arXiv:2306.07743  [pdf, other

    cs.AI cs.CV cs.LG

    V-LoL: A Diagnostic Dataset for Visual Logical Learning

    Authors: Lukas Helff, Wolfgang Stammer, Hikaru Shindo, Devendra Singh Dhami, Kristian Kersting

    Abstract: Despite the successes of recent developments in visual AI, different shortcomings still exist; from missing exact logical reasoning, to abstract generalization abilities, to understanding complex and noisy scenes. Unfortunately, existing benchmarks, were not designed to capture more than a few of these aspects. Whereas deep learning datasets focus on visually complex data but simple visual reasoni… ▽ More

    Submitted 3 July, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

  8. Boosting Object Representation Learning via Motion and Object Continuity

    Authors: Quentin Delfosse, Wolfgang Stammer, Thomas Rothenbacher, Dwarak Vittal, Kristian Kersting

    Abstract: Recent unsupervised multi-object detection models have shown impressive performance improvements, largely attributed to novel architectural inductive biases. Unfortunately, they may produce suboptimal object encodings for downstream tasks. To overcome this, we propose to exploit object motion and continuity, i.e., objects do not pop in and out of existence. This is accomplished through two mechani… ▽ More

    Submitted 21 February, 2024; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: 8 pages main text, 32 tables, 21 Figures

    Journal ref: Machine Learning and Knowledge Discovery in Databases: Research Track. ECML PKDD 2023. Lecture Notes in Computer Science(), vol 14172. Springer, Cham

  9. arXiv:2210.10332  [pdf, other

    cs.CL cs.AI cs.HC

    Revision Transformers: Instructing Language Models to Change their Values

    Authors: Felix Friedrich, Wolfgang Stammer, Patrick Schramowski, Kristian Kersting

    Abstract: Current transformer language models (LM) are large-scale models with billions of parameters. They have been shown to provide high performances on a variety of tasks but are also prone to shortcut learning and bias. Addressing such incorrect model behavior via parameter adjustments is very costly. This is particularly problematic for updating dynamic concepts, such as moral values, which vary cultu… ▽ More

    Submitted 25 July, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

  10. arXiv:2207.14526  [pdf, other

    cs.LG

    Leveraging Explanations in Interactive Machine Learning: An Overview

    Authors: Stefano Teso, Öznur Alkan, Wolfang Stammer, Elizabeth Daly

    Abstract: Explanations have gained an increasing level of interest in the AI and Machine Learning (ML) communities in order to improve model transparency and allow users to form a mental model of a trained ML model. However, explanations can go beyond this one way communication as a mechanism to elicit user control, because once users understand, they can then provide feedback. The goal of this paper is to… ▽ More

    Submitted 9 October, 2022; v1 submitted 29 July, 2022; originally announced July 2022.

  11. arXiv:2203.03668  [pdf, other

    cs.LG cs.AI cs.HC

    A Typology for Exploring the Mitigation of Shortcut Behavior

    Authors: Felix Friedrich, Wolfgang Stammer, Patrick Schramowski, Kristian Kersting

    Abstract: As machine learning models become increasingly larger, trained weakly supervised on large, possibly uncurated data sets, it becomes increasingly important to establish mechanisms for inspecting, interacting, and revising models to mitigate learning shortcuts and guarantee their learned knowledge is aligned with human knowledge. The recently proposed XIL framework was developed for this purpose, an… ▽ More

    Submitted 14 March, 2024; v1 submitted 4 March, 2022; originally announced March 2022.

  12. arXiv:2112.02290  [pdf, other

    cs.CV cs.LG

    Interactive Disentanglement: Learning Concepts by Interacting with their Prototype Representations

    Authors: Wolfgang Stammer, Marius Memmel, Patrick Schramowski, Kristian Kersting

    Abstract: Learning visual concepts from raw images without strong supervision is a challenging task. In this work, we show the advantages of prototype representations for understanding and revising the latent space of neural concept learners. For this purpose, we introduce interactive Concept Swap** Networks (iCSNs), a novel framework for learning concept-grounded representations via weak supervision and… ▽ More

    Submitted 29 March, 2022; v1 submitted 4 December, 2021; originally announced December 2021.

    Comments: To be published in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022

  13. arXiv:2110.03395  [pdf, other

    cs.AI

    SLASH: Embracing Probabilistic Circuits into Neural Answer Set Programming

    Authors: Arseny Skryagin, Wolfgang Stammer, Daniel Ochs, Devendra Singh Dhami, Kristian Kersting

    Abstract: The goal of combining the robustness of neural networks and the expressivity of symbolic methods has rekindled the interest in neuro-symbolic AI. Recent advancements in neuro-symbolic AI often consider specifically-tailored architectures consisting of disjoint neural and symbolic components, and thus do not exhibit desired gains that can be achieved by integrating them into a unifying framework. W… ▽ More

    Submitted 23 November, 2021; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: 18 pages, 7 figures and 6 tables

    ACM Class: I.2.5; D.3.2

  14. arXiv:2011.12854  [pdf, other

    cs.LG cs.AI

    Right for the Right Concept: Revising Neuro-Symbolic Concepts by Interacting with their Explanations

    Authors: Wolfgang Stammer, Patrick Schramowski, Kristian Kersting

    Abstract: Most explanation methods in deep learning map importance estimates for a model's prediction back to the original input space. These "visual" explanations are often insufficient, as the model's actual concept remains elusive. Moreover, without insights into the model's semantic concept, it is difficult -- if not impossible -- to intervene on the model's behavior via its explanations, called Explana… ▽ More

    Submitted 21 June, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, p. 3619-3629

  15. arXiv:2001.05371  [pdf, other

    cs.LG cs.AI stat.ML

    Making deep neural networks right for the right scientific reasons by interacting with their explanations

    Authors: Patrick Schramowski, Wolfgang Stammer, Stefano Teso, Anna Brugger, Xiaoting Shao, Hans-Georg Luigs, Anne-Katrin Mahlein, Kristian Kersting

    Abstract: Deep neural networks have shown excellent performances in many real-world applications. Unfortunately, they may show "Clever Hans"-like behavior -- making use of confounding factors within datasets -- to achieve high performance. In this work, we introduce the novel learning setting of "explanatory interactive learning" (XIL) and illustrate its benefits on a plant phenoty** research task. XIL ad… ▽ More

    Submitted 5 March, 2024; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: arXiv admin note: text overlap with arXiv:1805.08578