Skip to main content

Showing 1–14 of 14 results for author: Delfosse, Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16748  [pdf, other

    cs.LG cs.CL

    OCALM: Object-Centric Assessment with Language Models

    Authors: Timo Kaufmann, Jannis Blüml, Antonia Wüst, Quentin Delfosse, Kristian Kersting, Eyke Hüllermeier

    Abstract: Properly defining a reward signal to efficiently train a reinforcement learning (RL) agent is a challenging task. Designing balanced objective functions from which a desired behavior can emerge requires expert knowledge, especially for complex environments. Learning rewards from human feedback or using large language models (LLMs) to directly provide rewards are promising alternatives, allowing no… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted at the RLBRew Workshop at RLC 2024

  2. arXiv:2406.06107  [pdf, other

    cs.AI

    EXPIL: Explanatory Predicate Invention for Learning in Games

    Authors: **gyuan Sha, Hikaru Shindo, Quentin Delfosse, Kristian Kersting, Devendra Singh Dhami

    Abstract: Reinforcement learning (RL) has proven to be a powerful tool for training agents that excel in various games. However, the black-box nature of neural network models often hinders our ability to understand the reasoning behind the agent's actions. Recent research has attempted to address this issue by using the guidance of pretrained neural agents to encode logic-based policies, allowing for interp… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 9 pages, 2 pages references, 8 figures, 3 tables

  3. arXiv:2406.03997  [pdf, other

    cs.AI cs.LG

    HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning

    Authors: Quentin Delfosse, Jannis Blüml, Bjarne Gregori, Kristian Kersting

    Abstract: Artificial agents' adaptability to novelty and alignment with intended behavior is crucial for their effective deployment. Reinforcement learning (RL) leverages novelty as a means of exploration, yet agents often struggle to handle novel situations, hindering generalization. To address these issues, we propose HackAtari, a framework introducing controlled novelty to the most common RL benchmark, t… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 9 main pages, 4 pages references, 19 pages of appendix

  4. arXiv:2405.14956  [pdf, other

    cs.AI cs.LG

    Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning

    Authors: Hector Kohler, Quentin Delfosse, Riad Akrour, Kristian Kersting, Philippe Preux

    Abstract: Deep reinforcement learning agents are prone to goal misalignments. The black-box nature of their policies hinders the detection and correction of such misalignments, and the trust necessary for real-world deployment. So far, solutions learning interpretable policies are inefficient or require many human priors. We propose INTERPRETER, a fast distillation method producing INTerpretable Editable tR… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  5. arXiv:2404.10906  [pdf, other

    cs.AI cs.HC cs.LG cs.SC

    Towards a Research Community in Interpretable Reinforcement Learning: the InterpPol Workshop

    Authors: Hector Kohler, Quentin Delfosse, Paul Festor, Philippe Preux

    Abstract: Embracing the pursuit of intrinsically explainable reinforcement learning raises crucial questions: what distinguishes explainability from interpretability? Should explainable and interpretable agents be developed outside of domains where transparency is imperative? What advantages do interpretable policies offer over neural networks? How can we rigorously define and measure interpretability in po… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  6. arXiv:2402.08280  [pdf, other

    cs.AI cs.CV cs.LG

    Pix2Code: Learning to Compose Neural Visual Concepts as Programs

    Authors: Antonia Wüst, Wolfgang Stammer, Quentin Delfosse, Devendra Singh Dhami, Kristian Kersting

    Abstract: The challenge in learning abstract concepts from images in an unsupervised fashion lies in the required integration of visual perception and generalizable relational reasoning. Moreover, the unsupervised nature of this task makes it necessary for human users to be able to understand a model's learnt concepts and potentially revise false behaviours. To tackle both the generalizability and interpret… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  7. arXiv:2401.05821  [pdf, other

    cs.LG cs.SC

    Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents

    Authors: Quentin Delfosse, Sebastian Sztwiertnia, Mark Rothermel, Wolfgang Stammer, Kristian Kersting

    Abstract: Goal misalignment, reward sparsity and difficult credit assignment are only a few of the many issues that make it difficult for deep reinforcement learning (RL) agents to learn optimal policies. Unfortunately, the black-box nature of deep neural networks impedes the inclusion of domain experts for inspecting the model and revising suboptimal policies. To this end, we introduce *Successive Concept… ▽ More

    Submitted 24 May, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: 20 pages, 8 of main text, 8 of appendix, 3 main figures

  8. arXiv:2306.08649  [pdf, other

    cs.LG cs.AI cs.CV

    OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments

    Authors: Quentin Delfosse, Jannis Blüml, Bjarne Gregori, Sebastian Sztwiertnia, Kristian Kersting

    Abstract: Cognitive science and psychology suggest that object-centric representations of complex scenes are a promising step towards enabling efficient abstract reasoning from low-level perceptual features. Yet, most deep reinforcement learning approaches only rely on pixel-based representations that do not capture the compositional properties of natural scenes. For this, we need environments and datasets… ▽ More

    Submitted 27 February, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: 26 pages, 8 main paper pages, 36 appendix pages. In main paper: 4 figures, 3 tables

  9. arXiv:2306.01439  [pdf, other

    cs.LG cs.AI cs.CL cs.LO cs.SC

    Interpretable and Explainable Logical Policies via Neurally Guided Symbolic Abstraction

    Authors: Quentin Delfosse, Hikaru Shindo, Devendra Dhami, Kristian Kersting

    Abstract: The limited priors required by neural networks make them the dominating choice to encode and learn policies using reinforcement learning (RL). However, they are also black-boxes, making it hard to understand the agent's behaviour, especially when working on the image level. Therefore, neuro-symbolic RL aims at creating policies that are interpretable in the first place. Unfortunately, interpretabi… ▽ More

    Submitted 25 October, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: 9 main pages + appendix (19 in total)

  10. Boosting Object Representation Learning via Motion and Object Continuity

    Authors: Quentin Delfosse, Wolfgang Stammer, Thomas Rothenbacher, Dwarak Vittal, Kristian Kersting

    Abstract: Recent unsupervised multi-object detection models have shown impressive performance improvements, largely attributed to novel architectural inductive biases. Unfortunately, they may produce suboptimal object encodings for downstream tasks. To overcome this, we propose to exploit object motion and continuity, i.e., objects do not pop in and out of existence. This is accomplished through two mechani… ▽ More

    Submitted 21 February, 2024; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: 8 pages main text, 32 tables, 21 Figures

    Journal ref: Machine Learning and Knowledge Discovery in Databases: Research Track. ECML PKDD 2023. Lecture Notes in Computer Science(), vol 14172. Springer, Cham

  11. arXiv:2205.01549  [pdf, other

    cs.CL cs.AI

    Adaptable Adapters

    Authors: Nafise Sadat Moosavi, Quentin Delfosse, Kristian Kersting, Iryna Gurevych

    Abstract: State-of-the-art pretrained NLP models contain a hundred million to trillion parameters. Adapters provide a parameter-efficient alternative for the full finetuning in which we can only finetune lightweight neural network layers on top of pretrained weights. Adapter layers are initialized randomly. However, existing work uses the same adapter architecture -- i.e., the same adapter layer on top of e… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: Accepted at NAACL-2022 main conference

  12. arXiv:2110.03331  [pdf, other

    cs.LG

    CLEVA-Compass: A Continual Learning EValuation Assessment Compass to Promote Research Transparency and Comparability

    Authors: Martin Mundt, Steven Lang, Quentin Delfosse, Kristian Kersting

    Abstract: What is the state of the art in continual machine learning? Although a natural question for predominant static benchmarks, the notion to train systems in a lifelong manner entails a plethora of additional challenges with respect to set-up and evaluation. The latter have recently sparked a growing amount of critiques on prominent algorithm-centric perspectives and evaluation protocols being too nar… ▽ More

    Submitted 1 February, 2022; v1 submitted 7 October, 2021; originally announced October 2021.

    Comments: International Conference on Learning Representations (ICLR) 2022

  13. arXiv:2108.04328  [pdf, other

    cs.NE cs.CV

    Generative Adversarial Neural Cellular Automata

    Authors: Maximilian Otte, Quentin Delfosse, Johannes Czech, Kristian Kersting

    Abstract: Motivated by the interaction between cells, the recently introduced concept of Neural Cellular Automata shows promising results in a variety of tasks. So far, this concept was mostly used to generate images for a single scenario. As each scenario requires a new model, this type of generation seems contradictory to the adaptability of cells in nature. To address this contradiction, we introduce a c… ▽ More

    Submitted 19 July, 2021; originally announced August 2021.

    Comments: 8 pages with 12 figures

    ACM Class: I.2.m

  14. arXiv:2102.09407  [pdf, other

    cs.LG

    Adaptive Rational Activations to Boost Deep Reinforcement Learning

    Authors: Quentin Delfosse, Patrick Schramowski, Martin Mundt, Alejandro Molina, Kristian Kersting

    Abstract: Latest insights from biology show that intelligence not only emerges from the connections between neurons but that individual neurons shoulder more computational responsibility than previously anticipated. This perspective should be critical in the context of constantly changing distinct reinforcement learning environments, yet current approaches still primarily employ static activation functions.… ▽ More

    Submitted 16 March, 2024; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: Main paper: 9 pages, References: 4 pages, Appendix: 11 pages. Main paper: 5 figures, Appendix: 6 figures, 6 tables. Rational Activation Functions repository: https://github.com/k4ntz/activation-functions Rational Reinforcement Learning: https://github.com/ml-research/rational_rl