Skip to main content

Showing 1–9 of 9 results for author: Wermter, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2401.00104  [pdf, other

    cs.LG cs.AI stat.ME

    Causal State Distillation for Explainable Reinforcement Learning

    Authors: Wenhao Lu, Xufeng Zhao, Thilo Fryen, Jae Hee Lee, Mengdi Li, Sven Magg, Stefan Wermter

    Abstract: Reinforcement learning (RL) is a powerful technique for training intelligent agents, but understanding why these agents make specific decisions can be quite challenging. This lack of transparency in RL models has been a long-standing problem, making it difficult for users to grasp the reasons behind an agent's behaviour. Various approaches have been explored to address this problem, with one promi… ▽ More

    Submitted 1 April, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: https://lukaswill.github.io/; Accepted as oral by CLeaR 2024

  2. arXiv:2005.03420  [pdf, other

    cs.LG cs.RO stat.ML

    Curious Hierarchical Actor-Critic Reinforcement Learning

    Authors: Frank Röder, Manfred Eppe, Phuong D. H. Nguyen, Stefan Wermter

    Abstract: Hierarchical abstraction and curiosity-driven exploration are two common paradigms in current reinforcement learning approaches to break down difficult problems into a sequence of simpler ones and to overcome reward sparsity. However, there is a lack of approaches that combine these paradigms, and it is currently unknown whether curiosity also helps to perform the hierarchical abstraction. As a no… ▽ More

    Submitted 17 August, 2020; v1 submitted 7 May, 2020; originally announced May 2020.

    Comments: 12 pages, 4 figures

  3. arXiv:2004.08830  [pdf

    cs.LG cs.AI cs.RO stat.ML

    Improving Robot Dual-System Motor Learning with Intrinsically Motivated Meta-Control and Latent-Space Experience Imagination

    Authors: Muhammad Burhan Hafez, Cornelius Weber, Matthias Kerzel, Stefan Wermter

    Abstract: Combining model-based and model-free learning systems has been shown to improve the sample efficiency of learning to perform complex robotic tasks. However, dual-system approaches fail to consider the reliability of the learned model when it is applied to make multiple-step predictions, resulting in a compounding of prediction errors and performance degradation. In this paper, we present a novel d… ▽ More

    Submitted 1 November, 2020; v1 submitted 19 April, 2020; originally announced April 2020.

    Journal ref: Robotics and Autonomous Systems 133 (2020) 103630

  4. arXiv:2001.06338  [pdf, other

    cs.CV cs.LG stat.ML

    Efficient Facial Feature Learning with Wide Ensemble-based Convolutional Neural Networks

    Authors: Henrique Siqueira, Sven Magg, Stefan Wermter

    Abstract: Ensemble methods, traditionally built with independently trained de-correlated models, have proven to be efficient methods for reducing the remaining residual generalization error, which results in robust and accurate methods for real-world applications. In the context of deep learning, however, training an ensemble of deep networks is costly and generates high redundancy which is inefficient. In… ▽ More

    Submitted 17 January, 2020; originally announced January 2020.

    Comments: Accepted at the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20), 1-1, New York, USA

  5. arXiv:1910.04729  [pdf

    cs.LG cs.AI cs.RO stat.ML

    Efficient Intrinsically Motivated Robotic Gras** with Learning-Adaptive Imagination in Latent Space

    Authors: Muhammad Burhan Hafez, Cornelius Weber, Matthias Kerzel, Stefan Wermter

    Abstract: Combining model-based and model-free deep reinforcement learning has shown great promise for improving sample efficiency on complex control tasks while still retaining high performance. Incorporating imagination is a recent effort in this direction inspired by human mental simulation of motor behavior. We propose a learning-adaptive imagination approach which, unlike previous approaches, takes int… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

    Comments: In: Proceedings of the Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics (ICDL-EpiRob), Oslo, Norway, Aug. 19-22, 2019

  6. arXiv:1905.01718  [pdf, ps, other

    cs.LG cs.AI cs.RO stat.ML

    Curious Meta-Controller: Adaptive Alternation between Model-Based and Model-Free Control in Deep Reinforcement Learning

    Authors: Muhammad Burhan Hafez, Cornelius Weber, Matthias Kerzel, Stefan Wermter

    Abstract: Recent success in deep reinforcement learning for continuous control has been dominated by model-free approaches which, unlike model-based approaches, do not suffer from representational limitations in making assumptions about the world dynamics and model errors inevitable in complex domains. However, they require a lot of experiences compared to model-based approaches that are typically more samp… ▽ More

    Submitted 5 May, 2019; originally announced May 2019.

    Comments: Accepted at IJCNN 2019

  7. arXiv:1809.06146  [pdf, other

    cs.LG stat.ML

    Curriculum goal masking for continuous deep reinforcement learning

    Authors: Manfred Eppe, Sven Magg, Stefan Wermter

    Abstract: Deep reinforcement learning has recently gained a focus on problems where policy or value functions are independent of goals. Evidence exists that the sampling of goals has a strong effect on the learning performance, but there is a lack of general mechanisms that focus on optimizing the goal sampling process. In this work, we present a simple and general goal masking method that also allows us to… ▽ More

    Submitted 13 February, 2019; v1 submitted 17 September, 2018; originally announced September 2018.

  8. arXiv:1802.07569  [pdf, other

    cs.LG q-bio.NC stat.ML

    Continual Lifelong Learning with Neural Networks: A Review

    Authors: German I. Parisi, Ronald Kemker, Jose L. Part, Christopher Kanan, Stefan Wermter

    Abstract: Humans and animals have the ability to continually acquire, fine-tune, and transfer knowledge and skills throughout their lifespan. This ability, referred to as lifelong learning, is mediated by a rich set of neurocognitive mechanisms that together contribute to the development and specialization of our sensorimotor skills as well as to long-term memory consolidation and retrieval. Consequently, l… ▽ More

    Submitted 10 February, 2019; v1 submitted 21 February, 2018; originally announced February 2018.

  9. arXiv:1801.07654  [pdf, other

    cs.LG cs.AI cs.SD q-bio.NC stat.ML

    Expectation Learning for Adaptive Crossmodal Stimuli Association

    Authors: Pablo Barros, German I. Parisi, Di Fu, Xun Liu, Stefan Wermter

    Abstract: The human brain is able to learn, generalize, and predict crossmodal stimuli. Learning by expectation fine-tunes crossmodal processing at different levels, thus enhancing our power of generalization and adaptation in highly dynamic environments. In this paper, we propose a deep neural architecture trained by using expectation learning accounting for unsupervised learning tasks. Our learning model… ▽ More

    Submitted 23 January, 2018; originally announced January 2018.

    Comments: 3 pages 2017 EUCog meeting abstract