Skip to main content

Showing 1–6 of 6 results for author: Becker-Ehmck, P

.
  1. arXiv:2404.18896  [pdf, other

    cs.LG

    Overcoming Knowledge Barriers: Online Imitation Learning from Observation with Pretrained World Models

    Authors: Xingyuan Zhang, Philip Becker-Ehmck, Patrick van der Smagt, Maximilian Karl

    Abstract: Incorporating the successful paradigm of pretraining and finetuning from Computer Vision and Natural Language Processing into decision-making has become increasingly popular in recent years. In this paper, we study Imitation Learning from Observation with pretrained models and find existing approaches such as BCO and AIME face knowledge barriers, specifically the Embodiment Knowledge Barrier (EKB)… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 19 pages, 7 figures

  2. arXiv:2312.02019  [pdf, other

    cs.LG cs.AI

    Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models

    Authors: Xingyuan Zhang, Philip Becker-Ehmck, Patrick van der Smagt, Maximilian Karl

    Abstract: Unlike most reinforcement learning agents which require an unrealistic amount of environment interactions to learn a new behaviour, humans excel at learning quickly by merely observing and imitating others. This ability highly depends on the fact that humans have a model of their own embodiment that allows them to infer the most likely actions that led to the observed behaviour. In this paper, we… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023

  3. arXiv:2003.08876  [pdf, other

    cs.RO cs.AI cs.LG stat.ML

    Learning to Fly via Deep Model-Based Reinforcement Learning

    Authors: Philip Becker-Ehmck, Maximilian Karl, Jan Peters, Patrick van der Smagt

    Abstract: Learning to control robots without requiring engineered models has been a long-term goal, promising diverse and novel applications. Yet, reinforcement learning has only achieved limited impact on real-time robot control due to its high demand of real-world interactions. In this work, by leveraging a learnt probabilistic model of drone dynamics, we learn a thrust-attitude controller for a quadrotor… ▽ More

    Submitted 4 August, 2020; v1 submitted 19 March, 2020; originally announced March 2020.

  4. arXiv:1911.00756  [pdf, other

    cs.LG stat.ML

    Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations

    Authors: Neha Das, Maximilian Karl, Philip Becker-Ehmck, Patrick van der Smagt

    Abstract: Learning a model of dynamics from high-dimensional images can be a core ingredient for success in many applications across different domains, especially in sequential decision making. However, currently prevailing methods based on latent-variable models are limited to working with low resolution images only. In this work, we show that some of the issues with using high-dimensional observations ari… ▽ More

    Submitted 2 November, 2019; originally announced November 2019.

  5. arXiv:1905.12434  [pdf, other

    stat.ML cs.LG

    Switching Linear Dynamics for Variational Bayes Filtering

    Authors: Philip Becker-Ehmck, Jan Peters, Patrick van der Smagt

    Abstract: System identification of complex and nonlinear systems is a central problem for model predictive control and model-based reinforcement learning. Despite their complexity, such systems can often be approximated well by a set of linear dynamical systems if broken into appropriate subsequences. This mechanism not only helps us find good approximations of dynamics, but also gives us deeper insight int… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: Appears in Proceedings of the 36th International Conference on Machine Learning (ICML)

  6. arXiv:1710.05101  [pdf, other

    stat.ML

    Unsupervised Real-Time Control through Variational Empowerment

    Authors: Maximilian Karl, Maximilian Soelch, Philip Becker-Ehmck, Djalel Benbouzid, Patrick van der Smagt, Justin Bayer

    Abstract: We introduce a methodology for efficiently computing a lower bound to empowerment, allowing it to be used as an unsupervised cost function for policy learning in real-time control. Empowerment, being the channel capacity between actions and states, maximises the influence of an agent on its near future. It has been shown to be a good model of biological behaviour in the absence of an extrinsic goa… ▽ More

    Submitted 13 October, 2017; originally announced October 2017.