Skip to main content

Showing 1–10 of 10 results for author: Arenz, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.00599  [pdf, other

    cs.LG cs.AI cs.RO

    LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning

    Authors: Firas Al-Hafez, Davide Tateo, Oleg Arenz, Guo** Zhao, Jan Peters

    Abstract: Recent methods for imitation learning directly learn a $Q$-function using an implicit reward formulation rather than an explicit reward function. However, these methods generally require implicit reward regularization to improve stability and often mistreat absorbing states. Previous works show that a squared norm regularization on the implicit reward function is effective, but do not provide a th… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

  2. arXiv:2209.11533  [pdf, other

    cs.LG cs.RO stat.ML

    A Unified Perspective on Natural Gradient Variational Inference with Gaussian Mixture Models

    Authors: Oleg Arenz, Philipp Dahlinger, Zihan Ye, Michael Volpp, Gerhard Neumann

    Abstract: Variational inference with Gaussian mixture models (GMMs) enables learning of highly tractable yet multi-modal approximations of intractable target distributions with up to a few hundred dimensions. The two currently most effective methods for GMM-based variational inference, VIPS and iBayes-GMM, both employ independent natural gradient updates for the individual components and their weights. We s… ▽ More

    Submitted 17 July, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: This version corresponds to the camera ready version published at Transactions of Machine Learning Research (TMLR). https://openreview.net/forum?id=tLBjsX4tjs

    Journal ref: Transactions on Machine Learning Research (2023) ISSN: 2835-8856

  3. arXiv:2209.05333  [pdf, other

    cs.LG cs.RO

    Self-supervised Sequential Information Bottleneck for Robust Exploration in Deep Reinforcement Learning

    Authors: Bang You, **gming Xie, You** Chen, Jan Peters, Oleg Arenz

    Abstract: Effective exploration is critical for reinforcement learning agents in environments with sparse rewards or high-dimensional state-action spaces. Recent works based on state-visitation counts, curiosity and entropy-maximization generate intrinsic reward signals to motivate the agent to visit novel states for exploration. However, the agent can get distracted by perturbations to sensor inputs that c… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: 14 pages

  4. Integrating Contrastive Learning with Dynamic Models for Reinforcement Learning from Images

    Authors: Bang You, Oleg Arenz, You** Chen, Jan Peters

    Abstract: Recent methods for reinforcement learning from images use auxiliary tasks to learn image features that are used by the agent's policy or Q-function. In particular, methods based on contrastive learning that induce linearity of the latent dynamics or invariance to data augmentation have been shown to greatly improve the sample efficiency of the reinforcement learning algorithm and the generalizabil… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: 28 pages, 11 figures, 5 tables

    Journal ref: Neurocomputing 476(2022)102-114

  5. Assisted Teleoperation in Changing Environments with a Mixture of Virtual Guides

    Authors: Marco Ewerton, Oleg Arenz, Jan Peters

    Abstract: Haptic guidance is a powerful technique to combine the strengths of humans and autonomous systems for teleoperation. The autonomous system can provide haptic cues to enable the operator to perform precise movements; the operator can interfere with the plan of the autonomous system leveraging his/her superior cognitive capabilities. However, providing haptic cues such that the individual strengths… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: 19 pages, 9 figures

    Journal ref: Advanced Robotics, 2020

  6. arXiv:2008.03525  [pdf, other

    cs.LG cs.IT cs.RO stat.ML

    Non-Adversarial Imitation Learning and its Connections to Adversarial Methods

    Authors: Oleg Arenz, Gerhard Neumann

    Abstract: Many modern methods for imitation learning and inverse reinforcement learning, such as GAIL or AIRL, are based on an adversarial formulation. These methods apply GANs to match the expert's distribution over states and actions with the implicit state-action distribution induced by the agent's policy. However, by framing imitation learning as a saddle point problem, adversarial methods can suffer fr… ▽ More

    Submitted 8 August, 2020; originally announced August 2020.

  7. arXiv:2003.03779  [pdf, other

    cs.RO cs.AI cs.LG

    Deep Adversarial Reinforcement Learning for Object Disentangling

    Authors: Melvin Laux, Oleg Arenz, Jan Peters, Joni Pajarinen

    Abstract: Deep learning in combination with improved training techniques and high computational power has led to recent advances in the field of reinforcement learning (RL) and to successful robotic RL applications such as in-hand manipulation. However, most robotic RL relies on a well known initial state distribution. In real-world tasks, this information is however often not available. For example, when d… ▽ More

    Submitted 17 March, 2021; v1 submitted 8 March, 2020; originally announced March 2020.

    Comments: 7 pages, IROS 2020

  8. arXiv:2002.11495  [pdf, other

    cs.RO

    Probabilistic approach to physical object disentangling

    Authors: Joni Pajarinen, Oleg Arenz, Jan Peters, Gerhard Neumann

    Abstract: Physically disentangling entangled objects from each other is a problem encountered in waste segregation or in any task that requires disassembly of structures. Often there are no object models, and, especially with cluttered irregularly shaped objects, the robot can not create a model of the scene due to occlusion. One of our key insights is that based on previous sensory input we are only intere… ▽ More

    Submitted 12 April, 2021; v1 submitted 26 February, 2020; originally announced February 2020.

  9. arXiv:2001.08682  [pdf, other

    cs.LG stat.ML

    Expected Information Maximization: Using the I-Projection for Mixture Density Estimation

    Authors: Philipp Becker, Oleg Arenz, Gerhard Neumann

    Abstract: Modelling highly multi-modal data is a challenging problem in machine learning. Most algorithms are based on maximizing the likelihood, which corresponds to the M(oment)-projection of the data distribution to the model distribution. The M-projection forces the model to average over modes it cannot represent. In contrast, the I(information)-projection ignores such modes in the data and concentrates… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

  10. arXiv:1907.04710  [pdf, other

    cs.LG stat.ML

    Trust-Region Variational Inference with Gaussian Mixture Models

    Authors: Oleg Arenz, Mingjun Zhong, Gerhard Neumann

    Abstract: Many methods for machine learning rely on approximate inference from intractable probability distributions. Variational inference approximates such distributions by tractable models that can be subsequently used for approximate inference. Learning sufficiently accurate approximations requires a rich model family and careful exploration of the relevant modes of the target distribution. We propose a… ▽ More

    Submitted 4 August, 2020; v1 submitted 10 July, 2019; originally announced July 2019.

    Journal ref: Journal of Machine Learning Research. 21(163):1-60, 2020