Skip to main content

Showing 1–4 of 4 results for author: Schlaginhaufen, A

.
  1. arXiv:2406.01793  [pdf, other

    cs.LG cs.AI stat.ML

    Towards the Transferability of Rewards Recovered via Regularized Inverse Reinforcement Learning

    Authors: Andreas Schlaginhaufen, Maryam Kamgarpour

    Abstract: Inverse reinforcement learning (IRL) aims to infer a reward from expert demonstrations, motivated by the idea that the reward, rather than the policy, is the most succinct and transferable description of a task [Ng et al., 2000]. However, the reward corresponding to an optimal policy is not unique, making it unclear if an IRL-learned reward is transferable to new transition laws in the sense that… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2403.16829  [pdf, ps, other

    cs.LG cs.AI

    Convergence of a model-free entropy-regularized inverse reinforcement learning algorithm

    Authors: Titouan Renard, Andreas Schlaginhaufen, Tingting Ni, Maryam Kamgarpour

    Abstract: Given a dataset of expert demonstrations, inverse reinforcement learning (IRL) aims to recover a reward for which the expert is optimal. This work proposes a model-free algorithm to solve entropy-regularized IRL problem. In particular, we employ a stochastic gradient descent update for the reward and a stochastic soft policy iteration update for the policy. Assuming access to a generative model, w… ▽ More

    Submitted 23 April, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  3. arXiv:2306.00629  [pdf, other

    cs.LG cs.AI eess.SY math.OC

    Identifiability and Generalizability in Constrained Inverse Reinforcement Learning

    Authors: Andreas Schlaginhaufen, Maryam Kamgarpour

    Abstract: Two main challenges in Reinforcement Learning (RL) are designing appropriate reward functions and ensuring the safety of the learned policy. To address these challenges, we present a theoretical framework for Inverse Reinforcement Learning (IRL) in constrained Markov decision processes. From a convex-analytic perspective, we extend prior results on reward identifiability and generalizability to bo… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Published at ICML 2023

  4. arXiv:2110.14296  [pdf, other

    cs.LG eess.SY math.DS stat.ML

    Learning Stable Deep Dynamics Models for Partially Observed or Delayed Dynamical Systems

    Authors: Andreas Schlaginhaufen, Philippe Wenk, Andreas Krause, Florian Dörfler

    Abstract: Learning how complex dynamical systems evolve over time is a key challenge in system identification. For safety critical systems, it is often crucial that the learned model is guaranteed to converge to some equilibrium point. To this end, neural ODEs regularized with neural Lyapunov functions are a promising approach when states are fully observed. For practical applications however, partial obser… ▽ More

    Submitted 10 December, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: Published at NeurIPS 2021

    Journal ref: Advances in Neural Information Processing Systems, 2021