Skip to main content

Showing 1–16 of 16 results for author: Abdulsamad, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.07868  [pdf, other

    stat.ML cs.LG stat.ME

    Nesting Particle Filters for Experimental Design in Dynamical Systems

    Authors: Sahel Iqbal, Adrien Corenflos, Simo Särkkä, Hany Abdulsamad

    Abstract: In this paper, we propose a novel approach to Bayesian experimental design for non-exchangeable data that formulates it as risk-sensitive policy optimization. We develop the Inside-Out SMC$^2$ algorithm, a nested sequential Monte Carlo technique to infer optimal designs, and embed it into a particle Markov chain Monte Carlo framework to perform gradient-based policy amortization. Our approach is d… ▽ More

    Submitted 29 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024

  2. arXiv:2312.14000  [pdf, ps, other

    cs.LG eess.SY

    Risk-Sensitive Stochastic Optimal Control as Rao-Blackwellized Markovian Score Climbing

    Authors: Hany Abdulsamad, Sahel Iqbal, Adrien Corenflos, Simo Särkkä

    Abstract: Stochastic optimal control of dynamical systems is a crucial challenge in sequential decision-making. Recently, control-as-inference approaches have had considerable success, providing a viable risk-sensitive framework to address the exploration-exploitation dilemma. Nonetheless, a majority of these techniques only invoke the inference-control duality to derive a modified risk objective that is th… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  3. arXiv:2303.06398  [pdf, ps, other

    stat.CO cs.CE eess.SY stat.ML

    Variational Gaussian filtering via Wasserstein gradient flows

    Authors: Adrien Corenflos, Hany Abdulsamad

    Abstract: We present a novel approach to approximate Gaussian and mixture-of-Gaussians filtering. Our method relies on a variational approximation via a gradient-flow representation. The gradient flow is derived from a Kullback--Leibler discrepancy minimization on the space of probability distributions equipped with the Wasserstein metric. We outline the general method and show its competitiveness in poster… ▽ More

    Submitted 19 June, 2023; v1 submitted 11 March, 2023; originally announced March 2023.

    Comments: 5 pages, 2 figures, double column, minor modifications compared to version 1 (more experiments + typos). Accepted as a conference paper to EUSIPCO 2023

  4. arXiv:2211.01120  [pdf, other

    cs.LG cs.AI cs.RO

    Variational Hierarchical Mixtures for Probabilistic Learning of Inverse Dynamics

    Authors: Hany Abdulsamad, Peter Nickl, Pascal Klink, Jan Peters

    Abstract: Well-calibrated probabilistic regression models are a crucial learning component in robotics applications as datasets grow rapidly and tasks become more complex. Unfortunately, classical regression models are usually either probabilistic kernel machines with a flexible structure that does not scale gracefully with data or deterministic and vastly scalable automata, albeit with a restrictive parame… ▽ More

    Submitted 10 September, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2011.05217

  5. arXiv:2206.10313  [pdf, other

    cs.RO cs.LG

    Active Inference for Robotic Manipulation

    Authors: Tim Schneider, Boris Belousov, Hany Abdulsamad, Jan Peters

    Abstract: Robotic manipulation stands as a largely unsolved problem despite significant advances in robotics and machine learning in the last decades. One of the central challenges of manipulation is partial observability, as the agent usually does not know all physical properties of the environment and the objects it is manipulating in advance. A recently emerging theory that deals with partial observabili… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: Published at "The Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM)" 2022

  6. arXiv:2111.06211  [pdf, other

    eess.SY cs.LG

    Model-Based Reinforcement Learning via Stochastic Hybrid Models

    Authors: Hany Abdulsamad, Jan Peters

    Abstract: Optimal control of general nonlinear systems is a central challenge in automation. Enabled by powerful function approximators, data-driven approaches to control have recently successfully tackled challenging applications. However, such methods often obscure the structure of dynamics and control behind black-box over-parameterized representations, thus limiting our ability to understand closed-loop… ▽ More

    Submitted 20 June, 2023; v1 submitted 11 November, 2021; originally announced November 2021.

  7. arXiv:2105.07693  [pdf, other

    cs.LG cs.RO eess.SY

    Efficient Stochastic Optimal Control through Approximate Bayesian Input Inference

    Authors: Joe Watson, Hany Abdulsamad, Rolf Findeisen, Jan Peters

    Abstract: Optimal control under uncertainty is a prevailing challenge for many reasons. One of the critical difficulties lies in producing tractable solutions for the underlying stochastic optimization problem. We show how advanced approximate inference techniques can be used to handle the statistical approximations principled and practically by framing the control problem as a problem of input estimation.… ▽ More

    Submitted 13 March, 2022; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: Submitted to Transactions on Automatic Control Special Issue: Learning and Control. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  8. arXiv:2103.15388  [pdf, ps, other

    eess.SY cs.RO

    Distributionally Robust Trajectory Optimization Under Uncertain Dynamics via Relative Entropy Trust-Regions

    Authors: Hany Abdulsamad, Tim Dorau, Boris Belousov, Jia-Jie Zhu, Jan Peters

    Abstract: Trajectory optimization and model predictive control are essential techniques underpinning advanced robotic applications, ranging from autonomous driving to full-body humanoid control. State-of-the-art algorithms have focused on data-driven approaches that infer the system dynamics online and incorporate posterior uncertainty during planning and control. Despite their success, such approaches are… ▽ More

    Submitted 11 November, 2021; v1 submitted 29 March, 2021; originally announced March 2021.

  9. arXiv:2102.13176  [pdf, other

    cs.LG

    A Probabilistic Interpretation of Self-Paced Learning with Applications to Reinforcement Learning

    Authors: Pascal Klink, Hany Abdulsamad, Boris Belousov, Carlo D'Eramo, Jan Peters, Joni Pajarinen

    Abstract: Across machine learning, the use of curricula has shown strong empirical potential to improve learning from data by avoiding local optima of training objectives. For reinforcement learning (RL), curricula are especially interesting, as the underlying optimization has a strong tendency to get stuck in local optima due to the exploration-exploitation trade-off. Recently, a number of approaches for a… ▽ More

    Submitted 2 September, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

    Journal ref: Journal of Machine Learning Research 22 (182), Pages 1-52, 2021

  10. arXiv:2011.05217  [pdf, other

    cs.LG cs.RO

    A Variational Infinite Mixture for Probabilistic Inverse Dynamics Learning

    Authors: Hany Abdulsamad, Peter Nickl, Pascal Klink, Jan Peters

    Abstract: Probabilistic regression techniques in control and robotics applications have to fulfill different criteria of data-driven adaptability, computational efficiency, scalability to high dimensions, and the capacity to deal with different modalities in the data. Classical regressors usually fulfill only a subset of these properties. In this work, we extend seminal work on Bayesian nonparametric mixtur… ▽ More

    Submitted 30 March, 2021; v1 submitted 10 November, 2020; originally announced November 2020.

  11. arXiv:2005.01432  [pdf, other

    cs.LG stat.ML

    Hierarchical Decomposition of Nonlinear Dynamics and Control for System Identification and Policy Distillation

    Authors: Hany Abdulsamad, Jan Peters

    Abstract: The control of nonlinear dynamical systems remains a major challenge for autonomous agents. Current trends in reinforcement learning (RL) focus on complex representations of dynamics and policies, which have yielded impressive results in solving a variety of hard control tasks. However, this new sophistication and extremely over-parameterized models have come with the cost of an overall reduction… ▽ More

    Submitted 12 May, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: 2nd Annual Conference on Learning for Dynamics and Control

  12. arXiv:2001.02435  [pdf, other

    cs.LG stat.ML

    A Nonparametric Off-Policy Policy Gradient

    Authors: Samuele Tosatto, Joao Carvalho, Hany Abdulsamad, Jan Peters

    Abstract: Reinforcement learning (RL) algorithms still suffer from high sample complexity despite outstanding recent successes. The need for intensive interactions with the environment is especially observed in many widely popular policy gradient algorithms that perform updates using on-policy samples. The price of such inefficiency becomes evident in real-world scenarios such as interaction-driven robot le… ▽ More

    Submitted 3 August, 2020; v1 submitted 8 January, 2020; originally announced January 2020.

  13. arXiv:1910.03620  [pdf, ps, other

    cs.LG cs.RO stat.ML

    Receding Horizon Curiosity

    Authors: Matthias Schultheis, Boris Belousov, Hany Abdulsamad, Jan Peters

    Abstract: Sample-efficient exploration is crucial not only for discovering rewarding experiences but also for adapting to environment changes in a task-agnostic fashion. A principled treatment of the problem of optimal input synthesis for system identification is provided within the framework of sequential Bayesian experimental design. In this paper, we present an effective trajectory-optimization-based app… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

    Comments: Published at Conference on Robot Learning (CoRL 2019)

  14. arXiv:1910.03003  [pdf, ps, other

    cs.LG cs.RO eess.SY stat.ML

    Stochastic Optimal Control as Approximate Input Inference

    Authors: Joe Watson, Hany Abdulsamad, Jan Peters

    Abstract: Optimal control of stochastic nonlinear dynamical systems is a major challenge in the domain of robot learning. Given the intractability of the global control problem, state-of-the-art algorithms focus on approximate sequential optimization techniques, that heavily rely on heuristics for regularization in order to achieve stable convergence. By building upon the duality between inference and contr… ▽ More

    Submitted 22 April, 2020; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: Conference on Robot Learning (CoRL 2019)

  15. arXiv:1910.02826  [pdf, other

    cs.LG stat.ML

    Self-Paced Contextual Reinforcement Learning

    Authors: Pascal Klink, Hany Abdulsamad, Boris Belousov, Jan Peters

    Abstract: Generalization and adaptation of learned skills to novel situations is a core requirement for intelligent autonomous robots. Although contextual reinforcement learning provides a principled framework for learning and generalization of behaviors across related tasks, it generally relies on uninformed sampling of environments from an unknown, uncontrolled context distribution, thus missing the benef… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

  16. arXiv:1606.09197  [pdf, other

    cs.LG cs.RO

    Model-Free Trajectory-based Policy Optimization with Monotonic Improvement

    Authors: Riad Akrour, Abbas Abdolmaleki, Hany Abdulsamad, Jan Peters, Gerhard Neumann

    Abstract: Many of the recent trajectory optimization algorithms alternate between linear approximation of the system dynamics around the mean trajectory and conservative policy update. One way of constraining the policy change is by bounding the Kullback-Leibler (KL) divergence between successive policies. These approaches already demonstrated great experimental success in challenging problems such as end-t… ▽ More

    Submitted 2 July, 2018; v1 submitted 29 June, 2016; originally announced June 2016.