Skip to main content

Showing 1–10 of 10 results for author: Blaes, S

.
  1. arXiv:2309.05582  [pdf, other

    cs.LG cs.AI cs.RO

    Mind the Uncertainty: Risk-Aware and Actively Exploring Model-Based Reinforcement Learning

    Authors: Marin Vlastelica, Sebastian Blaes, Cristina Pineri, Georg Martius

    Abstract: We introduce a simple but effective method for managing risk in model-based reinforcement learning with trajectory sampling that involves probabilistic safety constraints and balancing of optimism in the face of epistemic uncertainty and pessimism in the face of aleatoric uncertainty of an ensemble of stochastic neural networks.Various experiments indicate that the separation of uncertainties is e… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  2. arXiv:2308.07741  [pdf, other

    cs.RO cs.LG

    Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline Data in the Real World

    Authors: Nico Gürtler, Felix Widmaier, Cansu Sancaktar, Sebastian Blaes, Pavel Kolev, Stefan Bauer, Manuel Wüthrich, Markus Wulfmeier, Martin Riedmiller, Arthur Allshire, Qiang Wang, Robert McCarthy, Hangyeol Kim, Jongchan Baek, Wookyong Kwon, Shanliang Qian, Yasunori Toshimitsu, Mike Yan Michelis, Amirhossein Kazemipour, Arman Raayatsanati, Hehui Zheng, Barnabas Gavin Cangan, Bernhard Schölkopf, Georg Martius

    Abstract: Experimentation on real robots is demanding in terms of time and costs. For this reason, a large part of the reinforcement learning (RL) community uses simulators to develop and benchmark algorithms. However, insights gained in simulation do not necessarily translate to real robots, in particular for tasks involving complex interactions with the environment. The Real Robot Challenge 2022 therefore… ▽ More

    Submitted 24 November, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: Typo in author list fixed

  3. arXiv:2307.15690  [pdf, other

    cs.LG cs.RO

    Benchmarking Offline Reinforcement Learning on Real-Robot Hardware

    Authors: Nico Gürtler, Sebastian Blaes, Pavel Kolev, Felix Widmaier, Manuel Wüthrich, Stefan Bauer, Bernhard Schölkopf, Georg Martius

    Abstract: Learning policies from previously recorded data is a promising direction for real-world robotics tasks, as online learning is often infeasible. Dexterous manipulation in particular remains an open problem in its general form. The combination of offline reinforcement learning with large diverse datasets, however, has the potential to lead to a breakthrough in this challenging domain analogously to… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: The Eleventh International Conference on Learning Representations. 2022. Published at ICLR 2023. Datasets available at https://github.com/rr-learning/trifinger_rl_datasets

  4. arXiv:2306.12371  [pdf, other

    cs.LG cs.RO eess.SY

    Optimistic Active Exploration of Dynamical Systems

    Authors: Bhavya Sukhija, Lenart Treven, Cansu Sancaktar, Sebastian Blaes, Stelian Coros, Andreas Krause

    Abstract: Reinforcement learning algorithms commonly seek to optimize policies for solving one particular task. How should we explore an unknown dynamical system such that the estimated model globally approximates the dynamics and allows us to solve multiple downstream tasks in a zero-shot manner? In this paper, we address this challenge, by develo** an algorithm -- OPAX -- for active exploration. OPAX us… ▽ More

    Submitted 30 October, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  5. arXiv:2302.00768  [pdf, other

    cs.CL

    Leveraging Task Dependency and Contrastive Learning for Case Outcome Classification on European Court of Human Rights Cases

    Authors: T. Y. S. S Santosh, Marcel Perez San Blas, Phillip Kemper, Matthias Grabmair

    Abstract: We report on an experiment in case outcome classification on European Court of Human Rights cases where our model first learns to identify the convention articles allegedly violated by the state from case facts descriptions, and subsequently uses that information to classify whether the court finds a violation of those articles. We assess the dependency between these two tasks at the feature and o… ▽ More

    Submitted 13 February, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: Accepted to EACL 2023

  6. arXiv:2209.07899  [pdf, other

    cs.RO cs.AI cs.LG

    Versatile Skill Control via Self-supervised Adversarial Imitation of Unlabeled Mixed Motions

    Authors: Chenhao Li, Sebastian Blaes, Pavel Kolev, Marin Vlastelica, Jonas Frey, Georg Martius

    Abstract: Learning diverse skills is one of the main challenges in robotics. To this end, imitation learning approaches have achieved impressive results. These methods require explicitly labeled datasets or assume consistent skill execution to enable learning and active control of individual behaviors, which limits their applicability. In this work, we propose a cooperative adversarial method for obtaining… ▽ More

    Submitted 11 February, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

  7. arXiv:2206.11693  [pdf, other

    cs.RO cs.AI cs.LG

    Learning Agile Skills via Adversarial Imitation of Rough Partial Demonstrations

    Authors: Chenhao Li, Marin Vlastelica, Sebastian Blaes, Jonas Frey, Felix Grimminger, Georg Martius

    Abstract: Learning agile skills is one of the main challenges in robotics. To this end, reinforcement learning approaches have achieved impressive results. These methods require explicit task information in terms of a reward function or an expert that can be queried in simulation to provide a target control output, which limits their applicability. In this work, we propose a generative adversarial method fo… ▽ More

    Submitted 21 November, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

  8. arXiv:2206.11403  [pdf, other

    cs.LG cs.AI cs.RO

    Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation

    Authors: Cansu Sancaktar, Sebastian Blaes, Georg Martius

    Abstract: It has been a long-standing dream to design artificial agents that explore their environment efficiently via intrinsic motivation, similar to how children perform curious free play. Despite recent advances in intrinsically motivated reinforcement learning (RL), sample-efficient exploration in object manipulation scenarios remains a significant challenge as most of the relevant information lies in… ▽ More

    Submitted 26 November, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 camera-ready version

  9. arXiv:2008.06389  [pdf, other

    cs.LG cs.RO stat.ML

    Sample-efficient Cross-Entropy Method for Real-time Planning

    Authors: Cristina Pinneri, Shambhuraj Sawant, Sebastian Blaes, Jan Achterhold, Joerg Stueckler, Michal Rolinek, Georg Martius

    Abstract: Trajectory optimizers for model-based reinforcement learning, such as the Cross-Entropy Method (CEM), can yield compelling results even in high-dimensional control tasks and sparse-reward environments. However, their sampling inefficiency prevents them from being used for real-time planning and control. We propose an improved version of the CEM algorithm for fast planning, with novel additions inc… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.

  10. arXiv:1906.08190  [pdf, other

    cs.LG cs.AI cs.RO

    Control What You Can: Intrinsically Motivated Task-Planning Agent

    Authors: Sebastian Blaes, Marin Vlastelica Pogančić, Jia-Jie Zhu, Georg Martius

    Abstract: We present a novel intrinsically motivated agent that learns how to control the environment in the fastest possible manner by optimizing learning progress. It learns what can be controlled, how to allocate time and attention, and the relations between objects using surprise based motivation. The effectiveness of our method is demonstrated in a synthetic as well as a robotic manipulation environmen… ▽ More

    Submitted 9 January, 2020; v1 submitted 19 June, 2019; originally announced June 2019.