Skip to main content

Showing 1–5 of 5 results for author: Sancaktar, C

.
  1. arXiv:2312.01473  [pdf, other

    cs.LG

    Regularity as Intrinsic Reward for Free Play

    Authors: Cansu Sancaktar, Justus Piater, Georg Martius

    Abstract: We propose regularity as a novel reward signal for intrinsically-motivated reinforcement learning. Taking inspiration from child development, we postulate that striving for structure and order helps guide exploration towards a subspace of tasks that are not favored by naive uncertainty-based intrinsic rewards. Our generalized formulation of Regularity as Intrinsic Reward (RaIR) allows us to operat… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023 camera-ready version. Project webpage at http://sites.google.com/view/rair-project

  2. arXiv:2308.07741  [pdf, other

    cs.RO cs.LG

    Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline Data in the Real World

    Authors: Nico Gürtler, Felix Widmaier, Cansu Sancaktar, Sebastian Blaes, Pavel Kolev, Stefan Bauer, Manuel Wüthrich, Markus Wulfmeier, Martin Riedmiller, Arthur Allshire, Qiang Wang, Robert McCarthy, Hangyeol Kim, Jongchan Baek, Wookyong Kwon, Shanliang Qian, Yasunori Toshimitsu, Mike Yan Michelis, Amirhossein Kazemipour, Arman Raayatsanati, Hehui Zheng, Barnabas Gavin Cangan, Bernhard Schölkopf, Georg Martius

    Abstract: Experimentation on real robots is demanding in terms of time and costs. For this reason, a large part of the reinforcement learning (RL) community uses simulators to develop and benchmark algorithms. However, insights gained in simulation do not necessarily translate to real robots, in particular for tasks involving complex interactions with the environment. The Real Robot Challenge 2022 therefore… ▽ More

    Submitted 24 November, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: Typo in author list fixed

  3. arXiv:2306.12371  [pdf, other

    cs.LG cs.RO eess.SY

    Optimistic Active Exploration of Dynamical Systems

    Authors: Bhavya Sukhija, Lenart Treven, Cansu Sancaktar, Sebastian Blaes, Stelian Coros, Andreas Krause

    Abstract: Reinforcement learning algorithms commonly seek to optimize policies for solving one particular task. How should we explore an unknown dynamical system such that the estimated model globally approximates the dynamics and allows us to solve multiple downstream tasks in a zero-shot manner? In this paper, we address this challenge, by develo** an algorithm -- OPAX -- for active exploration. OPAX us… ▽ More

    Submitted 30 October, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  4. arXiv:2206.11403  [pdf, other

    cs.LG cs.AI cs.RO

    Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation

    Authors: Cansu Sancaktar, Sebastian Blaes, Georg Martius

    Abstract: It has been a long-standing dream to design artificial agents that explore their environment efficiently via intrinsic motivation, similar to how children perform curious free play. Despite recent advances in intrinsically motivated reinforcement learning (RL), sample-efficient exploration in object manipulation scenarios remains a significant challenge as most of the relevant information lies in… ▽ More

    Submitted 26 November, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 camera-ready version

  5. arXiv:2001.05847  [pdf, other

    cs.CV cs.AI cs.LG cs.RO q-bio.NC

    End-to-End Pixel-Based Deep Active Inference for Body Perception and Action

    Authors: Cansu Sancaktar, Marcel van Gerven, Pablo Lanillos

    Abstract: We present a pixel-based deep active inference algorithm (PixelAI) inspired by human body perception and action. Our algorithm combines the free-energy principle from neuroscience, rooted in variational inference, with deep convolutional decoders to scale the algorithm to directly deal with raw visual input and provide online adaptive inference. Our approach is validated by studying body perceptio… ▽ More

    Submitted 29 May, 2020; v1 submitted 28 December, 2019; originally announced January 2020.