Skip to main content

Showing 1–7 of 7 results for author: Amiranashvili, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2209.06681  [pdf, other

    cs.CV

    A Benchmark and a Baseline for Robust Multi-view Depth Estimation

    Authors: Philipp Schröppel, Jan Bechtold, Artemij Amiranashvili, Thomas Brox

    Abstract: Recent deep learning approaches for multi-view depth estimation are employed either in a depth-from-video or a multi-view stereo setting. Despite different settings, these approaches are technically similar: they correlate multiple source views with a keyview to estimate a depth map for the keyview. In this work, we introduce the Robust Multi-View Depth Benchmark that is built upon a set of public… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: Accepted at 3DV 2022

  2. arXiv:2104.14386  [pdf, other

    cs.LG cs.AI cs.RO

    Pre-training of Deep RL Agents for Improved Learning under Domain Randomization

    Authors: Artemij Amiranashvili, Max Argus, Lukas Hermann, Wolfram Burgard, Thomas Brox

    Abstract: Visual domain randomization in simulated environments is a widely used method to transfer policies trained in simulation to real robots. However, domain randomization and augmentation hamper the training of a policy. As reinforcement learning struggles with a noisy training signal, this additional nuisance can drastically impede training. For difficult tasks it can even result in complete failure… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

  3. arXiv:2007.02701  [pdf, other

    cs.LG cs.AI stat.ML

    Scaling Imitation Learning in Minecraft

    Authors: Artemij Amiranashvili, Nicolai Dorka, Wolfram Burgard, Vladlen Koltun, Thomas Brox

    Abstract: Imitation learning is a powerful family of techniques for learning sensorimotor coordination in immersive environments. We apply imitation learning to attain state-of-the-art performance on hard exploration problems in the Minecraft environment. We report experiments that highlight the influence of network architecture, loss function, and data augmentation. An early version of our approach reached… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

  4. arXiv:1910.07972  [pdf, other

    cs.RO cs.CV cs.LG

    Adaptive Curriculum Generation from Demonstrations for Sim-to-Real Visuomotor Control

    Authors: Lukas Hermann, Max Argus, Andreas Eitel, Artemij Amiranashvili, Wolfram Burgard, Thomas Brox

    Abstract: We propose Adaptive Curriculum Generation from Demonstrations (ACGD) for reinforcement learning in the presence of sparse rewards. Rather than designing shaped reward functions, ACGD adaptively sets the appropriate task difficulty for the learner by controlling where to sample from the demonstration trajectories and which set of simulation parameters to use. We show that training vision-based cont… ▽ More

    Submitted 8 July, 2020; v1 submitted 17 October, 2019; originally announced October 2019.

    Comments: Accepted at the 2020 IEEE International Conference on Robotics and Automation (ICRA). Project page see https://lmb.informatik.uni-freiburg.de/projects/curriculum/

  5. arXiv:1902.05605  [pdf, other

    cs.LG stat.ML

    CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity

    Authors: Aditya Bhatt, Daniel Palenicek, Boris Belousov, Max Argus, Artemij Amiranashvili, Thomas Brox, Jan Peters

    Abstract: Sample efficiency is a crucial problem in deep reinforcement learning. Recent algorithms, such as REDQ and DroQ, found a way to improve the sample efficiency by increasing the update-to-data (UTD) ratio to 20 gradient update steps on the critic per environment sample. However, this comes at the expense of a greatly increased computational cost. To reduce this computational burden, we introduce Cro… ▽ More

    Submitted 25 March, 2024; v1 submitted 14 February, 2019; originally announced February 2019.

    Comments: Published at ICLR 2024. Project page at http://aditya.bhatts.org/CrossQ and code release at https://github.com/adityab/CrossQ

  6. arXiv:1901.03162  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Motion Perception in Reinforcement Learning with Dynamic Objects

    Authors: Artemij Amiranashvili, Alexey Dosovitskiy, Vladlen Koltun, Thomas Brox

    Abstract: In dynamic environments, learned controllers are supposed to take motion into account when selecting the action to be taken. However, in existing reinforcement learning works motion is rarely treated explicitly; it is rather assumed that the controller learns the necessary motion representation from temporal stacks of frames implicitly. In this paper, we show that for continuous control tasks lear… ▽ More

    Submitted 1 February, 2019; v1 submitted 10 January, 2019; originally announced January 2019.

  7. arXiv:1806.01175  [pdf, other

    cs.LG cs.AI stat.ML

    TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning

    Authors: Artemij Amiranashvili, Alexey Dosovitskiy, Vladlen Koltun, Thomas Brox

    Abstract: Our understanding of reinforcement learning (RL) has been shaped by theoretical and empirical results that were obtained decades ago using tabular representations and linear function approximators. These results suggest that RL methods that use temporal differencing (TD) are superior to direct Monte Carlo estimation (MC). How do these results hold up in deep RL, which deals with perceptually compl… ▽ More

    Submitted 4 June, 2018; originally announced June 2018.