Skip to main content

Showing 1–12 of 12 results for author: Arndt, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.11439  [pdf, other

    cs.LG cs.AI stat.ML

    From Alexnet to Transformers: Measuring the Non-linearity of Deep Neural Networks with Affine Optimal Transport

    Authors: Quentin Bouniot, Ievgen Redko, Anton Mallasto, Charlotte Laclau, Karol Arndt, Oliver Struckmeier, Markus Heinonen, Ville Kyrki, Samuel Kaski

    Abstract: In the last decade, we have witnessed the introduction of several novel deep neural network (DNN) architectures exhibiting ever-increasing performance across diverse tasks. Explaining the upward trend of their performance, however, remains difficult as different DNN architectures of comparable depth and width -- common factors associated with their expressive power -- may exhibit a drastically dif… ▽ More

    Submitted 1 July, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Code available at https://github.com/qbouniot/AffScoreDeep

  2. arXiv:2305.07500  [pdf, other

    cs.LG cs.AI

    Learning representations that are closed-form Monge map** optimal with application to domain adaptation

    Authors: Oliver Struckmeier, Ievgen Redko, Anton Mallasto, Karol Arndt, Markus Heinonen, Ville Kyrki

    Abstract: Optimal transport (OT) is a powerful geometric tool used to compare and align probability measures following the least effort principle. Despite its widespread use in machine learning (ML), OT problem still bears its computational burden, while at the same time suffering from the curse of dimensionality for measures supported on general high-dimensional spaces. In this paper, we propose to tackle… ▽ More

    Submitted 11 August, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

  3. arXiv:2209.01207  [pdf, other

    cs.LG cs.AI cs.RO

    Co-Imitation: Learning Design and Behaviour by Imitation

    Authors: Chang Rajani, Karol Arndt, David Blanco-Mulero, Kevin Sebastian Luck, Ville Kyrki

    Abstract: The co-adaptation of robots has been a long-standing research endeavour with the goal of adapting both body and behaviour of a system for a given task, inspired by the natural evolution of animals. Co-adaptation has the potential to eliminate costly manual hardware engineering as well as improve the performance of systems. The standard approach to co-adaptation is to use a reward function for opti… ▽ More

    Submitted 6 February, 2023; v1 submitted 2 September, 2022; originally announced September 2022.

    Comments: 14 pages, 11 figures, accepted for AAAI-23

  4. arXiv:2206.14661  [pdf, other

    cs.RO cs.LG

    Online vs. Offline Adaptive Domain Randomization Benchmark

    Authors: Gabriele Tiboni, Karol Arndt, Giuseppe Averta, Ville Kyrki, Tatiana Tommasi

    Abstract: Physics simulators have shown great promise for conveniently learning reinforcement learning policies in safe, unconstrained environments. However, transferring the acquired knowledge to the real world can be challenging due to the reality gap. To this end, several methods have been recently proposed to automatically tune simulator parameters with posterior distributions given real data, for use w… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: 15 pages, 6 figures

  5. arXiv:2204.08573  [pdf, other

    cs.LG cs.RO

    Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models

    Authors: Ali Ghadirzadeh, Petra Poklukar, Karol Arndt, Chelsea Finn, Ville Kyrki, Danica Kragic, Mårten Björkman

    Abstract: We present a data-efficient framework for solving sequential decision-making problems which exploits the combination of reinforcement learning (RL) and latent variable generative models. The framework, called GenRL, trains deep policies by introducing an action latent variable such that the feed-forward policy search can be divided into two parts: (i) training a sub-policy that outputs a distribut… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2007.13134

  6. arXiv:2201.13248  [pdf, other

    cs.RO cs.AI cs.LG cs.NE

    SafeAPT: Safe Simulation-to-Real Robot Learning using Diverse Policies Learned in Simulation

    Authors: Rituraj Kaushik, Karol Arndt, Ville Kyrki

    Abstract: The framework of Simulation-to-real learning, i.e, learning policies in simulation and transferring those policies to the real world is one of the most promising approaches towards data-efficient learning in robotics. However, due to the inevitable reality gap between the simulation and the real world, a policy learned in the simulation may not always generate a safe behaviour on the real robot. A… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

    Comments: Under review. For video of the paper http://tiny.cc/safeAPT

  7. arXiv:2201.08434  [pdf, other

    cs.RO cs.LG

    DROPO: Sim-to-Real Transfer with Offline Domain Randomization

    Authors: Gabriele Tiboni, Karol Arndt, Ville Kyrki

    Abstract: In recent years, domain randomization over dynamics parameters has gained a lot of traction as a method for sim-to-real transfer of reinforcement learning policies in robotic manipulation; however, finding optimal randomization distributions can be difficult. In this paper, we introduce DROPO, a novel method for estimating domain randomization distributions for safe sim-to-real transfer. Unlike pr… ▽ More

    Submitted 12 January, 2023; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: 16 pages, 21 figures

  8. arXiv:2105.11739  [pdf, other

    cs.RO cs.LG

    Affine Transport for Sim-to-Real Domain Adaptation

    Authors: Anton Mallasto, Karol Arndt, Markus Heinonen, Samuel Kaski, Ville Kyrki

    Abstract: Sample-efficient domain adaptation is an open problem in robotics. In this paper, we present affine transport -- a variant of optimal transport, which models the map** between state transition distributions between the source and target domains with an affine transformation. First, we derive the affine transport framework; then, we extend the basic framework with Procrustes alignment to model ar… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

  9. arXiv:2103.07223  [pdf, other

    cs.LG cs.RO

    Domain Curiosity: Learning Efficient Data Collection Strategies for Domain Adaptation

    Authors: Karol Arndt, Oliver Struckmeier, Ville Kyrki

    Abstract: Domain adaptation is a common problem in robotics, with applications such as transferring policies from simulation to real world and lifelong learning. Performing such adaptation, however, requires informative data about the environment to be available during the adaptation. In this paper, we present domain curiosity -- a method of training exploratory policies that are explicitly optimized to pro… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

  10. arXiv:2010.08397  [pdf, other

    cs.LG cs.RO

    Few-shot model-based adaptation in noisy conditions

    Authors: Karol Arndt, Ali Ghadirzadeh, Murtaza Hazara, Ville Kyrki

    Abstract: Few-shot adaptation is a challenging problem in the context of simulation-to-real transfer in robotics, requiring safe and informative data collection. In physical systems, additional challenge may be posed by domain noise, which is present in virtually all real-world applications. In this paper, we propose to perform few-shot adaptation of dynamics models in noisy conditions using an uncertainty-… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

  11. arXiv:1909.12906  [pdf, other

    cs.CV cs.RO

    Meta Reinforcement Learning for Sim-to-real Domain Adaptation

    Authors: Karol Arndt, Murtaza Hazara, Ali Ghadirzadeh, Ville Kyrki

    Abstract: Modern reinforcement learning methods suffer from low sample efficiency and unsafe exploration, making it infeasible to train robotic policies entirely on real hardware. In this work, we propose to address the problem of sim-to-real domain transfer by using meta learning to train a policy that can adapt to a variety of dynamic conditions, and using a task-specific trajectory generation model to pr… ▽ More

    Submitted 16 September, 2019; originally announced September 2019.

    Comments: Submitted to ICRA 2020

  12. arXiv:1903.04053  [pdf, other

    cs.RO cs.LG

    Affordance Learning for End-to-End Visuomotor Robot Control

    Authors: Aleksi Hämäläinen, Karol Arndt, Ali Ghadirzadeh, Ville Kyrki

    Abstract: Training end-to-end deep robot policies requires a lot of domain-, task-, and hardware-specific data, which is often costly to provide. In this work, we propose to tackle this issue by employing a deep neural network with a modular architecture, consisting of separate perception, policy, and trajectory parts. Each part of the system is trained fully on synthetic data or in simulation. The data is… ▽ More

    Submitted 10 March, 2019; originally announced March 2019.

    Comments: 7 pages, 7 figures. Submitted to 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2019)