Skip to main content

Showing 1–36 of 36 results for author: Piater, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09120  [pdf, other

    cs.RO

    Direct Imitation Learning-based Visual Servoing using the Large Projection Formulation

    Authors: Sayantan Auddy, Antonio Paolillo, Justus Piater, Matteo Saveriano

    Abstract: Today robots must be safe, versatile, and user-friendly to operate in unstructured and human-populated environments. Dynamical system-based imitation learning enables robots to perform complex tasks stably and without explicit programming, greatly simplifying their real-world deployment. To exploit the full potential of these systems it is crucial to implement closed loops that use visual feedback… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: First two authors contributed equally

  2. arXiv:2404.02728  [pdf, other

    cs.RO cs.AI cs.LG

    Unsupervised Learning of Effective Actions in Robotics

    Authors: Marko Zaric, Jakob Hollenstein, Justus Piater, Erwan Renaudo

    Abstract: Learning actions that are relevant to decision-making and can be executed effectively is a key problem in autonomous robotics. Current state-of-the-art action representations in robotics lack proper effect-driven learning of the robot's actions. Although successful in solving manipulation tasks, deep learning methods also lack this ability, in addition to their high cost in terms of memory or trai… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted at The First Austrian Symposium on AI, Robotics, and Vision (AIROV24)

  3. arXiv:2403.12193  [pdf, other

    cs.RO

    Continual Domain Randomization

    Authors: Josip Josifovski, Sayantan Auddy, Mohammadhossein Malmir, Justus Piater, Alois Knoll, Nicolás Navarro-Guerrero

    Abstract: Domain Randomization (DR) is commonly used for sim2real transfer of reinforcement learning (RL) policies in robotics. Most DR approaches require a simulator with a fixed set of tunable parameters from the start of the training, from which the parameters are randomized simultaneously to train a robust model for use in the real world. However, the combined randomization of many parameters increases… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Under peer review

  4. arXiv:2401.00524  [pdf, other

    cs.RO cs.LG

    Effect of Optimizer, Initializer, and Architecture of Hypernetworks on Continual Learning from Demonstration

    Authors: Sayantan Auddy, Sebastian Bergner, Justus Piater

    Abstract: In continual learning from demonstration (CLfD), a robot learns a sequence of real-world motion skills continually from human demonstrations. Recently, hypernetworks have been successful in solving this problem. In this paper, we perform an exploratory study of the effects of different optimizers, initializers, and network architectures on the continual learning performance of hypernetworks for CL… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

  5. arXiv:2312.17605  [pdf, other

    cs.RO cs.AI

    Unified Task and Motion Planning using Object-centric Abstractions of Motion Constraints

    Authors: Alejandro Agostini, Justus Piater

    Abstract: In task and motion planning (TAMP), the ambiguity and underdetermination of abstract descriptions used by task planning methods make it difficult to characterize physical constraints needed to successfully execute a task. The usual approach is to overlook such constraints at task planning level and to implement expensive sub-symbolic geometric reasoning techniques that perform multiple calls on un… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  6. Colored Noise in PPO: Improved Exploration and Performance through Correlated Action Sampling

    Authors: Jakob Hollenstein, Georg Martius, Justus Piater

    Abstract: Proximal Policy Optimization (PPO), a popular on-policy deep reinforcement learning method, employs a stochastic policy for exploration. In this paper, we propose a colored noise-based stochastic policy variant of PPO. Previous research highlighted the importance of temporal correlation in action noise for effective exploration in off-policy reinforcement learning. Building on this, we investigate… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Journal ref: (2024) Proceedings of the AAAI Conference on Artificial Intelligence, 38(11), 12466-12472

  7. arXiv:2312.01473  [pdf, other

    cs.LG

    Regularity as Intrinsic Reward for Free Play

    Authors: Cansu Sancaktar, Justus Piater, Georg Martius

    Abstract: We propose regularity as a novel reward signal for intrinsically-motivated reinforcement learning. Taking inspiration from child development, we postulate that striving for structure and order helps guide exploration towards a subspace of tasks that are not favored by naive uncertainty-based intrinsic rewards. Our generalized formulation of Regularity as Intrinsic Reward (RaIR) allows us to operat… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2023 camera-ready version. Project webpage at http://sites.google.com/view/rair-project

  8. arXiv:2311.03600  [pdf, other

    cs.RO

    Scalable and Efficient Continual Learning from Demonstration via a Hypernetwork-generated Stable Dynamics Model

    Authors: Sayantan Auddy, Jakob Hollenstein, Matteo Saveriano, Antonio Rodríguez-Sánchez, Justus Piater

    Abstract: Learning from demonstration (LfD) provides an efficient way to train robots. The learned motions should be convergent and stable, but to be truly effective in the real world, LfD-capable robots should also be able to remember multiple motion skills. Existing stable-LfD approaches lack the capability of multi-skill retention. Although recent work on continual-LfD has shown that hypernetwork-generat… ▽ More

    Submitted 9 January, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: This paper is currently under peer review

  9. arXiv:2311.02475  [pdf, other

    cs.RO cs.AI cs.LG

    Constrained Equation Learner Networks for Precision-Preserving Extrapolation of Robotic Skills

    Authors: Hector Perez-Villeda, Justus Piater, Matteo Saveriano

    Abstract: In Programming by Demonstration, the robot learns novel skills from human demonstrations. After learning, the robot should be able not only to reproduce the skill, but also to generalize it to shifted domains without collecting new training data. Adaptation to similar domains has been investigated in the literature; however, an open problem is how to adapt learned skills to different conditions th… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 18 pages, 10 figures. To be submitted to IEEE Transactions on Robotics (T-RO)

  10. arXiv:2301.09954  [pdf, other

    cs.RO cs.SE

    Differentiable Forward Kinematics for TensorFlow 2

    Authors: Lukas Mölschl, Jakob J. Hollenstein, Justus Piater

    Abstract: Robotic systems are often complex and depend on the integration of a large number of software components. One important component in robotic systems provides the calculation of forward kinematics, which is required by both motion-planning and perception related components. End-to-end learning systems based on deep learning require passing gradients across component boundaries.Typical software impl… ▽ More

    Submitted 10 March, 2023; v1 submitted 24 January, 2023; originally announced January 2023.

  11. Learning and Extrapolation of Robotic Skills using Task-Parameterized Equation Learner Networks

    Authors: Hector Villeda, Justus Piater, Matteo Saveriano

    Abstract: Imitation learning approaches achieve good generalization within the range of the training data, but tend to generate unpredictable motions when querying outside this range. We present a novel approach to imitation learning with enhanced extrapolation capabilities that exploits the so-called Equation Learner Network (EQLN). Unlike conventional approaches, EQLNs use supervised learning to fit a set… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Journal ref: Robotics and Autonomous Systems 2022

  12. arXiv:2208.01134  [pdf, other

    cs.LG

    Improving the Trainability of Deep Neural Networks through Layerwise Batch-Entropy Regularization

    Authors: David Peer, Bart Keulen, Sebastian Stabinger, Justus Piater, Antonio Rodríguez-Sánchez

    Abstract: Training deep neural networks is a very demanding task, especially challenging is how to adapt architectures to improve the performance of trained models. We can find that sometimes, shallow networks generalize better than deep networks, and the addition of more layers results in higher training and test errors. The deep residual learning framework addresses this degradation problem by adding skip… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: Accepted at TMLR (07/2022): https://openreview.net/forum?id=LJohl5DnZf

  13. arXiv:2206.03787  [pdf, other

    cs.LG cs.AI

    Action Noise in Off-Policy Deep Reinforcement Learning: Impact on Exploration and Performance

    Authors: Jakob Hollenstein, Sayantan Auddy, Matteo Saveriano, Erwan Renaudo, Justus Piater

    Abstract: Many Deep Reinforcement Learning (D-RL) algorithms rely on simple forms of exploration such as the additive action noise often used in continuous control domains. Typically, the scaling factor of this action noise is chosen as a hyper-parameter and is kept constant during training. In this paper, we focus on action noise in off-policy deep reinforcement learning for continuous control. We analyze… ▽ More

    Submitted 5 June, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: Published in Transactions on Machine Learning Research (11/2022) https://openreview.net/forum?id=NljBlZ6hmG

  14. arXiv:2202.06843  [pdf, other

    cs.RO cs.LG

    Continual Learning from Demonstration of Robotics Skills

    Authors: Sayantan Auddy, Jakob Hollenstein, Matteo Saveriano, Antonio Rodríguez-Sánchez, Justus Piater

    Abstract: Methods for teaching motion skills to robots focus on training for a single skill at a time. Robots capable of learning from demonstration can considerably benefit from the added ability to learn new movement skills without forgetting what was learned in the past. To this end, we propose an approach for continual learning from demonstration using hypernetworks and neural ordinary differential equa… ▽ More

    Submitted 12 April, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: To appear in Robotics and Autonomous Systems

  15. arXiv:2103.09465  [pdf, other

    cs.RO

    Learning Descriptor of Constrained Task from Demonstration

    Authors: Xiang Zhang, Matteo Saveriano, Justus Piater

    Abstract: Constrained objects, such as doors and drawers are often complex and share a similar structure in the human environment. A robot needs to interact accurately with constrained objects to safely and successfully complete a task. Learning from Demonstration offers an appropriate path to learn the object structure of the demonstration for unknown objects for unknown tasks. There is work that extracts… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

  16. DeepSym: Deep Symbol Generation and Rule Learning from Unsupervised Continuous Robot Interaction for Planning

    Authors: Alper Ahmetoglu, M. Yunus Seker, Justus Piater, Erhan Oztop, Emre Ugur

    Abstract: We propose a novel general method that finds action-grounded, discrete object and effect categories and builds probabilistic rules over them for non-trivial action planning. Our robot interacts with objects using an initial action repertoire that is assumed to be acquired earlier and observes the effects it can create in the environment. To form action-grounded object, effect, and relational categ… ▽ More

    Submitted 27 September, 2022; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: To appear in JAIR

  17. arXiv:2010.15533  [pdf, other

    cs.LG

    How do Offline Measures for Exploration in Reinforcement Learning behave?

    Authors: Jakob J. Hollenstein, Sayantan Auddy, Matteo Saveriano, Erwan Renaudo, Justus Piater

    Abstract: Sufficient exploration is paramount for the success of a reinforcement learning agent. Yet, exploration is rarely assessed in an algorithm-independent way. We compare the behavior of three data-based, offline exploration metrics described in the literature on intuitive simple distributions and highlight problems to be aware of when using them. We propose a fourth metric,uniform relative entropy, a… ▽ More

    Submitted 29 October, 2020; originally announced October 2020.

    Comments: KBRL Workshop at IJCAI-PRICAI 2020, Yokohama, Japan

  18. arXiv:2010.12974  [pdf, other

    cs.LG cs.RO

    Improving the Exploration of Deep Reinforcement Learning in Continuous Domains using Planning for Policy Search

    Authors: Jakob J. Hollenstein, Erwan Renaudo, Matteo Saveriano, Justus Piater

    Abstract: Local policy search is performed by most Deep Reinforcement Learning (D-RL) methods, which increases the risk of getting trapped in a local minimum. Furthermore, the availability of a simulation model is not fully exploited in D-RL even in simulation-based training, which potentially decreases efficiency. To better exploit simulation models in policy search, we propose to integrate a kinodynamic p… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

  19. arXiv:2007.10663  [pdf, other

    cs.RO

    Reconfigurable Behavior Trees: Towards an Executive Framework Meeting High-level Decision Making and Control Layer Features

    Authors: Pilar de la Cruz, Justus Piater, Matteo Saveriano

    Abstract: Behavior Trees constitute a widespread AI tool which has been successfully spun out in robotics. Their advantages include simplicity, modularity, and reusability of code. However, Behavior Trees remain a high-level decision making engine; control features cannot be easily integrated. This paper proposes the Reconfigurable Behavior Trees (RBTs), an extension of the traditional BTs that considers ph… ▽ More

    Submitted 31 August, 2020; v1 submitted 21 July, 2020; originally announced July 2020.

    Comments: Accepted to System, Man and Cybernetics (SMC2020)

  20. arXiv:2001.10857  [pdf, other

    cs.CV

    Evaluating the Progress of Deep Learning for Visual Relational Concepts

    Authors: Sebastian Stabinger, Peer David, Justus Piater, Antonio Rodríguez-Sánchez

    Abstract: Convolutional Neural Networks (CNNs) have become the state of the art method for image classification in the last ten years. Despite the fact that they achieve superhuman classification accuracy on many popular datasets, they often perform much worse on more abstract image classification tasks. We will show that these difficult tasks are linked to relational concepts from cognitive psychology and… ▽ More

    Submitted 13 September, 2021; v1 submitted 29 January, 2020; originally announced January 2020.

    Comments: Accepted for publication at Journal of Vision

  21. arXiv:1809.04317  [pdf, other

    cs.RO

    Action Representations in Robotics: A Taxonomy and Systematic Classification

    Authors: Philipp Zech, Erwan Renaudo, Simon Haller, Xiang Zhang, Justus Piater

    Abstract: Understanding and defining the meaning of "action" is substantial for robotics research. This becomes utterly evident when aiming at equip** autonomous robots with robust manipulation skills for action execution. Unfortunately, to this day we still lack both a clear understanding of the concept of an action and a set of established criteria that ultimately characterize an action. In this survey… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

    Comments: 36 pages, 4 figures, 7 tables, submitted to the International Journal of Robotics Research (IJRR)

  22. arXiv:1805.04354  [pdf, other

    cs.RO

    Learning Movement Assessment Primitives for Force Interaction Skills

    Authors: Xiang Zhang, Athanasios S. Polydoros, Justus Piater

    Abstract: We present a novel, reusable and task-agnostic primitive for assessing the outcome of a force-interaction robotic skill, useful e.g.\ for applications such as quality control in industrial manufacturing. The proposed method is easily programmed by kinesthetic teaching, and the desired adaptability and reusability are achieved by machine learning models. The primitive records sensory data during bo… ▽ More

    Submitted 11 May, 2018; originally announced May 2018.

  23. Symbol Emergence in Cognitive Developmental Systems: a Survey

    Authors: Tadahiro Taniguchi, Emre Ugur, Matej Hoffmann, Lorenzo Jamone, Takayuki Nagai, Benjamin Rosman, Toshihiko Matsuka, Naoto Iwahashi, Erhan Oztop, Justus Piater, Florentin Wörgötter

    Abstract: Humans use signs, e.g., sentences in a spoken language, for communication and thought. Hence, symbol systems like language are crucial for our communication with other agents and adaptation to our real-world environment. The symbol systems we use in our human society adaptively and dynamically change over time. In the context of artificial intelligence (AI) and cognitive systems, the symbol ground… ▽ More

    Submitted 10 July, 2018; v1 submitted 26 January, 2018; originally announced January 2018.

    Comments: 23 pages, 6 figures. Submitted to IEEE Transactions on Cognitive and Developmental Systems

    Journal ref: IEEE Transactions on Cognitive and Developmental Systems, vol. 11, no. 4, pp. 494-516, 2019

  24. arXiv:1709.06049  [pdf, other

    cs.RO

    A novel Skill-based Programming Paradigm based on Autonomous Playing and Skill-centric Testing

    Authors: Simon Hangl, Andreas Mennel, Justus Piater

    Abstract: We introduce a novel paradigm for robot pro- gramming with which we aim to make robot programming more accessible for unexperienced users. In order to do so we incorporate two major components in one single framework: autonomous skill acquisition by robotic playing and visual programming. Simple robot program skeletons solving a task for one specific situation, so-called basic behaviours, are prov… ▽ More

    Submitted 18 September, 2017; originally announced September 2017.

  25. arXiv:1706.08560  [pdf, other

    cs.RO

    Skill Learning by Autonomous Robotic Playing using Active Learning and Creativity

    Authors: Simon Hangl, Vedran Dunjko, Hans J. Briegel, Justus Piater

    Abstract: We treat the problem of autonomous acquisition of manipulation skills where problem-solving strategies are initially available only for a narrow range of situations. We propose to extend the range of solvable situations by autonomous playing with the object. By applying previously-trained skills and behaviours, the robot learns how to prepare situations for which a successful strategy is already k… ▽ More

    Submitted 26 June, 2017; originally announced June 2017.

  26. arXiv:1703.00835  [pdf, other

    cs.RO

    Autonomous Skill-centric Testing using Deep Learning

    Authors: Simon Hangl, Sebastian Stabinger, Justus Piater

    Abstract: Software testing is an important tool to ensure software quality. This is a hard task in robotics due to dynamic environments and the expensive development and time-consuming execution of test cases. Most testing approaches use model-based and / or simulation-based testing to overcome these problems. We propose model-free skill-centric testing in which a robot autonomously executes skills in the r… ▽ More

    Submitted 13 August, 2017; v1 submitted 2 March, 2017; originally announced March 2017.

  27. arXiv:1611.06368  [pdf, other

    cs.RO

    Active and Transfer Learning of Grasps by Kernel Adaptive MCMC

    Authors: Philipp Zech, Hanchen Xiong, Justus Piater

    Abstract: Human ability of both versatile gras** of given objects and gras** of novel (as of yet unseen) objects is truly remarkable. This probably arises from the experience infants gather by actively playing around with diverse objects. Moreover, knowledge acquired during this process is reused during learning of how to grasp novel objects. We conjecture that this combined process of active and transf… ▽ More

    Submitted 19 November, 2016; originally announced November 2016.

    Comments: 6 pages, 4 figures, 3 tables, technical report

  28. arXiv:1611.06367  [pdf, other

    cs.RO

    Active and Transfer Learning of Grasps by Sampling from Demonstration

    Authors: Philipp Zech, Justus Piater

    Abstract: We guess humans start acquiring gras** skills as early as at the infant stage by virtue of two key processes. First, infants attempt to learn grasps for known objects by imitating humans. Secondly, knowledge acquired during this process is reused in learning to grasp novel objects. We argue that these processes of active and transfer learning boil down to a random search of grasps on an object,… ▽ More

    Submitted 19 November, 2016; originally announced November 2016.

    Comments: 6 pages, 3 figures, 3 tables, technical report. arXiv admin note: substantial text overlap with arXiv:1611.06366

  29. arXiv:1611.06366  [pdf, other

    cs.RO

    Grasp Learning by Sampling from Demonstration

    Authors: Philipp Zech, Justus Piater

    Abstract: Robotic gras** traditionally relies on object features or shape information for learning new or applying already learned grasps. We argue however that such a strong reliance on object geometric information renders gras** and grasp learning a difficult task in the event of cluttered environments with high uncertainty where reasonable object models are not available. This being so, in this paper… ▽ More

    Submitted 19 November, 2016; originally announced November 2016.

    Comments: 6 pages, 3 figures, 2 tables, technical report. arXiv admin note: substantial text overlap with arXiv:1611.06367

  30. arXiv:1607.08366  [pdf, other

    cs.CV

    25 years of CNNs: Can we compare to human abstraction capabilities?

    Authors: Sebastian Stabinger, Antonio Rodríguez-Sánchez, Justus Piater

    Abstract: We try to determine the progress made by convolutional neural networks over the past 25 years in classifying images into abstractc lasses. For this purpose we compare the performance of LeNet to that of GoogLeNet at classifying randomly generated images which are differentiated by an abstract property (e.g., one class contains two objects of the same size, the other class two objects of different… ▽ More

    Submitted 28 July, 2016; originally announced July 2016.

    Comments: To appear in the proceedings of ICANN 2016, Springer

  31. Learning Abstract Classes using Deep Learning

    Authors: Sebastian Stabinger, Antonio Rodriguez-Sanchez, Justus Piater

    Abstract: Humans are generally good at learning abstract concepts about objects and scenes (e.g.\ spatial orientation, relative sizes, etc.). Over the last years convolutional neural networks have achieved almost human performance in recognizing concrete classes (i.e.\ specific object categories). This paper tests the performance of a current CNN (GoogLeNet) on the task of differentiating between abstract c… ▽ More

    Submitted 17 June, 2016; originally announced June 2016.

    Comments: To be published in the proceedings of the International Conference on Bio-inspired Information and Communications Technologies 2015

  32. Robotic Playing for Hierarchical Complex Skill Learning

    Authors: Simon Hangl, Emre Ugur, Sandor Szedmak, Justus Piater

    Abstract: In complex manipulation scenarios (e.g. tasks requiring complex interaction of two hands or in-hand manipulation), generalization is a hard problem. Current methods still either require a substantial amount of (supervised) training data and / or strong assumptions on both the environment and the task. In this paradigm, controllers solving these tasks tend to be complex. We propose a paradigm of ma… ▽ More

    Submitted 13 August, 2017; v1 submitted 2 March, 2016; originally announced March 2016.

    Comments: Published at IROS2016 (http://www.iros2016.org/)

  33. arXiv:1306.3551   

    cs.RO

    Proceedings of the 2nd Workshop on Robots in Clutter: Preparing robots for the real world (Berlin, 2013)

    Authors: Michael Zillich, Maren Bennewitz, Maria Fox, Justus Piater, Dejan Pangercic

    Abstract: This volume represents the proceedings of the 2nd Workshop on Robots in Clutter: Preparing robots for the real world, held June 27, 2013, at the Robotics: Science and Systems conference in Berlin, Germany.

    Submitted 15 June, 2013; originally announced June 2013.

    ACM Class: I.2.9

  34. arXiv:1305.5905  [pdf, other

    cs.CV

    ÖAGM/AAPR 2013 - The 37th Annual Workshop of the Austrian Association for Pattern Recognition

    Authors: Justus Piater, Antonio J. Rodríguez Sánchez

    Abstract: In this editorial, the organizers summarize facts and background about the event.

    Submitted 25 May, 2013; originally announced May 2013.

    Comments: Part of the OAGM/AAPR 2013 proceedings (arXiv:1304.1876)

    Report number: OAGM-AAPR/2013/00

  35. arXiv:1304.1876   

    cs.CV

    Proceedings of the 37th Annual Workshop of the Austrian Association for Pattern Recognition (ÖAGM/AAPR), 2013

    Authors: Justus Piater, Antonio Rodríguez-Sánchez

    Abstract: This volume represents the proceedings of the 37th Annual Workshop of the Austrian Association for Pattern Recognition (ÖAGM/AAPR), held May 23-24, 2013, in Innsbruck, Austria.

    Submitted 28 May, 2013; v1 submitted 6 April, 2013; originally announced April 2013.

    Comments: Contributed papers presented at ÖAGM/AAPR 2013

    ACM Class: I.4; I.5; I.2.10

  36. Closed-Loop Learning of Visual Control Policies

    Authors: S. R. Jodogne, J. H. Piater

    Abstract: In this paper we present a general, flexible framework for learning map**s from images to actions by interacting with the environment. The basic idea is to introduce a feature-based image classifier in front of a reinforcement learning algorithm. The classifier partitions the visual space according to the presence or absence of few highly informative local descriptors that are incrementally sele… ▽ More

    Submitted 10 October, 2011; originally announced October 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 28, pages 349-391, 2007