Skip to main content

Showing 1–31 of 31 results for author: Stork, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03082  [pdf, other

    cs.LG

    Learning Solutions of Stochastic Optimization Problems with Bayesian Neural Networks

    Authors: Alan A. Lahoud, Erik Schaffernicht, Johannes A. Stork

    Abstract: Mathematical solvers use parametrized Optimization Problems (OPs) as inputs to yield optimal decisions. In many real-world settings, some of these parameters are unknown or uncertain. Recent research focuses on predicting the value of these unknown parameters using available contextual features, aiming to decrease decision regret by adopting end-to-end learning approaches. However, these approache… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2405.04923  [pdf, other

    cs.LG cs.AI

    DataSP: A Differential All-to-All Shortest Path Algorithm for Learning Costs and Predicting Paths with Context

    Authors: Alan A. Lahoud, Erik Schaffernicht, Johannes A. Stork

    Abstract: Learning latent costs of transitions on graphs from trajectories demonstrations under various contextual features is challenging but useful for path planning. Yet, existing methods either oversimplify cost assumptions or scale poorly with the number of observed trajectories. This paper introduces DataSP, a differentiable all-to-all shortest path algorithm to facilitate learning latent costs from t… ▽ More

    Submitted 30 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  3. arXiv:2405.01198  [pdf, other

    cs.LG cs.AI

    Towards Interpretable Reinforcement Learning with Constrained Normalizing Flow Policies

    Authors: Finn Rietz, Erik Schaffernicht, Stefan Heinrich, Johannes A. Stork

    Abstract: Reinforcement learning policies are typically represented by black-box neural networks, which are non-interpretable and not well-suited for safety-critical domains. To address both of these issues, we propose constrained normalizing flow policies as interpretable and safe-by-construction policy models. We achieve safety for reinforcement learning problems with instantaneous safety constraints, for… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  4. arXiv:2310.17785  [pdf, other

    cs.RO cs.LG

    Learning Extrinsic Dexterity with Parameterized Manipulation Primitives

    Authors: Shih-Min Yang, Martin Magnusson, Johannes A. Stork, Todor Stoyanov

    Abstract: Many practically relevant robot gras** problems feature a target object for which all grasps are occluded, e.g., by the environment. Single-shot grasp planning invariably fails in such scenarios. Instead, it is necessary to first manipulate the object into a configuration that affords a grasp. We solve this problem by learning a sequence of actions that utilize the environment to change the obje… ▽ More

    Submitted 9 May, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 2024 IEEE International Conference on Robotics and Automation (ICRA 2024)

  5. arXiv:2310.07493  [pdf, other

    cs.AI

    Diversity for Contingency: Learning Diverse Behaviors for Efficient Adaptation and Transfer

    Authors: Finn Rietz, Johannes Andreas Stork

    Abstract: Discovering all useful solutions for a given task is crucial for transferable RL agents, to account for changes in the task or transition dynamics. This is not considered by classical RL algorithms that are only concerned with finding the optimal policy, given the current task and dynamics. We propose a simple method for discovering all possible solutions of a given task, to obtain an agent that p… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Presented at the third RL-Conform workshop at IROS 2023

  6. arXiv:2310.02360  [pdf, other

    cs.AI

    Prioritized Soft Q-Decomposition for Lexicographic Reinforcement Learning

    Authors: Finn Rietz, Erik Schaffernicht, Stefan Heinrich, Johannes Andreas Stork

    Abstract: Reinforcement learning (RL) for complex tasks remains a challenge, primarily due to the difficulties of engineering scalar reward functions and the inherent inefficiency of training models from scratch. Instead, it would be better to specify complex tasks in terms of elementary subtasks and to reuse subtask solutions whenever possible. In this work, we address continuous space lexicographic multi-… ▽ More

    Submitted 2 May, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Camera ready version

  7. arXiv:2210.08600  [pdf, other

    cs.RO

    Heterogeneous Full-body Control of a Mobile Manipulator with Behavior Trees

    Authors: Marco Iannotta, David Cáceres Domínguez, Johannes A. Stork, Erik Schaffernicht, Todor Stoyanov

    Abstract: Integrating the heterogeneous controllers of a complex mechanical system, such as a mobile manipulator, within the same structure and in a modular way is still challenging. In this work we extend our framework based on Behavior Trees for the control of a redundant mechanical system to the problem of commanding more complex systems that involve multiple low-level controllers. This allows the integr… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2209.08619

  8. arXiv:2210.02891  [pdf, other

    cs.RO cs.AI cs.LG

    Transferring Knowledge for Reinforcement Learning in Contact-Rich Manipulation

    Authors: Quantao Yang, Johannes A. Stork, Todor Stoyanov

    Abstract: In manufacturing, assembly tasks have been a challenge for learning algorithms due to variant dynamics of different environments. Reinforcement learning (RL) is a promising framework to automatically learn these tasks, yet it is still not easy to apply a learned policy or skill, that is the ability of solving a task, to a similar environment even if the deployment conditions are only slightly diff… ▽ More

    Submitted 19 September, 2022; originally announced October 2022.

  9. arXiv:2209.09536  [pdf, other

    cs.LG cs.AI

    Towards Task-Prioritized Policy Composition

    Authors: Finn Rietz, Erik Schaffernicht, Todor Stoyanov, Johannes A. Stork

    Abstract: Combining learned policies in a prioritized, ordered manner is desirable because it allows for modular design and facilitates data reuse through knowledge transfer. In control theory, prioritized composition is realized by null-space control, where low-priority control actions are projected into the null-space of high-priority control actions. Such a method is currently unavailable for Reinforceme… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  10. A Stack-of-Tasks Approach Combined with Behavior Trees: a New Framework for Robot Control

    Authors: David Cáceres Domínguez, Marco Iannotta, Johannes A. Stork, Erik Schaffernicht, Todor Stoyanov

    Abstract: Stack-of-Tasks (SoT) control allows a robot to simultaneously fulfill a number of prioritized goals formulated in terms of (in)equality constraints in error space. Since this approach solves a sequence of Quadratic Programs (QP) at each time-step, without taking into account any temporal state evolution, it is suitable for dealing with local disturbances. However, its limitation lies in the handli… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

  11. arXiv:2107.13977  [pdf, other

    cs.AI

    Underwater Acoustic Networks for Security Risk Assessment in Public Drinking Water Reservoirs

    Authors: Jörg Stork, Philip Wenzel, Severin Landwein, Maria-Elena Algorri, Martin Zaefferer, Wolfgang Kusch, Martin Staubach, Thomas Bartz-Beielstein, Hartmut Köhn, Hermann Dejager, Christian Wolf

    Abstract: We have built a novel system for the surveillance of drinking water reservoirs using underwater sensor networks. We implement an innovative AI-based approach to detect, classify and localize underwater events. In this paper, we describe the technology and cognitive AI architecture of the system based on one of the sensor networks, the hydrophone network. We discuss the challenges of installing and… ▽ More

    Submitted 29 July, 2021; originally announced July 2021.

  12. Behavior-based Neuroevolutionary Training in Reinforcement Learning

    Authors: Jörg Stork, Martin Zaefferer, Nils Eisler, Patrick Tichelmann, Thomas Bartz-Beielstein, A. E. Eiben

    Abstract: In addition to their undisputed success in solving classical optimization problems, neuroevolutionary and population-based algorithms have become an alternative to standard reinforcement learning methods. However, evolutionary methods often lack the sample efficiency of standard value-based methods that leverage gathered state and value experience. If reinforcement learning for real-world problems… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

  13. arXiv:2005.06195  [pdf, other

    cs.LG stat.ML

    The effect of Target Normalization and Momentum on Dying ReLU

    Authors: Isac Arnekvist, J. Frederico Carvalho, Danica Kragic, Johannes A. Stork

    Abstract: Optimizing parameters with momentum, normalizing data values, and using rectified linear units (ReLUs) are popular choices in neural network (NN) regression. Although ReLUs are popular, they can collapse to a constant function and "die", effectively removing their contribution from the model. While some mitigations are known, the underlying reasons of ReLUs dying during optimization are currently… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  14. CAAI -- A Cognitive Architecture to Introduce Artificial Intelligence in Cyber-Physical Production Systems

    Authors: Andreas Fischbach, Jan Strohschein, Andreas Bunte, Jörg Stork, Heide Faeskorn-Woyke, Natalia Moriz, Thomas Bartz-Beielstein

    Abstract: This paper introduces CAAI, a novel cognitive architecture for artificial intelligence in cyber-physical production systems. The goal of the architecture is to reduce the implementation effort for the usage of artificial intelligence algorithms. The core of the CAAI is a cognitive module that processes declarative goals of the user, selects suitable models and algorithms, and creates a configurati… ▽ More

    Submitted 26 February, 2020; originally announced March 2020.

  15. arXiv:2002.04911  [pdf, other

    cs.LG cs.RO stat.ML

    Ensemble of Sparse Gaussian Process Experts for Implicit Surface Map** with Streaming Data

    Authors: Johannes A. Stork, Todor Stoyanov

    Abstract: Creating maps is an essential task in robotics and provides the basis for effective planning and navigation. In this paper, we learn a compact and continuous implicit surface map of an environment from a stream of range data with known poses. For this, we create and incrementally adjust an ensemble of approximate Gaussian process (GP) experts which are each responsible for a different part of the… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

  16. arXiv:1912.07024  [pdf, other

    cs.RO

    Multi-Object Rearrangement with Monte Carlo Tree Search:A Case Study on Planar Nonprehensile Sorting

    Authors: Haoran Song, Joshua A. Haustein, Weihao Yuan, Kaiyu Hang, Michael Yu Wang, Danica Kragic, Johannes A. Stork

    Abstract: In this work, we address a planar non-prehensile sorting task. Here, a robot needs to push many densely packed objects belonging to different classes into a configuration where these classes are clearly separated from each other. To achieve this, we propose to employ Monte Carlo tree search equipped with a task-specific heuristic function. We evaluate the algorithm on various simulated and real-wo… ▽ More

    Submitted 18 January, 2021; v1 submitted 15 December, 2019; originally announced December 2019.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2020; Project page at http://haoran-song.github.io/mcts-sorting/

  17. Surrogate Models for Enhancing the Efficiency of Neuroevolution in Reinforcement Learning

    Authors: Jörg Stork, Martin Zaefferer, Thomas Bartz-Beielstein, A. E. Eiben

    Abstract: In the last years, reinforcement learning received a lot of attention. One method to solve reinforcement learning tasks is Neuroevolution, where neural networks are optimized by evolutionary algorithms. A disadvantage of Neuroevolution is that it can require numerous function evaluations, while not fully utilizing the available information from each fitness evaluation. This is especially problemat… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: This is the authors version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in Genetic and Evolutionary Computation Conference (GECCO 2019)

    Journal ref: 2019, Genetic and Evolutionary Computation Conference (GECCO 2019), Prague, Czech Republic. ACM, New York, NY, USA

  18. Prediction of neural network performance by phenotypic modeling

    Authors: Alexander Hagg, Martin Zaefferer, Jörg Stork, Adam Gaier

    Abstract: Surrogate models are used to reduce the burden of expensive-to-evaluate objective functions in optimization. By creating models which map genomes to objective values, these models can estimate the performance of unknown inputs, and so be used in place of expensive objective functions. Evolutionary techniques such as genetic programming or neuroevolution commonly alter the structure of the genome i… ▽ More

    Submitted 16 July, 2019; originally announced July 2019.

  19. arXiv:1907.02555  [pdf, other

    cs.RO

    Object Placement Planning and Optimization for Robot Manipulators

    Authors: Joshua A. Haustein, Kaiyu Hang, Johannes Stork, Danica Kragic

    Abstract: We address the problem of motion planning for a robotic manipulator with the task to place a grasped object in a cluttered environment. In this task, we need to locate a collision-free pose for the object that a) facilitates the stable placement of the object, b) is reachable by the robot manipulator and c) optimizes a user-given placement objective. Because of the placement objective, this proble… ▽ More

    Submitted 4 July, 2019; originally announced July 2019.

    Comments: 8 pages

  20. arXiv:1903.03831  [pdf, other

    cs.RO

    Data-Driven Model Predictive Control for Food-Cutting

    Authors: Ioanna Mitsioni, Yiannis Karayiannidis, Johannes A. Stork, Danica Kragic

    Abstract: Modelling of contact-rich tasks is challenging and cannot be entirely solved using classical control approaches due to the difficulty of constructing an analytic description of the contact dynamics. Additionally, in a manipulation task like food-cutting, purely learning-based methods such as Reinforcement Learning, require either a vast amount of data that is expensive to collect on a real robot,… ▽ More

    Submitted 26 September, 2019; v1 submitted 9 March, 2019; originally announced March 2019.

  21. arXiv:1902.03419  [pdf, other

    cs.NE

    Improving NeuroEvolution Efficiency by Surrogate Model-based Optimization with Phenotypic Distance Kernels

    Authors: Jörg Stork, Martin Zaefferer, Thomas Bartz-Beielstein

    Abstract: In NeuroEvolution, the topologies of artificial neural networks are optimized with evolutionary algorithms to solve tasks in data regression, data classification, or reinforcement learning. One downside of NeuroEvolution is the large amount of necessary fitness evaluations, which might render it inefficient for tasks with expensive evaluations, such as real-time learning. For these expensive optim… ▽ More

    Submitted 9 February, 2019; originally announced February 2019.

    Comments: The final authenticated version of this publication will appear in the proceedings of the Applications of Evolutionary Computation - 22nd International Conference EvoApplications 2019 in the LNCS by Springer

  22. arXiv:1901.03557  [pdf, other

    cs.RO

    Learning Manipulation States and Actions for Efficient Non-prehensile Rearrangement Planning

    Authors: Joshua A. Haustein, Isac Arnekvist, Johannes Stork, Kaiyu Hang, Danica Kragic

    Abstract: This paper addresses non-prehensile rearrangement planning problems where a robot is tasked to rearrange objects among obstacles on a planar surface. We present an efficient planning algorithm that is designed to impose few assumptions on the robot's non-prehensile manipulation abilities and is simple to adapt to different robot embodiments. For this, we combine sampling-based motion planning with… ▽ More

    Submitted 11 January, 2019; originally announced January 2019.

  23. arXiv:1810.04438  [pdf, other

    cs.RO cs.LG

    Global Search with Bernoulli Alternation Kernel for Task-oriented Gras** Informed by Simulation

    Authors: Rika Antonova, Mia Kokic, Johannes A. Stork, Danica Kragic

    Abstract: We develop an approach that benefits from large simulated datasets and takes full advantage of the limited online data that is most relevant. We propose a variant of Bayesian optimization that alternates between using informed and uninformed kernels. With this Bernoulli Alternation Kernel we ensure that discrepancies between simulation and reality do not hinder adapting robot control policies onli… ▽ More

    Submitted 10 October, 2018; originally announced October 2018.

    Comments: To appear in 2nd Conference on Robot Learning (CoRL) 2018

  24. arXiv:1809.04322  [pdf, other

    cs.RO cs.AI cs.LG

    Reinforcement Learning in Topology-based Representation for Human Body Movement with Whole Arm Manipulation

    Authors: Weihao Yuan, Kaiyu Hang, Haoran Song, Danica Kragic, Michael Y. Wang, Johannes A. Stork

    Abstract: Moving a human body or a large and bulky object can require the strength of whole arm manipulation (WAM). This type of manipulation places the load on the robot's arms and relies on global properties of the interaction to succeed---rather than local contacts such as gras** or non-prehensile pushing. In this paper, we learn to generate motions that enable WAM for holding and transporting of human… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

    Comments: Submitted to RA-L with ICRA 2019

  25. arXiv:1809.03548  [pdf, other

    cs.LG stat.ML

    VPE: Variational Policy Embedding for Transfer Reinforcement Learning

    Authors: Isac Arnekvist, Danica Kragic, Johannes A. Stork

    Abstract: Reinforcement Learning methods are capable of solving complex problems, but resulting policies might perform poorly in environments that are even slightly different. In robotics especially, training and deployment conditions often vary and data collection is expensive, making retraining undesirable. Simulation training allows for feasible training times, but on the other hand suffers from a realit… ▽ More

    Submitted 14 September, 2018; v1 submitted 10 September, 2018; originally announced September 2018.

  26. A new Taxonomy of Continuous Global Optimization Algorithms

    Authors: Jörg Stork, A. E. Eiben, Thomas Bartz-Beielstein

    Abstract: Surrogate-based optimization, nature-inspired metaheuristics, and hybrid combinations have become state of the art in algorithm design for solving real-world optimization problems. Still, it is difficult for practitioners to get an overview that explains their advantages in comparison to a large number of available methods in the scope of optimization. Available taxonomies lack the embedding of cu… ▽ More

    Submitted 6 May, 2020; v1 submitted 27 August, 2018; originally announced August 2018.

    Comments: 35 pages total, 28 written pages, 4 figures, 2019 Reworked Version

    Journal ref: Natural Computing, 2020, 1-24

  27. arXiv:1807.07839  [pdf, other

    cs.NE

    Distance-based Kernels for Surrogate Model-based Neuroevolution

    Authors: Jörg Stork, Martin Zaefferer, Thomas Bartz-Beielstein

    Abstract: The topology optimization of artificial neural networks can be particularly difficult if the fitness evaluations require expensive experiments or simulations. For that reason, the optimization methods may need to be supported by surrogate models. We propose different distances for a suitable surrogate model, and compare them in a simple numerical test scenario.

    Submitted 20 July, 2018; originally announced July 2018.

    Comments: 4 pages, 1 figure. This publication was accepted to the Developmental Neural Networks Workshop of the Parallel Problem Solving from Nature 2018 (PPSN XV) conference

  28. arXiv:1807.01019  [pdf, other

    cs.NE

    Linear Combination of Distance Measures for Surrogate Models in Genetic Programming

    Authors: Martin Zaefferer, Jörg Stork, Oliver Flasch, Thomas Bartz-Beielstein

    Abstract: Surrogate models are a well established approach to reduce the number of expensive function evaluations in continuous optimization. In the context of genetic programming, surrogate modeling still poses a challenge, due to the complex genotype-phenotype relationships. We investigate how different genotypic and phenotypic distance measures can be used to learn Kriging models as surrogates. We compar… ▽ More

    Submitted 3 July, 2018; originally announced July 2018.

    Comments: The final authenticated version of this publication will appear in the proceedings of the 15th International Conference on Parallel Problem Solving from Nature 2018 (PPSN XV), published in the LNCS by Springer

  29. Rearrangement with Nonprehensile Manipulation Using Deep Reinforcement Learning

    Authors: Weihao Yuan, Johannes A. Stork, Danica Kragic, Michael Y. Wang, Kaiyu Hang

    Abstract: Rearranging objects on a tabletop surface by means of nonprehensile manipulation is a task which requires skillful interaction with the physical world. Usually, this is achieved by precisely modeling physical properties of the objects, robot, and the environment for explicit planning. In contrast, as explicitly modeling the physical environment is not always feasible and involves various uncertain… ▽ More

    Submitted 15 March, 2018; originally announced March 2018.

    Comments: 2018 International Conference on Robotics and Automation

  30. arXiv:1611.06070  [pdf, other

    cs.RO

    Rope through Loop Insertion for Robotic Knotting: A Virtual Magnetic Field Formulation

    Authors: Alejandro Marzinotto, Johannes A. Stork

    Abstract: Inserting an end of a rope through a loop is a common and important action that is required for creating most types of knots. To perform this action, we need to pass the end of the rope through an area that is enclosed by another segment of rope. As for all knotting actions, the robot must for this exercise control over a semi-compliant and flexible body whose complex 3d shape is difficult to perc… ▽ More

    Submitted 18 November, 2016; originally announced November 2016.

    Comments: 8 pages

    Report number: 978-91-7729-218-0

  31. arXiv:1510.03924  [pdf

    stat.AP cs.OH

    Comparison of different Methods for Univariate Time Series Imputation in R

    Authors: Steffen Moritz, Alexis Sardá, Thomas Bartz-Beielstein, Martin Zaefferer, Jörg Stork

    Abstract: Missing values in datasets are a well-known problem and there are quite a lot of R packages offering imputation functions. But while imputation in general is well covered within R, it is hard to find functions for imputation of univariate time series. The problem is, most standard imputation techniques can not be applied directly. Most algorithms rely on inter-attribute correlations, while univari… ▽ More

    Submitted 13 October, 2015; originally announced October 2015.