-
Limited Information Shared Control: A Potential Game Approach
Authors:
Balint Varga,
Jairo Inga,
Soeren Hohmann
Abstract:
This paper presents a systematic method for the design of a limited information shared control (LISC). LISC is used in applications where not all system states or reference trajectories are measurable by the automation. Typical examples are partially human-controlled systems, in which some subsystems are fully controlled by automation while others are controlled by a human. The proposed systematic…
▽ More
This paper presents a systematic method for the design of a limited information shared control (LISC). LISC is used in applications where not all system states or reference trajectories are measurable by the automation. Typical examples are partially human-controlled systems, in which some subsystems are fully controlled by automation while others are controlled by a human. The proposed systematic design method uses a novel class of games to model human-machine interaction: the near potential differential games (NPDG). We provide a necessary and sufficient condition for the existence of an NPDG and derive an algorithm for finding a NPDG that completely describes a given differential game. The proposed design method is applied to the control of a large vehicle-manipulator system, in which the manipulator is controlled by a human operator and the vehicle is fully automated. The suitability of the NPDG to model differential games is verified in simulations, leading to a faster and more accurate controller design compared to manual tuning. Furthermore, the overall design process is validated in a study with sixteen test subjects, indicating the applicability of the proposed concept in real applications.
△ Less
Submitted 1 June, 2022; v1 submitted 17 January, 2022;
originally announced January 2022.
-
Adaptive Optimal Trajectory Tracking Control Applied to a Large-Scale Ball-on-Plate System
Authors:
Florian Köpf,
Sean Kille,
Jairo Inga,
Sören Hohmann
Abstract:
While many theoretical works concerning Adaptive Dynamic Programming (ADP) have been proposed, application results are scarce. Therefore, we design an ADP-based optimal trajectory tracking controller and apply it to a large-scale ball-on-plate system. Our proposed method incorporates an approximated reference trajectory instead of using setpoint tracking and allows to automatically compensate for…
▽ More
While many theoretical works concerning Adaptive Dynamic Programming (ADP) have been proposed, application results are scarce. Therefore, we design an ADP-based optimal trajectory tracking controller and apply it to a large-scale ball-on-plate system. Our proposed method incorporates an approximated reference trajectory instead of using setpoint tracking and allows to automatically compensate for constant offset terms. Due to the off-policy characteristics of the algorithm, the method requires only a small amount of measured data to train the controller. Our experimental results show that this tracking mechanism significantly reduces the control cost compared to setpoint controllers. Furthermore, a comparison with a model-based optimal controller highlights the benefits of our model-free data-based ADP tracking controller, where no system model and manual tuning are required but the controller is tuned automatically using measured data.
△ Less
Submitted 25 January, 2021; v1 submitted 26 October, 2020;
originally announced October 2020.
-
Multi-Robot Task Allocation and Scheduling Considering Cooperative Tasks and Precedence Constraints
Authors:
Esther Bischoff,
Fabian Meyer,
Jairo Inga,
Sören Hohmann
Abstract:
In order to fully exploit the advantages inherent to cooperating heterogeneous multi-robot teams, sophisticated coordination algorithms are essential. Time-extended multi-robot task allocation approaches assign and schedule a set of tasks to a group of robots such that certain objectives are optimized and operational constraints are met. This is particularly challenging if cooperative tasks, i.e.…
▽ More
In order to fully exploit the advantages inherent to cooperating heterogeneous multi-robot teams, sophisticated coordination algorithms are essential. Time-extended multi-robot task allocation approaches assign and schedule a set of tasks to a group of robots such that certain objectives are optimized and operational constraints are met. This is particularly challenging if cooperative tasks, i.e. tasks that require two or more robots to work directly together, are considered. In this paper, we present an easy-to-implement criterion to validate the feasibility, i.e. executability, of solutions to time-extended multi-robot task allocation problems with cross schedule dependencies arising from the consideration of cooperative tasks and precedence constraints. Using the introduced feasibility criterion, we propose a local improvement heuristic based on a neighborhood operator for the problem class under consideration. The initial solution is obtained by a greedy constructive heuristic. Both methods use a generalized cost structure and are therefore able to handle various objective function instances. We evaluate the proposed approach using test scenarios of different problem sizes, all comprising the complexity aspects of the regarded problem. The simulation results illustrate the improvement potential arising from the application of the local improvement heuristic.
△ Less
Submitted 8 May, 2020;
originally announced May 2020.
-
Inverse Dynamic Games Based on Maximum Entropy Inverse Reinforcement Learning
Authors:
Jairo Inga,
Esther Bischoff,
Florian Köpf,
Sören Hohmann
Abstract:
We consider the inverse problem of dynamic games, where cost function parameters are sought which explain observed behavior of interacting players. Maximum entropy inverse reinforcement learning is extended to the N-player case in order to solve inverse dynamic games with continuous-valued state and control spaces. We present methods for identification of cost function parameters from observed dat…
▽ More
We consider the inverse problem of dynamic games, where cost function parameters are sought which explain observed behavior of interacting players. Maximum entropy inverse reinforcement learning is extended to the N-player case in order to solve inverse dynamic games with continuous-valued state and control spaces. We present methods for identification of cost function parameters from observed data which correspond to (i) a Pareto efficient solution, (ii) an open-loop Nash equilibrium or (iii) a feedback Nash equilibrium. Furthermore, we give results on the unbiasedness of the estimation of cost function parameters for each arising class of inverse dynamic game. The applicability of the methods is demonstrated with simulation examples of a nonlinear and a linear-quadratic dynamic game.
△ Less
Submitted 24 July, 2020; v1 submitted 18 November, 2019;
originally announced November 2019.