-
ARMCHAIR: integrated inverse reinforcement learning and model predictive control for human-robot collaboration
Authors:
Angelo Caregnato-Neto,
Luciano Cavalcante Siebert,
Arkady Zgonnikov,
Marcos Ricardo Omena de Albuquerque Maximo,
Rubens Junqueira Magalhães Afonso
Abstract:
One of the key issues in human-robot collaboration is the development of computational models that allow robots to predict and adapt to human behavior. Much progress has been achieved in develo** such models, as well as control techniques that address the autonomy problems of motion planning and decision-making in robotics. However, the integration of computational models of human behavior with…
▽ More
One of the key issues in human-robot collaboration is the development of computational models that allow robots to predict and adapt to human behavior. Much progress has been achieved in develo** such models, as well as control techniques that address the autonomy problems of motion planning and decision-making in robotics. However, the integration of computational models of human behavior with such control techniques still poses a major challenge, resulting in a bottleneck for efficient collaborative human-robot teams. In this context, we present a novel architecture for human-robot collaboration: Adaptive Robot Motion for Collaboration with Humans using Adversarial Inverse Reinforcement learning (ARMCHAIR). Our solution leverages adversarial inverse reinforcement learning and model predictive control to compute optimal trajectories and decisions for a mobile multi-robot system that collaborates with a human in an exploration task. During the mission, ARMCHAIR operates without human intervention, autonomously identifying the necessity to support and acting accordingly. Our approach also explicitly addresses the network connectivity requirement of the human-robot team. Extensive simulation-based evaluations demonstrate that ARMCHAIR allows a group of robots to safely support a simulated human in an exploration scenario, preventing collisions and network disconnections, and improving the overall performance of the task.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Data-driven Semi-supervised Machine Learning with Surrogate Safety Measures for Abnormal Driving Behavior Detection
Authors:
Yongqi Dong,
Lanxin Zhang,
Haneen Farah,
Arkady Zgonnikov,
Bart van Arem
Abstract:
Detecting abnormal driving behavior is critical for road traffic safety and the evaluation of drivers' behavior. With the advancement of machine learning (ML) algorithms and the accumulation of naturalistic driving data, many ML models have been adopted for abnormal driving behavior detection. Most existing ML-based detectors rely on (fully) supervised ML methods, which require substantial labeled…
▽ More
Detecting abnormal driving behavior is critical for road traffic safety and the evaluation of drivers' behavior. With the advancement of machine learning (ML) algorithms and the accumulation of naturalistic driving data, many ML models have been adopted for abnormal driving behavior detection. Most existing ML-based detectors rely on (fully) supervised ML methods, which require substantial labeled data. However, ground truth labels are not always available in the real world, and labeling large amounts of data is tedious. Thus, there is a need to explore unsupervised or semi-supervised methods to make the anomaly detection process more feasible and efficient. To fill this research gap, this study analyzes large-scale real-world data revealing several abnormal driving behaviors (e.g., sudden acceleration, rapid lane-changing) and develops a Hierarchical Extreme Learning Machines (HELM) based semi-supervised ML method using partly labeled data to accurately detect the identified abnormal driving behaviors. Moreover, previous ML-based approaches predominantly utilize basic vehicle motion features (such as velocity and acceleration) to label and detect abnormal driving behaviors, while this study seeks to introduce Surrogate Safety Measures (SSMs) as the input features for ML models to improve the detection performance. Results from extensive experiments demonstrate the effectiveness of the proposed semi-supervised ML model with the introduced SSMs serving as important features. The proposed semi-supervised ML method outperforms other baseline semi-supervised or unsupervised methods regarding various metrics, e.g., delivering the best accuracy at 99.58% and the best F-1 measure at 0.9913. The ablation study further highlights the significance of SSMs for advancing detection performance.
△ Less
Submitted 24 May, 2024; v1 submitted 7 December, 2023;
originally announced December 2023.
-
Bistable dynamics of control activation in human intermittent control
Authors:
Arkady Zgonnikov,
Ihor Lubashevsky
Abstract:
When facing a task of balancing a dynamic system near an unstable equilibrium, humans often adopt intermittent control strategy: instead of continuously controlling the system, they repeatedly switch the control on and off. Paradigmatic example of such a task is stick balancing. Despite the simplicity of the task itself, the complexity of human intermittent control dynamics in stick balancing stil…
▽ More
When facing a task of balancing a dynamic system near an unstable equilibrium, humans often adopt intermittent control strategy: instead of continuously controlling the system, they repeatedly switch the control on and off. Paradigmatic example of such a task is stick balancing. Despite the simplicity of the task itself, the complexity of human intermittent control dynamics in stick balancing still puzzles researchers in motor control. Here we attempt to model one of the key mechanisms of human intermittent control, control activation, using as an example the task of overdamped stick balancing. In so doing, we focus on the concept of noise-driven activation, a more general alternative to the conventional threshold-driven activation. We describe control activation as a random walk in an energy potential, which changes in response to the state of the controlled system. By way of numerical simulations, we show that the developed model captures the core properties of human control activation observed previously in the experiments on overdamped stick balancing. Our results demonstrate that the double-well potential model provides tractable mathematical description of human control activation at least in the considered task, and suggest that the adopted approach can potentially aid in understanding human intermittent control in more complex processes.
△ Less
Submitted 26 March, 2015; v1 submitted 19 November, 2014;
originally announced November 2014.
-
To react or not to react? Intrinsic stochasticity of human control in virtual stick balancing
Authors:
Arkady Zgonnikov,
Ihor Lubashevsky,
Shigeru Kanemoto,
Toru Miyazawa,
Takashi Suzuki
Abstract:
Understanding how humans control unstable systems is central to many research problems, with applications ranging from quiet standing to aircraft landing. Increasingly much evidence appears in favor of event-driven control hypothesis: human operators only start actively controlling the system when the discrepancy between the current and desired system states becomes large enough. The event-driven…
▽ More
Understanding how humans control unstable systems is central to many research problems, with applications ranging from quiet standing to aircraft landing. Increasingly much evidence appears in favor of event-driven control hypothesis: human operators only start actively controlling the system when the discrepancy between the current and desired system states becomes large enough. The event-driven models based on the concept of threshold can explain many features of the experimentally observed dynamics. However, much still remains unclear about the dynamics of human-controlled systems, which likely indicates that humans employ more intricate control mechanisms. The present paper argues that control activation in humans may be not threshold-driven, but instead intrinsically stochastic, noise-driven. Specifically, we suggest that control activation stems from stochastic interplay between the operator's need to keep the controlled system near the goal state on one hand and the tendency to postpone interrupting the system dynamics on the other hand. We propose a model capturing this interplay and show that it matches the experimental data on human balancing of virtual overdamped stick. Our results illuminate that the noise-driven activation mechanism plays a crucial role at least in the considered task, and, hypothetically, in a broad range of human-controlled processes.
△ Less
Submitted 15 June, 2014; v1 submitted 12 February, 2014;
originally announced February 2014.