-
Generation of Geodesics with Actor-Critic Reinforcement Learning to Predict Midpoints
Authors:
Kazumi Kasaura
Abstract:
To find the shortest paths for all pairs on continuous manifolds with infinitesimally defined metrics, we propose to generate them by predicting midpoints recursively and an actor-critic method to learn midpoint prediction. We prove the soundness of our approach and show experimentally that the proposed method outperforms existing methods on both local and global path planning tasks.
To find the shortest paths for all pairs on continuous manifolds with infinitesimally defined metrics, we propose to generate them by predicting midpoints recursively and an actor-critic method to learn midpoint prediction. We prove the soundness of our approach and show experimentally that the proposed method outperforms existing methods on both local and global path planning tasks.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Swarm Body: Embodied Swarm Robots
Authors:
Sosuke Ichihashi,
So Kuroki,
Mai Nishimura,
Kazumi Kasaura,
Takefumi Hiraki,
Kazutoshi Tanaka,
Shigeo Yoshida
Abstract:
The human brain's plasticity allows for the integration of artificial body parts into the human body. Leveraging this, embodied systems realize intuitive interactions with the environment. We introduce a novel concept: embodied swarm robots. Swarm robots constitute a collective of robots working in harmony to achieve a common objective, in our case, serving as functional body parts. Embodied swarm…
▽ More
The human brain's plasticity allows for the integration of artificial body parts into the human body. Leveraging this, embodied systems realize intuitive interactions with the environment. We introduce a novel concept: embodied swarm robots. Swarm robots constitute a collective of robots working in harmony to achieve a common objective, in our case, serving as functional body parts. Embodied swarm robots can dynamically alter their shape, density, and the correspondences between body parts and individual robots. We contribute an investigation of the influence on embodiment of swarm robot-specific factors derived from these characteristics, focusing on a hand. Our paper is the first to examine these factors through virtual reality (VR) and real-world robot studies to provide essential design considerations and applications of embodied swarm robots. Through quantitative and qualitative analysis, we identified a system configuration to achieve the embodiment of swarm robots.
△ Less
Submitted 29 February, 2024; v1 submitted 24 February, 2024;
originally announced February 2024.
-
Homotopy-Aware Multi-Agent Path Planning in Plane
Authors:
Kazumi Kasaura
Abstract:
We propose an efficient framework using the Dynnikov coordinates for homotopy-aware multi-agent path planning in the plane. We developed a method to generate multiple homotopically distinct solutions of multi-agent path planning problem in the plane by combining our framework with revised prioritized planning and proved its completeness in the grid world under specific assumptions. Experimentally,…
▽ More
We propose an efficient framework using the Dynnikov coordinates for homotopy-aware multi-agent path planning in the plane. We developed a method to generate multiple homotopically distinct solutions of multi-agent path planning problem in the plane by combining our framework with revised prioritized planning and proved its completeness in the grid world under specific assumptions. Experimentally, we demonstrated the scalability of our method for the number of agents. We also confirmed experimentally that homotopy-aware planning contributes to avoiding locally optimal solutions when searching for low-cost trajectories for a swarm of agents in a continuous environment.
△ Less
Submitted 30 May, 2024; v1 submitted 3 October, 2023;
originally announced October 2023.
-
Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints
Authors:
Kazumi Kasaura,
Shuwa Miura,
Tadashi Kozuno,
Ryo Yonetani,
Kenta Hoshino,
Yohei Hosoe
Abstract:
This study presents a benchmark for evaluating action-constrained reinforcement learning (RL) algorithms. In action-constrained RL, each action taken by the learning system must comply with certain constraints. These constraints are crucial for ensuring the feasibility and safety of actions in real-world systems. We evaluate existing algorithms and their novel variants across multiple robotics con…
▽ More
This study presents a benchmark for evaluating action-constrained reinforcement learning (RL) algorithms. In action-constrained RL, each action taken by the learning system must comply with certain constraints. These constraints are crucial for ensuring the feasibility and safety of actions in real-world systems. We evaluate existing algorithms and their novel variants across multiple robotics control environments, encompassing multiple action constraint types. Our evaluation provides the first in-depth perspective of the field, revealing surprising insights, including the effectiveness of a straightforward baseline approach. The benchmark problems and associated code utilized in our experiments are made available online at github.com/omron-sinicx/action-constrained-RL-benchmark for further research and development.
△ Less
Submitted 29 May, 2023; v1 submitted 18 April, 2023;
originally announced April 2023.
-
Periodic Multi-Agent Path Planning
Authors:
Kazumi Kasaura,
Ryo Yonetani,
Mai Nishimura
Abstract:
Multi-agent path planning (MAPP) is the problem of planning collision-free trajectories from start to goal locations for a team of agents. This work explores a relatively unexplored setting of MAPP where streams of agents have to go through the starts and goals with high throughput. We tackle this problem by formulating a new variant of MAPP called periodic MAPP in which the timing of agent appear…
▽ More
Multi-agent path planning (MAPP) is the problem of planning collision-free trajectories from start to goal locations for a team of agents. This work explores a relatively unexplored setting of MAPP where streams of agents have to go through the starts and goals with high throughput. We tackle this problem by formulating a new variant of MAPP called periodic MAPP in which the timing of agent appearances is periodic. The objective with periodic MAPP is to find a periodic plan, a set of collision-free trajectories that the agent streams can use repeatedly over periods, with periods that are as small as possible. To meet this objective, we propose a solution method that is based on constraint relaxation and optimization. We show that the periodic plans once found can be used for a more practical case in which agents in a stream can appear at random times. We confirm the effectiveness of our method compared with baseline methods in terms of throughput in several scenarios that abstract autonomous intersection management tasks.
△ Less
Submitted 29 May, 2023; v1 submitted 25 January, 2023;
originally announced January 2023.
-
On extension of overconvergent log isocrystals on log smooth varieties
Authors:
Kazumi Kasaura
Abstract:
By works of Kedlaya and Shiho, it is known that, for a smooth variety $\overline{X}$ over a field of positive characteristic and its simple normal crossing divisor $Z$, an overconvergent isocrystal on the compliment of $Z$ satisfying a certain monodromy condition can be extended to a convergent log isocrystal on $\left(\overline{X}, \mathcal{M}_Z\right)$, where $\mathcal{M}_Z$ is the log structure…
▽ More
By works of Kedlaya and Shiho, it is known that, for a smooth variety $\overline{X}$ over a field of positive characteristic and its simple normal crossing divisor $Z$, an overconvergent isocrystal on the compliment of $Z$ satisfying a certain monodromy condition can be extended to a convergent log isocrystal on $\left(\overline{X}, \mathcal{M}_Z\right)$, where $\mathcal{M}_Z$ is the log structure associated to $Z$. We prove a generalization of this result: for a log smooth variety $\left(\overline{X},\mathcal{M}\right)$ satisfying some conditions, an overconvergent log isocrystal on the trivial locus of a direct summand of $\mathcal{M}$ satisfying a certain monodromy condition can be extended to a convergent log isocrystal on $\left(\overline{X}, \mathcal{M}\right)$.
△ Less
Submitted 25 June, 2020;
originally announced June 2020.