Sub-optimal Policy Aided Multi-Agent Reinforcement Learning for Flocking Control

Qiu, Yunbo; **, Yue; Wang, Jian; Zhang, Xudong

Computer Science > Machine Learning

arXiv:2209.08347 (cs)

[Submitted on 17 Sep 2022]

Title:Sub-optimal Policy Aided Multi-Agent Reinforcement Learning for Flocking Control

Authors:Yunbo Qiu, Yue **, Jian Wang, Xudong Zhang

View PDF

Abstract:Flocking control is a challenging problem, where multiple agents, such as drones or vehicles, need to reach a target position while maintaining the flock and avoiding collisions with obstacles and collisions among agents in the environment. Multi-agent reinforcement learning has achieved promising performance in flocking control. However, methods based on traditional reinforcement learning require a considerable number of interactions between agents and the environment. This paper proposes a sub-optimal policy aided multi-agent reinforcement learning algorithm (SPA-MARL) to boost sample efficiency. SPA-MARL directly leverages a prior policy that can be manually designed or solved with a non-learning method to aid agents in learning, where the performance of the policy can be sub-optimal. SPA-MARL recognizes the difference in performance between the sub-optimal policy and itself, and then imitates the sub-optimal policy if the sub-optimal policy is better. We leverage SPA-MARL to solve the flocking control problem. A traditional control method based on artificial potential fields is used to generate a sub-optimal policy. Experiments demonstrate that SPA-MARL can speed up the training process and outperform both the MARL baseline and the used sub-optimal policy.

Comments:	Accepted by IEEE International Conference on Systems, Man, and Cybernetics (SMC) 2022
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Robotics (cs.RO)
Cite as:	arXiv:2209.08347 [cs.LG]
	(or arXiv:2209.08347v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2209.08347

Submission history

From: Yunbo Qiu [view email]
[v1] Sat, 17 Sep 2022 15:10:49 UTC (349 KB)

Computer Science > Machine Learning

Title:Sub-optimal Policy Aided Multi-Agent Reinforcement Learning for Flocking Control

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Sub-optimal Policy Aided Multi-Agent Reinforcement Learning for Flocking Control

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators