Computer Science > Information Theory
[Submitted on 28 May 2020 (v1), revised 16 Aug 2021 (this version, v3), latest version 4 Jan 2023 (v5)]
Title:Task-Based Information Compression for Multi-Agent Communication Problems with Channel Rate Constraints
View PDFAbstract:A collaborative task is assigned to a multiagent system (MAS) in which agents are allowed to communicate. The MAS runs over an underlying Markov decision process and its task is to maximize the averaged sum of discounted one-stage rewards. Although knowing the global state of the environment is necessary for the optimal action selection of the MAS, agents are limited to individual observations. Inter-agent communication can tackle the issue of local observability, however, the limited rate of inter-agent communication prevents the agents from acquiring the precise global state information. To overcome this challenge, agents need to communicate an abstract version of their observations to each other such that the MAS compromises the minimum possible sum of rewards. We show that this problem is equivalent to a form of rate-distortion problem, which we call task-based information compression (TBIC). We introduce state aggregation for information compression (SAIC) to solve the TBIC problem. SAIC is shown to achieve near-optimal performance in terms of the achieved sum of discounted rewards. The proposed algorithm is applied to a rendezvous problem and its performance is compared with several benchmarks. Numerical experiments confirm the superiority of the proposed algorithm.
Submission history
From: Arsham Mostaani [view email][v1] Thu, 28 May 2020 18:29:21 UTC (1,149 KB)
[v2] Tue, 27 Jul 2021 14:27:26 UTC (5,075 KB)
[v3] Mon, 16 Aug 2021 12:14:53 UTC (5,075 KB)
[v4] Mon, 31 Jan 2022 11:31:07 UTC (5,305 KB)
[v5] Wed, 4 Jan 2023 14:13:00 UTC (5,597 KB)
Bibliographic and Citation Tools
Bibliographic Explorer (What is the Explorer?)
Litmaps (What is Litmaps?)
scite Smart Citations (What are Smart Citations?)
Code, Data and Media Associated with this Article
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub (What is DagsHub?)
Gotit.pub (What is GotitPub?)
Papers with Code (What is Papers with Code?)
ScienceCast (What is ScienceCast?)
Demos
Recommenders and Search Tools
Influence Flower (What are Influence Flowers?)
Connected Papers (What is Connected Papers?)
CORE Recommender (What is CORE?)
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.