Satisficing Paths and Independent Multi-Agent Reinforcement Learning in Stochastic Games

Yongacoglu, Bora; Arslan, Gürdal; Yüksel, Serdar

doi:10.1137/22M1515112

Computer Science > Computer Science and Game Theory

arXiv:2110.04638 (cs)

[Submitted on 9 Oct 2021 (v1), last revised 20 Feb 2023 (this version, v4)]

Title:Satisficing Paths and Independent Multi-Agent Reinforcement Learning in Stochastic Games

Authors:Bora Yongacoglu, Gürdal Arslan, Serdar Yüksel

View PDF

Abstract:In multi-agent reinforcement learning (MARL), independent learners are those that do not observe the actions of other agents in the system. Due to the decentralization of information, it is challenging to design independent learners that drive play to equilibrium. This paper investigates the feasibility of using satisficing dynamics to guide independent learners to approximate equilibrium in stochastic games. For $\epsilon \geq 0$, an $\epsilon$-satisficing policy update rule is any rule that instructs the agent to not change its policy when it is $\epsilon$-best-responding to the policies of the remaining players; $\epsilon$-satisficing paths are defined to be sequences of joint policies obtained when each agent uses some $\epsilon$-satisficing policy update rule to select its next policy. We establish structural results on the existence of $\epsilon$-satisficing paths into $\epsilon$-equilibrium in both symmetric $N$-player games and general stochastic games with two players. We then present an independent learning algorithm for $N$-player symmetric games and give high probability guarantees of convergence to $\epsilon$-equilibrium under self-play. This guarantee is made using symmetry alone, leveraging the previously unexploited structure of $\epsilon$-satisficing paths.

Subjects:	Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:2110.04638 [cs.GT]
	(or arXiv:2110.04638v4 [cs.GT] for this version)
	https://doi.org/10.48550/arXiv.2110.04638
Journal reference:	SIAM Journal on Mathematics of Data Science, vol 5, no. 3, pp. 745-773, Aug 2023
Related DOI:	https://doi.org/10.1137/22M1515112

Submission history

From: Bora Yongacoglu [view email]
[v1] Sat, 9 Oct 2021 19:57:21 UTC (73 KB)
[v2] Fri, 21 Jan 2022 16:43:02 UTC (72 KB)
[v3] Tue, 9 Aug 2022 04:04:49 UTC (89 KB)
[v4] Mon, 20 Feb 2023 00:01:29 UTC (95 KB)

Computer Science > Computer Science and Game Theory

Title:Satisficing Paths and Independent Multi-Agent Reinforcement Learning in Stochastic Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Science and Game Theory

Title:Satisficing Paths and Independent Multi-Agent Reinforcement Learning in Stochastic Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators