Best-Response Dynamics and Fictitious Play in Identical-Interest and Zero-Sum Stochastic Games

Baudin, Lucas; Laraki, Rida

Computer Science > Computer Science and Game Theory

arXiv:2111.04317 (cs)

[Submitted on 8 Nov 2021 (v1), last revised 16 May 2022 (this version, v2)]

Title:Best-Response Dynamics and Fictitious Play in Identical-Interest and Zero-Sum Stochastic Games

Authors:Lucas Baudin, Rida Laraki

View PDF

Abstract:This paper combines ideas from Q-learning and fictitious play to define three reinforcement learning procedures which converge to the set of stationary mixed Nash equilibria in identical interest discounted stochastic games. First, we analyse three continuous-time systems that generalize the best-response dynamics defined by Leslie et al. for zero-sum discounted stochastic games. Under some assumptions depending on the system, the dynamics are shown to converge to the set of stationary equilibria in identical interest discounted stochastic games. Then, we introduce three analog discrete-time procedures in the spirit of Sayin et al. and demonstrate their convergence to the set of stationary equilibria using our results in continuous time together with stochastic approximation techniques. Some numerical experiments complement our theoretical findings.

Comments:	Preprint, accepted at ICML 2022
Subjects:	Computer Science and Game Theory (cs.GT)
Cite as:	arXiv:2111.04317 [cs.GT]
	(or arXiv:2111.04317v2 [cs.GT] for this version)
	https://doi.org/10.48550/arXiv.2111.04317

Submission history

From: Lucas Baudin [view email]
[v1] Mon, 8 Nov 2021 08:06:57 UTC (318 KB)
[v2] Mon, 16 May 2022 11:37:12 UTC (644 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.GT

< prev | next >

new | recent | 2021-11

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

export BibTeX citation

Computer Science > Computer Science and Game Theory

Title:Best-Response Dynamics and Fictitious Play in Identical-Interest and Zero-Sum Stochastic Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Science and Game Theory

Title:Best-Response Dynamics and Fictitious Play in Identical-Interest and Zero-Sum Stochastic Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators