Last-iterate Convergence in Extensive-Form Games

Lee, Chung-Wei; Kroer, Christian; Luo, Haipeng

Computer Science > Machine Learning

arXiv:2106.14326 (cs)

[Submitted on 27 Jun 2021 (v1), last revised 27 Oct 2021 (this version, v2)]

Title:Last-iterate Convergence in Extensive-Form Games

Authors:Chung-Wei Lee, Christian Kroer, Haipeng Luo

View PDF

Abstract:Regret-based algorithms are highly efficient at finding approximate Nash equilibria in sequential games such as poker games. However, most regret-based algorithms, including counterfactual regret minimization (CFR) and its variants, rely on iterate averaging to achieve convergence. Inspired by recent advances on last-iterate convergence of optimistic algorithms in zero-sum normal-form games, we study this phenomenon in sequential games, and provide a comprehensive study of last-iterate convergence for zero-sum extensive-form games with perfect recall (EFGs), using various optimistic regret-minimization algorithms over treeplexes. This includes algorithms using the vanilla entropy or squared Euclidean norm regularizers, as well as their dilated versions which admit more efficient implementation. In contrast to CFR, we show that all of these algorithms enjoy last-iterate convergence, with some of them even converging exponentially fast. We also provide experiments to further support our theoretical results.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2106.14326 [cs.LG]
	(or arXiv:2106.14326v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2106.14326

Submission history

From: Chung-Wei Lee [view email]
[v1] Sun, 27 Jun 2021 22:02:26 UTC (373 KB)
[v2] Wed, 27 Oct 2021 07:00:17 UTC (527 KB)

Full-text links:

Access Paper:

view license

Current browse context:

< prev | next >

new | recent | 2021-06

Change to browse by:

cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Chung-Wei Lee
Christian Kroer
Haipeng Luo

export BibTeX citation

Computer Science > Machine Learning

Title:Last-iterate Convergence in Extensive-Form Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Last-iterate Convergence in Extensive-Form Games

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators