An $\alpha$-No-Regret Algorithm For Graphical Bilinear Bandits

Rizk, Geovani; Colin, Igor; Thomas, Albert; Laraki, Rida; Chevaleyre, Yann

Computer Science > Machine Learning

arXiv:2206.00466 (cs)

[Submitted on 1 Jun 2022 (v1), last revised 12 Oct 2022 (this version, v2)]

Title:An $α$-No-Regret Algorithm For Graphical Bilinear Bandits

Authors:Geovani Rizk, Igor Colin, Albert Thomas, Rida Laraki, Yann Chevaleyre

View PDF

Abstract:We propose the first regret-based approach to the Graphical Bilinear Bandits problem, where $n$ agents in a graph play a stochastic bilinear bandit game with each of their neighbors. This setting reveals a combinatorial NP-hard problem that prevents the use of any existing regret-based algorithm in the (bi-)linear bandit literature. In this paper, we fill this gap and present the first regret-based algorithm for graphical bilinear bandits using the principle of optimism in the face of uncertainty. Theoretical analysis of this new method yields an upper bound of $\tilde{O}(\sqrt{T})$ on the $\alpha$-regret and evidences the impact of the graph structure on the rate of convergence. Finally, we show through various experiments the validity of our approach.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2206.00466 [cs.LG]
	(or arXiv:2206.00466v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.00466

Submission history

From: Geovani Rizk [view email]
[v1] Wed, 1 Jun 2022 12:55:17 UTC (399 KB)
[v2] Wed, 12 Oct 2022 13:26:59 UTC (399 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2022-06

Change to browse by:

cs
stat
stat.ML

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:An $α$-No-Regret Algorithm For Graphical Bilinear Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:An $α$-No-Regret Algorithm For Graphical Bilinear Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators