Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost

Amani, Sanae; Lattimore, Tor; György, András; Yang, Lin F.

Computer Science > Machine Learning

arXiv:2205.13170 (cs)

[Submitted on 26 May 2022 (v1), last revised 8 Dec 2022 (this version, v2)]

Title:Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost

Authors:Sanae Amani, Tor Lattimore, András György, Lin F. Yang

View PDF

Abstract:We study distributed contextual linear bandits with stochastic contexts, where $N$ agents act cooperatively to solve a linear bandit-optimization problem with $d$-dimensional features over the course of $T$ rounds. For this problem, we derive the first ever information-theoretic lower bound $\Omega(dN)$ on the communication cost of any algorithm that performs optimally in a regret minimization setup. We then propose a distributed batch elimination version of the LinUCB algorithm, DisBE-LUCB, where the agents share information among each other through a central server. We prove that the communication cost of DisBE-LUCB matches our lower bound up to logarithmic factors. In particular, for scenarios with known context distribution, the communication cost of DisBE-LUCB is only $\tilde{\mathcal{O}}(dN)$ and its regret is ${\tilde{\mathcal{O}}}(\sqrt{dNT})$, which is of the same order as that incurred by an optimal single-agent algorithm for $NT$ rounds. We also provide similar bounds for practical settings where the context distribution can only be estimated. Therefore, our proposed algorithm is nearly minimax optimal in terms of \emph{both regret and communication cost}. Finally, we propose DecBE-LUCB, a fully decentralized version of DisBE-LUCB, which operates without a central server, where agents share information with their \emph{immediate neighbors} through a carefully designed consensus procedure.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2205.13170 [cs.LG]
	(or arXiv:2205.13170v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2205.13170

Submission history

From: Sanae Amani [view email]
[v1] Thu, 26 May 2022 05:56:23 UTC (95 KB)
[v2] Thu, 8 Dec 2022 04:25:12 UTC (123 KB)

Computer Science > Machine Learning

Title:Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators