Stochastic Bandits with Vector Losses: Minimizing $\ell^\infty$-Norm of Relative Losses

Shang, Xuedong; Shao, Han; Qian, Jian

Computer Science > Machine Learning

arXiv:2010.08061 (cs)

[Submitted on 15 Oct 2020]

Title:Stochastic Bandits with Vector Losses: Minimizing $\ell^\infty$-Norm of Relative Losses

Authors:Xuedong Shang, Han Shao, Jian Qian

View PDF

Abstract:Multi-armed bandits are widely applied in scenarios like recommender systems, for which the goal is to maximize the click rate. However, more factors should be considered, e.g., user stickiness, user growth rate, user experience assessment, etc. In this paper, we model this situation as a problem of $K$-armed bandit with multiple losses. We define relative loss vector of an arm where the $i$-th entry compares the arm and the optimal arm with respect to the $i$-th loss. We study two goals: (a) finding the arm with the minimum $\ell^\infty$-norm of relative losses with a given confidence level (which refers to fixed-confidence best-arm identification); (b) minimizing the $\ell^\infty$-norm of cumulative relative losses (which refers to regret minimization). For goal (a), we derive a problem-dependent sample complexity lower bound and discuss how to achieve matching algorithms. For goal (b), we provide a regret lower bound of $\Omega(T^{2/3})$ and provide a matching algorithm.

Comments:	14 pages
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2010.08061 [cs.LG]
	(or arXiv:2010.08061v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2010.08061

Submission history

From: Xuedong Shang [view email]
[v1] Thu, 15 Oct 2020 23:03:35 UTC (242 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-10

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Han Shao
Jian Qian

export BibTeX citation

Computer Science > Machine Learning

Title:Stochastic Bandits with Vector Losses: Minimizing $\ell^\infty$-Norm of Relative Losses

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Stochastic Bandits with Vector Losses: Minimizing $\ell^\infty$-Norm of Relative Losses

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators