Suboptimal Performance of the Bayes Optimal Algorithm in Frequentist Best Arm Identification

Komiyama, Junpei

Statistics > Machine Learning

arXiv:2202.05193 (stat)

[Submitted on 10 Feb 2022 (v1), last revised 15 Apr 2024 (this version, v3)]

Title:Suboptimal Performance of the Bayes Optimal Algorithm in Frequentist Best Arm Identification

Authors:Junpei Komiyama

View PDF HTML (experimental)

Abstract:We consider the fixed-budget best arm identification problem with rewards following normal distributions. In this problem, the forecaster is given $K$ arms (or treatments) and $T$ time steps. The forecaster attempts to find the arm with the largest mean, via an adaptive experiment conducted using an algorithm. The algorithm's performance is evaluated by simple regret, reflecting the quality of the estimated best arm. While frequentist simple regret can decrease exponentially with respect to $T$, Bayesian simple regret decreases polynomially. This paper demonstrates that the Bayes optimal algorithm, which minimizes the Bayesian simple regret, does not yield an exponential decrease in simple regret under certain parameter settings. This contrasts with the numerous findings that suggest the asymptotic equivalence of Bayesian and frequentist approaches in fixed sampling regimes. Although the Bayes optimal algorithm is formulated as a recursive equation that is virtually impossible to compute exactly, we lay the groundwork for future research by introducing a novel concept termed the expected Bellman improvement.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Probability (math.PR)
Cite as:	arXiv:2202.05193 [stat.ML]
	(or arXiv:2202.05193v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2202.05193

Submission history

From: Junpei Komiyama [view email]
[v1] Thu, 10 Feb 2022 17:50:26 UTC (70 KB)
[v2] Fri, 19 Aug 2022 15:15:33 UTC (94 KB)
[v3] Mon, 15 Apr 2024 02:46:34 UTC (80 KB)

Statistics > Machine Learning

Title:Suboptimal Performance of the Bayes Optimal Algorithm in Frequentist Best Arm Identification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Suboptimal Performance of the Bayes Optimal Algorithm in Frequentist Best Arm Identification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators