Showing 1–1 of 1 results for author: Lim, E Y Y

Search v0.5.6 released 2020-02-24

arXiv:2105.06960 [pdf, ps, other]

cs.LG stat.ML

Thompson Sampling for Gaussian Entropic Risk Bandits

Authors: Ming Liang Ang, Eloise Y. Y. Lim, Joel Q. L. Chang

Abstract: The multi-armed bandit (MAB) problem is a ubiquitous decision-making problem that exemplifies exploration-exploitation tradeoff. Standard formulations exclude risk in decision making. Risknotably complicates the basic reward-maximising objectives, in part because there is no universally agreed definition of it. In this paper, we consider an entropic risk (ER) measure and explore the performance of… ▽ More The multi-armed bandit (MAB) problem is a ubiquitous decision-making problem that exemplifies exploration-exploitation tradeoff. Standard formulations exclude risk in decision making. Risknotably complicates the basic reward-maximising objectives, in part because there is no universally agreed definition of it. In this paper, we consider an entropic risk (ER) measure and explore the performance of a Thompson sampling-based algorithm ERTS under this risk measure by providing regret bounds for ERTS and corresponding instance dependent lower bounds. △ Less

Submitted 14 May, 2021; originally announced May 2021.

Comments: arXiv admin note: text overlap with arXiv:2011.08046

Search v0.5.6 released 2020-02-24