Skip to main content

Showing 1–1 of 1 results for author: Claeys, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2005.02209  [pdf, ps, other

    cs.LG stat.ML

    Hyper-parameter Tuning for the Contextual Bandit

    Authors: Djallel Bouneffouf, Emmanuelle Claeys

    Abstract: We study here the problem of learning the exploration exploitation trade-off in the contextual bandit problem with linear reward function setting. In the traditional algorithms that solve the contextual bandit problem, the exploration is a parameter that is tuned by the user. However, our proposed algorithm learn to choose the right exploration parameters in an online manner based on the observed… ▽ More

    Submitted 4 May, 2020; originally announced May 2020.

    Comments: arXiv admin note: text overlap with arXiv:1705.03821