Skip to main content

Showing 1–1 of 1 results for author: Munos, R e

.
  1. arXiv:1506.04782  [pdf, other

    cs.LG

    Cheap Bandits

    Authors: Manjesh Kumar Hanawal, Venkatesh Saligrama, Michal Valko, R\' emi Munos

    Abstract: We consider stochastic sequential learning problems where the learner can observe the \textit{average reward of several actions}. Such a setting is interesting in many applications involving monitoring and surveillance, where the set of the actions to observe represent some (geographical) area. The importance of this setting is that in these applications, it is actually \textit{cheaper} to observe… ▽ More

    Submitted 18 June, 2015; v1 submitted 15 June, 2015; originally announced June 2015.

    Comments: To be presented at ICML 2015