Skip to main content

Showing 1–1 of 1 results for author: Sankararama, K A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2011.01488  [pdf, other

    cs.LG cs.AI

    Multi-armed Bandits with Cost Subsidy

    Authors: Deeksha Sinha, Karthik Abinav Sankararama, Abbas Kazerouni, Vashist Avadhanula

    Abstract: In this paper, we consider a novel variant of the multi-armed bandit (MAB) problem, MAB with cost subsidy, which models many real-life applications where the learning agent has to pay to select an arm and is concerned about optimizing cumulative costs and rewards. We present two applications, intelligent SMS routing problem and ad audience optimization problem faced by several businesses (especial… ▽ More

    Submitted 15 March, 2021; v1 submitted 3 November, 2020; originally announced November 2020.