Skip to main content

Showing 1–2 of 2 results for author: Osband, I

Searching in archive eess. Search in all archives.
.
  1. arXiv:1602.04621  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Deep Exploration via Bootstrapped DQN

    Authors: Ian Osband, Charles Blundell, Alexander Pritzel, Benjamin Van Roy

    Abstract: Efficient exploration in complex environments remains a major challenge for reinforcement learning. We propose bootstrapped DQN, a simple algorithm that explores in a computationally and statistically efficient manner through use of randomized value functions. Unlike dithering strategies such as epsilon-greedy exploration, bootstrapped DQN carries out temporally-extended (or deep) exploration; thi… ▽ More

    Submitted 4 July, 2016; v1 submitted 15 February, 2016; originally announced February 2016.

  2. arXiv:1402.0635  [pdf, other

    stat.ML cs.AI cs.LG eess.SY

    Generalization and Exploration via Randomized Value Functions

    Authors: Ian Osband, Benjamin Van Roy, Zheng Wen

    Abstract: We propose randomized least-squares value iteration (RLSVI) -- a new reinforcement learning algorithm designed to explore and generalize efficiently via linearly parameterized value functions. We explain why versions of least-squares value iteration that use Boltzmann or epsilon-greedy exploration can be highly inefficient, and we present computational results that demonstrate dramatic efficiency… ▽ More

    Submitted 15 February, 2016; v1 submitted 4 February, 2014; originally announced February 2014.

    Comments: arXiv admin note: text overlap with arXiv:1307.4847