Skip to main content

Showing 1–1 of 1 results for author: Moazehi, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:1401.0843  [pdf, other

    math.OC cs.LG

    Least Squares Policy Iteration with Instrumental Variables vs. Direct Policy Search: Comparison Against Optimal Benchmarks Using Energy Storage

    Authors: Warren R. Scott, Warren B. Powell, Somayeh Moazehi

    Abstract: This paper studies approximate policy iteration (API) methods which use least-squares Bellman error minimization for policy evaluation. We address several of its enhancements, namely, Bellman error minimization using instrumental variables, least-squares projected Bellman error minimization, and projected Bellman error minimization using instrumental variables. We prove that for a general discrete… ▽ More

    Submitted 4 January, 2014; originally announced January 2014.

    Comments: 37 pages, 9 figures