Skip to main content

Showing 1–1 of 1 results for author: Kaledin, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2002.01268  [pdf, other

    stat.ML cs.LG

    Finite Time Analysis of Linear Two-timescale Stochastic Approximation with Markovian Noise

    Authors: Maxim Kaledin, Eric Moulines, Alexey Naumov, Vladislav Tadic, Hoi-To Wai

    Abstract: Linear two-timescale stochastic approximation (SA) scheme is an important class of algorithms which has become popular in reinforcement learning (RL), particularly for the policy evaluation problem. Recently, a number of works have been devoted to establishing the finite time analysis of the scheme, especially under the Markovian (non-i.i.d.) noise settings that are ubiquitous in practice. In this… ▽ More

    Submitted 4 February, 2020; originally announced February 2020.