Skip to main content

Showing 1–1 of 1 results for author: Kobeissi, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2202.07960  [pdf, other

    cs.LG cs.AI math.AP math.OC

    Temporal Difference Learning with Continuous Time and State in the Stochastic Setting

    Authors: Ziad Kobeissi, Francis Bach

    Abstract: We consider the problem of continuous-time policy evaluation. This consists in learning through observations the value function associated with an uncontrolled continuous-time stochastic dynamic and a reward function. We propose two original variants of the well-known TD(0) method using vanishing time steps. One is model-free and the other is model-based. For both methods, we prove theoretical con… ▽ More

    Submitted 7 June, 2023; v1 submitted 16 February, 2022; originally announced February 2022.