Skip to main content

Showing 1–1 of 1 results for author: Pagare, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2304.03729  [pdf, other

    eess.SY cs.LG

    Full Gradient Deep Reinforcement Learning for Average-Reward Criterion

    Authors: Tejas Pagare, Vivek Borkar, Konstantin Avrachenkov

    Abstract: We extend the provably convergent Full Gradient DQN algorithm for discounted reward Markov decision processes from Avrachenkov et al. (2021) to average reward problems. We experimentally compare widely used RVI Q-Learning with recently proposed Differential Q-Learning in the neural function approximation setting with Full Gradient DQN and DQN. We also extend this to learn Whittle indices for Marko… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: 13 pages, 4 figures; Accepted by 5th Annual Learning for Dynamics & Control Conference (L4DC) 2023

    MSC Class: 93-06