Showing 1–1 of 1 results for author: Pagare, T

Search v0.5.6 released 2020-02-24

arXiv:2304.03729 [pdf, other]

eess.SY cs.LG

Full Gradient Deep Reinforcement Learning for Average-Reward Criterion

Authors: Tejas Pagare, Vivek Borkar, Konstantin Avrachenkov

Abstract: We extend the provably convergent Full Gradient DQN algorithm for discounted reward Markov decision processes from Avrachenkov et al. (2021) to average reward problems. We experimentally compare widely used RVI Q-Learning with recently proposed Differential Q-Learning in the neural function approximation setting with Full Gradient DQN and DQN. We also extend this to learn Whittle indices for Marko… ▽ More We extend the provably convergent Full Gradient DQN algorithm for discounted reward Markov decision processes from Avrachenkov et al. (2021) to average reward problems. We experimentally compare widely used RVI Q-Learning with recently proposed Differential Q-Learning in the neural function approximation setting with Full Gradient DQN and DQN. We also extend this to learn Whittle indices for Markovian restless multi-armed bandits. We observe a better convergence rate of the proposed Full Gradient variant across different tasks. △ Less

Submitted 7 April, 2023; originally announced April 2023.

Comments: 13 pages, 4 figures; Accepted by 5th Annual Learning for Dynamics & Control Conference (L4DC) 2023

MSC Class: 93-06

Search v0.5.6 released 2020-02-24