Boosting Long-Delayed Reinforcement Learning with Auxiliary Short-Delayed Task

Wu, Qingyuan; Zhan, Simon Sinong; Wang, Yixuan; Lin, Chung-Wei; Lv, Chen; Zhu, Qi; Huang, Chao

Computer Science > Machine Learning

arXiv:2402.03141v1 (cs)

[Submitted on 5 Feb 2024 (this version), latest version 5 Jun 2024 (v2)]

Title:Boosting Long-Delayed Reinforcement Learning with Auxiliary Short-Delayed Task

Authors:Qingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Chao Huang

View PDF

Abstract:Reinforcement learning is challenging in delayed scenarios, a common real-world situation where observations and interactions occur with delays. State-of-the-art (SOTA) state-augmentation techniques either suffer from the state-space explosion along with the delayed steps, or performance degeneration in stochastic environments. To address these challenges, our novel Auxiliary-Delayed Reinforcement Learning (AD-RL) leverages an auxiliary short-delayed task to accelerate the learning on a long-delayed task without compromising the performance in stochastic environments. Specifically, AD-RL learns the value function in the short-delayed task and then employs it with the bootstrap** and policy improvement techniques in the long-delayed task. We theoretically show that this can greatly reduce the sample complexity compared to directly learning on the original long-delayed task. On deterministic and stochastic benchmarks, our method remarkably outperforms the SOTAs in both sample efficiency and policy performance.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Systems and Control (eess.SY)
Cite as:	arXiv:2402.03141 [cs.LG]
	(or arXiv:2402.03141v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.03141

Submission history

From: Qingyuan Wu [view email]
[v1] Mon, 5 Feb 2024 16:11:03 UTC (3,943 KB)
[v2] Wed, 5 Jun 2024 19:12:37 UTC (4,131 KB)

Computer Science > Machine Learning

Title:Boosting Long-Delayed Reinforcement Learning with Auxiliary Short-Delayed Task

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Boosting Long-Delayed Reinforcement Learning with Auxiliary Short-Delayed Task

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators