Bridging the Gap Between Target Networks and Functional Regularization

Piche, Alexandre; Thomas, Valentin; Marino, Joseph; Pardinas, Rafael; Marconi, Gian Maria; Pal, Christopher; Khan, Mohammad Emtiyaz

Computer Science > Machine Learning

arXiv:2210.12282 (cs)

This paper has been withdrawn by Valentin Thomas

[Submitted on 21 Oct 2022 (v1), last revised 3 Jan 2024 (this version, v2)]

Title:Bridging the Gap Between Target Networks and Functional Regularization

Authors:Alexandre Piche, Valentin Thomas, Joseph Marino, Rafael Pardinas, Gian Maria Marconi, Christopher Pal, Mohammad Emtiyaz Khan

No PDF available, click to view other formats

Abstract:Bootstrap** is behind much of the successes of Deep Reinforcement Learning. However, learning the value function via bootstrap** often leads to unstable training due to fast-changing target values. Target Networks are employed to stabilize training by using an additional set of lagging parameters to estimate the target values. Despite the popularity of Target Networks, their effect on the optimization is still misunderstood. In this work, we show that they act as an implicit regularizer. This regularizer has disadvantages such as being inflexible and non convex. To overcome these issues, we propose an explicit Functional Regularization that is a convex regularizer in function space and can easily be tuned. We analyze the convergence of our method theoretically and empirically demonstrate that replacing Target Networks with the more theoretically grounded Functional Regularization approach leads to better sample efficiency and performance improvements.

Comments:	The published version of this paper (TMLR 2023) is available at arXiv:2106.02613 and this https URL
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2210.12282 [cs.LG]
	(or arXiv:2210.12282v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2210.12282

Submission history

From: Valentin Thomas [view email]
[v1] Fri, 21 Oct 2022 22:27:07 UTC (1,633 KB)
[v2] Wed, 3 Jan 2024 17:02:21 UTC (1 KB) (withdrawn)

Computer Science > Machine Learning

Title:Bridging the Gap Between Target Networks and Functional Regularization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Bridging the Gap Between Target Networks and Functional Regularization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators