DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning

Sheikh, Hassam; Frisbee, Kizza; Phielipp, Mariano

Computer Science > Machine Learning

arXiv:2201.13357 (cs)

[Submitted on 31 Jan 2022 (v1), last revised 17 May 2022 (this version, v3)]

Title:DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning

Authors:Hassam Sheikh, Kizza Frisbee, Mariano Phielipp

View PDF

Abstract:Application of ensemble of neural networks is becoming an imminent tool for advancing the state-of-the-art in deep reinforcement learning algorithms. However, training these large numbers of neural networks in the ensemble has an exceedingly high computation cost which may become a hindrance in training large-scale systems. In this paper, we propose DNS: a Determinantal Point Process based Neural Network Sampler that specifically uses k-dpp to sample a subset of neural networks for backpropagation at every training step thus significantly reducing the training time and computation cost. We integrated DNS in REDQ for continuous control tasks and evaluated on MuJoCo environments. Our experiments show that DNS augmented REDQ outperforms baseline REDQ in terms of average cumulative reward and achieves this using less than 50% computation when measured in FLOPS.

Comments:	Accepted for Publication at ICML 2022
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2201.13357 [cs.LG]
	(or arXiv:2201.13357v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2201.13357

Submission history

From: Hassam Sheikh [view email]
[v1] Mon, 31 Jan 2022 17:08:39 UTC (1,568 KB)
[v2] Sun, 6 Feb 2022 09:20:30 UTC (1,568 KB)
[v3] Tue, 17 May 2022 15:48:35 UTC (1,575 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2022-01

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Mariano Phielipp

export BibTeX citation

Computer Science > Machine Learning

Title:DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators