Showing 1–1 of 1 results for author: Panneershelvam, V

Search v0.5.6 released 2020-02-24

arXiv:1507.04296 [pdf, other]

cs.LG cs.AI cs.DC cs.NE

Massively Parallel Methods for Deep Reinforcement Learning

Authors: Arun Nair, Praveen Srinivasan, Sam Blackwell, Cagdas Alcicek, Rory Fearon, Alessandro De Maria, Vedavyas Panneershelvam, Mustafa Suleyman, Charles Beattie, Stig Petersen, Shane Legg, Volodymyr Mnih, Koray Kavukcuoglu, David Silver

Abstract: We present the first massively distributed architecture for deep reinforcement learning. This architecture uses four main components: parallel actors that generate new behaviour; parallel learners that are trained from stored experience; a distributed neural network to represent the value function or behaviour policy; and a distributed store of experience. We used our architecture to implement the… ▽ More We present the first massively distributed architecture for deep reinforcement learning. This architecture uses four main components: parallel actors that generate new behaviour; parallel learners that are trained from stored experience; a distributed neural network to represent the value function or behaviour policy; and a distributed store of experience. We used our architecture to implement the Deep Q-Network algorithm (DQN). Our distributed algorithm was applied to 49 games from Atari 2600 games from the Arcade Learning Environment, using identical hyperparameters. Our performance surpassed non-distributed DQN in 41 of the 49 games and also reduced the wall-time required to achieve these results by an order of magnitude on most games. △ Less

Submitted 16 July, 2015; v1 submitted 15 July, 2015; originally announced July 2015.

Comments: Presented at the Deep Learning Workshop, International Conference on Machine Learning, Lille, France, 2015

Search v0.5.6 released 2020-02-24