Skip to main content

Showing 1–4 of 4 results for author: Parr, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2209.13398  [pdf, other

    cs.LG

    Paused Agent Replay Refresh

    Authors: Benjamin Parr

    Abstract: Reinforcement learning algorithms have become more complex since the invention of target networks. Unfortunately, target networks have not kept up with this increased complexity, instead requiring approximate solutions to be computationally feasible. These approximations increase noise in the Q-value targets and in the replay sampling distribution. Paused Agent Replay Refresh (PARR) is a drop-in r… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  2. arXiv:1801.03138  [pdf, other

    cs.AI

    Deep In-GPU Experience Replay

    Authors: Ben Parr

    Abstract: Experience replay allows a reinforcement learning agent to train on samples from a large amount of the most recent experiences. A simple in-RAM experience replay stores these most recent experiences in a list in RAM, and then copies sampled batches to the GPU for training. I moved this list to the GPU, thus creating an in-GPU experience replay, and a training step that no longer has inputs copied… ▽ More

    Submitted 9 January, 2018; originally announced January 2018.

    Comments: Source code (uses TensorFlow): https://github.com/bparr/gpu-experience-replay

  3. arXiv:1801.03137  [pdf, other

    cs.LG cs.AI stat.ML

    Convergence Analysis of Gradient Descent Algorithms with Proportional Updates

    Authors: Igor Gitman, Deepak Dilipkumar, Ben Parr

    Abstract: The rise of deep learning in recent years has brought with it increasingly clever optimization methods to deal with complex, non-linear loss functions. These methods are often designed with convex optimization in mind, but have been shown to work well in practice even for the highly non-convex optimization associated with neural networks. However, one significant drawback of these methods when the… ▽ More

    Submitted 9 January, 2018; originally announced January 2018.

    Comments: Source code (uses TensorFlow): https://github.com/bparr/lars

  4. arXiv:1712.03280  [pdf, other

    cs.AI

    Nintendo Super Smash Bros. Melee: An "Untouchable" Agent

    Authors: Ben Parr, Deepak Dilipkumar, Yuan Liu

    Abstract: Nintendo's Super Smash Bros. Melee fighting game can be emulated on modern hardware allowing us to inspect internal memory states, such as character positions. We created an AI that avoids being hit by training using these internal memory states and outputting controller button presses. After training on a month's worth of Melee matches, our best agent learned to avoid the toughest AI built into t… ▽ More

    Submitted 8 December, 2017; originally announced December 2017.