Skip to main content

Showing 1–1 of 1 results for author: Jabri, M K

Searching in archive cs. Search in all archives.
.
  1. arXiv:1807.01672  [pdf, other

    cs.LG cs.AI stat.ML

    Ranked Reward: Enabling Self-Play Reinforcement Learning for Combinatorial Optimization

    Authors: Alexandre Laterre, Yunguan Fu, Mohamed Khalil Jabri, Alain-Sam Cohen, David Kas, Karl Hajjar, Torbjorn S. Dahl, Amine Kerkeni, Karim Beguir

    Abstract: Adversarial self-play in two-player games has delivered impressive results when used with reinforcement learning algorithms that combine deep neural networks and tree search. Algorithms like AlphaZero and Expert Iteration learn tabula-rasa, producing highly informative training data on the fly. However, the self-play training strategy is not directly applicable to single-player games. Recently, se… ▽ More

    Submitted 6 December, 2018; v1 submitted 4 July, 2018; originally announced July 2018.

    Journal ref: Presented at the Thirty-second Conference on Neural Information Processing Systems (NeurIPS 2018), Deep Reinforcement Learning Workshop, Montreal, Canada, December 3-8, 2018