Skip to main content

Showing 1–2 of 2 results for author: Kas, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:1905.12941  [pdf, other

    cs.AI

    Learning Compositional Neural Programs with Recursive Tree Search and Planning

    Authors: Thomas Pierrot, Guillaume Ligner, Scott Reed, Olivier Sigaud, Nicolas Perrin, Alexandre Laterre, David Kas, Karim Beguir, Nando de Freitas

    Abstract: We propose a novel reinforcement learning algorithm, AlphaNPI, that incorporates the strengths of Neural Programmer-Interpreters (NPI) and AlphaZero. NPI contributes structural biases in the form of modularity, hierarchy and recursion, which are helpful to reduce sample complexity, improve generalization and increase interpretability. AlphaZero contributes powerful neural network guided search alg… ▽ More

    Submitted 13 April, 2021; v1 submitted 30 May, 2019; originally announced May 2019.

  2. arXiv:1807.01672  [pdf, other

    cs.LG cs.AI stat.ML

    Ranked Reward: Enabling Self-Play Reinforcement Learning for Combinatorial Optimization

    Authors: Alexandre Laterre, Yunguan Fu, Mohamed Khalil Jabri, Alain-Sam Cohen, David Kas, Karl Hajjar, Torbjorn S. Dahl, Amine Kerkeni, Karim Beguir

    Abstract: Adversarial self-play in two-player games has delivered impressive results when used with reinforcement learning algorithms that combine deep neural networks and tree search. Algorithms like AlphaZero and Expert Iteration learn tabula-rasa, producing highly informative training data on the fly. However, the self-play training strategy is not directly applicable to single-player games. Recently, se… ▽ More

    Submitted 6 December, 2018; v1 submitted 4 July, 2018; originally announced July 2018.

    Journal ref: Presented at the Thirty-second Conference on Neural Information Processing Systems (NeurIPS 2018), Deep Reinforcement Learning Workshop, Montreal, Canada, December 3-8, 2018