Skip to main content

Showing 1–35 of 35 results for author: Cazenave, T

Searching in archive cs. Search in all archives.
.
  1. Mixture of Public and Private Distributions in Imperfect Information Games

    Authors: Jérôme Arjonilla, Abdallah Saffidine, Tristan Cazenave

    Abstract: In imperfect information games (e.g. Bridge, Skat, Poker), one of the fundamental considerations is to infer the missing information while at the same time avoiding the disclosure of private information. Disregarding the issue of protecting private information can lead to a highly exploitable performance. Yet, excessive attention to it leads to hesitations that are no longer consistent with our pr… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted in CoG 2023

    Journal ref: 2023 IEEE Conference on Games (CoG)

  2. Deep Reinforcement Learning for 5*5 Multiplayer Go

    Authors: Brahim Driss, Jérôme Arjonilla, Hui Wang, Abdallah Saffidine, Tristan Cazenave

    Abstract: In recent years, much progress has been made in computer Go and most of the results have been obtained thanks to search algorithms (Monte Carlo Tree Search) and Deep Reinforcement Learning (DRL). In this paper, we propose to use and analyze the latest algorithms that use search and DRL (AlphaZero and Descent algorithms) to automatically learn to play an extended version of the game of Go with more… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted in EvoApps at Evostar2023

    Journal ref: International Conference on the Applications of Evolutionary Computation (Part of EvoStar), 2023, 753--764

  3. arXiv:2404.09304  [pdf, other

    cs.AI

    Monte Carlo Search Algorithms Discovering Monte Carlo Tree Search Exploration Terms

    Authors: Tristan Cazenave

    Abstract: Monte Carlo Tree Search and Monte Carlo Search have good results for many combinatorial problems. In this paper we propose to use Monte Carlo Search to design mathematical expressions that are used as exploration terms for Monte Carlo Tree Search algorithms. The optimized Monte Carlo Tree Search algorithms are PUCT and SHUSS. We automatically design the PUCT and the SHUSS root exploration terms. F… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  4. arXiv:2401.10431  [pdf, other

    cs.AI

    Learning a Prior for Monte Carlo Search by Replaying Solutions to Combinatorial Problems

    Authors: Tristan Cazenave

    Abstract: Monte Carlo Search gives excellent results in multiple difficult combinatorial problems. Using a prior to perform non uniform playouts during the search improves a lot the results compared to uniform playouts. Handmade heuristics tailored to the combinatorial problem are often used as priors. We propose a method to automatically compute a prior. It uses statistics on solved problems. It is a simpl… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  5. arXiv:2401.10420  [pdf, other

    cs.AI

    Generalized Nested Rollout Policy Adaptation with Limited Repetitions

    Authors: Tristan Cazenave

    Abstract: Generalized Nested Rollout Policy Adaptation (GNRPA) is a Monte Carlo search algorithm for optimizing a sequence of choices. We propose to improve on GNRPA by avoiding too deterministic policies that find again and again the same sequence of choices. We do so by limiting the number of repetitions of the best sequence found at a given level. Experiments show that it improves the algorithm for three… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  6. arXiv:2309.12711  [pdf, other

    cs.AI

    The Mathematical Game

    Authors: Marc Pierre, Quentin Cohen-Solal, Tristan Cazenave

    Abstract: Monte Carlo Tree Search can be used for automated theorem proving. Holophrasm is a neural theorem prover using MCTS combined with neural networks for the policy and the evaluation. In this paper we propose to improve the performance of the Holophrasm theorem prover using other game tree search algorithms.

    Submitted 22 September, 2023; originally announced September 2023.

  7. arXiv:2309.12675  [pdf, other

    cs.AI cs.CV

    Vision Transformers for Computer Go

    Authors: Amani Sagri, Tristan Cazenave, Jérôme Arjonilla, Abdallah Saffidine

    Abstract: Motivated by the success of transformers in various fields, such as language understanding and image analysis, this investigation explores their application in the context of the game of Go. In particular, our study focuses on the analysis of the Transformer in Vision. Through a detailed analysis of numerous points such as prediction accuracy, win rates, memory, speed, size, or even learning rate,… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  8. arXiv:2308.12767  [pdf, other

    cs.IR cs.LG stat.ML

    On the Consistency of Average Embeddings for Item Recommendation

    Authors: Walid Bendada, Guillaume Salha-Galvan, Romain Hennequin, Thomas Bouabça, Tristan Cazenave

    Abstract: A prevalent practice in recommender systems consists in averaging item embeddings to represent users or higher-level concepts in the same embedding space. This paper investigates the relevance of such a practice. For this purpose, we propose an expected precision score, designed to measure the consistency of an average embedding relative to the items used for its construction. We subsequently anal… ▽ More

    Submitted 30 August, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 17th ACM Conference on Recommender Systems (RecSys 2023)

  9. arXiv:2304.09061  [pdf, other

    cs.IR cs.LG cs.SD eess.AS

    A Scalable Framework for Automatic Playlist Continuation on Music Streaming Services

    Authors: Walid Bendada, Guillaume Salha-Galvan, Thomas Bouabça, Tristan Cazenave

    Abstract: Music streaming services often aim to recommend songs for users to extend the playlists they have created on these services. However, extending playlists while preserving their musical characteristics and matching user preferences remains a challenging task, commonly referred to as Automatic Playlist Continuation (APC). Besides, while these services often need to select the best songs to recommend… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Accepted as a Full Paper at the SIGIR 2023 conference

  10. arXiv:2302.13225  [pdf, ps, other

    cs.AI

    Towards Tackling MaxSAT by Combining Nested Monte Carlo with Local Search

    Authors: Hui Wang, Abdallah Saffidine, Tristan Cazenave

    Abstract: Recent work proposed the UCTMAXSAT algorithm to address Maximum Satisfiability Problems (MaxSAT) and shown improved performance over pure Stochastic Local Search algorithms (SLS). UCTMAXSAT is based on Monte Carlo Tree Search but it uses SLS instead of purely random playouts. In this work, we introduce two algorithmic variations over UCTMAXSAT. We carry an empirical analysis on MaxSAT benchmarks f… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

  11. arXiv:2302.04318  [pdf, other

    cs.AI

    Learning to Play Stochastic Two-player Perfect-Information Games without Knowledge

    Authors: Quentin Cohen-Solal, Tristan Cazenave

    Abstract: In this paper, we extend the Descent framework, which enables learning and planning in the context of two-player games with perfect information, to the framework of stochastic games. We propose two ways of doing this, the first way generalizes the search algorithm, i.e. Descent, to stochastic games and the second way approximates stochastic games by deterministic games. We then evaluate them o… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  12. arXiv:2301.09533  [pdf, ps, other

    cs.AI cs.CE

    Solving the HP model with Nested Monte Carlo Search

    Authors: Milo Roucairol, Tristan Cazenave

    Abstract: In this paper we present a new Monte Carlo Search (MCS) algorithm for finding the ground state energy of proteins in the HP-model. We also compare it briefly to other MCS algorithms not usually used on the HP-model and provide an overview of the algorithms used on HP-model. The algorithm presented in this paper does not beat state of the art algorithms, see PERM (Hsu and Grassberger 2011), REMC (T… ▽ More

    Submitted 25 January, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Accepted to AAAI's workshop AI2ASE 2023: 2nd Annual AAAI Workshop on AI to Accelerate Science and Engineering. 6 pages, 1 for references

  13. arXiv:2210.08844  [pdf, other

    cs.GT

    Sequential Elimination Voting Games

    Authors: Ulysse Pavloff, Tristan Cazenave, Jérôme Lang

    Abstract: Voting by sequential elimination is a low-communication voting protocol: voters play in sequence and eliminate one or more of the remaining candidates, until only one remains. While the fairness and efficiency of such protocols have been explored, the impact of strategic behaviour has not been addressed. We model voting by sequential elimination as a game. Given a fixed elimination sequence, we sh… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  14. arXiv:2210.00216  [pdf, other

    cs.AI

    Nested Search versus Limited Discrepancy Search

    Authors: Tristan Cazenave

    Abstract: Limited Discrepancy Search (LDS) is a popular algorithm to search a state space with a heuristic to order the possible actions. Nested Search (NS) is another algorithm to search a state space with the same heuristic. NS spends more time on the move associated to the best heuristic playout while LDS spends more time on the best heuristic move. They both use similar times for the same level of searc… ▽ More

    Submitted 1 October, 2022; originally announced October 2022.

  15. arXiv:2207.13181  [pdf, other

    cs.AI

    Planning and Learning: Path-Planning for Autonomous Vehicles, a Review of the Literature

    Authors: Kevin Osanlou, Christophe Guettier, Tristan Cazenave, Eric Jacopin

    Abstract: This short review aims to make the reader familiar with state-of-the-art works relating to planning, scheduling and learning. First, we study state-of-the-art planning algorithms. We give a brief introduction of neural networks. Then we explore in more detail graph neural networks, a recent variant of neural networks suited for processing graph-structured inputs. We describe briefly the concept of… ▽ More

    Submitted 17 October, 2023; v1 submitted 26 July, 2022; originally announced July 2022.

    Comments: AAAI-format & updated

  16. arXiv:2207.03343  [pdf, other

    cs.AI math.CO

    Refutation of Spectral Graph Theory Conjectures with Monte Carlo Search

    Authors: Milo Roucairol, Tristan Cazenave

    Abstract: We demonstrate how Monte Carlo Search (MCS) algorithms, namely Nested Monte Carlo Search (NMCS) and Nested Rollout Policy Adaptation (NRPA), can be used to build graphs and find counter-examples to spectral graph theory conjectures in minutes.

    Submitted 3 August, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: 11 pages, figures and pseudocode in appendix

  17. arXiv:2203.15030  [pdf, other

    cs.AI cs.LG cs.MA cs.RO eess.SY

    Solving Disjunctive Temporal Networks with Uncertainty under Restricted Time-Based Controllability using Tree Search and Graph Neural Networks

    Authors: Kevin Osanlou, Jeremy Frank, Andrei Bursuc, Tristan Cazenave, Eric Jacopin, Christophe Guettier, J. Benton

    Abstract: Planning under uncertainty is an area of interest in artificial intelligence. We present a novel approach based on tree search and graph machine learning for the scheduling problem known as Disjunctive Temporal Networks with Uncertainty (DTNU). Dynamic Controllability (DC) of DTNUs seeks a reactive scheduling strategy to satisfy temporal constraints in response to uncontrollable action durations.… ▽ More

    Submitted 30 March, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: Thirty-Sixth AAAI Conference on Artificial Intelligence. This version includes the technical appendix. arXiv admin note: substantial text overlap with arXiv:2108.01068

    Journal ref: Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

  18. arXiv:2111.06928  [pdf, other

    cs.AI

    Generalized Nested Rollout Policy Adaptation with Dynamic Bias for Vehicle Routing

    Authors: Julien Sentuc, Tristan Cazenave, Jean-Yves Lucas

    Abstract: In this paper we present an extension of the Nested Rollout Policy Adaptation algorithm (NRPA), namely the Generalized Nested Rollout Policy Adaptation (GNRPA), as well as its use for solving some instances of the Vehicle Routing Problem. We detail some results obtained on the Solomon instances set which is a conventional benchmark for the Vehicle Routing Problem (VRP). We show that on all instanc… ▽ More

    Submitted 29 December, 2021; v1 submitted 12 November, 2021; originally announced November 2021.

  19. arXiv:2108.01080  [pdf, other

    cs.AI cs.LG cs.RO

    Learning-based Preference Prediction for Constrained Multi-Criteria Path-Planning

    Authors: Kevin Osanlou, Christophe Guettier, Andrei Bursuc, Tristan Cazenave, Eric Jacopin

    Abstract: Learning-based methods are increasingly popular for search algorithms in single-criterion optimization problems. In contrast, for multiple-criteria optimization there are significantly fewer approaches despite the existence of numerous applications. Constrained path-planning for Autonomous Ground Vehicles (AGV) is one such application, where an AGV is typically deployed in disaster relief or searc… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: arXiv admin note: text overlap with arXiv:2108.00978

    Journal ref: International Conference on Automated Planning and Scheduling 2019, Workshop SPARK

  20. arXiv:2108.01068  [pdf, other

    cs.AI cs.LG cs.RO

    Time-based Dynamic Controllability of Disjunctive Temporal Networks with Uncertainty: A Tree Search Approach with Graph Neural Network Guidance

    Authors: Kevin Osanlou, Jeremy Frank, J. Benton, Andrei Bursuc, Christophe Guettier, Eric Jacopin, Tristan Cazenave

    Abstract: Scheduling in the presence of uncertainty is an area of interest in artificial intelligence due to the large number of applications. We study the problem of dynamic controllability (DC) of disjunctive temporal networks with uncertainty (DTNU), which seeks a strategy to satisfy all constraints in response to uncontrollable action durations. We introduce a more restricted, stronger form of controlla… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Journal ref: International Conference on Automated Planning and Scheduling 2020. Workshop on Bridging the Gap Between AI Planning and Reinforcement Learning (PRL)

  21. Optimal Solving of Constrained Path-Planning Problems with Graph Convolutional Networks and Optimized Tree Search

    Authors: Kevin Osanlou, Andrei Bursuc, Christophe Guettier, Tristan Cazenave, Eric Jacopin

    Abstract: Deep learning-based methods are growing prominence for planning purposes. In this paper, we present a hybrid planner that combines a graph machine learning model and an optimal solver based on branch and bound tree search for path-planning tasks. More specifically, a graph neural network is used to assist the branch and bound algorithm in handling constraints associated with a desired solution pat… ▽ More

    Submitted 3 April, 2022; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: Published as a conference paper at IROS 2019

    Journal ref: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2019, pp. 3519-3525

  22. arXiv:2108.00978  [pdf, other

    cs.AI cs.LG cs.RO

    Constrained Shortest Path Search with Graph Convolutional Neural Networks

    Authors: Kevin Osanlou, Christophe Guettier, Andrei Bursuc, Tristan Cazenave, Eric Jacopin

    Abstract: Planning for Autonomous Unmanned Ground Vehicles (AUGV) is still a challenge, especially in difficult, off-road, critical situations. Automatic planning can be used to reach mission objectives, to perform navigation or maneuvers. Most of the time, the problem consists in finding a path from a source to a destination, while satisfying some operational constraints. In a graph without negative cycles… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Journal ref: AAAI - ICML / IJCAI / AAMAS 2018 Workshop on Planning and Learning (PAL-18). Stockholm, Sweden 2018

  23. arXiv:2104.04278  [pdf, ps, other

    cs.AI

    Batch Monte Carlo Tree Search

    Authors: Tristan Cazenave

    Abstract: Making inferences with a deep neural network on a batch of states is much faster with a GPU than making inferences on one state after another. We build on this property to propose Monte Carlo Tree Search algorithms using batched inferences. Instead of using either a search tree or a transposition table we propose to use both in the same algorithm. The transposition table contains the results of th… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

  24. arXiv:2102.03467  [pdf, other

    cs.AI

    Improving Model and Search for Computer Go

    Authors: Tristan Cazenave

    Abstract: The standard for Deep Reinforcement Learning in games, following Alpha Zero, is to use residual networks and to increase the depth of the network to get better results. We propose to improve mobile networks as an alternative to residual networks and experimentally show the playing strength of the networks according to both their width and their depth. We also propose a generalization of the PUCT s… ▽ More

    Submitted 9 April, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

  25. arXiv:2101.12639  [pdf, other

    cs.AI

    Optimizing $αμ$

    Authors: Tristan Cazenave, Swann Legras, Véronique Ventos

    Abstract: $αμ$ is a search algorithm which repairs two defaults of Perfect Information Monte Carlo search: strategy fusion and non locality. In this paper we optimize $αμ… ▽ More

    Submitted 29 January, 2021; originally announced January 2021.

  26. arXiv:2101.03563  [pdf, other

    cs.AI

    Stabilized Nested Rollout Policy Adaptation

    Authors: Tristan Cazenave, Jean-Baptiste Sevestre, Matthieu Toulemont

    Abstract: Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games. In this paper we propose to modify NRPA in order to improve the stability of the algorithm. Experiments show it improves the algorithm for different application domains: SameGame, Traveling Salesman with Time Windows and Expression Discovery.

    Submitted 10 January, 2021; originally announced January 2021.

    Comments: arXiv admin note: text overlap with arXiv:2003.10024

  27. arXiv:2012.10700  [pdf, other

    cs.AI

    Minimax Strikes Back

    Authors: Quentin Cohen-Solal, Tristan Cazenave

    Abstract: Deep Reinforcement Learning (DRL) reaches a superhuman level of play in many complete information games. The state of the art search algorithm used in combination with DRL is Monte Carlo Tree Search (MCTS). We take another approach to DRL using a Minimax algorithm instead of MCTS and learning only the evaluation of states, not the policy. We show that for multiple games it is competitive with the… ▽ More

    Submitted 19 December, 2020; originally announced December 2020.

  28. arXiv:2008.10080  [pdf, other

    cs.AI

    Mobile Networks for Computer Go

    Authors: Tristan Cazenave

    Abstract: The architecture of the neural networks used in Deep Reinforcement Learning programs such as Alpha Zero or Polygames has been shown to have a great impact on the performances of the resulting playing engines. For example the use of residual networks gave a 600 ELO increase in the strength of Alpha Go. This paper proposes to evaluate the interest of Mobile Network for the game of Go using supervise… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

  29. arXiv:2005.09961  [pdf, other

    cs.AI

    Monte Carlo Inverse Folding

    Authors: Tristan Cazenave, Thomas Fournier

    Abstract: The RNA Inverse Folding problem comes from computational biology. The goal is to find a molecule that has a given folding. It is important for scientific fields such as bioengineering, pharmaceutical research, biochemistry, synthetic biology and RNA nanostructures. Nested Monte Carlo Search has given excellent results for this problem. We propose to adapt and evaluate different Monte Carlo Search… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

  30. arXiv:2003.10024  [pdf, other

    cs.AI

    Generalized Nested Rollout Policy Adaptation

    Authors: Tristan Cazenave

    Abstract: Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games. In this paper we propose to generalize NRPA with a temperature and a bias and to analyze theoretically the algorithms. The generalized algorithm is named GNRPA. Experiments show it improves on NRPA for different application domains: SameGame and the Traveling Salesman Problem with Time Windows.

    Submitted 22 March, 2020; originally announced March 2020.

  31. arXiv:2001.09832  [pdf, other

    cs.LG stat.ML

    Polygames: Improved Zero Learning

    Authors: Tristan Cazenave, Yen-Chi Chen, Guan-Wei Chen, Shi-Yu Chen, Xian-Dong Chiu, Julien Dehos, Maria Elsa, Qucheng Gong, Hengyuan Hu, Vasil Khalidov, Cheng-Ling Li, Hsin-I Lin, Yu-** Lin, Xavier Martinet, Vegard Mella, Jeremy Rapin, Baptiste Roziere, Gabriel Synnaeve, Fabien Teytaud, Olivier Teytaud, Shi-Cheng Ye, Yi-Jun Ye, Shi-Jim Yen, Sergey Zagoruyko

    Abstract: Since DeepMind's AlphaZero, Zero learning quickly became the state-of-the-art method for many board games. It can be improved using a fully convolutional structure (no fully connected layer). Using such an architecture plus global pooling, we can create bots independent of the board size. The training can be made more robust by kee** track of the best checkpoints during the training and by train… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

  32. arXiv:2001.05087  [pdf, ps, other

    cs.AI

    Monte Carlo Game Solver

    Authors: Tristan Cazenave

    Abstract: We present a general algorithm to order moves so as to speedup exact game solvers. It uses online learning of playout policies and Monte Carlo Tree Search. The learned policy and the information in the Monte Carlo tree are used to order moves in game solvers. They improve greatly the solving time for multiple games.

    Submitted 14 January, 2020; originally announced January 2020.

  33. arXiv:1911.07960  [pdf, ps, other

    cs.AI

    The αμ Search Algorithm for the Game of Bridge

    Authors: Tristan Cazenave, Véronique Ventos

    Abstract: αμ is an anytime heuristic search algorithm for incomplete information games that assumes perfect information for the opponents. αμ addresses the strategy fusion and non-locality problems encountered by Perfect Information Monte Carlo sampling. In this paper αμ is applied to the game of Bridge.

    Submitted 18 November, 2019; originally announced November 2019.

  34. arXiv:1607.02431  [pdf, other

    cs.AI cs.GT

    Learning opening books in partially observable games: using random seeds in Phantom Go

    Authors: Tristan Cazenave, Jialin Liu, Fabien Teytaud, Olivier Teytaud

    Abstract: Many artificial intelligences (AIs) are randomized. One can be lucky or unlucky with the random seed; we quantify this effect and show that, maybe contrarily to intuition, this is far from being negligible. Then, we apply two different existing algorithms for selecting good seeds and good probability distributions over seeds. This mainly leads to learning an opening book. We apply this to Phantom… ▽ More

    Submitted 8 July, 2016; originally announced July 2016.

    Comments: 7 pages, 15 figures. Accepted by CIG2016

    MSC Class: 91A05; 91A10

  35. arXiv:1511.02006  [pdf, other

    cs.GT

    Depth, balancing, and limits of the Elo model

    Authors: Marie-Liesse Cauwet, Olivier Teytaud, Hua-Min Liang, Shi-Jim Yen, Hung-Hsuan Lin, I-Chen Wu, Tristan Cazenave, Abdallah Saffidine

    Abstract: -Much work has been devoted to the computational complexity of games. However, they are not necessarily relevant for estimating the complexity in human terms. Therefore, human-centered measures have been proposed, e.g. the depth. This paper discusses the depth of various games, extends it to a continuous measure. We provide new depth results and present tool (given-first-move, pie rule, size exten… ▽ More

    Submitted 6 November, 2015; originally announced November 2015.

    Journal ref: IEEE Conference on Computational Intelligence and Games 2015, Aug 2015, Tainan, Taiwan. 2015