Skip to main content

Showing 1–2 of 2 results for author: Habara, K

.
  1. arXiv:2303.17503  [pdf, other

    cs.AI cs.LG

    Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning

    Authors: Sotetsu Koyamada, Shinri Okano, Soichiro Nishimori, Yu Murata, Keigo Habara, Haruka Kita, Shin Ishii

    Abstract: We propose Pgx, a suite of board game reinforcement learning (RL) environments written in JAX and optimized for GPU/TPU accelerators. By leveraging JAX's auto-vectorization and parallelization over accelerators, Pgx can efficiently scale to thousands of simultaneous simulations over accelerators. In our experiments on a DGX-A100 workstation, we discovered that Pgx can simulate RL environments 10-1… ▽ More

    Submitted 15 January, 2024; v1 submitted 28 March, 2023; originally announced March 2023.

  2. arXiv:2303.11046  [pdf, other

    cs.GT cs.AI

    Convergence analysis and acceleration of the smoothing methods for solving extensive-form games

    Authors: Keigo Habara, Ellen Hidemi Fukuda, Nobuo Yamashita

    Abstract: The extensive-form game has been studied considerably in recent years. It can represent games with multiple decision points and incomplete information, and hence it is helpful in formulating games with uncertain inputs, such as poker. We consider an extended-form game with two players and zero-sum, i.e., the sum of their payoffs is always zero. In such games, the problem of finding the optimal str… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 23 pages, 6 figures

    MSC Class: 91A05; 91A10; 91A18; 91A27 ACM Class: G.1.6