Showing 1–1 of 1 results for author: Burdick, R

Search v0.5.6 released 2020-02-24

arXiv:2109.02797 [pdf]

cs.LG cs.AI cs.CL

Puzzle Solving without Search or Human Knowledge: An Unnatural Language Approach

Authors: David Noever, Ryerson Burdick

Abstract: The application of Generative Pre-trained Transformer (GPT-2) to learn text-archived game notation provides a model environment for exploring sparse reward gameplay. The transformer architecture proves amenable to training on solved text archives describing mazes, Rubik's Cube, and Sudoku solvers. The method benefits from fine-tuning the transformer architecture to visualize plausible strategies d… ▽ More The application of Generative Pre-trained Transformer (GPT-2) to learn text-archived game notation provides a model environment for exploring sparse reward gameplay. The transformer architecture proves amenable to training on solved text archives describing mazes, Rubik's Cube, and Sudoku solvers. The method benefits from fine-tuning the transformer architecture to visualize plausible strategies derived outside any guidance from human heuristics or domain expertise. The large search space ($>10^{19}$) for the games provides a puzzle environment in which the solution has few intermediate rewards and a final move that solves the challenge. △ Less

Submitted 6 September, 2021; originally announced September 2021.

Search v0.5.6 released 2020-02-24