-
[Ne v] emission from a faint epoch of reionization-era galaxy: evidence for a narrow-line intermediate mass black hole
Authors:
J. Chisholm,
D. A. Berg,
R. Endsley,
S. Gazagnes,
C. T. Richardson,
E. Lambrides,
J. Greene,
S. Finkelstein,
S. Flury,
N. G. Guseva,
A. Henry,
T. A. Hutchison,
Y. I. Izotov,
R. Marques-Chaves,
P. Oesch,
C. Papovich,
A. Saldana-Lopez,
D. Schaerer,
M. G. Stephenson
Abstract:
Here we present high spectral resolution $\textit{JWST}$ NIRSpec observations of GN42437, a low-mass (log(M$_\ast/M_\odot)=7.9$), compact ($r_e < 500$pc), extreme starburst galaxy at $z=5.59$ with 13 emission line detections. GN42437 has a low-metallicity (5-10% Z$_\odot$) and its rest-frame H$α$ equivalent width suggests nearly all of the observed stellar mass formed within the last 3 Myr. GN4243…
▽ More
Here we present high spectral resolution $\textit{JWST}$ NIRSpec observations of GN42437, a low-mass (log(M$_\ast/M_\odot)=7.9$), compact ($r_e < 500$pc), extreme starburst galaxy at $z=5.59$ with 13 emission line detections. GN42437 has a low-metallicity (5-10% Z$_\odot$) and its rest-frame H$α$ equivalent width suggests nearly all of the observed stellar mass formed within the last 3 Myr. GN42437 has an extraordinary 7$σ$ significant [Ne V] 3427 $\mathring{\rm A}$ detection. The [Ne V] line has a rest-frame equivalent width of $11\pm2\mathring{\rm A}$, [Ne V]/H$α=0.04\pm0.007$, [Ne V]/[Ne III] 3870$\mathring{\rm A} = 0.26\pm0.04$, and [Ne V]/He II 4687 $\mathring{\rm A} = 1.2\pm0.5$. Ionization from massive stars, shocks, or high-mass X-ray binaries cannot simultaneously produce these [Ne V] and low-ionization line ratios. Reproducing the complete nebular structure requires both massive stars and accretion onto a black hole. We do not detect broad lines nor do the traditional diagnostics indicate that GN42437 has an accreting black hole. Thus, the very-high-ionization emission lines powerfully diagnose faint narrow-line black holes at high-redshift. We approximate the black hole mass in a variety of ways as log(M$_{\rm BH}/M_\odot) \sim 5-7$. This black hole mass is consistent with local relations between the black hole mass and the observed velocity dispersion, but significantly more massive than the stellar mass would predict. Very-high-ionization emission lines may reveal samples to probe the formation and growth of the first black holes in the universe.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Cumulant Tensors in Partitioned Independent Component Analysis
Authors:
Marina Garrote-López,
Monroe Stephenson
Abstract:
In this work, we explore Partitioned Independent Component Analysis (PICA), an extension of the well-established Independent Component Analysis (ICA) framework. Traditionally, ICA focuses on extracting a vector of independent source signals from a linear combination of them defined by a mixing matrix. We aim to provide a comprehensive understanding of the identifiability of this mixing matrix in I…
▽ More
In this work, we explore Partitioned Independent Component Analysis (PICA), an extension of the well-established Independent Component Analysis (ICA) framework. Traditionally, ICA focuses on extracting a vector of independent source signals from a linear combination of them defined by a mixing matrix. We aim to provide a comprehensive understanding of the identifiability of this mixing matrix in ICA. Significant to our investigation, recent developments by Mesters and Zwiernik relax these strict independence requirements, studying the identifiability of the mixing matrix from zero restrictions on cumulant tensors. In this paper, we assume alternative independence conditions, in particular, the PICA case, where only partitions of the sources are mutually independent. We study this case from an algebraic perspective, and our primary result generalizes previous results on the identifiability of the mixing matrix.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Quasar Sightline and Galaxy Evolution (QSAGE) -- III. The mass-metallicity and fundamental metallicity relation of $z \approx$ 2.2 galaxies
Authors:
H. M. O. Stephenson,
J. P. Stott,
F. Cullen,
R. M. Bielby,
N. Amos,
R. Dutta,
M. Fumagalli,
N. Tejos,
J. N. Burchett,
R. A. Crain,
J. X. Prochaska
Abstract:
We present analysis of the mass-metallicity relation (MZR) for a sample of 67 [OIII]-selected star-forming galaxies at a redshift range of $z=1.99 - 2.32$ ($z_{\text{med}} = 2.16$) using \emph{Hubble Space Telescope} Wide Field Camera 3 grism spectroscopy from the Quasar Sightline and Galaxy Evolution (QSAGE) survey. Metallicities were determined using empirical gas-phase metallicity calibrations…
▽ More
We present analysis of the mass-metallicity relation (MZR) for a sample of 67 [OIII]-selected star-forming galaxies at a redshift range of $z=1.99 - 2.32$ ($z_{\text{med}} = 2.16$) using \emph{Hubble Space Telescope} Wide Field Camera 3 grism spectroscopy from the Quasar Sightline and Galaxy Evolution (QSAGE) survey. Metallicities were determined using empirical gas-phase metallicity calibrations based on the strong emission lines [OII]3727,3729, [OIII]4959,5007 and H$β$. Star-forming galaxies were identified, and distinguished from active-galactic nuclei, via Mass-Excitation diagrams. Using $z\sim0$ metallicity calibrations, we observe a negative offset in the $z=2.2$ MZR of $\approx -0.51$ dex in metallicity when compared to locally derived relationships, in agreement with previous literature analysis. A similar offset of $\approx -0.46$ dex in metallicity is found when using empirical metallicity calibrations that are suitable out to $z\sim5$, though our $z=2.2$ MZR, in this case, has a shallower slope. We find agreement between our MZR and those predicted from various galaxy evolution models and simulations. Additionally, we explore the extended fundamental metallicity relation (FMR) which includes an additional dependence on star formation rate (SFR). Our results consistently support the existence of the FMR, as well as revealing an offset of $0.28\pm0.04$ dex in metallicity compared to locally-derived relationships, consistent with previous studies at similar redshifts. We interpret the negative correlation with SFR at fixed mass, inferred from an FMR existing for our sample, as being caused by the efficient accretion of metal-poor gas fuelling SFR at cosmic noon.
△ Less
Submitted 12 December, 2023; v1 submitted 16 November, 2023;
originally announced November 2023.
-
Cooperative AI via Decentralized Commitment Devices
Authors:
Xinyuan Sun,
Davide Crapis,
Matt Stephenson,
Barnabé Monnot,
Thomas Thiery,
Jonathan Passerat-Palmbach
Abstract:
Credible commitment devices have been a popular approach for robust multi-agent coordination. However, existing commitment mechanisms face limitations like privacy, integrity, and susceptibility to mediator or user strategic behavior. It is unclear if the cooperative AI techniques we study are robust to real-world incentives and attack vectors. However, decentralized commitment devices that utiliz…
▽ More
Credible commitment devices have been a popular approach for robust multi-agent coordination. However, existing commitment mechanisms face limitations like privacy, integrity, and susceptibility to mediator or user strategic behavior. It is unclear if the cooperative AI techniques we study are robust to real-world incentives and attack vectors. However, decentralized commitment devices that utilize cryptography have been deployed in the wild, and numerous studies have shown their ability to coordinate algorithmic agents facing adversarial opponents with significant economic incentives, currently in the order of several million to billions of dollars. In this paper, we use examples in the decentralization and, in particular, Maximal Extractable Value (MEV) (arXiv:1904.05234) literature to illustrate the potential security issues in cooperative AI. We call for expanded research into decentralized commitments to advance cooperative AI capabilities for secure coordination in open environments and empirical testing frameworks to evaluate multi-agent coordination ability given real-world commitment constraints.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Utilizing Generative Adversarial Networks for Stable Structure Generation in Angry Birds
Authors:
Frederic Abraham,
Matthew Stephenson
Abstract:
This paper investigates the suitability of using Generative Adversarial Networks (GANs) to generate stable structures for the physics-based puzzle game Angry Birds. While previous applications of GANs for level generation have been mostly limited to tile-based representations, this paper explores their suitability for creating stable structures made from multiple smaller blocks. This includes a de…
▽ More
This paper investigates the suitability of using Generative Adversarial Networks (GANs) to generate stable structures for the physics-based puzzle game Angry Birds. While previous applications of GANs for level generation have been mostly limited to tile-based representations, this paper explores their suitability for creating stable structures made from multiple smaller blocks. This includes a detailed encoding/decoding process for converting between Angry Birds level descriptions and a suitable grid-based representation, as well as utilizing state-of-the-art GAN architectures and training methods to produce new structure designs. Our results show that GANs can be successfully applied to generate a varied range of complex and stable Angry Birds structures.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.
-
Physics-Based Task Generation through Causal Sequence of Physical Interactions
Authors:
Chathura Gamage,
Vimukthini Pinto,
Matthew Stephenson,
Jochen Renz
Abstract:
Performing tasks in a physical environment is a crucial yet challenging problem for AI systems operating in the real world. Physics simulation-based tasks are often employed to facilitate research that addresses this challenge. In this paper, first, we present a systematic approach for defining a physical scenario using a causal sequence of physical interactions between objects. Then, we propose a…
▽ More
Performing tasks in a physical environment is a crucial yet challenging problem for AI systems operating in the real world. Physics simulation-based tasks are often employed to facilitate research that addresses this challenge. In this paper, first, we present a systematic approach for defining a physical scenario using a causal sequence of physical interactions between objects. Then, we propose a methodology for generating tasks in a physics-simulating environment using these defined scenarios as inputs. Our approach enables a better understanding of the granular mechanics required for solving physics-based tasks, thereby facilitating accurate evaluation of AI systems' physical reasoning capabilities. We demonstrate our proposed task generation methodology using the physics-based puzzle game Angry Birds and evaluate the generated tasks using a range of metrics, including physical stability, solvability using intended physical interactions, and accidental solvability using unintended solutions. We believe that the tasks generated using our proposed methodology can facilitate a nuanced evaluation of physical reasoning agents, thus paving the way for the development of agents for more sophisticated real-world applications.
△ Less
Submitted 16 August, 2023; v1 submitted 5 August, 2023;
originally announced August 2023.
-
A Differential Datalog Interpreter
Authors:
Matthew Stephenson
Abstract:
Redacted by arXiv admins
Redacted by arXiv admins
△ Less
Submitted 27 July, 2023;
originally announced July 2023.
-
NovPhy: A Testbed for Physical Reasoning in Open-world Environments
Authors:
Chathura Gamage,
Vimukthini Pinto,
Cheng Xue,
Peng Zhang,
Ekaterina Nikonova,
Matthew Stephenson,
Jochen Renz
Abstract:
Due to the emergence of AI systems that interact with the physical environment, there is an increased interest in incorporating physical reasoning capabilities into those AI systems. But is it enough to only have physical reasoning capabilities to operate in a real physical environment? In the real world, we constantly face novel situations we have not encountered before. As humans, we are compete…
▽ More
Due to the emergence of AI systems that interact with the physical environment, there is an increased interest in incorporating physical reasoning capabilities into those AI systems. But is it enough to only have physical reasoning capabilities to operate in a real physical environment? In the real world, we constantly face novel situations we have not encountered before. As humans, we are competent at successfully adapting to those situations. Similarly, an agent needs to have the ability to function under the impact of novelties in order to properly operate in an open-world physical environment. To facilitate the development of such AI systems, we propose a new testbed, NovPhy, that requires an agent to reason about physical scenarios in the presence of novelties and take actions accordingly. The testbed consists of tasks that require agents to detect and adapt to novelties in physical scenarios. To create tasks in the testbed, we develop eight novelties representing a diverse novelty space and apply them to five commonly encountered scenarios in a physical environment. According to our testbed design, we evaluate two capabilities of an agent: the performance on a novelty when it is applied to different physical scenarios and the performance on a physical scenario when different novelties are applied to it. We conduct a thorough evaluation with human players, learning agents, and heuristic agents. Our evaluation shows that humans' performance is far beyond the agents' performance. Some agents, even with good normal task performance, perform significantly worse when there is a novelty, and the agents that can adapt to novelties typically adapt slower than humans. We promote the development of intelligent agents capable of performing at the human level or above when operating in open-world physical environments. Testbed website: https://github.com/phy-q/novphy
△ Less
Submitted 5 August, 2023; v1 submitted 2 March, 2023;
originally announced March 2023.
-
Measuring Board Game Distance
Authors:
Matthew Stephenson,
Dennis J. N. J. Soemers,
Éric Piette,
Cameron Browne
Abstract:
This paper presents a general approach for measuring distances between board games within the Ludii general game system. These distances are calculated using a previously published set of general board game concepts, each of which represents a common game idea or shared property. Our results compare and contrast two different measures of distance, highlighting the subjective nature of such metrics…
▽ More
This paper presents a general approach for measuring distances between board games within the Ludii general game system. These distances are calculated using a previously published set of general board game concepts, each of which represents a common game idea or shared property. Our results compare and contrast two different measures of distance, highlighting the subjective nature of such metrics and discussing the different ways that they can be interpreted.
△ Less
Submitted 10 January, 2023;
originally announced January 2023.
-
The Ludii Game Description Language is Universal
Authors:
Dennis J. N. J. Soemers,
Éric Piette,
Matthew Stephenson,
Cameron Browne
Abstract:
There are several different game description languages (GDLs), each intended to allow wide ranges of arbitrary games (i.e., general games) to be described in a single higher-level language than general-purpose programming languages. Games described in such formats can subsequently be presented as challenges for automated general game playing agents, which are expected to be capable of playing any…
▽ More
There are several different game description languages (GDLs), each intended to allow wide ranges of arbitrary games (i.e., general games) to be described in a single higher-level language than general-purpose programming languages. Games described in such formats can subsequently be presented as challenges for automated general game playing agents, which are expected to be capable of playing any arbitrary game described in such a language without prior knowledge about the games to be played. The language used by the Ludii general game system was previously shown to be capable of representing equivalent games for any arbitrary, finite, deterministic, fully observable extensive-form game. In this paper, we prove its universality by extending this to include finite non-deterministic and imperfect-information games.
△ Less
Submitted 12 June, 2024; v1 submitted 1 May, 2022;
originally announced May 2022.
-
Spatial State-Action Features for General Games
Authors:
Dennis J. N. J. Soemers,
Éric Piette,
Matthew Stephenson,
Cameron Browne
Abstract:
In many board games and other abstract games, patterns have been used as features that can guide automated game-playing agents. Such patterns or features often represent particular configurations of pieces, empty positions, etc., which may be relevant for a game's strategies. Their use has been particularly prevalent in the game of Go, but also many other games used as benchmarks for AI research.…
▽ More
In many board games and other abstract games, patterns have been used as features that can guide automated game-playing agents. Such patterns or features often represent particular configurations of pieces, empty positions, etc., which may be relevant for a game's strategies. Their use has been particularly prevalent in the game of Go, but also many other games used as benchmarks for AI research. In this paper, we formulate a design and efficient implementation of spatial state-action features for general games. These are patterns that can be trained to incentivise or disincentivise actions based on whether or not they match variables of the state in a local area around action variables. We provide extensive details on several design and implementation choices, with a primary focus on achieving a high degree of generality to support a wide variety of different games using different board geometries or other graphs. Secondly, we propose an efficient approach for evaluating active features for any given set of features. In this approach, we take inspiration from heuristics used in problems such as SAT to optimise the order in which parts of patterns are matched and prune unnecessary evaluations. This approach is defined for a highly general and abstract description of the problem -- phrased as optimising the order in which propositions of formulas in disjunctive normal form are evaluated -- and may therefore also be of interest to other types of problems than board games. An empirical evaluation on 33 distinct games in the Ludii general game system demonstrates the efficiency of this approach in comparison to a naive baseline, as well as a baseline based on prefix trees, and demonstrates that the additional efficiency significantly improves the playing strength of agents using the features to guide search.
△ Less
Submitted 4 May, 2023; v1 submitted 17 January, 2022;
originally announced January 2022.
-
Asymptotic Behavior of Differential Powers
Authors:
Jennifer Kenkel,
Lillian McPherson,
Janet Page,
Daniel Smolkin,
Monroe Stephenson,
Fuxiang Yang
Abstract:
In this paper, we study the differential power operation on ideals. We begin with a focus on monomial ideals in characteristic 0 and find a class of ideals whose differential powers are eventually principal. We also study the containment problem between ordinary and differential powers of ideals, in analogy to earlier work comparing ordinary and symbolic powers of ideals. We further define a possi…
▽ More
In this paper, we study the differential power operation on ideals. We begin with a focus on monomial ideals in characteristic 0 and find a class of ideals whose differential powers are eventually principal. We also study the containment problem between ordinary and differential powers of ideals, in analogy to earlier work comparing ordinary and symbolic powers of ideals. We further define a possible closure operation on ideals, called the differential closure, in analogy with integral closure and tight closure. We show that this closure operation agrees with taking the radical of an ideal if and only if the ambient ring is a simple $D$-module.
△ Less
Submitted 30 November, 2021;
originally announced November 2021.
-
General Board Geometry
Authors:
Cameron Browne,
Éric Piette,
Matthew Stephenson,
Dennis J. N. J. Soemers
Abstract:
Game boards are described in the Ludii general game system by their underlying graphs, based on tiling, shape and graph operators, with the automatic detection of important properties such as topological relationships between graph elements, directions and radial step sequences. This approach allows most conceivable game boards to be described simply and succinctly.
Game boards are described in the Ludii general game system by their underlying graphs, based on tiling, shape and graph operators, with the automatic detection of important properties such as topological relationships between graph elements, directions and radial step sequences. This approach allows most conceivable game boards to be described simply and succinctly.
△ Less
Submitted 22 November, 2021;
originally announced November 2021.
-
Optimised Playout Implementations for the Ludii General Game System
Authors:
Dennis J. N. J. Soemers,
Éric Piette,
Matthew Stephenson,
Cameron Browne
Abstract:
This paper describes three different optimised implementations of playouts, as commonly used by game-playing algorithms such as Monte-Carlo Tree Search. Each of the optimised implementations is applicable only to specific sets of games, based on their rules. The Ludii general game system can automatically infer, based on a game's description in its general game description language, whether any op…
▽ More
This paper describes three different optimised implementations of playouts, as commonly used by game-playing algorithms such as Monte-Carlo Tree Search. Each of the optimised implementations is applicable only to specific sets of games, based on their rules. The Ludii general game system can automatically infer, based on a game's description in its general game description language, whether any optimised implementations are applicable. An empirical evaluation demonstrates major speedups over a standard implementation, with a median result of running playouts 5.08 times as fast, over 145 different games in Ludii for which one of the optimised implementations is applicable.
△ Less
Submitted 4 November, 2021;
originally announced November 2021.
-
Automatic Generation of Board Game Manuals
Authors:
Matthew Stephenson,
Eric Piette,
Dennis J. N. J. Soemers,
Cameron Browne
Abstract:
In this paper we present a process for automatically generating manuals for board games within the Ludii general game system. This process requires many different sub-tasks to be addressed, such as English translation of Ludii game descriptions, move visualisation, highlighting winning moves, strategy explanation, among others. These aspects are then combined to create a full manual for any given…
▽ More
In this paper we present a process for automatically generating manuals for board games within the Ludii general game system. This process requires many different sub-tasks to be addressed, such as English translation of Ludii game descriptions, move visualisation, highlighting winning moves, strategy explanation, among others. These aspects are then combined to create a full manual for any given game. This manual is intended to provide a more intuitive explanation of a game's rules and mechanics, particularly for players who are less familiar with the Ludii game description language and grammar.
△ Less
Submitted 20 September, 2021;
originally announced September 2021.
-
General Board Game Concepts
Authors:
Éric Piette,
Matthew Stephenson,
Dennis J. N. J. Soemers,
Cameron Browne
Abstract:
Many games often share common ideas or aspects between them, such as their rules, controls, or playing area. However, in the context of General Game Playing (GGP) for board games, this area remains under-explored. We propose to formalise the notion of "game concept", inspired by terms generally used by game players and designers. Through the Ludii General Game System, we describe concepts for seve…
▽ More
Many games often share common ideas or aspects between them, such as their rules, controls, or playing area. However, in the context of General Game Playing (GGP) for board games, this area remains under-explored. We propose to formalise the notion of "game concept", inspired by terms generally used by game players and designers. Through the Ludii General Game System, we describe concepts for several levels of abstraction, such as the game itself, the moves played, or the states reached. This new GGP feature associated with the ludeme representation of games opens many new lines of research. The creation of a hyper-agent selector, the transfer of AI learning between games, or explaining AI techniques using game terms, can all be facilitated by the use of game concepts. Other applications which can benefit from game concepts are also discussed, such as the generation of plausible reconstructed rules for incomplete ancient games, or the implementation of a board game recommender system.
△ Less
Submitted 2 July, 2021;
originally announced July 2021.
-
The Difficulty of Novelty Detection in Open-World Physical Domains: An Application to Angry Birds
Authors:
Vimukthini Pinto,
Cheng Xue,
Chathura Nagoda Gamage,
Matthew Stephenson,
Jochen Renz
Abstract:
Detecting and responding to novel situations in open-world environments is a key capability of human cognition and is a persistent problem for AI systems. In an open-world, novelties can appear in many different forms and may be easy or hard to detect. Therefore, to accurately evaluate the novelty detection capability of AI systems, it is necessary to investigate how difficult it may be to detect…
▽ More
Detecting and responding to novel situations in open-world environments is a key capability of human cognition and is a persistent problem for AI systems. In an open-world, novelties can appear in many different forms and may be easy or hard to detect. Therefore, to accurately evaluate the novelty detection capability of AI systems, it is necessary to investigate how difficult it may be to detect different types of novelty. In this paper, we propose a qualitative physics-based method to quantify the difficulty of novelty detection focusing on open-world physical domains. We apply our method in the popular physics simulation game Angry Birds, and conduct a user study across different novelties to validate our method. Results indicate that our calculated detection difficulties are in line with those of human users.
△ Less
Submitted 25 June, 2023; v1 submitted 16 June, 2021;
originally announced June 2021.
-
Deceptive Level Generation for Angry Birds
Authors:
Chathura Gamage,
Matthew Stephenson,
Vimukthini Pinto,
Jochen Renz
Abstract:
The Angry Birds AI competition has been held over many years to encourage the development of AI agents that can play Angry Birds game levels better than human players. Many different agents with various approaches have been employed over the competition's lifetime to solve this task. Even though the performance of these agents has increased significantly over the past few years, they still show ma…
▽ More
The Angry Birds AI competition has been held over many years to encourage the development of AI agents that can play Angry Birds game levels better than human players. Many different agents with various approaches have been employed over the competition's lifetime to solve this task. Even though the performance of these agents has increased significantly over the past few years, they still show major drawbacks in playing deceptive levels. This is because most of the current agents try to identify the best next shot rather than planning an effective sequence of shots. In order to encourage advancements in such agents, we present an automated methodology to generate deceptive game levels for Angry Birds. Even though there are many existing content generators for Angry Birds, they do not focus on generating deceptive levels. In this paper, we propose a procedure to generate deceptive levels for six deception categories that can fool the state-of-the-art Angry Birds playing AI agents. Our results show that generated deceptive levels exhibit similar characteristics of human-created deceptive levels. Additionally, we define metrics to measure the stability, solvability, and degree of deception of the generated levels.
△ Less
Submitted 3 June, 2021;
originally announced June 2021.
-
General Game Heuristic Prediction Based on Ludeme Descriptions
Authors:
Matthew Stephenson,
Dennis J. N. J. Soemers,
Eric Piette,
Cameron Browne
Abstract:
This paper investigates the performance of different general-game-playing heuristics for games in the Ludii general game system. Based on these results, we train several regression learning models to predict the performance of these heuristics based on each game's description file. We also provide a condensed analysis of the games available in Ludii, and the different ludemes that define them.
This paper investigates the performance of different general-game-playing heuristics for games in the Ludii general game system. Based on these results, we train several regression learning models to predict the performance of these heuristics based on each game's description file. We also provide a condensed analysis of the games available in Ludii, and the different ludemes that define them.
△ Less
Submitted 5 July, 2021; v1 submitted 26 May, 2021;
originally announced May 2021.
-
Transfer of Fully Convolutional Policy-Value Networks Between Games and Game Variants
Authors:
Dennis J. N. J. Soemers,
Vegard Mella,
Eric Piette,
Matthew Stephenson,
Cameron Browne,
Olivier Teytaud
Abstract:
In this paper, we use fully convolutional architectures in AlphaZero-like self-play training setups to facilitate transfer between variants of board games as well as distinct games. We explore how to transfer trained parameters of these architectures based on shared semantics of channels in the state and action representations of the Ludii general game system. We use Ludii's large library of games…
▽ More
In this paper, we use fully convolutional architectures in AlphaZero-like self-play training setups to facilitate transfer between variants of board games as well as distinct games. We explore how to transfer trained parameters of these architectures based on shared semantics of channels in the state and action representations of the Ludii general game system. We use Ludii's large library of games and game variants for extensive transfer learning evaluations, in zero-shot transfer experiments as well as experiments with additional fine-tuning time.
△ Less
Submitted 24 February, 2021;
originally announced February 2021.
-
Productivity of cellular-Lindelof spaces
Authors:
Alan Dow,
Robert M. Stephenson
Abstract:
The main purpose of this note is to prove that the product of a cellular Lindelof space with a space of countable spread need not be cellular-Lindelof.
The main purpose of this note is to prove that the product of a cellular Lindelof space with a space of countable spread need not be cellular-Lindelof.
△ Less
Submitted 4 December, 2020;
originally announced December 2020.
-
AZP: Automatic Specialization for Zero Values in Gaming Applications
Authors:
Mark W. Stephenson,
Ram Rangan
Abstract:
Recent research has shown that dynamic zeros in shader programs of gaming applications can be effectively leveraged with a profile-guided, code-versioning transform. This transform duplicates code, specializes one path assuming certain key program operands, called versioning variables, are zero, and leaves the other path unspecialized. Dynamically, depending on the versioning variable's value, eit…
▽ More
Recent research has shown that dynamic zeros in shader programs of gaming applications can be effectively leveraged with a profile-guided, code-versioning transform. This transform duplicates code, specializes one path assuming certain key program operands, called versioning variables, are zero, and leaves the other path unspecialized. Dynamically, depending on the versioning variable's value, either the specialized fast path or the default slow path will execute. Prior work applied this transform manually and showed promising gains on gaming applications. In this paper, we present AZP, an automatic compiler approach to perform the above code-versioning transform. Our framework automatically determines which versioning variables or combinations of them are profitable, and determines the code region to duplicate and specialize (called the versioning scope). AZP takes operand zero value probabilities as input and it then uses classical techniques such as constant folding and dead-code elimination to determine the most profitable versioning variables and their versioning scopes. This information is then used to affect the final transform in a straightforward manner. We demonstrate that AZP is able to achieve an average speedup of 16.4% for targeted shader programs, amounting to an average frame-rate speedup of 3.5% across a collection of modern gaming applications on an NVIDIA GeForce RTX 2080 GPU GPU.
△ Less
Submitted 20 November, 2020;
originally announced November 2020.
-
Training CNN Classifiers for Semantic Segmentation using Partially Annotated Images: with Application on Human Thigh and Calf MRI
Authors:
Chun Kit Wong,
Stephanie Marchesseau,
Maria Kalimeri,
Tiang Siew Yap,
Serena S. H. Teo,
Lingaraj Krishna,
Alfredo Franco-Obregón,
Stacey K. H. Tay,
Chin Meng Khoo,
Philip T. H. Lee,
Melvin K. S. Leow,
John J. Totman,
Mary C. Stephenson
Abstract:
Objective: Medical image datasets with pixel-level labels tend to have a limited number of organ or tissue label classes annotated, even when the images have wide anatomical coverage. With supervised learning, multiple classifiers are usually needed given these partially annotated datasets. In this work, we propose a set of strategies to train one single classifier in segmenting all label classes…
▽ More
Objective: Medical image datasets with pixel-level labels tend to have a limited number of organ or tissue label classes annotated, even when the images have wide anatomical coverage. With supervised learning, multiple classifiers are usually needed given these partially annotated datasets. In this work, we propose a set of strategies to train one single classifier in segmenting all label classes that are heterogeneously annotated across multiple datasets without moving into semi-supervised learning. Methods: Masks were first created from each label image through a process we termed presence masking. Three presence masking modes were evaluated, differing mainly in weightage assigned to the annotated and unannotated classes. These masks were then applied to the loss function during training to remove the influence of unannotated classes. Results: Evaluation against publicly available CT datasets shows that presence masking is a viable method for training class-generic classifiers. Our class-generic classifier can perform as well as multiple class-specific classifiers combined, while the training duration is similar to that required for one class-specific classifier. Furthermore, the class-generic classifier can outperform the class-specific classifiers when trained on smaller datasets. Finally, consistent results are observed from evaluations against human thigh and calf MRI datasets collected in-house. Conclusion: The evaluation outcomes show that presence masking is capable of significantly improving both training and inference efficiency across imaging modalities and anatomical regions. Improved performance may even be observed on small datasets. Significance: Presence masking strategies can reduce the computational resources and costs involved in manual medical image annotations. All codes are publicly available at https://github.com/wong-ck/DeepSegment.
△ Less
Submitted 16 August, 2020;
originally announced August 2020.
-
Manipulating the Distributions of Experience used for Self-Play Learning in Expert Iteration
Authors:
Dennis J. N. J. Soemers,
Éric Piette,
Matthew Stephenson,
Cameron Browne
Abstract:
Expert Iteration (ExIt) is an effective framework for learning game-playing policies from self-play. ExIt involves training a policy to mimic the search behaviour of a tree search algorithm - such as Monte-Carlo tree search - and using the trained policy to guide it. The policy and the tree search can then iteratively improve each other, through experience gathered in self-play between instances o…
▽ More
Expert Iteration (ExIt) is an effective framework for learning game-playing policies from self-play. ExIt involves training a policy to mimic the search behaviour of a tree search algorithm - such as Monte-Carlo tree search - and using the trained policy to guide it. The policy and the tree search can then iteratively improve each other, through experience gathered in self-play between instances of the guided tree search algorithm. This paper outlines three different approaches for manipulating the distribution of data collected from self-play, and the procedure that samples batches for learning updates from the collected data. Firstly, samples in batches are weighted based on the durations of the episodes in which they were originally experienced. Secondly, Prioritized Experience Replay is applied within the ExIt framework, to prioritise sampling experience from which we expect to obtain valuable training signals. Thirdly, a trained exploratory policy is used to diversify the trajectories experienced in self-play. This paper summarises the effects of these manipulations on training performance evaluated in fourteen different board games. We find major improvements in early training performance in some games, and minor improvements averaged over fourteen games.
△ Less
Submitted 30 May, 2020;
originally announced June 2020.
-
Estimating Silent Data Corruption Rates Using a Two-Level Model
Authors:
Siva Kumar Sastry Hari,
Paolo Rech,
Timothy Tsai,
Mark Stephenson,
Arslan Zulfiqar,
Michael Sullivan,
Philip Shirvani,
Paul Racunas,
Joel Emer,
Stephen W. Keckler
Abstract:
High-performance and safety-critical system architects must accurately evaluate the application-level silent data corruption (SDC) rates of processors to soft errors. Such an evaluation requires error propagation all the way from particle strikes on low-level state up to the program output. Existing approaches that rely on low-level simulations with fault injection cannot evaluate full application…
▽ More
High-performance and safety-critical system architects must accurately evaluate the application-level silent data corruption (SDC) rates of processors to soft errors. Such an evaluation requires error propagation all the way from particle strikes on low-level state up to the program output. Existing approaches that rely on low-level simulations with fault injection cannot evaluate full applications because of their slow speeds, while application-level accelerated fault testing in accelerated particle beams is often impractical. We present a new two-level methodology for application resilience evaluation that overcomes these challenges. The proposed approach decomposes application failure rate estimation into (1) identifying how particle strikes in low-level unprotected state manifest at the architecture-level, and (2) measuring how such architecture-level manifestations propagate to the program output. We demonstrate the effectiveness of this approach on GPU architectures. We also show that using just one of the two steps can overestimate SDC rates and produce different trends---the composition of the two is needed for accurate reliability modeling.
△ Less
Submitted 27 April, 2020;
originally announced May 2020.
-
Superstition in the Network: Deep Reinforcement Learning Plays Deceptive Games
Authors:
Philip Bontrager,
Ahmed Khalifa,
Damien Anderson,
Matthew Stephenson,
Christoph Salge,
Julian Togelius
Abstract:
Deep reinforcement learning has learned to play many games well, but failed on others. To better characterize the modes and reasons of failure of deep reinforcement learners, we test the widely used Asynchronous Actor-Critic (A2C) algorithm on four deceptive games, which are specially designed to provide challenges to game-playing agents. These games are implemented in the General Video Game AI fr…
▽ More
Deep reinforcement learning has learned to play many games well, but failed on others. To better characterize the modes and reasons of failure of deep reinforcement learners, we test the widely used Asynchronous Actor-Critic (A2C) algorithm on four deceptive games, which are specially designed to provide challenges to game-playing agents. These games are implemented in the General Video Game AI framework, which allows us to compare the behavior of reinforcement learning-based agents with planning agents based on tree search. We find that several of these games reliably deceive deep reinforcement learners, and that the resulting behavior highlights the shortcomings of the learning algorithm. The particular ways in which agents fail differ from how planning-based agents fail, further illuminating the character of these algorithms. We propose an initial typology of deceptions which could help us better understand pitfalls and failure modes of (deep) reinforcement learning.
△ Less
Submitted 12 August, 2019;
originally announced August 2019.
-
Ludii as a Competition Platform
Authors:
Matthew Stephenson,
Éric Piette,
Dennis J. N. J. Soemers,
Cameron Browne
Abstract:
Ludii is a general game system being developed as part of the ERC-funded Digital Ludeme Project (DLP). While its primary aim is to model, play, and analyse the full range of traditional strategy games, Ludii also has the potential to support a wide range of AI research topics and competitions. This paper describes some of the future competitions and challenges that we intend to run using the Ludii…
▽ More
Ludii is a general game system being developed as part of the ERC-funded Digital Ludeme Project (DLP). While its primary aim is to model, play, and analyse the full range of traditional strategy games, Ludii also has the potential to support a wide range of AI research topics and competitions. This paper describes some of the future competitions and challenges that we intend to run using the Ludii system, highlighting some of its most important aspects that can potentially lead to many algorithm improvements and new avenues of research. We compare and contrast our proposed competition motivations, goals and frameworks against those of existing general game playing competitions, addressing the strengths and weaknesses of each platform.
△ Less
Submitted 29 June, 2019;
originally announced July 2019.
-
Ludii and XCSP: Playing and Solving Logic Puzzles
Authors:
Cédric Piette,
Éric Piette,
Matthew Stephenson,
Dennis J. N. J. Soemers,
Cameron Browne
Abstract:
Many of the famous single-player games, commonly called puzzles, can be shown to be NP-Complete. Indeed, this class of complexity contains hundreds of puzzles, since people particularly appreciate completing an intractable puzzle, such as Sudoku, but also enjoy the ability to check their solution easily once it's done. For this reason, using constraint programming is naturally suited to solve them…
▽ More
Many of the famous single-player games, commonly called puzzles, can be shown to be NP-Complete. Indeed, this class of complexity contains hundreds of puzzles, since people particularly appreciate completing an intractable puzzle, such as Sudoku, but also enjoy the ability to check their solution easily once it's done. For this reason, using constraint programming is naturally suited to solve them. In this paper, we focus on logic puzzles described in the Ludii general game system and we propose using the XCSP formalism in order to solve them with any CSP solver.
△ Less
Submitted 29 June, 2019;
originally announced July 2019.
-
An Empirical Evaluation of Two General Game Systems: Ludii and RBG
Authors:
Éric Piette,
Matthew Stephenson,
Dennis J. N. J. Soemers,
Cameron Browne
Abstract:
Although General Game Playing (GGP) systems can facilitate useful research in Artificial Intelligence (AI) for game-playing, they are often computationally inefficient and somewhat specialised to a specific class of games. However, since the start of this year, two General Game Systems have emerged that provide efficient alternatives to the academic state of the art -- the Game Description Languag…
▽ More
Although General Game Playing (GGP) systems can facilitate useful research in Artificial Intelligence (AI) for game-playing, they are often computationally inefficient and somewhat specialised to a specific class of games. However, since the start of this year, two General Game Systems have emerged that provide efficient alternatives to the academic state of the art -- the Game Description Language (GDL). In order of publication, these are the Regular Boardgames language (RBG), and the Ludii system. This paper offers an experimental evaluation of Ludii. Here, we focus mainly on a comparison between the two new systems in terms of two key properties for any GGP system: simplicity/clarity (e.g. human-readability), and efficiency.
△ Less
Submitted 29 June, 2019;
originally announced July 2019.
-
An Overview of the Ludii General Game System
Authors:
Matthew Stephenson,
Éric Piette,
Dennis J. N. J. Soemers,
Cameron Browne
Abstract:
The Digital Ludeme Project (DLP) aims to reconstruct and analyse over 1000 traditional strategy games using modern techniques. One of the key aspects of this project is the development of Ludii, a general game system that will be able to model and play the complete range of games required by this project. Such an undertaking will create a wide range of possibilities for new AI challenges. In this…
▽ More
The Digital Ludeme Project (DLP) aims to reconstruct and analyse over 1000 traditional strategy games using modern techniques. One of the key aspects of this project is the development of Ludii, a general game system that will be able to model and play the complete range of games required by this project. Such an undertaking will create a wide range of possibilities for new AI challenges. In this paper we describe many of the features of Ludii that can be used. This includes designing and modifying games using the Ludii game description language, creating agents capable of playing these games, and several advantages the system has over prior general game software.
△ Less
Submitted 29 June, 2019;
originally announced July 2019.
-
"Did You Hear That?" Learning to Play Video Games from Audio Cues
Authors:
Raluca D. Gaina,
Matthew Stephenson
Abstract:
Game-playing AI research has focused for a long time on learning to play video games from visual input or symbolic information. However, humans benefit from a wider array of sensors which we utilise in order to navigate the world around us. In particular, sounds and music are key to how many of us perceive the world and influence the decisions we make. In this paper, we present initial experiments…
▽ More
Game-playing AI research has focused for a long time on learning to play video games from visual input or symbolic information. However, humans benefit from a wider array of sensors which we utilise in order to navigate the world around us. In particular, sounds and music are key to how many of us perceive the world and influence the decisions we make. In this paper, we present initial experiments on game-playing agents learning to play video games solely from audio cues. We expand the Video Game Description Language to allow for audio specification, and the General Video Game AI framework to provide new audio games and an API for learning agents to make use of audio observations. We analyse the games and the audio game design process, include initial results with simple Q~Learning agents, and encourage further research in this area.
△ Less
Submitted 11 June, 2019; v1 submitted 10 June, 2019;
originally announced June 2019.
-
Foundations of Digital Archæoludology
Authors:
Cameron Browne,
Dennis J. N. J. Soemers,
Éric Piette,
Matthew Stephenson,
Michael Conrad,
Walter Crist,
Thierry Depaulis,
Eddie Duggan,
Fred Horn,
Steven Kelk,
Simon M. Lucas,
João Pedro Neto,
David Parlett,
Abdallah Saffidine,
Ulrich Schädler,
Jorge Nuno Silva,
Alex de Voogt,
Mark H. M. Winands
Abstract:
Digital Archaeoludology (DAL) is a new field of study involving the analysis and reconstruction of ancient games from incomplete descriptions and archaeological evidence using modern computational techniques. The aim is to provide digital tools and methods to help game historians and other researchers better understand traditional games, their development throughout recorded human history, and the…
▽ More
Digital Archaeoludology (DAL) is a new field of study involving the analysis and reconstruction of ancient games from incomplete descriptions and archaeological evidence using modern computational techniques. The aim is to provide digital tools and methods to help game historians and other researchers better understand traditional games, their development throughout recorded human history, and their relationship to the development of human culture and mathematical knowledge. This work is being explored in the ERC-funded Digital Ludeme Project.
The aim of this inaugural international research meeting on DAL is to gather together leading experts in relevant disciplines - computer science, artificial intelligence, machine learning, computational phylogenetics, mathematics, history, archaeology, anthropology, etc. - to discuss the key themes and establish the foundations for this new field of research, so that it may continue beyond the lifetime of its initiating project.
△ Less
Submitted 31 May, 2019;
originally announced May 2019.
-
Using Restart Heuristics to Improve Agent Performance in Angry Birds
Authors:
Tommy Liu,
Jochen Renz,
Peng Zhang,
Matthew Stephenson
Abstract:
Over the past few years the Angry Birds AI competition has been held in an attempt to develop intelligent agents that can successfully and efficiently solve levels for the video game Angry Birds. Many different agents and strategies have been developed to solve the complex and challenging physical reasoning problems associated with such a game. However none of these agents attempt one of the key s…
▽ More
Over the past few years the Angry Birds AI competition has been held in an attempt to develop intelligent agents that can successfully and efficiently solve levels for the video game Angry Birds. Many different agents and strategies have been developed to solve the complex and challenging physical reasoning problems associated with such a game. However none of these agents attempt one of the key strategies which humans employ to solve Angry Birds levels, which is restarting levels. Restarting is important in Angry Birds because sometimes the level is no longer solvable or some given shot made has little to no benefit towards the ultimate goal of the game. This paper proposes a framework and experimental evaluation for when to restart levels in Angry Birds. We demonstrate that restarting is a viable strategy to improve agent performance in many cases.
△ Less
Submitted 30 May, 2019;
originally announced May 2019.
-
Learning Policies from Self-Play with Policy Gradients and MCTS Value Estimates
Authors:
Dennis J. N. J. Soemers,
Éric Piette,
Matthew Stephenson,
Cameron Browne
Abstract:
In recent years, state-of-the-art game-playing agents often involve policies that are trained in self-playing processes where Monte Carlo tree search (MCTS) algorithms and trained policies iteratively improve each other. The strongest results have been obtained when policies are trained to mimic the search behaviour of MCTS by minimising a cross-entropy loss. Because MCTS, by design, includes an e…
▽ More
In recent years, state-of-the-art game-playing agents often involve policies that are trained in self-playing processes where Monte Carlo tree search (MCTS) algorithms and trained policies iteratively improve each other. The strongest results have been obtained when policies are trained to mimic the search behaviour of MCTS by minimising a cross-entropy loss. Because MCTS, by design, includes an element of exploration, policies trained in this manner are also likely to exhibit a similar extent of exploration. In this paper, we are interested in learning policies for a project with future goals including the extraction of interpretable strategies, rather than state-of-the-art game-playing performance. For these goals, we argue that such an extent of exploration is undesirable, and we propose a novel objective function for training policies that are not exploratory. We derive a policy gradient expression for maximising this objective function, which can be estimated using MCTS value estimates, rather than MCTS visit counts. We empirically evaluate various properties of resulting policies, in a variety of board games.
△ Less
Submitted 14 May, 2019;
originally announced May 2019.
-
Ludii -- The Ludemic General Game System
Authors:
Éric Piette,
Dennis J. N. J. Soemers,
Matthew Stephenson,
Chiara F. Sironi,
Mark H. M. Winands,
Cameron Browne
Abstract:
While current General Game Playing (GGP) systems facilitate useful research in Artificial Intelligence (AI) for game-playing, they are often somewhat specialised and computationally inefficient. In this paper, we describe the "ludemic" general game system Ludii, which has the potential to provide an efficient tool for AI researchers as well as game designers, historians, educators and practitioner…
▽ More
While current General Game Playing (GGP) systems facilitate useful research in Artificial Intelligence (AI) for game-playing, they are often somewhat specialised and computationally inefficient. In this paper, we describe the "ludemic" general game system Ludii, which has the potential to provide an efficient tool for AI researchers as well as game designers, historians, educators and practitioners in related fields. Ludii defines games as structures of ludemes -- high-level, easily understandable game concepts -- which allows for concise and human-understandable game descriptions. We formally describe Ludii and outline its main benefits: generality, extensibility, understandability and efficiency. Experimentally, Ludii outperforms one of the most efficient Game Description Language (GDL) reasoners, based on a propositional network, in all games available in the Tiltyard GGP repository. Moreover, Ludii is also competitive in terms of performance with the more recently proposed Regular Boardgames (RBG) system, and has various advantages in qualitative aspects such as generality.
△ Less
Submitted 21 February, 2020; v1 submitted 13 May, 2019;
originally announced May 2019.
-
Agent-Based Adaptive Level Generation for Dynamic Difficulty Adjustment in Angry Birds
Authors:
Matthew Stephenson,
Jochen Renz
Abstract:
This paper presents an adaptive level generation algorithm for the physics-based puzzle game Angry Birds. The proposed algorithm is based on a pre-existing level generator for this game, but where the difficulty of the generated levels can be adjusted based on the player's performance. This allows for the creation of personalised levels tailored specifically to the player's own abilities. The effe…
▽ More
This paper presents an adaptive level generation algorithm for the physics-based puzzle game Angry Birds. The proposed algorithm is based on a pre-existing level generator for this game, but where the difficulty of the generated levels can be adjusted based on the player's performance. This allows for the creation of personalised levels tailored specifically to the player's own abilities. The effectiveness of our proposed method is evaluated using several agents with differing strategies and AI techniques. By using these agents as models / representations of real human player's characteristics, we can optimise level properties efficiently over a large number of generations. As a secondary investigation, we also demonstrate that by combining the performance of several agents together it is possible to generate levels that are especially challenging for certain players but not others.
△ Less
Submitted 7 February, 2019;
originally announced February 2019.
-
The Computational Complexity of Angry Birds
Authors:
Matthew Stephenson,
Jochen Renz,
Xiaoyu Ge
Abstract:
The physics-based simulation game Angry Birds has been heavily researched by the AI community over the past five years, and has been the subject of a popular AI competition that is currently held annually as part of a leading AI conference. Develo** intelligent agents that can play this game effectively has been an incredibly complex and challenging problem for traditional AI techniques to solve…
▽ More
The physics-based simulation game Angry Birds has been heavily researched by the AI community over the past five years, and has been the subject of a popular AI competition that is currently held annually as part of a leading AI conference. Develo** intelligent agents that can play this game effectively has been an incredibly complex and challenging problem for traditional AI techniques to solve, even though the game is simple enough that any human player could learn and master it within a short time. In this paper we analyse how hard the problem really is, presenting several proofs for the computational complexity of Angry Birds. By using a combination of several gadgets within this game's environment, we are able to demonstrate that the decision problem of solving general levels for different versions of Angry Birds is either NP-hard, PSPACE-hard, PSPACE-complete or EXPTIME-hard. Proof of NP-hardness is by reduction from 3-SAT, whilst proof of PSPACE-hardness is by reduction from True Quantified Boolean Formula (TQBF). Proof of EXPTIME-hardness is by reduction from G2, a known EXPTIME-complete problem similar to that used for many previous games such as Chess, Go and Checkers. To the best of our knowledge, this is the first time that a single-player game has been proven EXPTIME-hard. This is achieved by using stochastic game engine dynamics to effectively model the real world, or in our case the physics simulator, as the opponent against which we are playing. These proofs can also be extended to other physics-based games with similar mechanics.
△ Less
Submitted 15 January, 2020; v1 submitted 19 December, 2018;
originally announced December 2018.
-
A Continuous Information Gain Measure to Find the Most Discriminatory Problems for AI Benchmarking
Authors:
Matthew Stephenson,
Damien Anderson,
Ahmed Khalifa,
John Levine,
Jochen Renz,
Julian Togelius,
Christoph Salge
Abstract:
This paper introduces an information-theoretic method for selecting a subset of problems which gives the most information about a group of problem-solving algorithms. This method was tested on the games in the General Video Game AI (GVGAI) framework, allowing us to identify a smaller set of games that still gives a large amount of information about the abilities of different game-playing agents. T…
▽ More
This paper introduces an information-theoretic method for selecting a subset of problems which gives the most information about a group of problem-solving algorithms. This method was tested on the games in the General Video Game AI (GVGAI) framework, allowing us to identify a smaller set of games that still gives a large amount of information about the abilities of different game-playing agents. This approach can be used to make agent testing more efficient. We can achieve almost as good discriminatory accuracy when testing on only a handful of games as when testing on more than a hundred games, something which is often computationally infeasible. Furthermore, this method can be extended to study the dimensions of the effective variance in game design between these games, allowing us to identify which games differentiate between agents in the most complementary ways.
△ Less
Submitted 18 May, 2020; v1 submitted 8 September, 2018;
originally announced September 2018.
-
The 2017 AIBIRDS Competition
Authors:
Matthew Stephenson,
Jochen Renz,
Xiaoyu Ge,
Peng Zhang
Abstract:
This paper presents an overview of the sixth AIBIRDS competition, held at the 26th International Joint Conference on Artificial Intelligence. This competition tasked participants with develo** an intelligent agent which can play the physics-based puzzle game Angry Birds. This game uses a sophisticated physics engine that requires agents to reason and predict the outcome of actions with only limi…
▽ More
This paper presents an overview of the sixth AIBIRDS competition, held at the 26th International Joint Conference on Artificial Intelligence. This competition tasked participants with develo** an intelligent agent which can play the physics-based puzzle game Angry Birds. This game uses a sophisticated physics engine that requires agents to reason and predict the outcome of actions with only limited environmental information. Agents entered into this competition were required to solve a wide assortment of previously unseen levels within a set time limit. The physical reasoning and planning required to solve these levels are very similar to those of many real-world problems. This year's competition featured some of the best agents developed so far and even included several new AI techniques such as deep reinforcement learning. Within this paper we describe the framework, rules, submitted agents and results for this competition. We also provide some background information on related work and other video game AI competitions, as well as discussing some potential ideas for future AIBIRDS competitions and agent improvements.
△ Less
Submitted 14 March, 2018;
originally announced March 2018.
-
Deceptive Games
Authors:
Damien Anderson,
Matthew Stephenson,
Julian Togelius,
Christian Salge,
John Levine,
Jochen Renz
Abstract:
Deceptive games are games where the reward structure or other aspects of the game are designed to lead the agent away from a globally optimal policy. While many games are already deceptive to some extent, we designed a series of games in the Video Game Description Language (VGDL) implementing specific types of deception, classified by the cognitive biases they exploit. VGDL games can be run in the…
▽ More
Deceptive games are games where the reward structure or other aspects of the game are designed to lead the agent away from a globally optimal policy. While many games are already deceptive to some extent, we designed a series of games in the Video Game Description Language (VGDL) implementing specific types of deception, classified by the cognitive biases they exploit. VGDL games can be run in the General Video Game Artificial Intelligence (GVGAI) Framework, making it possible to test a variety of existing AI agents that have been submitted to the GVGAI Competition on these deceptive games. Our results show that all tested agents are vulnerable to several kinds of deception, but that different agents have different weaknesses. This suggests that we can use deception to understand the capabilities of a game-playing algorithm, and game-playing algorithms to characterize the deception displayed by a game.
△ Less
Submitted 4 February, 2018; v1 submitted 31 January, 2018;
originally announced February 2018.
-
Scaling and Eigenmode Tests of the Improved Fat Clover Action
Authors:
Mark Stephenson,
Carleton DeTar,
Thomas DeGrand,
Anna Hasenfratz
Abstract:
We test a recently proposed improved lattice-fermion action, the fat link clover action, examining indicators of pathological small-quark-mass lattice artifacts ("exceptional configurations") on quenched lattices of spacing 0.12 fm and studying scaling properties of the light hadron spectrum for lattice spacing a=0.09 and 0.16 fm. We show that the action apparently has fewer problems with pathol…
▽ More
We test a recently proposed improved lattice-fermion action, the fat link clover action, examining indicators of pathological small-quark-mass lattice artifacts ("exceptional configurations") on quenched lattices of spacing 0.12 fm and studying scaling properties of the light hadron spectrum for lattice spacing a=0.09 and 0.16 fm. We show that the action apparently has fewer problems with pathological lattice artifacts than the conventional nonperturbatively improved clover action and its spectrum scales just as well.
△ Less
Submitted 10 April, 2000; v1 submitted 12 October, 1999;
originally announced October 1999.