Skip to main content

Showing 1–39 of 39 results for author: Ontanon, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2310.04418  [pdf, other

    cs.LG

    Functional Interpolation for Relative Positions Improves Long Context Transformers

    Authors: Shanda Li, Chong You, Guru Guruganesh, Joshua Ainslie, Santiago Ontanon, Manzil Zaheer, Sumit Sanghai, Yiming Yang, Sanjiv Kumar, Srinadh Bhojanapalli

    Abstract: Preventing the performance decay of Transformers on inputs longer than those used for training has been an important challenge in extending the context length of these models. Though the Transformer architecture has fundamentally no limits on the input sequence lengths it can process, the choice of position encoding used during training can limit the performance of these models on longer inputs. W… ▽ More

    Submitted 2 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: 26 pages; ICLR 2024 camera ready version

  3. arXiv:2310.00036  [pdf, other

    cs.LG

    Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform

    Authors: Shengyi Huang, Jiayi Weng, Rujikorn Charakorn, Min Lin, Zhongwen Xu, Santiago Ontañón

    Abstract: Distributed Deep Reinforcement Learning (DRL) aims to leverage more computational resources to train autonomous agents with less training time. Despite recent progress in the field, reproducibility issues have not been sufficiently explored. This paper first shows that the typical actor-learner framework can have reproducibility issues even if hyperparameters are controlled. We then introduce Clea… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

  4. arXiv:2308.14903  [pdf, other

    cs.CL

    MEMORY-VQ: Compression for Tractable Internet-Scale Memory

    Authors: Yury Zemlyanskiy, Michiel de Jong, Luke Vilnis, Santiago Ontañón, William W. Cohen, Sumit Sanghai, Joshua Ainslie

    Abstract: Retrieval augmentation is a powerful but expensive method to make language models more knowledgeable about the world. Memory-based methods like LUMEN pre-compute token representations for retrieved passages to drastically speed up inference. However, memory also leads to much greater storage requirements from storing pre-computed representations. We propose MEMORY-VQ, a new method to reduce stor… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  5. arXiv:2305.11129  [pdf, other

    cs.CL

    mLongT5: A Multilingual and Efficient Text-To-Text Transformer for Longer Sequences

    Authors: David Uthus, Santiago Ontañón, Joshua Ainslie, Mandy Guo

    Abstract: We present our work on develo** a multilingual, efficient text-to-text transformer that is suitable for handling long inputs. This model, called mLongT5, builds upon the architecture of LongT5, while leveraging the multilingual datasets used for pretraining mT5 and the pretraining tasks of UL2. We evaluate this model on a variety of multilingual summarization and question-answering tasks, and th… ▽ More

    Submitted 26 October, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  6. arXiv:2305.06218  [pdf, other

    cs.CL cs.AI cs.IR

    Multi-Task End-to-End Training Improves Conversational Recommendation

    Authors: Naveen Ram, Dima Kuzmin, Ellie Ka In Chio, Moustafa Farid Alzantot, Santiago Ontanon, Ambarish Jash, Judith Yue Li

    Abstract: In this paper, we analyze the performance of a multitask end-to-end transformer model on the task of conversational recommendations, which aim to provide recommendations based on a user's explicit preferences expressed in dialogue. While previous works in this area adopt complex multi-component approaches where the dialogue management and entity recommendation tasks are handled by separate compone… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 10 pages, 4 tables, 1 figure

  7. arXiv:2303.09752  [pdf, other

    cs.CL cs.LG

    CoLT5: Faster Long-Range Transformers with Conditional Computation

    Authors: Joshua Ainslie, Tao Lei, Michiel de Jong, Santiago Ontañón, Siddhartha Brahma, Yury Zemlyanskiy, David Uthus, Mandy Guo, James Lee-Thorp, Yi Tay, Yun-Hsuan Sung, Sumit Sanghai

    Abstract: Many natural language processing tasks benefit from long inputs, but processing long documents with Transformers is expensive -- not only due to quadratic attention complexity but also from applying feedforward and projection layers to every token. However, not all tokens are equally important, especially for longer documents. We propose CoLT5, a long-input Transformer model that builds on this in… ▽ More

    Submitted 23 October, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: Accepted at EMNLP 2023

  8. Improving Fairness in Adaptive Social Exergames via Shapley Bandits

    Authors: Robert C. Gray, Jennifer Villareale, Thomas B. Fox, Diane H. Dallal, Santiago Ontañón, Danielle Arigo, Shahin Jabbari, Jichen Zhu

    Abstract: Algorithmic fairness is an essential requirement as AI becomes integrated in society. In the case of social applications where AI distributes resources, algorithms often must make decisions that will benefit a subset of users, sometimes repeatedly or exclusively, while attempting to maximize specific outcomes. How should we design such systems to serve users more fairly? This paper explores this q… ▽ More

    Submitted 21 February, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

  9. arXiv:2205.09123  [pdf, other

    cs.LG

    A2C is a special case of PPO

    Authors: Shengyi Huang, Anssi Kanervisto, Antonin Raffin, Weixun Wang, Santiago Ontañón, Rousslan Fernand Julien Dossa

    Abstract: Advantage Actor-critic (A2C) and Proximal Policy Optimization (PPO) are popular deep reinforcement learning algorithms used for game AI in recent years. A common understanding is that A2C and PPO are separate algorithms because PPO's clipped objective appears significantly different than A2C's objective. In this paper, however, we show A2C is a special case of PPO. We present theoretical justifica… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

  10. arXiv:2203.15099  [pdf, other

    cs.AI

    LogicInference: A New Dataset for Teaching Logical Inference to seq2seq Models

    Authors: Santiago Ontanon, Joshua Ainslie, Vaclav Cvicek, Zachary Fisher

    Abstract: Machine learning models such as Transformers or LSTMs struggle with tasks that are compositional in nature such as those involving reasoning/inference. Although many datasets exist to evaluate compositional generalization, when it comes to evaluating inference abilities, options are more limited. This paper presents LogicInference, a new dataset to evaluate the ability of models to perform logical… ▽ More

    Submitted 11 April, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: Accepted at ICLR 2022 OSC workshop (v3 contains updated results after fixing a problem in dataset generation)

  11. arXiv:2112.07916  [pdf, other

    cs.CL

    LongT5: Efficient Text-To-Text Transformer for Long Sequences

    Authors: Mandy Guo, Joshua Ainslie, David Uthus, Santiago Ontanon, Jianmo Ni, Yun-Hsuan Sung, Yinfei Yang

    Abstract: Recent work has shown that either (1) increasing the input length or (2) increasing model size can improve the performance of Transformer-based neural models. In this paper, we present a new model, called LongT5, with which we explore the effects of scaling both the input length and model size at the same time. Specifically, we integrated attention ideas from long-input transformers (ETC), and ado… ▽ More

    Submitted 3 May, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: Accepted in NAACL 2022

  12. arXiv:2111.06774  [pdf, other

    cs.CV cs.HC cs.LG

    Identifying On-road Scenarios Predictive of ADHD usingDriving Simulator Time Series Data

    Authors: David Grethlein, Aleksanteri Sladek, Santiago Ontañón

    Abstract: In this paper we introduce a novel algorithm called Iterative Section Reduction (ISR) to automatically identify sub-intervals of spatiotemporal time series that are predictive of a target classification task. Specifically, using data collected from a driving simulator study, we identify which spatial regions (dubbed "sections") along the simulated routes tend to manifest driving behaviors that are… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

  13. arXiv:2110.04169  [pdf, other

    cs.LG cs.CL

    Iterative Decoding for Compositional Generalization in Transformers

    Authors: Luana Ruiz, Joshua Ainslie, Santiago Ontañón

    Abstract: Deep learning models generalize well to in-distribution data but struggle to generalize compositionally, i.e., to combine a set of learned primitives to solve more complex tasks. In sequence-to-sequence (seq2seq) learning, transformers are often unable to predict correct outputs for longer examples than those seen at training. This paper introduces iterative decoding, an alternative to seq2seq tha… ▽ More

    Submitted 9 December, 2021; v1 submitted 8 October, 2021; originally announced October 2021.

  14. arXiv:2108.04378  [pdf, other

    cs.AI cs.CL

    Making Transformers Solve Compositional Tasks

    Authors: Santiago Ontañón, Joshua Ainslie, Vaclav Cvicek, Zachary Fisher

    Abstract: Several studies have reported the inability of Transformer models to generalize compositionally, a key type of generalization in many NLP tasks such as semantic parsing. In this paper we explore the design space of Transformer models showing that the inductive biases given to the model by several design decisions significantly impact compositional generalization. Through this exploration, we ident… ▽ More

    Submitted 3 March, 2022; v1 submitted 9 August, 2021; originally announced August 2021.

    Comments: Source code: https://github.com/google-research/google-research/tree/master/compositional_transformers

    Journal ref: ACL 2022

  15. arXiv:2106.10434  [pdf, other

    cs.LG cs.CL

    Improving Compositional Generalization in Classification Tasks via Structure Annotations

    Authors: Juyong Kim, Pradeep Ravikumar, Joshua Ainslie, Santiago Ontañón

    Abstract: Compositional generalization is the ability to generalize systematically to a new data distribution by combining known components. Although humans seem to have a great ability to generalize compositionally, state-of-the-art neural models struggle to do so. In this work, we study compositional generalization in classification tasks and present two main contributions. First, we study ways to convert… ▽ More

    Submitted 19 June, 2021; originally announced June 2021.

    Comments: Accepted as a short paper at ACL 2021

  16. arXiv:2105.13807  [pdf, other

    cs.LG

    Gym-$μ$RTS: Toward Affordable Full Game Real-time Strategy Games Research with Deep Reinforcement Learning

    Authors: Shengyi Huang, Santiago Ontañón, Chris Bamford, Lukasz Grela

    Abstract: In recent years, researchers have achieved great success in applying Deep Reinforcement Learning (DRL) algorithms to Real-time Strategy (RTS) games, creating strong autonomous agents that could defeat professional players in StarCraft~II. However, existing approaches to tackle full games have high computational costs, usually requiring the use of thousands of GPUs and CPUs for weeks. This paper ha… ▽ More

    Submitted 28 July, 2021; v1 submitted 21 May, 2021; originally announced May 2021.

    Comments: Accepted to IEEE Conference of Games (COG) 2021. See the blog post at https://wandb.ai/vwxyzjn/gym-microrts-paper/reports/Gym-RTS-Toward-Affordable-Deep-Reinforcement-Learning-Research-in-Real-Time-Strategy-Games--Vmlldzo2MDIzMTg and the source code at https://github.com/vwxyzjn/gym-microrts-paper

  17. arXiv:2105.09930  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Mondegreen: A Post-Processing Solution to Speech Recognition Error Correction for Voice Search Queries

    Authors: Sukhdeep S. Sodhi, Ellie Ka-In Chio, Ambarish Jash, Santiago Ontañón, Ajit Apte, Ankit Kumar, Ayooluwakunmi Jeje, Dima Kuzmin, Harry Fung, Heng-Tze Cheng, Jon Effrat, Tarush Bali, Nitin **dal, Pei Cao, Sarvjeet Singh, Senqiang Zhou, Tameen Khan, Amol Wankhede, Moustafa Alzantot, Allen Wu, Tushar Chandra

    Abstract: As more and more online search queries come from voice, automatic speech recognition becomes a key component to deliver relevant search results. Errors introduced by automatic speech recognition (ASR) lead to irrelevant search results returned to the user, thus causing user dissatisfaction. In this paper, we introduce an approach, Mondegreen, to correct voice queries in text space without dependin… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    Comments: Accepted in KDD 2021

  18. arXiv:2105.03824  [pdf, other

    cs.CL cs.LG

    FNet: Mixing Tokens with Fourier Transforms

    Authors: James Lee-Thorp, Joshua Ainslie, Ilya Eckstein, Santiago Ontanon

    Abstract: We show that Transformer encoder architectures can be sped up, with limited accuracy costs, by replacing the self-attention sublayers with simple linear transformations that "mix" input tokens. These linear mixers, along with standard nonlinearities in feed-forward layers, prove competent at modeling semantic relationships in several text classification tasks. Most surprisingly, we find that repla… ▽ More

    Submitted 26 May, 2022; v1 submitted 8 May, 2021; originally announced May 2021.

    Comments: To appear at NAACL 2022

  19. arXiv:2103.01771  [pdf, other

    cs.HC

    The Personalization Paradox: the Conflict between Accurate User Models and Personalized Adaptive Systems

    Authors: Santiago Ontañón, Jichen Zhu

    Abstract: Personalized adaptation technology has been adopted in a wide range of digital applications such as health, training and education, e-commerce and entertainment. Personalization systems typically build a user model, aiming to characterize the user at hand, and then use this model to personalize the interaction. Personalization and user modeling, however, are often intrinsically at odds with each o… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2101.10020

  20. arXiv:2102.07548  [pdf, other

    cs.AI cs.HC

    Player-Centered AI for Automatic Game Personalization: Open Problems

    Authors: Jichen Zhu, Santiago Ontañón

    Abstract: Computer games represent an ideal research domain for the next generation of personalized digital applications. This paper presents a player-centered framework of AI for game personalization, complementary to the commonly used system-centered approaches. Built on the Structure of Actions theory, the paper maps out the current landscape of game personalization research and identifies eight open pro… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

  21. arXiv:2102.05264  [pdf, other

    cs.AI cs.LG

    Player Modeling via Multi-Armed Bandits

    Authors: Robert C. Gray, Jichen Zhu, Dannielle Arigo, Evan Forman, Santiago Ontañón

    Abstract: This paper focuses on building personalized player models solely from player behavior in the context of adaptive games. We present two main contributions: The first is a novel approach to player modeling based on multi-armed bandits (MABs). This approach addresses, at the same time and in a principled way, both the problem of collecting data to model the characteristics of interest for the current… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

    Journal ref: In Proceedings of the International Conference on the Foundations of Digital Games (FDG 2020)

  22. arXiv:2102.05263  [pdf, other

    cs.LG cs.AI stat.ML

    Regression Oracles and Exploration Strategies for Short-Horizon Multi-Armed Bandits

    Authors: Robert C. Gray, Jichen Zhu, Santiago Ontañón

    Abstract: This paper explores multi-armed bandit (MAB) strategies in very short horizon scenarios, i.e., when the bandit strategy is only allowed very few interactions with the environment. This is an understudied setting in the MAB literature with many applications in the context of games, such as player modeling. Specifically, we pursue three different ideas. First, we explore the use of regression oracle… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: 8 pages

    Journal ref: In proceedings of the 2020 IEEE Conference on Games (CoG) (pp. 312-319)

  23. arXiv:2101.10020  [pdf, other

    cs.HC cs.AI

    Personalization Paradox in Behavior Change Apps: Lessons from a Social Comparison-Based Personalized App for Physical Activity

    Authors: Jichen Zhu, Diane H. Dallal, Robert C. Gray, Jennifer Villareale, Santiago Ontañón, Evan M. Forman, Danielle Arigo

    Abstract: Social comparison-based features are widely used in social computing apps. However, most existing apps are not grounded in social comparison theories and do not consider individual differences in social comparison preferences and reactions. This paper is among the first to automatically personalize social comparison targets. In the context of an m-health app for physical activity, we use artificia… ▽ More

    Submitted 11 February, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

  24. arXiv:2010.03956  [pdf, other

    cs.LG stat.ML

    Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games

    Authors: Shengyi Huang, Santiago Ontañón

    Abstract: Training agents using Reinforcement Learning in games with sparse rewards is a challenging problem, since large amounts of exploration are required to retrieve even the first reward. To tackle this problem, a common approach is to use reward sha** to help exploration. However, an important drawback of reward sha** is that agents sometimes learn to optimize the shaped reward instead of the true… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

    Comments: Preprint

  25. arXiv:2007.14062  [pdf, other

    cs.LG cs.CL stat.ML

    Big Bird: Transformers for Longer Sequences

    Authors: Manzil Zaheer, Guru Guruganesh, Avinava Dubey, Joshua Ainslie, Chris Alberti, Santiago Ontanon, Philip Pham, Anirudh Ravula, Qifan Wang, Li Yang, Amr Ahmed

    Abstract: Transformers-based models, such as BERT, have been one of the most successful deep learning models for NLP. Unfortunately, one of their core limitations is the quadratic dependency (mainly in terms of memory) on the sequence length due to their full attention mechanism. To remedy this, we propose, BigBird, a sparse attention mechanism that reduces this quadratic dependency to linear. We show that… ▽ More

    Submitted 8 January, 2021; v1 submitted 28 July, 2020; originally announced July 2020.

    Journal ref: Neural Information Processing Systems (NeurIPS) 2020

  26. arXiv:2006.14171  [pdf, other

    cs.LG cs.AI stat.ML

    A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

    Authors: Shengyi Huang, Santiago Ontañón

    Abstract: In recent years, Deep Reinforcement Learning (DRL) algorithms have achieved state-of-the-art performance in many challenging strategy games. Because these games have complicated rules, an action sampled from the full discrete action distribution predicted by the learned policy is likely to be invalid according to the game rules (e.g., walking into a wall). The usual approach to deal with this prob… ▽ More

    Submitted 30 May, 2022; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: Accepted into the proceedings of International FLAIRS Conference Proceedings, Vol. 35 (2022)

    Journal ref: FLAIRS. Vol. 35 (2022)

  27. arXiv:2005.04789  [pdf, other

    cs.HC cs.CY

    Understanding Learners' Problem-Solving Strategies in Concurrent and Parallel Programming: A Game-Based Approach

    Authors: Jichen Zhu, Katelyn Alderfer, Brian Smith, Bruce Char, Santiago Ontañón

    Abstract: Concurrent and parallel programming (CPP) is an increasingly important subject in Computer Science Education. However, the conceptual shift from sequential programming is notoriously difficult to make. Currently, relatively little research exists on how people learn CPP core concepts. This paper presents our results of using Parallel, an educational game about CPP, focusing on the learners' self-e… ▽ More

    Submitted 10 May, 2020; originally announced May 2020.

    Comments: Submitted to CHI Play '20

  28. arXiv:2004.08483  [pdf, other

    cs.LG stat.ML

    ETC: Encoding Long and Structured Inputs in Transformers

    Authors: Joshua Ainslie, Santiago Ontanon, Chris Alberti, Vaclav Cvicek, Zachary Fisher, Philip Pham, Anirudh Ravula, Sumit Sanghai, Qifan Wang, Li Yang

    Abstract: Transformer models have advanced the state of the art in many Natural Language Processing (NLP) tasks. In this paper, we present a new Transformer architecture, Extended Transformer Construction (ETC), that addresses two key challenges of standard Transformer architectures, namely scaling input length and encoding structured inputs. To scale attention to longer inputs, we introduce a novel global-… ▽ More

    Submitted 27 October, 2020; v1 submitted 17 April, 2020; originally announced April 2020.

    Comments: Accepted at EMNLP 2020

  29. arXiv:2002.07420  [pdf, other

    cs.AI cs.LG

    An Overview of Distance and Similarity Functions for Structured Data

    Authors: Santiago Ontañón

    Abstract: The notions of distance and similarity play a key role in many machine learning approaches, and artificial intelligence (AI) in general, since they can serve as an organizing principle by which individuals classify objects, form concepts and make generalizations. While distance functions for propositional representations have been thoroughly studied, work on distance functions for structured repre… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

  30. arXiv:1910.12134  [pdf, other

    cs.LG cs.AI stat.ML

    Comparing Observation and Action Representations for Deep Reinforcement Learning in $μ$RTS

    Authors: Shengyi Huang, Santiago Ontañón

    Abstract: This paper presents a preliminary study comparing different observation and action space representations for Deep Reinforcement Learning (DRL) in the context of Real-time Strategy (RTS) games. Specifically, we compare two representations: (1) a global representation where the observation represents the whole game state, and the RL agent needs to choose which unit to issue actions to, and which act… ▽ More

    Submitted 22 April, 2020; v1 submitted 26 October, 2019; originally announced October 2019.

    Comments: Presented in the AIIDE 2019 Workshop on Artificial Intelligence for Strategy Games

  31. arXiv:1908.05632  [pdf, other

    cs.AI

    Tracing Player Knowledge in a Parallel Programming Educational Game

    Authors: Pavan Kantharaju, Katelyn Alderfer, Jichen Zhu, Bruce Char, Brian Smith, Santiago Ontañón

    Abstract: This paper focuses on "tracing player knowledge" in educational games. Specifically, given a set of concepts or skills required to master a game, the goal is to estimate the likelihood with which the current player has mastery of each of those concepts or skills. The main contribution of the paper is an approach that integrates machine learning and domain knowledge rules to find when the player ap… ▽ More

    Submitted 15 August, 2019; originally announced August 2019.

    Comments: 7 pages, 2 figures, published at AIIDE 2018 conference

  32. arXiv:1907.02349  [pdf, other

    cs.HC cs.AI

    Experience Management in Multi-player Games

    Authors: Jichen Zhu, Santiago Ontañón

    Abstract: Experience Management studies AI systems that automatically adapt interactive experiences such as games to tailor to specific players and to fulfill design goals. Although it has been explored for several decades, existing work in experience management has mostly focused on single-player experiences. This paper is a first attempt at identifying the main challenges to expand EM to multi-player/mult… ▽ More

    Submitted 4 July, 2019; originally announced July 2019.

  33. arXiv:1710.04805  [pdf, other

    cs.AI

    Combinatorial Multi-armed Bandits for Real-Time Strategy Games

    Authors: Santiago Ontañón

    Abstract: Games with large branching factors pose a significant challenge for game tree search algorithms. In this paper, we address this problem with a sampling strategy for Monte Carlo Tree Search (MCTS) algorithms called {\em naïve sampling}, based on a variant of the Multi-armed Bandit problem called {\em Combinatorial Multi-armed Bandits} (CMAB). We analyze the theoretical properties of several variant… ▽ More

    Submitted 13 October, 2017; originally announced October 2017.

    Journal ref: (2017) Journal of Artificial Intelligence Research (JAIR). Volume 58, pp 665-702

  34. arXiv:1606.07487  [pdf, other

    cs.HC cs.AI cs.LG

    The VGLC: The Video Game Level Corpus

    Authors: Adam James Summerville, Sam Snodgrass, Michael Mateas, Santiago Ontañón

    Abstract: Levels are a key component of many different video games, and a large body of work has been produced on how to procedurally generate game levels. Recently, Machine Learning techniques have been applied to video game level generation towards the purpose of automatically generating levels that have the properties of the training corpus. Towards that end we have made available a corpora of video game… ▽ More

    Submitted 3 July, 2016; v1 submitted 23 June, 2016; originally announced June 2016.

    Comments: To appear in proceedings of the 7th Workshop on Procedural Content Generation

  35. arXiv:1605.05305  [pdf, other

    cs.AI

    Combat Models for RTS Games

    Authors: Alberto Uriarte, Santiago Ontañón

    Abstract: Game tree search algorithms, such as Monte Carlo Tree Search (MCTS), require access to a forward model (or "simulator") of the game at hand. However, in some games such forward model is not readily available. This paper presents three forward models for two-player attrition games, which we call "combat models", and show how they can be used to simulate combat in RTS games. We also show how these c… ▽ More

    Submitted 17 May, 2016; originally announced May 2016.

  36. arXiv:1604.06954  [pdf, other

    cs.AI

    RHOG: A Refinement-Operator Library for Directed Labeled Graphs

    Authors: Santiago Ontañón

    Abstract: This document provides the foundations behind the functionality provided by the $ρ$G library (https://github.com/santiontanon/RHOG), focusing on the basic operations the library provides: subsumption, refinement of directed labeled graphs, and distance/similarity assessment between directed labeled graphs. $ρ$G development was initially supported by the National Science Foundation, by the EAGER gr… ▽ More

    Submitted 18 April, 2020; v1 submitted 23 April, 2016; originally announced April 2016.

    Comments: Report of the theory behind the RHOG library developed under NSF EAGER grant IIS-1551338

  37. A semantic network-based evolutionary algorithm for computational creativity

    Authors: Atilim Gunes Baydin, Ramon Lopez de Mantaras, Santiago Ontanon

    Abstract: We introduce a novel evolutionary algorithm (EA) with a semantic network-based representation. For enabling this, we establish new formulations of EA variation operators, crossover and mutation, that we adapt to work on semantic networks. The algorithm employs commonsense reasoning to ensure all operations preserve the meaningfulness of the networks, using ConceptNet and WordNet knowledge bases. T… ▽ More

    Submitted 14 July, 2014; v1 submitted 30 April, 2014; originally announced April 2014.

    Comments: 20 pages, 14 figures, revision after reviews, changed title

    MSC Class: 92D15; 91E40; 68T20; 68T30 ACM Class: I.2.4; I.2.6; G.1.6; J.4; J.3

    Journal ref: Evolutionary Intelligence, 8(1):3-21 (2015)

  38. arXiv:1208.1940  [pdf, other

    cs.AI cs.GT

    Experiments with Game Tree Search in Real-Time Strategy Games

    Authors: Santiago Ontanon

    Abstract: Game tree search algorithms such as minimax have been used with enormous success in turn-based adversarial games such as Chess or Checkers. However, such algorithms cannot be directly applied to real-time strategy (RTS) games because a number of reasons. For example, minimax assumes a turn-taking game mechanics, not present in RTS games. In this paper we present RTMM, a real-time variant of the st… ▽ More

    Submitted 9 August, 2012; originally announced August 2012.

    Comments: 7 pages, 1 figure

  39. arXiv:1204.2335  [pdf, other

    cs.NE nlin.AO

    Automated Generation of Cross-Domain Analogies via Evolutionary Computation

    Authors: Atilim Gunes Baydin, Ramon Lopez de Mantaras, Santiago Ontanon

    Abstract: Analogy plays an important role in creativity, and is extensively used in science as well as art. In this paper we introduce a technique for the automated generation of cross-domain analogies based on a novel evolutionary algorithm (EA). Unlike existing work in computational analogy-making restricted to creating analogies between two given cases, our approach, for a given case, is capable of creat… ▽ More

    Submitted 11 April, 2012; originally announced April 2012.

    Comments: Conference submission, International Conference on Computational Creativity 2012 (8 pages, 6 figures)

    MSC Class: 92D15; 91E40; 68T20; 68T30 ACM Class: I.2.4; I.2.6; G.1.6; J.4; J.3

    Journal ref: In Proceedings of the Third International Conference on Computational Creativity, Dublin, Ireland, May 30-June 1, 2012. Dublin: University College Dublin, 2012, pp. 25-32