Skip to main content

Showing 51–60 of 60 results for author: Bowling, M

.
  1. arXiv:1211.0587  [pdf, other

    cs.IT cs.LG stat.ML

    Partition Tree Weighting

    Authors: Joel Veness, Martha White, Michael Bowling, András György

    Abstract: This paper introduces the Partition Tree Weighting technique, an efficient meta-algorithm for piecewise stationary sources. The technique works by performing Bayesian model averaging over a large class of possible partitions of the data into locally stationary segments. It uses a prior, closely related to the Context Tree Weighting technique of Willems, that is well suited to data compression appl… ▽ More

    Submitted 21 November, 2012; v1 submitted 2 November, 2012; originally announced November 2012.

  2. The Arcade Learning Environment: An Evaluation Platform for General Agents

    Authors: Marc G. Bellemare, Yavar Naddaf, Joel Veness, Michael Bowling

    Abstract: In this article we introduce the Arcade Learning Environment (ALE): both a challenge problem and a platform and methodology for evaluating the development of general, domain-independent AI technology. ALE provides an interface to hundreds of Atari 2600 game environments, each one different, interesting, and designed to be a challenge for human players. ALE presents significant research challenges… ▽ More

    Submitted 21 June, 2013; v1 submitted 19 July, 2012; originally announced July 2012.

    Journal ref: Journal of Artificial Intelligence Research 47, pages 253-279

  3. arXiv:1207.1411  [pdf

    cs.GT cs.AI

    Bayes' Bluff: Opponent Modelling in Poker

    Authors: Finnegan Southey, Michael P. Bowling, Bryce Larson, Carmelo Piccione, Neil Burch, Darse Billings, Chris Rayner

    Abstract: Poker is a challenging problem for artificial intelligence, with non-deterministic dynamics, partial observability, and the added difficulty of unknown adversaries. Modelling all of the uncertainties in this domain is not an easy task. In this paper we present a Bayesian probabilistic model for a broad class of poker games, separating the uncertainty in the game dynamics from the uncertainty of th… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-550-558

  4. arXiv:1206.3318  [pdf, other

    cs.AI

    On Local Regret

    Authors: Michael Bowling, Martin Zinkevich

    Abstract: Online learning aims to perform nearly as well as the best hypothesis in hindsight. For some hypothesis classes, though, even finding the best hypothesis offline is challenging. In such offline cases, local search techniques are often employed and only local optimality guaranteed. For online decision-making with such hypothesis classes, we introduce local regret, a generalization of regret that ai… ▽ More

    Submitted 14 June, 2012; originally announced June 2012.

    Comments: This is the longer version of the same-titled paper appearing in the Proceedings of the Twenty-Ninth International Conference on Machine Learning (ICML), 2012

    Report number: TR12-04

  5. arXiv:1206.3285  [pdf

    cs.AI cs.LG eess.SY

    Dyna-Style Planning with Linear Function Approximation and Prioritized Swee**

    Authors: Richard S. Sutton, Csaba Szepesvari, Alborz Geramifard, Michael P. Bowling

    Abstract: We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available after each interaction with the world. This paper develops an explicitly model-based approach extending the Dyna architecture to linear function approximation. Dynastyle planning proceeds by generating imaginary experience fr… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI2008)

    Report number: UAI-P-2008-PG-528-536

  6. arXiv:1205.0622  [pdf, other

    cs.GT cs.AI

    No-Regret Learning in Extensive-Form Games with Imperfect Recall

    Authors: Marc Lanctot, Richard Gibson, Neil Burch, Martin Zinkevich, Michael Bowling

    Abstract: Counterfactual Regret Minimization (CFR) is an efficient no-regret learning algorithm for decision problems modeled as extensive games. CFR's regret bounds depend on the requirement of perfect recall: players always remember information that was revealed to them and the order in which it was revealed. In games without perfect recall, however, CFR's guarantees do not apply. In this paper, we presen… ▽ More

    Submitted 3 May, 2012; originally announced May 2012.

    Comments: 21 pages, 4 figures, expanded version of article to appear in Proceedings of the Twenty-Ninth International Conference on Machine Learning

  7. arXiv:1205.0288  [pdf, other

    cs.LG stat.ML

    A Randomized Mirror Descent Algorithm for Large Scale Multiple Kernel Learning

    Authors: Arash Afkanpour, András György, Csaba Szepesvári, Michael Bowling

    Abstract: We consider the problem of simultaneously learning to linearly combine a very large number of kernels and learn a good predictor based on the learnt kernel. When the number of kernels $d$ to be combined is very large, multiple kernel learning methods whose computational cost scales linearly in $d$ are intractable. We propose a randomized version of the mirror descent algorithm to overcome this iss… ▽ More

    Submitted 7 January, 2013; v1 submitted 1 May, 2012; originally announced May 2012.

  8. arXiv:1112.4607  [pdf, ps, other

    cs.LG stat.ML

    Alignment Based Kernel Learning with a Continuous Set of Base Kernels

    Authors: Arash Afkanpour, Csaba Szepesvari, Michael Bowling

    Abstract: The success of kernel-based learning methods depend on the choice of kernel. Recently, kernel learning methods have been proposed that use data to select the most appropriate kernel, usually by combining a set of base kernels. We introduce a new algorithm for kernel learning that combines a {\em continuous set of base kernels}, without the common step of discretizing the space of base kernels. We… ▽ More

    Submitted 20 December, 2011; originally announced December 2011.

  9. arXiv:1111.3182  [pdf, ps, other

    cs.IT

    Context Tree Switching

    Authors: Joel Veness, Kee Siong Ng, Marcus Hutter, Michael Bowling

    Abstract: This paper describes the Context Tree Switching technique, a modification of Context Tree Weighting for the prediction of binary, stationary, n-Markov sources. By modifying Context Tree Weighting's recursive weighting scheme, it is possible to mix over a strictly larger class of models without increasing the asymptotic time or space complexity of the original algorithm. We prove that this generali… ▽ More

    Submitted 14 November, 2011; originally announced November 2011.

    Comments: Technical Report

  10. arXiv:1107.0033  [pdf, ps

    cs.MA cs.GT

    Existence of Multiagent Equilibria with Limited Agents

    Authors: M. Bowling, M. Veloso

    Abstract: Multiagent learning is a necessary yet challenging problem as multiagent systems become more prevalent and environments become more dynamic. Much of the groundbreaking work in this area draws on notable results from game theory, in particular, the concept of Nash equilibria. Learners that directly learn an equilibrium obviously rely on their existence. Learners that instead seek to play optimal… ▽ More

    Submitted 30 June, 2011; originally announced July 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 22, pages 353-384, 2004