Skip to main content

Showing 1–7 of 7 results for author: Devlin, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2301.10677  [pdf, other

    cs.AI cs.LG stat.ML

    Imitating Human Behaviour with Diffusion Models

    Authors: Tim Pearce, Tabish Rashid, Anssi Kanervisto, Dave Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin

    Abstract: Diffusion models have emerged as powerful generative models in the text-to-image domain. This paper studies their application as observation-to-action models for imitating human behaviour in sequential environments. Human behaviour is stochastic and multimodal, with structured correlations between action dimensions. Meanwhile, standard modelling choices in behaviour cloning are limited in their ex… ▽ More

    Submitted 3 March, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: Published in ICLR 2023

    Journal ref: ICLR 2023

  2. arXiv:2101.11071  [pdf, other

    cs.LG cs.AI stat.ML

    The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors

    Authors: William H. Guss, Mario Ynocente Castro, Sam Devlin, Brandon Houghton, Noboru Sean Kuno, Crissman Loomis, Stephanie Milani, Sharada Mohanty, Keisuke Nakata, Ruslan Salakhutdinov, John Schulman, Shinya Shiroshita, Nicholay Topin, Avinash Ummadisingu, Oriol Vinyals

    Abstract: Although deep reinforcement learning has led to breakthroughs in many difficult domains, these successes have required an ever-increasing number of samples, affording only a shrinking segment of the AI community access to their development. Resolution of these limitations requires new, sample-efficient methods. To facilitate research in this direction, we propose this second iteration of the MineR… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: 37 pages, initial submission, accepted at NeurIPS. arXiv admin note: substantial text overlap with arXiv:1904.10079

  3. arXiv:2007.02912  [pdf, other

    cs.LG stat.ML

    Meta-Learning Divergences of Variational Inference

    Authors: Ruqi Zhang, Yingzhen Li, Christopher De Sa, Sam Devlin, Cheng Zhang

    Abstract: Variational inference (VI) plays an essential role in approximate Bayesian inference due to its computational efficiency and broad applicability. Crucial to the performance of VI is the selection of the associated divergence measure, as VI approximates the intractable distribution by minimizing this divergence. In this paper we propose a meta-learning algorithm to learn the divergence metric suite… ▽ More

    Submitted 22 June, 2021; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: Published at AISTATS 2021

  4. arXiv:2006.14188  [pdf, other

    stat.AP

    Identifying group contributions in NBA lineups with spectral analysis

    Authors: Stephen Devlin, David Uminsky

    Abstract: We address the question of how to quantify the contributions of groups of players to team success. Our approach is based on spectral analysis, a technique from algebraic signal processing, which has several appealing features. First, our analysis decomposes the team success signal into components that are naturally understood as the contributions of player groups of a given size: individuals, pair… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: To appear in Journal of Sports Analytics

  5. arXiv:2006.08718  [pdf, other

    cs.LG cs.RO stat.ML

    Analytic Manifold Learning: Unifying and Evaluating Representations for Continuous Control

    Authors: Rika Antonova, Maksim Maydanskiy, Danica Kragic, Sam Devlin, Katja Hofmann

    Abstract: We address the problem of learning reusable state representations from streaming high-dimensional observations. This is important for areas like Reinforcement Learning (RL), which yields non-stationary data distributions during training. We make two key contributions. First, we propose an evaluation suite that measures alignment between latent and true low-dimensional states. We benchmark several… ▽ More

    Submitted 6 October, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: Added Section 4: "Imposing AML Relations During Transfer"; expanded description of experiments in Section 5: "Evaluating AML and Latent Space Transfer"

  6. arXiv:1910.12911  [pdf, other

    cs.LG cs.AI stat.ML

    Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck

    Authors: Maximilian Igl, Kamil Ciosek, Yingzhen Li, Sebastian Tschiatschek, Cheng Zhang, Sam Devlin, Katja Hofmann

    Abstract: The ability for policies to generalize to new environments is key to the broad application of RL agents. A promising approach to prevent an agent's policy from overfitting to a limited set of training environments is to apply regularization techniques originally developed for supervised learning. However, there are stark differences between supervised learning and RL. We discuss those differences… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: Published at Neurips 2019

  7. arXiv:1905.07631  [pdf, other

    stat.ML cs.LG stat.ME

    Disentangled Attribution Curves for Interpreting Random Forests and Boosted Trees

    Authors: Summer Devlin, Chandan Singh, W. James Murdoch, Bin Yu

    Abstract: Tree ensembles, such as random forests and AdaBoost, are ubiquitous machine learning models known for achieving strong predictive performance across a wide variety of domains. However, this strong performance comes at the cost of interpretability (i.e. users are unable to understand the relationships a trained random forest has learned and why it is making its predictions). In particular, it is ch… ▽ More

    Submitted 18 May, 2019; originally announced May 2019.

    Comments: Under review