Skip to main content

Showing 1–19 of 19 results for author: Garnelo, M

.
  1. arXiv:2305.10203  [pdf, other

    cs.LG cs.NE

    Exploring the Space of Key-Value-Query Models with Intention

    Authors: Marta Garnelo, Wojciech Marian Czarnecki

    Abstract: Attention-based models have been a key element of many recent breakthroughs in deep learning. Two key components of Attention are the structure of its input (which consists of keys, values and queries) and the computations by which these three are combined. In this paper we explore the space of models that share said input structure but are not restricted to the computations of Attention. We refer… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  2. arXiv:2110.04041  [pdf, other

    cs.AI

    Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity

    Authors: Marta Garnelo, Wojciech Marian Czarnecki, Siqi Liu, Dhruva Tirumala, Junhyuk Oh, Gauthier Gidel, Hado van Hasselt, David Balduzzi

    Abstract: Strategic diversity is often essential in games: in multi-player games, for example, evaluating a player against a diverse set of strategies will yield a more accurate estimate of its performance. Furthermore, in games with non-transitivities diversity allows a player to cover several winning strategies. However, despite the significance of strategic diversity, training agents that exhibit diverse… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  3. arXiv:2106.14108  [pdf, other

    cs.CE eess.IV

    Inferring a Continuous Distribution of Atom Coordinates from Cryo-EM Images using VAEs

    Authors: Dan Rosenbaum, Marta Garnelo, Michal Zielinski, Charlie Beattie, Ellen Clancy, Andrea Huber, Pushmeet Kohli, Andrew W. Senior, John Jumper, Carl Doersch, S. M. Ali Eslami, Olaf Ronneberger, Jonas Adler

    Abstract: Cryo-electron microscopy (cryo-EM) has revolutionized experimental protein structure determination. Despite advances in high resolution reconstruction, a majority of cryo-EM experiments provide either a single state of the studied macromolecule, or a relatively small number of its conformations. This reduces the effectiveness of the technique for proteins with flexible regions, which are known to… ▽ More

    Submitted 26 June, 2021; originally announced June 2021.

  4. arXiv:2106.04219  [pdf, other

    cs.LG cs.AI cs.MA

    Time-series Imputation of Temporally-occluded Multiagent Trajectories

    Authors: Shayegan Omidshafiei, Daniel Hennes, Marta Garnelo, Eugene Tarassov, Zhe Wang, Romuald Elie, Jerome T. Connor, Paul Muller, Ian Graham, William Spearman, Karl Tuyls

    Abstract: In multiagent environments, several decision-making individuals interact while adhering to the dynamics constraints imposed by the environment. These interactions, combined with the potential stochasticity of the agents' decision-making processes, make such systems complex and interesting to study from a dynamical perspective. Significant research has been conducted on learning models for forward-… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

  5. arXiv:2011.09192  [pdf, other

    cs.AI cs.GT cs.MA

    Game Plan: What AI can do for Football, and What Football can do for AI

    Authors: Karl Tuyls, Shayegan Omidshafiei, Paul Muller, Zhe Wang, Jerome Connor, Daniel Hennes, Ian Graham, William Spearman, Tim Waskett, Dafydd Steele, Pauline Luc, Adria Recasens, Alexandre Galashov, Gregory Thornton, Romuald Elie, Pablo Sprechmann, Pol Moreno, Kris Cao, Marta Garnelo, Praneet Dutta, Michal Valko, Nicolas Heess, Alex Bridgland, Julien Perolat, Bart De Vylder , et al. (11 additional authors not shown)

    Abstract: The rapid progress in artificial intelligence (AI) and machine learning has opened unprecedented analytics possibilities in various team and individual sports, including baseball, basketball, and tennis. More recently, AI techniques have been applied to football, due to a huge increase in data collection by professional teams, increased computational power, and advances in machine learning, with t… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

  6. arXiv:2007.08973  [pdf, other

    cs.CV cs.AI cs.LG

    AlignNet: Unsupervised Entity Alignment

    Authors: Antonia Creswell, Kyriacos Nikiforou, Oriol Vinyals, Andre Saraiva, Rishabh Kabra, Loic Matthey, Chris Burgess, Malcolm Reynolds, Richard Tanburn, Marta Garnelo, Murray Shanahan

    Abstract: Recently developed deep learning models are able to learn to segment scenes into component objects without supervision. This opens many new and exciting avenues of research, allowing agents to take objects (or entities) as inputs, rather that pixels. Unfortunately, while these models provide excellent segmentation of a single frame, they do not keep track of how objects segmented at one time-step… ▽ More

    Submitted 21 July, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

  7. arXiv:2002.05820  [pdf, other

    stat.ML cs.GT cs.LG

    A Limited-Capacity Minimax Theorem for Non-Convex Games or: How I Learned to Stop Worrying about Mixed-Nash and Love Neural Nets

    Authors: Gauthier Gidel, David Balduzzi, Wojciech Marian Czarnecki, Marta Garnelo, Yoram Bachrach

    Abstract: Adversarial training, a special case of multi-objective optimization, is an increasingly prevalent machine learning technique: some of its most notable applications include GAN-based generative modeling and self-play techniques in reinforcement learning which have been applied to complex games such as Go or Poker. In practice, a \emph{single} pair of networks is typically trained in order to find… ▽ More

    Submitted 15 March, 2021; v1 submitted 13 February, 2020; originally announced February 2020.

    Comments: Appears in: Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS 2021). 19 pages

  8. arXiv:1907.05181  [pdf, other

    cs.MA cs.LG

    Learning Truthful, Efficient, and Welfare Maximizing Auction Rules

    Authors: Andrea Tacchetti, DJ Strouse, Marta Garnelo, Thore Graepel, Yoram Bachrach

    Abstract: From social networks to supply chains, more and more aspects of how humans, firms and organizations interact is mediated by artificial learning agents. As the influence of machine learning systems grows, it is paramount that we study how to imbue our modern institutions with our own values and principles. Here we consider the problem of allocating goods to buyers who have preferences over them in… ▽ More

    Submitted 1 November, 2022; v1 submitted 11 July, 2019; originally announced July 2019.

  9. arXiv:1905.10307  [pdf, other

    cs.LG stat.ML

    An Explicitly Relational Neural Network Architecture

    Authors: Murray Shanahan, Kyriacos Nikiforou, Antonia Creswell, Christos Kaplanis, David Barrett, Marta Garnelo

    Abstract: With a view to bridging the gap between deep learning and symbolic AI, we present a novel end-to-end neural network architecture that learns to form propositional representations with an explicitly relational structure from raw pixel data. In order to evaluate and analyse the architecture, we introduce a family of simple visual relational reasoning tasks of varying complexity. We show that the pro… ▽ More

    Submitted 23 June, 2020; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: In Proceedings ICML 2020

  10. arXiv:1903.11907  [pdf, other

    stat.ML cs.LG

    Meta-Learning surrogate models for sequential decision making

    Authors: Alexandre Galashov, Jonathan Schwarz, Hyunjik Kim, Marta Garnelo, David Saxton, Pushmeet Kohli, S. M. Ali Eslami, Yee Whye Teh

    Abstract: We introduce a unified probabilistic framework for solving sequential decision making problems ranging from Bayesian optimisation to contextual bandits and reinforcement learning. This is accomplished by a probabilistic model-based approach that explains observed data while capturing predictive uncertainty during the decision making process. Crucially, this probabilistic model is chosen to be a Me… ▽ More

    Submitted 12 June, 2019; v1 submitted 28 March, 2019; originally announced March 2019.

  11. arXiv:1902.02527  [pdf, other

    cs.LG stat.ML

    Adaptive Posterior Learning: few-shot learning with a surprise-based memory module

    Authors: Tiago Ramalho, Marta Garnelo

    Abstract: The ability to generalize quickly from few observations is crucial for intelligent systems. In this paper we introduce APL, an algorithm that approximates probability distributions by remembering the most surprising observations it has encountered. These past observations are recalled from an external memory module and processed by a decoder network that can combine information from different memo… ▽ More

    Submitted 7 February, 2019; originally announced February 2019.

    Comments: ICLR 2019

  12. arXiv:1901.08106  [pdf, other

    cs.LG cs.GT cs.MA stat.ML

    Open-ended Learning in Symmetric Zero-sum Games

    Authors: David Balduzzi, Marta Garnelo, Yoram Bachrach, Wojciech M. Czarnecki, Julien Perolat, Max Jaderberg, Thore Graepel

    Abstract: Zero-sum games such as chess and poker are, abstractly, functions that evaluate pairs of agents, for example labeling them `winner' and `loser'. If the game is approximately transitive, then self-play generates sequences of agents of increasing strength. However, nontransitive games, such as rock-paper-scissors, can exhibit strategic cycles, and there is no longer a clear objective -- we want agen… ▽ More

    Submitted 13 May, 2019; v1 submitted 23 January, 2019; originally announced January 2019.

    Comments: ICML 2019, final version

  13. arXiv:1901.05761  [pdf, other

    cs.LG stat.ML

    Attentive Neural Processes

    Authors: Hyunjik Kim, Andriy Mnih, Jonathan Schwarz, Marta Garnelo, Ali Eslami, Dan Rosenbaum, Oriol Vinyals, Yee Whye Teh

    Abstract: Neural Processes (NPs) (Garnelo et al 2018a;b) approach regression by learning to map a context set of observed input-output pairs to a distribution over regression functions. Each function models the distribution of the output given an input, conditioned on the context. NPs have the benefit of fitting observed data efficiently with linear complexity in the number of context input-output pairs, an… ▽ More

    Submitted 9 July, 2019; v1 submitted 17 January, 2019; originally announced January 2019.

  14. arXiv:1812.02795  [pdf, other

    cs.LG stat.ML

    Verification of deep probabilistic models

    Authors: Krishnamurthy Dvijotham, Marta Garnelo, Alhussein Fawzi, Pushmeet Kohli

    Abstract: Probabilistic models are a critical part of the modern deep learning toolbox - ranging from generative models (VAEs, GANs), sequence to sequence models used in machine translation and speech processing to models over functional spaces (conditional neural processes, neural processes). Given the size and complexity of these models, safely deploying them in applications requires the development of to… ▽ More

    Submitted 6 December, 2018; originally announced December 2018.

    Comments: Accepted to NeurIPS 2018 Workshop on Security in Machine Learning

  15. arXiv:1807.02033  [pdf, other

    cs.CV cs.LG stat.ML

    Consistent Generative Query Networks

    Authors: Ananya Kumar, S. M. Ali Eslami, Danilo J. Rezende, Marta Garnelo, Fabio Viola, Edward Lockhart, Murray Shanahan

    Abstract: Stochastic video prediction models take in a sequence of image frames, and generate a sequence of consecutive future image frames. These models typically generate future frames in an autoregressive fashion, which is slow and requires the input and output frames to be consecutive. We introduce a model that overcomes these drawbacks by generating a latent representation from an arbitrary set of fram… ▽ More

    Submitted 21 April, 2019; v1 submitted 5 July, 2018; originally announced July 2018.

  16. arXiv:1807.01622  [pdf, other

    cs.LG stat.ML

    Neural Processes

    Authors: Marta Garnelo, Jonathan Schwarz, Dan Rosenbaum, Fabio Viola, Danilo J. Rezende, S. M. Ali Eslami, Yee Whye Teh

    Abstract: A neural network (NN) is a parameterised function that can be tuned via gradient descent to approximate a labelled collection of data with high precision. A Gaussian process (GP), on the other hand, is a probabilistic model that defines a distribution over possible functions, and is updated in light of data via the rules of probabilistic inference. GPs are probabilistic, data-efficient and flexibl… ▽ More

    Submitted 4 July, 2018; originally announced July 2018.

  17. arXiv:1807.01613  [pdf, other

    cs.LG stat.ML

    Conditional Neural Processes

    Authors: Marta Garnelo, Dan Rosenbaum, Chris J. Maddison, Tiago Ramalho, David Saxton, Murray Shanahan, Yee Whye Teh, Danilo J. Rezende, S. M. Ali Eslami

    Abstract: Deep neural networks excel at function approximation, yet they are typically trained from scratch for each new function. On the other hand, Bayesian methods, such as Gaussian Processes (GPs), exploit prior knowledge to quickly infer the shape of a new function at test time. Yet GPs are computationally expensive, and it can be hard to design appropriate priors. In this paper we propose a family of… ▽ More

    Submitted 4 July, 2018; originally announced July 2018.

  18. arXiv:1611.02648  [pdf, other

    cs.LG cs.NE stat.ML

    Deep Unsupervised Clustering with Gaussian Mixture Variational Autoencoders

    Authors: Nat Dilokthanakul, Pedro A. M. Mediano, Marta Garnelo, Matthew C. H. Lee, Hugh Salimbeni, Kai Arulkumaran, Murray Shanahan

    Abstract: We study a variant of the variational autoencoder model (VAE) with a Gaussian mixture as a prior distribution, with the goal of performing unsupervised clustering through deep generative models. We observe that the known problem of over-regularisation that has been shown to arise in regular VAEs also manifests itself in our model and leads to cluster degeneracy. We show that a heuristic called min… ▽ More

    Submitted 13 January, 2017; v1 submitted 8 November, 2016; originally announced November 2016.

    Comments: 12 pages, 6 figures, Under review as a conference paper at ICLR 2017

  19. arXiv:1609.05518  [pdf, other

    cs.AI cs.LG

    Towards Deep Symbolic Reinforcement Learning

    Authors: Marta Garnelo, Kai Arulkumaran, Murray Shanahan

    Abstract: Deep reinforcement learning (DRL) brings the power of deep neural networks to bear on the generic task of trial-and-error learning, and its effectiveness has been convincingly demonstrated on tasks such as Atari video games and the game of Go. However, contemporary DRL systems inherit a number of shortcomings from the current generation of deep learning techniques. For example, they require very l… ▽ More

    Submitted 1 October, 2016; v1 submitted 18 September, 2016; originally announced September 2016.