Skip to main content

Showing 1–19 of 19 results for author: Papamakarios, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  2. arXiv:2305.13233  [pdf, other

    physics.comp-ph cond-mat.stat-mech cs.LG stat.ML

    Estimating Gibbs free energies via isobaric-isothermal flows

    Authors: Peter Wirnsberger, Borja Ibarz, George Papamakarios

    Abstract: We present a machine-learning model based on normalizing flows that is trained to sample from the isobaric-isothermal ensemble. In our approach, we approximate the joint distribution of a fully-flexible triclinic simulation box and particle coordinates to achieve a desired internal pressure. This novel extension of flow-based sampling to the isobaric-isothermal ensemble yields direct estimates of… ▽ More

    Submitted 6 September, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 19 pages, 7 figures

  3. arXiv:2302.04798  [pdf, other

    cs.LG cs.AI stat.ML

    Equivariant MuZero

    Authors: Andreea Deac, Théophane Weber, George Papamakarios

    Abstract: Deep reinforcement learning repeatedly succeeds in closed, well-defined domains such as games (Chess, Go, StarCraft). The next frontier is real-world scenarios, where setups are numerous and varied. For this, agents need to learn the underlying rules governing the environment, so as to robustly generalise to conditions that differ from those they were trained on. Model-based reinforcement learning… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: 9 pages, 3 figures

  4. arXiv:2209.14249  [pdf, other

    cs.LG stat.ML

    Compositional Score Modeling for Simulation-based Inference

    Authors: Tomas Geffner, George Papamakarios, Andriy Mnih

    Abstract: Neural Posterior Estimation methods for simulation-based inference can be ill-suited for dealing with posterior distributions obtained by conditioning on multiple observations, as they tend to require a large number of simulator calls to learn accurate approximations. In contrast, Neural Likelihood Estimation methods can handle multiple observations at inference time after learning from individual… ▽ More

    Submitted 9 July, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

  5. arXiv:2209.11142  [pdf, other

    cs.LG cs.AI stat.ML

    A Generalist Neural Algorithmic Learner

    Authors: Borja Ibarz, Vitaly Kurin, George Papamakarios, Kyriacos Nikiforou, Mehdi Bennani, Róbert Csordás, Andrew Dudzik, Matko Bošnjak, Alex Vitvitskyi, Yulia Rubanova, Andreea Deac, Beatrice Bevilacqua, Yaroslav Ganin, Charles Blundell, Petar Veličković

    Abstract: The cornerstone of neural algorithmic reasoning is the ability to solve algorithmic tasks, especially in a way that generalises out of distribution. While recent years have seen a surge in methodological improvements in this area, they mostly focused on building specialist models. Specialist models are capable of learning to neurally execute either only one algorithm or a collection of algorithms… ▽ More

    Submitted 3 December, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: To appear at LoG 2022 (Spotlight talk). 23 pages, 11 figures

  6. arXiv:2006.04710  [pdf, other

    stat.ML cs.LG

    The Lipschitz Constant of Self-Attention

    Authors: Hyunjik Kim, George Papamakarios, Andriy Mnih

    Abstract: Lipschitz constants of neural networks have been explored in various contexts in deep learning, such as provable adversarial robustness, estimating Wasserstein distance, stabilising training of GANs, and formulating invertible neural networks. Such works have focused on bounding the Lipschitz constant of fully connected or convolutional networks, composed of linear maps and pointwise non-lineariti… ▽ More

    Submitted 9 June, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

  7. arXiv:2002.03712  [pdf, other

    stat.ML cs.LG

    On Contrastive Learning for Likelihood-free Inference

    Authors: Conor Durkan, Iain Murray, George Papamakarios

    Abstract: Likelihood-free methods perform parameter inference in stochastic simulator models where evaluating the likelihood is intractable but sampling synthetic data is possible. One class of methods for this likelihood-free problem uses a classifier to distinguish between pairs of parameter-observation samples generated using the simulator and pairs sampled from some reference distribution, which implici… ▽ More

    Submitted 18 December, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: Appeared at ICML 2020

  8. arXiv:2002.02836  [pdf, other

    cs.LG cs.AI stat.ML

    Causally Correct Partial Models for Reinforcement Learning

    Authors: Danilo J. Rezende, Ivo Danihelka, George Papamakarios, Nan Rosemary Ke, Ray Jiang, Theophane Weber, Karol Gregor, Hamza Merzic, Fabio Viola, Jane Wang, Jovana Mitrovic, Frederic Besse, Ioannis Antonoglou, Lars Buesing

    Abstract: In reinforcement learning, we can learn a model of future observations and rewards, and use it to plan the agent's next actions. However, jointly modeling future observations can be computationally expensive or even intractable if the observations are high-dimensional (e.g. images). For this reason, previous works have considered partial models, which model only part of the observation. In this pa… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

  9. arXiv:2002.02428  [pdf, other

    stat.ML cs.LG

    Normalizing Flows on Tori and Spheres

    Authors: Danilo Jimenez Rezende, George Papamakarios, Sébastien Racanière, Michael S. Albergo, Gurtej Kanwar, Phiala E. Shanahan, Kyle Cranmer

    Abstract: Normalizing flows are a powerful tool for building expressive distributions in high dimensions. So far, most of the literature has concentrated on learning flows on Euclidean spaces. Some problems however, such as those involving angles, are defined on spaces with more complex geometries, such as tori or spheres. In this paper, we propose and compare expressive and numerically stable flows on such… ▽ More

    Submitted 1 July, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: Accepted to the International Conference on Machine Learning (ICML) 2020

  10. arXiv:1912.02762  [pdf, other

    stat.ML cs.LG

    Normalizing Flows for Probabilistic Modeling and Inference

    Authors: George Papamakarios, Eric Nalisnick, Danilo Jimenez Rezende, Shakir Mohamed, Balaji Lakshminarayanan

    Abstract: Normalizing flows provide a general mechanism for defining expressive probability distributions, only requiring the specification of a (usually simple) base distribution and a series of bijective transformations. There has been much recent work on normalizing flows, ranging from improving their expressive power to expanding their application. We believe the field has now matured and is in need of… ▽ More

    Submitted 8 April, 2021; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: Review article, 64 pages, 9 figures. Published in the Journal of Machine Learning Research (see https://jmlr.org/papers/v22/19-1028.html)

    Journal ref: Journal of Machine Learning Research, 22(57):1-64, 2021

  11. arXiv:1910.13233  [pdf, other

    stat.ML cs.LG

    Neural Density Estimation and Likelihood-free Inference

    Authors: George Papamakarios

    Abstract: I consider two problems in machine learning and statistics: the problem of estimating the joint probability density of a collection of random variables, known as density estimation, and the problem of inferring model parameters when their likelihood is intractable, known as likelihood-free inference. The contribution of the thesis is a set of new methods for addressing these problems that are base… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: PhD thesis submitted to the University of Edinburgh in April 2019. Includes in full the following articles: arXiv:1605.06376, arXiv:1705.07057, arXiv:1805.07226

  12. arXiv:1906.04032  [pdf, other

    stat.ML cs.LG

    Neural Spline Flows

    Authors: Conor Durkan, Artur Bekasov, Iain Murray, George Papamakarios

    Abstract: A normalizing flow models a complex probability density as an invertible transformation of a simple base density. Flows based on either coupling or autoregressive transforms both offer exact density evaluation and sampling, but rely on the parameterization of an easily invertible elementwise transformation, whose choice determines the flexibility of these models. Building upon recent work, we prop… ▽ More

    Submitted 2 December, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: Published at the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  13. arXiv:1906.02145  [pdf, other

    stat.ML cs.LG

    Cubic-Spline Flows

    Authors: Conor Durkan, Artur Bekasov, Iain Murray, George Papamakarios

    Abstract: A normalizing flow models a complex probability density as an invertible transformation of a simple density. The invertibility means that we can evaluate densities and generate samples from a flow. In practice, autoregressive flow-based models are slow to invert, making either density estimation or sample generation slow. Flows based on coupling transforms are fast for both tasks, but have previou… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

    Comments: Appeared at the 1st Workshop on Invertible Neural Networks and Normalizing Flows at ICML 2019

  14. arXiv:1811.08723  [pdf, other

    stat.ML cs.LG

    Sequential Neural Methods for Likelihood-free Inference

    Authors: Conor Durkan, George Papamakarios, Iain Murray

    Abstract: Likelihood-free inference refers to inference when a likelihood function cannot be explicitly evaluated, which is often the case for models based on simulators. Most of the literature is based on sample-based `Approximate Bayesian Computation' methods, but recent work suggests that approaches based on deep neural conditional density estimators can obtain state-of-the-art results with fewer simulat… ▽ More

    Submitted 21 November, 2018; originally announced November 2018.

  15. arXiv:1806.03107  [pdf, other

    cs.LG stat.ML

    Temporal Difference Variational Auto-Encoder

    Authors: Karol Gregor, George Papamakarios, Frederic Besse, Lars Buesing, Theophane Weber

    Abstract: To act and plan in complex environments, we posit that agents should have a mental simulator of the world with three characteristics: (a) it should build an abstract state representing the condition of the world; (b) it should form a belief which represents uncertainty on the world; (c) it should go beyond simple step-by-step simulation, and exhibit temporal abstraction. Motivated by the absence o… ▽ More

    Submitted 2 January, 2019; v1 submitted 8 June, 2018; originally announced June 2018.

  16. arXiv:1805.07226  [pdf, other

    stat.ML cs.LG

    Sequential Neural Likelihood: Fast Likelihood-free Inference with Autoregressive Flows

    Authors: George Papamakarios, David C. Sterratt, Iain Murray

    Abstract: We present Sequential Neural Likelihood (SNL), a new method for Bayesian inference in simulator models, where the likelihood is intractable but simulating data from the model is possible. SNL trains an autoregressive flow on simulated data in order to learn a model of the likelihood in the region of high posterior density. A sequential training procedure guides simulations and reduces simulation c… ▽ More

    Submitted 21 January, 2019; v1 submitted 18 May, 2018; originally announced May 2018.

    Comments: Accepted for publication at AISTATS 2019

  17. arXiv:1705.07057  [pdf, other

    stat.ML cs.LG

    Masked Autoregressive Flow for Density Estimation

    Authors: George Papamakarios, Theo Pavlakou, Iain Murray

    Abstract: Autoregressive models are among the best performing neural density estimators. We describe an approach for increasing the flexibility of an autoregressive model, based on modelling the random numbers that the model uses internally when generating data. By constructing a stack of autoregressive models, each modelling the random numbers of the next model in the stack, we obtain a type of normalizing… ▽ More

    Submitted 14 June, 2018; v1 submitted 19 May, 2017; originally announced May 2017.

    Comments: section 4.3 is corrected since the previous version

  18. arXiv:1605.06376  [pdf, other

    stat.ML cs.LG stat.CO

    Fast $ε$-free Inference of Simulation Models with Bayesian Conditional Density Estimation

    Authors: George Papamakarios, Iain Murray

    Abstract: Many statistical models can be simulated forwards but have intractable likelihoods. Approximate Bayesian Computation (ABC) methods are used to infer properties of these models from data. Traditionally these methods approximate the posterior over parameters by conditioning on data being inside an $ε$-ball around the observed data, which is only correct in the limit $ε\!\rightarrow\!0$. Monte Carlo… ▽ More

    Submitted 2 April, 2018; v1 submitted 20 May, 2016; originally announced May 2016.

    Comments: Appeared at NIPS 2016. Fixed typo in Eq (37)

  19. arXiv:1510.02437  [pdf, other

    stat.ML cs.LG

    Distilling Model Knowledge

    Authors: George Papamakarios

    Abstract: Top-performing machine learning systems, such as deep neural networks, large ensembles and complex probabilistic graphical models, can be expensive to store, slow to evaluate and hard to integrate into larger systems. Ideally, we would like to replace such cumbersome models with simpler models that perform equally well. In this thesis, we study knowledge distillation, the idea of extracting the… ▽ More

    Submitted 8 October, 2015; originally announced October 2015.