Skip to main content

Showing 1–31 of 31 results for author: Gutmann, M U

.
  1. arXiv:2403.03069  [pdf, other

    cs.LG stat.ML

    Improving Variational Autoencoder Estimation from Incomplete Data with Mixture Variational Families

    Authors: Vaidotas Simkus, Michael U. Gutmann

    Abstract: We consider the task of estimating variational autoencoders (VAEs) when the training data is incomplete. We show that missing data increases the complexity of the model's posterior distribution over the latent variables compared to the fully-observed case. The increased complexity may adversely affect the fit of the model due to a mismatch between the variational and model posterior distributions.… ▽ More

    Submitted 27 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Published in Transactions on Machine Learning Research (TMLR), 2024

    MSC Class: 62D10 ACM Class: I.2.6; G.3

  2. arXiv:2308.09078  [pdf, other

    cs.LG stat.ML

    Conditional Sampling of Variational Autoencoders via Iterated Approximate Ancestral Sampling

    Authors: Vaidotas Simkus, Michael U. Gutmann

    Abstract: Conditional sampling of variational autoencoders (VAEs) is needed in various applications, such as missing data imputation, but is computationally intractable. A principled choice for asymptotically exact conditional sampling is Metropolis-within-Gibbs (MWG). However, we observe that the tendency of VAEs to learn a structured latent space, a commonly desired property, can cause the MWG sampler to… ▽ More

    Submitted 8 November, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: Published in Transactions on Machine Learning Research (TMLR), 2023

    MSC Class: 62D10 ACM Class: G.3

  3. arXiv:2305.07721  [pdf, other

    cs.LG stat.ME

    Designing Optimal Behavioral Experiments Using Machine Learning

    Authors: Simon Valentin, Steven Kleinegesse, Neil R. Bramley, Peggy Seriès, Michael U. Gutmann, Christopher G. Lucas

    Abstract: Computational models are powerful tools for understanding human cognition and behavior. They let us express our theories clearly and precisely, and offer predictions that can be subtle and often counter-intuitive. However, this same richness and ability to surprise means our scientific intuitions and traditional tools are ill-suited to designing experiments to test and compare these models. To avo… ▽ More

    Submitted 26 November, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: Accepted in eLife

  4. arXiv:2305.00869  [pdf, other

    stat.ML cs.LG

    Estimating the Density Ratio between Distributions with High Discrepancy using Multinomial Logistic Regression

    Authors: Akash Srivastava, Seungwook Han, Kai Xu, Benjamin Rhodes, Michael U. Gutmann

    Abstract: Functions of the ratio of the densities $p/q$ are widely used in machine learning to quantify the discrepancy between the two distributions $p$ and $q$. For high-dimensional distributions, binary classification-based density ratio estimators have shown great promise. However, when densities are well separated, estimating the density ratio with a binary classifier is challenging. In this work, we s… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Journal ref: TMLR 2023

  5. arXiv:2208.02704  [pdf, other

    cs.LG stat.ML

    Bayesian Optimization with Informative Covariance

    Authors: Afonso Eduardo, Michael U. Gutmann

    Abstract: Bayesian optimization is a methodology for global optimization of unknown and expensive objectives. It combines a surrogate Bayesian regression model with an acquisition function to decide where to evaluate the objective. Typical regression models are given by Gaussian processes with stationary covariance functions. However, these functions are unable to express prior input-dependent information,… ▽ More

    Submitted 1 April, 2023; v1 submitted 4 August, 2022; originally announced August 2022.

    Journal ref: Transactions on Machine Learning Research (TMLR), 2023. URL: https://openreview.net/forum?id=JwgVBv18RG

  6. arXiv:2206.13446  [pdf, other

    cs.LG stat.ML

    Pen and Paper Exercises in Machine Learning

    Authors: Michael U. Gutmann

    Abstract: This is a collection of (mostly) pen-and-paper exercises in machine learning. The exercises are on the following topics: linear algebra, optimisation, directed graphical models, undirected graphical models, expressive power of graphical models, factor graphs and message passing, inference for hidden Markov models, model-based learning (including ICA and unnormalised models), sampling and Monte-Car… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: The associated github page is https://github.com/michaelgutmann/ml-pen-and-paper-exercises

  7. arXiv:2204.13999  [pdf, other

    cs.LG stat.ML

    Statistical applications of contrastive learning

    Authors: Michael U. Gutmann, Steven Kleinegesse, Benjamin Rhodes

    Abstract: The likelihood function plays a crucial role in statistical inference and experimental design. However, it is computationally intractable for several important classes of statistical models, including energy-based models and simulator-based models. Contrastive learning is an intuitive and computationally feasible alternative to likelihood-based learning. We here first provide an introduction to co… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: Accepted to Behaviormetrika

  8. arXiv:2111.13180  [pdf, other

    cs.LG stat.ME stat.ML

    Variational Gibbs Inference for Statistical Model Estimation from Incomplete Data

    Authors: Vaidotas Simkus, Benjamin Rhodes, Michael U. Gutmann

    Abstract: Statistical models are central to machine learning with broad applicability across a range of downstream tasks. The models are controlled by free parameters that are typically estimated from data by maximum-likelihood estimation or approximations thereof. However, when faced with real-world data sets many of the models run into a critical issue: they are formulated in terms of fully-observed data,… ▽ More

    Submitted 15 August, 2023; v1 submitted 25 November, 2021; originally announced November 2021.

    Comments: Published at Journal of Machine Learning Research (JMLR)

    MSC Class: 62D10 ACM Class: I.2.6; G.3

    Journal ref: Journal of Machine Learning Research, 24(196), 1-72, 2023

  9. arXiv:2111.02329  [pdf, other

    stat.ML cs.AI cs.LG stat.CO

    Implicit Deep Adaptive Design: Policy-Based Experimental Design without Likelihoods

    Authors: Desi R. Ivanova, Adam Foster, Steven Kleinegesse, Michael U. Gutmann, Tom Rainforth

    Abstract: We introduce implicit Deep Adaptive Design (iDAD), a new method for performing adaptive experiments in real-time with implicit models. iDAD amortizes the cost of Bayesian optimal experimental design (BOED) by learning a design policy network upfront, which can then be deployed quickly at the time of the experiment. The iDAD network can be trained on any model which simulates differentiable samples… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: 33 pages, 8 figures. Published as a conference paper at NeurIPS 2021

  10. arXiv:2110.15632  [pdf, other

    cs.LG

    Bayesian Optimal Experimental Design for Simulator Models of Cognition

    Authors: Simon Valentin, Steven Kleinegesse, Neil R. Bramley, Michael U. Gutmann, Christopher G. Lucas

    Abstract: Bayesian optimal experimental design (BOED) is a methodology to identify experiments that are expected to yield informative data. Recent work in cognitive science considered BOED for computational models of human behavior with tractable and known likelihood functions. However, tractability often comes at the cost of realism; simulator models that can capture the richness of human behavior are ofte… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

    Comments: Accepted as a poster at the NeurIPS 2021 Workshop "AI for Science"

  11. arXiv:2105.04379  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Gradient-based Bayesian Experimental Design for Implicit Models using Mutual Information Lower Bounds

    Authors: Steven Kleinegesse, Michael U. Gutmann

    Abstract: We introduce a framework for Bayesian experimental design (BED) with implicit models, where the data-generating distribution is intractable but sampling from it is still possible. In order to find optimal experimental designs for such models, our approach maximises mutual information lower bounds that are parametrised by neural networks. By training a neural network on sampled data, we simultaneou… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: Under review

    MSC Class: 62K05;

  12. arXiv:2006.12204  [pdf, other

    stat.ML cs.LG

    Telesco** Density-Ratio Estimation

    Authors: Benjamin Rhodes, Kai Xu, Michael U. Gutmann

    Abstract: Density-ratio estimation via classification is a cornerstone of unsupervised learning. It has provided the foundation for state-of-the-art methods in representation learning and generative modelling, with the number of use-cases continuing to proliferate. However, it suffers from a critical limitation: it fails to accurately estimate ratios p/q for which the two densities differ significantly. Emp… ▽ More

    Submitted 24 November, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

  13. arXiv:2003.09379  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Sequential Bayesian Experimental Design for Implicit Models via Mutual Information

    Authors: Steven Kleinegesse, Christopher Drovandi, Michael U. Gutmann

    Abstract: Bayesian experimental design (BED) is a framework that uses statistical models and decision making under uncertainty to optimise the cost and performance of a scientific experiment. Sequential BED, as opposed to static BED, considers the scenario where we can sequentially update our beliefs about the model parameters through data gathered in the experiment. A class of models of particular interest… ▽ More

    Submitted 20 March, 2020; originally announced March 2020.

    MSC Class: 62K05; 62L05

  14. arXiv:2002.08129  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Bayesian Experimental Design for Implicit Models by Mutual Information Neural Estimation

    Authors: Steven Kleinegesse, Michael U. Gutmann

    Abstract: Implicit stochastic models, where the data-generation distribution is intractable but sampling is possible, are ubiquitous in the natural sciences. The models typically have free parameters that need to be inferred from data collected in scientific experiments. A fundamental question is how to design the experiments so that the collected data are most useful. The field of Bayesian experimental des… ▽ More

    Submitted 14 August, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: Accepted at the thirty-seventh International Conference on Machine Learning (ICML) 2020. Camera-ready version

    MSC Class: 62K05 (Primary) ACM Class: G.3

  15. arXiv:1907.01505  [pdf, other

    stat.CO stat.AP stat.ME

    Adaptive Approximate Bayesian Computation Tolerance Selection

    Authors: Umberto Simola, Jessica Cisewski-Kehe, Michael U. Gutmann, Jukka Corander

    Abstract: Approximate Bayesian Computation (ABC) methods are increasingly used for inference in situations in which the likelihood function is either computationally costly or intractable to evaluate. Extensions of the basic ABC rejection algorithm have improved the computational efficiency of the procedure and broadened its applicability. The ABC-Population Monte Carlo (ABC-PMC) approach of Beaumont et al.… ▽ More

    Submitted 30 April, 2020; v1 submitted 21 June, 2019; originally announced July 2019.

    Comments: 26 pages, 8 figures

  16. arXiv:1904.02431  [pdf, other

    cs.RO

    To Stir or Not to Stir: Online Estimation of Liquid Properties for Pouring Actions

    Authors: Tatiana Lopez-Guevara, Rita Pucci, Nicholas Taylor, Michael U. Gutmann, Subramanian Ramamoorthy, Kartic Subr

    Abstract: Our brains are able to exploit coarse physical models of fluids to solve everyday manipulation tasks. There has been considerable interest in develo** such a capability in robots so that they can autonomously manipulate fluids adapting to different conditions. In this paper, we investigate the problem of adaptation to liquids with different characteristics. We develop a simple calibration task (… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

    Comments: Presented at the Modeling the Physical World: Perception, Learning, and Control Workshop (NeurIPS) 2018

  17. arXiv:1904.00670  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Robust Optimisation Monte Carlo

    Authors: Borislav Ikonomov, Michael U. Gutmann

    Abstract: This paper is on Bayesian inference for parametric statistical models that are defined by a stochastic simulator which specifies how data is generated. Exact sampling is then possible but evaluating the likelihood function is typically prohibitively expensive. Approximate Bayesian Computation (ABC) is a framework to perform approximate inference in such situations. While basic ABC algorithms are w… ▽ More

    Submitted 28 February, 2020; v1 submitted 1 April, 2019; originally announced April 2019.

    Comments: 8 pages + 6 page appendix; v2: made clarifications, added a second possible algorithm implementation and its results; v3: small clarifications, to be published in AISTATS 2020

  18. arXiv:1902.10704  [pdf, other

    stat.CO cs.LG stat.ME

    Adaptive Gaussian Copula ABC

    Authors: Yanzhi Chen, Michael U. Gutmann

    Abstract: Approximate Bayesian computation (ABC) is a set of techniques for Bayesian inference when the likelihood is intractable but sampling from the model is possible. This work presents a simple yet effective ABC algorithm based on the combination of two classical ABC approaches --- regression ABC and sequential ABC. The key idea is that rather than learning the posterior directly, we first target anoth… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

    Comments: 8 pages, 5 figures, accepted to AISTATS 2019

  19. arXiv:1810.09899  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Dynamic Likelihood-free Inference via Ratio Estimation (DIRE)

    Authors: Traiko Dinev, Michael U. Gutmann

    Abstract: Parametric statistical models that are implicitly defined in terms of a stochastic data generating process are used in a wide range of scientific disciplines because they enable accurate modeling. However, learning the parameters from observed data is generally very difficult because their likelihood function is typically intractable. Likelihood-free Bayesian inference methods have been proposed w… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

    Comments: For a demo, see https://traiko.com/pages/research/lfire/

  20. arXiv:1806.03664  [pdf, other

    stat.ML cs.LG

    Conditional Noise-Contrastive Estimation of Unnormalised Models

    Authors: Ciwan Ceylan, Michael U. Gutmann

    Abstract: Many parametric statistical models are not properly normalised and only specified up to an intractable partition function, which renders parameter estimation difficult. Examples of unnormalised models are Gibbs distributions, Markov random fields, and neural network models in unsupervised deep learning. In previous work, the estimation principle called noise-contrastive estimation (NCE) was introd… ▽ More

    Submitted 10 June, 2018; originally announced June 2018.

    Comments: Accepted to ICML 2018

  21. arXiv:1806.00101  [pdf, other

    stat.ML cs.LG

    Generative Ratio Matching Networks

    Authors: Akash Srivastava, Kai Xu, Michael U. Gutmann, Charles Sutton

    Abstract: Deep generative models can learn to generate realistic-looking images, but many of the most effective methods are adversarial and involve a saddlepoint optimization, which requires a careful balancing of training between a generator network and a critic network. Maximum mean discrepancy networks (MMD-nets) avoid this issue by using kernel as a fixed adversary, but unfortunately, they have not on t… ▽ More

    Submitted 14 February, 2020; v1 submitted 31 May, 2018; originally announced June 2018.

    Comments: ICLR 2020; Code: https://github.com/GRAM-nets

  22. Efficient Bayesian Inference of Atomistic Structure in Complex Functional Materials

    Authors: Milica Todorović, Michael U. Gutmann, Jukka Corander, Patrick Rinke

    Abstract: Tailoring the functional properties of advanced organic/inorganic heterogeonous devices to their intended technological applications requires knowledge and control of the microscopic structure inside the device. Atomistic quantum mechanical simulation methods deliver accurate energies and properties for individual configurations, however, finding the most favourable configurations remains computat… ▽ More

    Submitted 12 March, 2019; v1 submitted 30 August, 2017; originally announced August 2017.

    Comments: 8 pages, 5 figures. Peer-reviewed version

  23. arXiv:1708.00707  [pdf, other

    stat.ML cs.MS stat.CO

    ELFI: Engine for Likelihood-Free Inference

    Authors: Jarno Lintusaari, Henri Vuollekoski, Antti Kangasrääsiö, Kusti Skytén, Marko Järvenpää, Pekka Marttinen, Michael U. Gutmann, Aki Vehtari, Jukka Corander, Samuel Kaski

    Abstract: Engine for Likelihood-Free Inference (ELFI) is a Python software library for performing likelihood-free inference (LFI). ELFI provides a convenient syntax for arranging components in LFI, such as priors, simulators, summaries or distances, to a network called ELFI graph. The components can be implemented in a wide variety of languages. The stand-alone ELFI graph can be used with any of the availab… ▽ More

    Submitted 5 July, 2018; v1 submitted 2 August, 2017; originally announced August 2017.

    Journal ref: Journal of Machine Learning Research, 19(16):1-7, 2018. http://jmlr.org/papers/v19/17-374.html

  24. arXiv:1705.07761  [pdf, other

    stat.ML

    VEEGAN: Reducing Mode Collapse in GANs using Implicit Variational Learning

    Authors: Akash Srivastava, Lazar Valkov, Chris Russell, Michael U. Gutmann, Charles Sutton

    Abstract: Deep generative models provide powerful tools for distributions over complicated manifolds, such as those of natural images. But many of these methods, including generative adversarial networks (GANs), can be difficult to train, in part because they are prone to mode collapse, which means that they characterize only a few modes of the true distribution. To address this, we introduce VEEGAN, which… ▽ More

    Submitted 6 November, 2017; v1 submitted 22 May, 2017; originally announced May 2017.

    Comments: Published as a conference paper at NIPS, 2017

  25. arXiv:1704.00520  [pdf, other

    stat.ML stat.CO stat.ME

    Efficient acquisition rules for model-based approximate Bayesian computation

    Authors: Marko Järvenpää, Michael U. Gutmann, Arijus Pleska, Aki Vehtari, Pekka Marttinen

    Abstract: Approximate Bayesian computation (ABC) is a method for Bayesian inference when the likelihood is unavailable but simulating from the model is possible. However, many ABC algorithms require a large number of simulations, which can be costly. To reduce the computational cost, Bayesian optimisation (BO) and surrogate models such as Gaussian processes have been proposed. Bayesian optimisation enables… ▽ More

    Submitted 8 August, 2018; v1 submitted 3 April, 2017; originally announced April 2017.

    Comments: 30 pages, 10 figures

  26. arXiv:1611.10242  [pdf, other

    stat.ML stat.CO stat.ME

    Likelihood-free inference by ratio estimation

    Authors: Owen Thomas, Ritabrata Dutta, Jukka Corander, Samuel Kaski, Michael U. Gutmann

    Abstract: We consider the problem of parametric statistical inference when likelihood computations are prohibitively expensive but sampling from the model is possible. Several so-called likelihood-free methods have been developed to perform inference in the absence of a likelihood function. The popular synthetic likelihood approach infers the parameters by modelling summary statistics of the data by a Gauss… ▽ More

    Submitted 11 September, 2020; v1 submitted 30 November, 2016; originally announced November 2016.

    Comments: Accepted to Bayesian Analysis (2020)

  27. arXiv:1506.05666  [pdf, other

    stat.ML

    Simultaneous Estimation of Non-Gaussian Components and their Correlation Structure

    Authors: Hiroaki Sasaki, Michael U. Gutmann, Hayaru Shouno, Aapo Hyvärinen

    Abstract: The statistical dependencies which independent component analysis (ICA) cannot remove often provide rich information beyond the linear independent components. It would thus be very useful to estimate the dependency structure from data. While such models have been proposed, they usually concentrated on higher-order correlations such as energy (square) correlations. Yet, linear correlations are a mo… ▽ More

    Submitted 27 July, 2017; v1 submitted 18 June, 2015; originally announced June 2015.

  28. arXiv:1502.05503  [pdf, ps, other

    stat.CO stat.ME stat.ML

    Classification and Bayesian Optimization for Likelihood-Free Inference

    Authors: Michael U. Gutmann, Jukka Corander, Ritabrata Dutta, Samuel Kaski

    Abstract: Some statistical models are specified via a data generating process for which the likelihood function cannot be computed in closed form. Standard likelihood-based inference is then not feasible but the model parameters can be inferred by finding the values which yield simulated data that resemble the observed data. This approach faces at least two major difficulties: The first difficulty is the ch… ▽ More

    Submitted 19 February, 2015; originally announced February 2015.

  29. arXiv:1501.03291  [pdf, other

    stat.ML stat.CO stat.ME

    Bayesian Optimization for Likelihood-Free Inference of Simulator-Based Statistical Models

    Authors: Michael U. Gutmann, Jukka Corander

    Abstract: Our paper deals with inferring simulator-based statistical models given some observed data. A simulator-based model is a parametrized mechanism which specifies how data are generated. It is thus also referred to as generative model. We assume that only a finite number of parameters are of interest and allow the generative process to be very general; it may be a noisy nonlinear dynamical system wit… ▽ More

    Submitted 31 December, 2015; v1 submitted 14 January, 2015; originally announced January 2015.

    Comments: In press with the Journal of Machine Learning Research (JMLR). Accepted August 17, 2015

  30. arXiv:1407.4981  [pdf, other

    stat.CO stat.ME stat.ML

    Likelihood-free inference via classification

    Authors: Michael U. Gutmann, Ritabrata Dutta, Samuel Kaski, Jukka Corander

    Abstract: Increasingly complex generative models are being used across disciplines as they allow for realistic characterization of data, but a common difficulty with them is the prohibitively large computational cost to evaluate the likelihood function and thus to perform likelihood-based statistical inference. A likelihood-free inference framework has emerged where the parameters are identified by finding… ▽ More

    Submitted 3 March, 2017; v1 submitted 18 July, 2014; originally announced July 2014.

    Comments: Accepted for publication in Statistics and Computing (Feb 13, 2017)

  31. arXiv:1304.6803  [pdf, ps, other

    stat.ML

    Direct Learning of Sparse Changes in Markov Networks by Density Ratio Estimation

    Authors: Song Liu, John A. Quinn, Michael U. Gutmann, Taiji Suzuki, Masashi Sugiyama

    Abstract: We propose a new method for detecting changes in Markov network structure between two sets of samples. Instead of naively fitting two Markov network models separately to the two data sets and figuring out their difference, we \emph{directly} learn the network structure change by estimating the ratio of Markov network models. This density-ratio formulation naturally allows us to introduce sparsity… ▽ More

    Submitted 1 January, 2014; v1 submitted 25 April, 2013; originally announced April 2013.