Skip to main content

Showing 1–20 of 20 results for author: Frellsen, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.13151  [pdf, other

    stat.ML cs.LG stat.CO

    von Mises Quasi-Processes for Bayesian Circular Regression

    Authors: Yarden Cohen, Alexandre Khae Wu Navarro, Jes Frellsen, Richard E. Turner, Raziel Riemer, Ari Pakman

    Abstract: The need for regression models to predict circular values arises in many scientific fields. In this work we explore a family of expressive and interpretable distributions over circle-valued random functions related to Gaussian processes targeting two Euclidean dimensions conditioned on the unit circle. The resulting probability model has connections with continuous spin models in statistical physi… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Contribution to the Structured Probabilistic Inference & Generative Modeling workshop of ICML 2024

  2. arXiv:2404.17452  [pdf, other

    cs.LG stat.ML

    A Continuous Relaxation for Discrete Bayesian Optimization

    Authors: Richard Michael, Simon Bartels, Miguel González-Duque, Yevgen Zainchkovskyy, Jes Frellsen, Søren Hauberg, Wouter Boomsma

    Abstract: To optimize efficiently over discrete data and with only few available target observations is a challenge in Bayesian optimization. We propose a continuous relaxation of the objective function and show that inference and optimization can be computationally tractable. We consider in particular the optimization domain where very few observations and strict budgets exist; motivated by optimizing prot… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  3. arXiv:2310.06643  [pdf, other

    cs.LG stat.ML

    Implicit Variational Inference for High-Dimensional Posteriors

    Authors: Anshuk Uppal, Kristoffer Stensbo-Smidt, Wouter Boomsma, Jes Frellsen

    Abstract: In variational inference, the benefits of Bayesian models rely on accurately capturing the true posterior distribution. We propose using neural samplers that specify implicit distributions, which are well-suited for approximating complex multimodal and correlated posteriors in high-dimensional spaces. Our approach introduces novel bounds for approximate inference using implicit distributions by lo… ▽ More

    Submitted 9 November, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: 10 pages and appendix, 9 figures, 7 tables

  4. arXiv:2212.03131  [pdf, other

    cs.LG cs.AI stat.ME

    Explainability as statistical inference

    Authors: Hugo Henri Joseph Senetaire, Damien Garreau, Jes Frellsen, Pierre-Alexandre Mattei

    Abstract: A wide variety of model explanation approaches have been proposed in recent years, all guided by very different rationales and heuristics. In this paper, we take a new route and cast interpretability as a statistical inference problem. We propose a general deep probabilistic model designed to produce interpretable predictions. The model parameters can be learned via maximum likelihood, and the met… ▽ More

    Submitted 29 December, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: 10 pages, 22 figures, published at ICLR 2023

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:30584-30612, 2023

  5. arXiv:2203.01097  [pdf, other

    stat.ML cs.LG

    Model-agnostic out-of-distribution detection using combined statistical tests

    Authors: Federico Bergamin, Pierre-Alexandre Mattei, Jakob D. Havtorn, Hugo Senetaire, Hugo Schmutz, Lars Maaløe, Søren Hauberg, Jes Frellsen

    Abstract: We present simple methods for out-of-distribution detection using a trained generative model. These techniques, based on classical statistical tests, are model-agnostic in the sense that they can be applied to any differentiable generative model. The idea is to combine a classical parametric test (Rao's score test) with the recently introduced typicality test. These two test statistics are both th… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: Accepted at the 25th International Conference on Artificial Intelligence and Statistics (AISTATS), 2022

  6. arXiv:2202.12707  [pdf, other

    eess.AS cs.AI cs.LG cs.SD stat.ML

    Benchmarking Generative Latent Variable Models for Speech

    Authors: Jakob D. Havtorn, Lasse Borgholt, Søren Hauberg, Jes Frellsen, Lars Maaløe

    Abstract: Stochastic latent variable models (LVMs) achieve state-of-the-art performance on natural image generation but are still inferior to deterministic models on speech. In this paper, we develop a speech benchmark of popular temporal LVMs and compare them against state-of-the-art deterministic models. We report the likelihood, which is a much used metric in the image domain, but rarely, or incomparably… ▽ More

    Submitted 5 April, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: Accepted at the 2022 ICLR workshop on Deep Generative Models for Highly Structured Data (https://deep-gen-struct.github.io)

  7. arXiv:2201.10989  [pdf, other

    stat.ML cs.AI cs.LG stat.CO stat.ME

    Uphill Roads to Variational Tightness: Monotonicity and Monte Carlo Objectives

    Authors: Pierre-Alexandre Mattei, Jes Frellsen

    Abstract: We revisit the theory of importance weighted variational inference (IWVI), a promising strategy for learning latent variable models. IWVI uses new variational bounds, known as Monte Carlo objectives (MCOs), obtained by replacing intractable integrals by Monte Carlo estimates -- usually simply obtained via importance sampling. Burda, Grosse and Salakhutdinov (2016) showed that increasing the number… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

    MSC Class: 62-08

  8. arXiv:2111.00929  [pdf, other

    cs.LG stat.ML

    Bounds all around: training energy-based models with bidirectional bounds

    Authors: Cong Geng, Jia Wang, Zhiyong Gao, Jes Frellsen, Søren Hauberg

    Abstract: Energy-based models (EBMs) provide an elegant framework for density estimation, but they are notoriously difficult to train. Recent work has established links to generative adversarial networks, where the EBM is trained through a minimax game with a variational value function. We propose a bidirectional bound on the EBM log-likelihood, such that we maximize a lower bound and minimize an upper boun… ▽ More

    Submitted 2 November, 2021; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: This paper has been accepted by NeurIPS 2021

  9. arXiv:2110.03051  [pdf, other

    cs.LG cs.AI stat.ML

    Prior and Posterior Networks: A Survey on Evidential Deep Learning Methods For Uncertainty Estimation

    Authors: Dennis Ulmer, Christian Hardmeier, Jes Frellsen

    Abstract: Popular approaches for quantifying predictive uncertainty in deep neural networks often involve distributions over weights or multiple models, for instance via Markov Chain sampling, ensembling, or Monte Carlo dropout. These techniques usually incur overhead by having to train multiple model instances or do not produce very diverse predictions. This comprehensive and extensive survey aims to famil… ▽ More

    Submitted 7 March, 2023; v1 submitted 6 October, 2021; originally announced October 2021.

  10. arXiv:2107.10587  [pdf, other

    stat.CO

    Kernel-Matrix Determinant Estimates from stopped Cholesky Decomposition

    Authors: Simon Bartels, Wouter Boomsma, Jes Frellsen, Damien Garreau

    Abstract: Algorithms involving Gaussian processes or determinantal point processes typically require computing the determinant of a kernel matrix. Frequently, the latter is computed from the Cholesky decomposition, an algorithm of cubic complexity in the size of the matrix. We show that, under mild assumptions, it is possible to estimate the determinant from only a sub-matrix, with probabilistic guarantee o… ▽ More

    Submitted 22 July, 2021; originally announced July 2021.

  11. arXiv:2102.08248  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Hierarchical VAEs Know What They Don't Know

    Authors: Jakob D. Havtorn, Jes Frellsen, Søren Hauberg, Lars Maaløe

    Abstract: Deep generative models have been demonstrated as state-of-the-art density estimators. Yet, recent work has found that they often assign a higher likelihood to data from outside the training distribution. This seemingly paradoxical behavior has caused concerns over the quality of the attained density estimates. In the context of hierarchical variational autoencoders, we provide evidence to explain… ▽ More

    Submitted 18 January, 2022; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: Appeared in Proceedings of the 38th International Conference on Machine Learning (ICML 2021). 18 pages, source code available at https://github.com/JakobHavtorn/hvae-oodd, https://github.com/vlievin/biva-pytorch and https://github.com/larsmaaloee/BIVA

  12. arXiv:2102.06522  [pdf, other

    stat.ML cs.LG stat.ME

    Sequential Neural Posterior and Likelihood Approximation

    Authors: Samuel Wiqvist, Jes Frellsen, Umberto Picchini

    Abstract: We introduce the sequential neural posterior and likelihood approximation (SNPLA) algorithm. SNPLA is a normalizing flows-based algorithm for inference in implicit models, and therefore is a simulation-based inference method that only requires simulations from a generative model. SNPLA avoids Markov chain Monte Carlo sampling and correction-steps of the parameter proposal function that are introdu… ▽ More

    Submitted 5 June, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: 28 pages, 8 tables, 14 figures. The supplementary material is attached to the main paper

  13. arXiv:2006.12871  [pdf, other

    stat.ML cs.LG stat.ME

    not-MIWAE: Deep Generative Modelling with Missing not at Random Data

    Authors: Niels Bruun Ipsen, Pierre-Alexandre Mattei, Jes Frellsen

    Abstract: When a missing process depends on the missing values themselves, it needs to be explicitly modelled and taken into account while doing likelihood-based inference. We present an approach for building and fitting deep latent variable models (DLVMs) in cases where the missing process is dependent on the missing data. Specifically, a deep neural network enables us to flexibly model the conditional dis… ▽ More

    Submitted 18 March, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: Camera-ready version for ICLR 2021

  14. arXiv:1902.03642  [pdf, other

    cs.LG stat.ML

    (q,p)-Wasserstein GANs: Comparing Ground Metrics for Wasserstein GANs

    Authors: Anton Mallasto, Jes Frellsen, Wouter Boomsma, Aasa Feragen

    Abstract: Generative Adversial Networks (GANs) have made a major impact in computer vision and machine learning as generative models. Wasserstein GANs (WGANs) brought Optimal Transport (OT) theory into GANs, by minimizing the $1$-Wasserstein distance between model and data distributions as their objective function. Since then, WGANs have gained considerable interest due to their stability and theoretical fr… ▽ More

    Submitted 10 February, 2019; originally announced February 2019.

  15. arXiv:1901.10230  [pdf, other

    stat.ML cs.LG stat.CO

    Partially Exchangeable Networks and Architectures for Learning Summary Statistics in Approximate Bayesian Computation

    Authors: Samuel Wiqvist, Pierre-Alexandre Mattei, Umberto Picchini, Jes Frellsen

    Abstract: We present a novel family of deep neural architectures, named partially exchangeable networks (PENs) that leverage probabilistic symmetries. By design, PENs are invariant to block-switch transformations, which characterize the partial exchangeability properties of conditionally Markovian processes. Moreover, we show that any block-switch invariant function has a PEN-like representation. The DeepSe… ▽ More

    Submitted 17 May, 2019; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: Forthcoming on the Proceedings of ICML 2019. New comparisons with several different networks. We now use the Wasserstein distance to produce comparisons. Code available on GitHub. 16 pages, 5 figures, 21 tables

    Journal ref: Proceedings of the 36th International Conference on Machine Learning, PMLR 97:6798--6807, 2019

  16. arXiv:1812.02633  [pdf, other

    stat.ML cs.LG stat.ME

    MIWAE: Deep Generative Modelling and Imputation of Incomplete Data

    Authors: Pierre-Alexandre Mattei, Jes Frellsen

    Abstract: We consider the problem of handling missing data with deep latent variable models (DLVMs). First, we present a simple technique to train DLVMs when the training set contains missing-at-random data. Our approach, called MIWAE, is based on the importance-weighted autoencoder (IWAE), and maximises a potentially tight lower bound of the log-likelihood of the observed data. Compared to the original IWA… ▽ More

    Submitted 4 February, 2019; v1 submitted 6 December, 2018; originally announced December 2018.

    Comments: A short version of this paper was presented at the 3rd NeurIPS workshop on Bayesian Deep Learning

  17. arXiv:1802.04826  [pdf, other

    stat.ML cs.LG stat.ME

    Leveraging the Exact Likelihood of Deep Latent Variable Models

    Authors: Pierre-Alexandre Mattei, Jes Frellsen

    Abstract: Deep latent variable models (DLVMs) combine the approximation abilities of deep neural networks and the statistical foundations of generative models. Variational methods are commonly used for inference; however, the exact likelihood of these models has been largely overlooked. The purpose of this work is to study the general properties of this quantity and to show how they can be leveraged in prac… ▽ More

    Submitted 28 June, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

    MSC Class: 62H25

  18. arXiv:1707.05147  [pdf, other

    stat.ML cs.LG

    Comparative Study of Inference Methods for Bayesian Nonnegative Matrix Factorisation

    Authors: Thomas Brouwer, Jes Frellsen, Pietro Lió

    Abstract: In this paper, we study the trade-offs of different inference approaches for Bayesian matrix factorisation methods, which are commonly used for predicting missing values, and for finding patterns in the data. In particular, we consider Bayesian nonnegative variants of matrix factorisation and tri-factorisation, and compare non-probabilistic inference, Gibbs sampling, variational Bayesian inference… ▽ More

    Submitted 13 July, 2017; originally announced July 2017.

    Comments: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2017). The final publication will be available at link.springer.com. arXiv admin note: text overlap with arXiv:1610.08127

  19. arXiv:1610.08127  [pdf, other

    cs.LG cs.AI math.NA stat.ML

    Fast Bayesian Non-Negative Matrix Factorisation and Tri-Factorisation

    Authors: Thomas Brouwer, Jes Frellsen, Pietro Lio'

    Abstract: We present a fast variational Bayesian algorithm for performing non-negative matrix factorisation and tri-factorisation. We show that our approach achieves faster convergence per iteration and timestep (wall-clock) than Gibbs sampling and non-probabilistic approaches, and do not require additional samples to estimate the posterior. We show that in particular for matrix tri-factorisation convergenc… ▽ More

    Submitted 25 October, 2016; originally announced October 2016.

    Comments: NIPS 2016 Workshop on Advances in Approximate Bayesian Inference

  20. arXiv:1602.05003  [pdf, other

    stat.ML

    The Multivariate Generalised von Mises distribution: Inference and applications

    Authors: Alexandre K. W. Navarro, Jes Frellsen, Richard E. Turner

    Abstract: Circular variables arise in a multitude of data-modelling contexts ranging from robotics to the social sciences, but they have been largely overlooked by the machine learning community. This paper partially redresses this imbalance by extending some standard probabilistic modelling tools to the circular domain. First we introduce a new multivariate distribution over circular variables, called the… ▽ More

    Submitted 8 August, 2017; v1 submitted 16 February, 2016; originally announced February 2016.

    Comments: 16 pages, 8 figures. Final version available at AAAI Press website: https://aaai.org/ocs/index.php/AAAI/AAAI17/paper/view/15020. This version includes supplementary material submitted to, but not published, in the AAAI proceedings

    ACM Class: G.3; I.2