Skip to main content

Showing 1–49 of 49 results for author: Bürkner, P

.
  1. arXiv:2407.04967  [pdf, other

    stat.CO

    posteriordb: Testing, Benchmarking and Develo** Bayesian Inference Algorithms

    Authors: Måns Magnusson, Jakob Torgander, Paul-Christian Bürkner, Lu Zhang, Bob Carpenter, Aki Vehtari

    Abstract: The generality and robustness of inference algorithms is critical to the success of widely used probabilistic programming languages such as Stan, PyMC, Pyro, and Turing.jl. When designing a new general-purpose inference algorithm, whether it involves Monte Carlo sampling or variational approximation, the fundamental problem arises in evaluating its accuracy and efficiency across a range of represe… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  2. arXiv:2406.03154  [pdf, other

    cs.LG cs.AI

    Detecting Model Misspecification in Amortized Bayesian Inference with Neural Networks: An Extended Investigation

    Authors: Marvin Schmitt, Paul-Christian Bürkner, Ullrich Köthe, Stefan T. Radev

    Abstract: Recent advances in probabilistic deep learning enable efficient amortized Bayesian inference in settings where the likelihood function is only implicitly defined by a simulation program (simulation-based inference; SBI). But how faithful is such inference if the simulation represents reality somewhat inaccurately, that is, if the true system behavior at test time deviates from the one seen during… ▽ More

    Submitted 6 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: Extended version of the conference paper https://doi.org/10.1007/978-3-031-54605-1_35. arXiv admin note: text overlap with arXiv:2112.08866

  3. arXiv:2404.14124  [pdf, other

    stat.ME

    Gaussian distributional structural equation models: A framework for modeling latent heteroscedasticity

    Authors: Luna Fazio, Paul-Christian Bürkner

    Abstract: Accounting for the complexity of psychological theories requires methods that can predict not only changes in the means of latent variables -- such as personality factors, creativity, or intelligence -- but also changes in their variances. Structural equation modeling (SEM) is the framework of choice for analyzing complex relationships among latent variables, but current methods do not allow model… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 30 pages, 13 figures

  4. arXiv:2404.04074  [pdf, other

    stat.ME

    DGP-LVM: Derivative Gaussian process latent variable model

    Authors: Soham Mukherjee, Manfred Claassen, Paul-Christian Bürkner

    Abstract: We develop a framework for derivative Gaussian process latent variable models (DGP-LVM) that can handle multi-dimensional output data using modified derivative covariance functions. The modifications account for complexities in the underlying data generating process such as scaled derivatives, varying information across multiple output dimensions as well as interactions between outputs. Further, o… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 22 pages, 13 figures

  5. arXiv:2403.11141  [pdf, other

    cs.GR

    The Simplex Projection: Lossless Visualization of 4D Compositional Data on a 2D Canvas

    Authors: Marvin Schmitt, Yuga Hikida, Stefan T Radev, Filip Sadlo, Paul-Christian Bürkner

    Abstract: The simplex projection expands the capabilities of simplex plots (also known as ternary plots) to achieve a lossless visualization of 4D compositional data on a 2D canvas. Previously, this was only possible for 3D compositional data. We demonstrate how our approach can be applied to individual data points, point clouds, and continuous probability density functions on simplices. While we showcase o… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  6. arXiv:2403.08591  [pdf, other

    cs.CV

    ActionDiffusion: An Action-aware Diffusion Model for Procedure Planning in Instructional Videos

    Authors: Lei Shi, Paul Bürkner, Andreas Bulling

    Abstract: We present ActionDiffusion -- a novel diffusion model for procedure planning in instructional videos that is the first to take temporal inter-dependencies between actions into account in a diffusion model for procedure planning. This approach is in stark contrast to existing methods that fail to exploit the rich information content available in the particular order in which actions are performed.… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: Submitted to IROS 2024

  7. arXiv:2401.10180  [pdf, other

    stat.ME

    Generalized Decomposition Priors on R2

    Authors: Javier Enrique Aguilar, Paul-Christian Bürkner

    Abstract: The adoption of continuous shrinkage priors in high-dimensional linear models has gained momentum, driven by their theoretical and practical advantages. One of these shrinkage priors is the R2D2 prior, which comes with intuitive hyperparameters and well understood theoretical properties. The core idea is to specify a prior on the percentage of explained variance $R^2$ and to conduct a Dirichlet de… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 31 pages, 12 figures

  8. arXiv:2312.05440  [pdf, other

    cs.LG cs.AI stat.ML

    Consistency Models for Scalable and Fast Simulation-Based Inference

    Authors: Marvin Schmitt, Valentin Pratz, Ullrich Köthe, Paul-Christian Bürkner, Stefan T Radev

    Abstract: Simulation-based inference (SBI) is constantly in search of more expressive algorithms for accurately inferring the parameters of complex models from noisy data. We present consistency models for neural posterior estimation (CMPE), a new free-form conditional sampler for scalable, fast, and amortized SBI with generative neural networks. CMPE combines the advantages of normalizing flows and flow ma… ▽ More

    Submitted 27 February, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

  9. arXiv:2312.05153  [pdf, other

    stat.ML cs.LG

    Uncertainty Quantification and Propagation in Surrogate-based Bayesian Inference

    Authors: Philipp Reiser, Javier Enrique Aguilar, Anneli Guthke, Paul-Christian Bürkner

    Abstract: Surrogate models are statistical or conceptual approximations for more complex simulation models. In this context, it is crucial to propagate the uncertainty induced by limited simulation budget and surrogate approximation error to predictions, inference, and subsequent decision-relevant quantities. However, quantifying and then propagating the uncertainty of surrogates is usually limited to speci… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  10. arXiv:2311.10671  [pdf, other

    cs.LG cs.AI

    Fuse It or Lose It: Deep Fusion for Multimodal Simulation-Based Inference

    Authors: Marvin Schmitt, Stefan T. Radev, Paul-Christian Bürkner

    Abstract: We present multimodal neural posterior estimation (MultiNPE), a method to integrate heterogeneous data from different sources in simulation-based inference with neural networks. Inspired by advances in deep fusion learning, it empowers researchers to analyze data from different domains and infer the parameters of complex mathematical models with increased accuracy. We formulate multimodal fusion a… ▽ More

    Submitted 26 February, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

  11. arXiv:2311.09081  [pdf, other

    stat.ME

    Posterior accuracy and calibration under misspecification in Bayesian generalized linear models

    Authors: Maximilian Scholz, Paul-Christian Bürkner

    Abstract: Generalized linear models (GLMs) are popular for data-analysis in almost all quantitative sciences, but the choice of likelihood family and link function is often difficult. This motivates the search for likelihoods and links that minimize the impact of potential misspecification. We perform a large-scale simulation study on double-bounded and lower-bounded response data where we systematically va… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2210.06927

  12. arXiv:2310.11122  [pdf, other

    stat.ML cs.LG stat.ME

    Sensitivity-Aware Amortized Bayesian Inference

    Authors: Lasse Elsemüller, Hans Olischläger, Marvin Schmitt, Paul-Christian Bürkner, Ullrich Köthe, Stefan T. Radev

    Abstract: Sensitivity analyses reveal the influence of various modeling choices on the outcomes of statistical analyses. While theoretically appealing, they are overwhelmingly inefficient for complex Bayesian models. In this work, we propose sensitivity-aware amortized Bayesian inference (SA-ABI), a multifaceted approach to efficiently integrate sensitivity analyses into simulation-based inference with neur… ▽ More

    Submitted 8 May, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

  13. arXiv:2310.04395  [pdf, other

    cs.LG cs.AI

    Leveraging Self-Consistency for Data-Efficient Amortized Bayesian Inference

    Authors: Marvin Schmitt, Desi R. Ivanova, Daniel Habermann, Ullrich Köthe, Paul-Christian Bürkner, Stefan T. Radev

    Abstract: We propose a method to improve the efficiency and accuracy of amortized Bayesian inference by leveraging universal symmetries in the joint probabilistic model of parameters and data. In a nutshell, we invert Bayes' theorem and estimate the marginal likelihood based on approximate representations of the joint model. Upon perfect approximation, the marginal likelihood is constant across all paramete… ▽ More

    Submitted 26 February, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: previously published as an extended abstract at NeurIPS UniReps 2023

  14. arXiv:2308.12194  [pdf, other

    cs.HC

    Inferring Human Intentions from Predicted Action Probabilities

    Authors: Lei Shi, Paul-Christian Bürkner, Andreas Bulling

    Abstract: Predicting the next action that a human is most likely to perform is key to human-AI collaboration and has consequently attracted increasing research interests in recent years. An important factor for next action prediction are human intentions: If the AI agent knows the intention it can predict future actions and plan collaboration more effectively. Existing Bayesian methods for this task struggl… ▽ More

    Submitted 25 March, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted by Workshop on Theory of Mind in Human-AI Interaction at CHI 2024

  15. arXiv:2308.11672  [pdf, other

    stat.ME stat.ML

    Simulation-Based Prior Knowledge Elicitation for Parametric Bayesian Models

    Authors: Florence Bockting, Stefan T. Radev, Paul-Christian Bürkner

    Abstract: A central characteristic of Bayesian statistics is the ability to consistently incorporate prior knowledge into various modeling processes. In this paper, we focus on translating domain expert knowledge into corresponding prior distributions over model parameters, a process known as prior elicitation. Expert knowledge can manifest itself in diverse formats, including information about raw data, su… ▽ More

    Submitted 15 April, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

  16. arXiv:2306.16015  [pdf, other

    cs.LG cs.AI stat.ML

    BayesFlow: Amortized Bayesian Workflows With Neural Networks

    Authors: Stefan T Radev, Marvin Schmitt, Lukas Schumacher, Lasse Elsemüller, Valentin Pratz, Yannik Schälte, Ullrich Köthe, Paul-Christian Bürkner

    Abstract: Modern Bayesian inference involves a mixture of computational techniques for estimating, validating, and drawing conclusions from probabilistic models as part of principled workflows for data analysis. Typical problems in Bayesian workflows are the approximation of intractable posterior distributions for diverse model types and the comparison of competing models of the same process in terms of the… ▽ More

    Submitted 10 July, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

  17. arXiv:2302.09125  [pdf, other

    cs.LG stat.ML

    JANA: Jointly Amortized Neural Approximation of Complex Bayesian Models

    Authors: Stefan T. Radev, Marvin Schmitt, Valentin Pratz, Umberto Picchini, Ullrich Köthe, Paul-Christian Bürkner

    Abstract: This work proposes ``jointly amortized neural approximation'' (JANA) of intractable likelihood functions and posterior densities arising in Bayesian surrogate modeling and simulation-based inference. We train three complementary networks in an end-to-end fashion: 1) a summary network to compress individual data points, sets, or time series into informative embedding vectors; 2) a posterior network… ▽ More

    Submitted 20 June, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

  18. arXiv:2301.11873  [pdf, other

    stat.ML cs.LG stat.ME

    A Deep Learning Method for Comparing Bayesian Hierarchical Models

    Authors: Lasse Elsemüller, Martin Schnuerch, Paul-Christian Bürkner, Stefan T. Radev

    Abstract: Bayesian model comparison (BMC) offers a principled approach for assessing the relative merits of competing computational models and propagating uncertainty into model selection decisions. However, BMC is often intractable for the popular class of hierarchical models due to their high-dimensional nested parameter structure. To address this intractability, we propose a deep learning method for perf… ▽ More

    Submitted 23 November, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  19. arXiv:2211.13165  [pdf, other

    stat.ME stat.ML

    Neural Superstatistics for Bayesian Estimation of Dynamic Cognitive Models

    Authors: Lukas Schumacher, Paul-Christian Bürkner, Andreas Voss, Ullrich Köthe, Stefan T. Radev

    Abstract: Mathematical models of cognition are often memoryless and ignore potential fluctuations of their parameters. However, human cognition is inherently dynamic. Thus, we propose to augment mechanistic cognitive models with a temporal dimension and estimate the resulting dynamics from a superstatistics perspective. Such a model entails a hierarchy between a low-level observation model and a high-level… ▽ More

    Submitted 20 September, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

  20. arXiv:2211.12556  [pdf, other

    stat.ME

    Optimal design of the Wilcoxon-Mann-Whitney-test

    Authors: Paul-Christian Bürkner, Philipp Doebler, Heinz Holling

    Abstract: In scientific research, many hypotheses relate to the comparison of two independent groups. Usually, it is of interest to use a design (i.e., the allocation of sample sizes $m$ and $n$ for fixed $N = m + n$) that maximizes the power of the applied statistical test. It is known that the two-sample t-tests for homogeneous and heterogeneous variances may lose substantial power when variances are uneq… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  21. arXiv:2211.12538  [pdf, other

    stat.ME

    Testing for Publication Bias in Diagnostic Meta-Analysis: A Simulation Study

    Authors: Paul-Christian Bürkner, Philipp Doebler

    Abstract: The present study investigates the performance of several statistical tests to detect publication bias in diagnostic meta-analysis by means of simulation. While bivariate models should be used to pool data from primary studies in diagnostic meta-analysis, univariate measures of diagnostic accuracy are preferable for the purpose of detecting publication bias. In contrast to earlier research, which… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2002.04775 by other authors

  22. Simulation-Based Calibration Checking for Bayesian Computation: The Choice of Test Quantities Shapes Sensitivity

    Authors: Martin Modrák, Angie H. Moon, Shinyoung Kim, Paul Bürkner, Niko Huurre, Kateřina Faltejsková, Andrew Gelman, Aki Vehtari

    Abstract: Simulation-based calibration checking (SBC) is a practical method to validate computationally-derived posterior distributions or their approximations. In this paper, we introduce a new variant of SBC to alleviate several known problems. Our variant allows the user to in principle detect any possible issue with the posterior, while previously reported implementations could never detect large classe… ▽ More

    Submitted 19 October, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: 50 pages, 11 figures, upcoming in Bayesian Analysis

  23. arXiv:2210.10487  [pdf, other

    cs.LG stat.ML

    Estimating the Contamination Factor's Distribution in Unsupervised Anomaly Detection

    Authors: Lorenzo Perini, Paul Buerkner, Arto Klami

    Abstract: Anomaly detection methods identify examples that do not follow the expected behaviour, typically in an unsupervised fashion, by assigning real-valued anomaly scores to the examples based on various heuristics. These scores need to be transformed into actual predictions by thresholding, so that the proportion of examples marked as anomalies equals the expected proportion of anomalies, called contam… ▽ More

    Submitted 17 October, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

  24. arXiv:2210.07278  [pdf, other

    stat.ML cs.LG

    Meta-Uncertainty in Bayesian Model Comparison

    Authors: Marvin Schmitt, Stefan T. Radev, Paul-Christian Bürkner

    Abstract: Bayesian model comparison (BMC) offers a principled probabilistic approach to study and rank competing models. In standard BMC, we construct a discrete probability distribution over the set of possible models, conditional on the observed data of interest. These posterior model probabilities (PMPs) are measures of uncertainty, but -- when derived from a finite number of observations -- are also unc… ▽ More

    Submitted 21 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: accepted at AISTATS 2023

  25. arXiv:2210.06927  [pdf, other

    stat.ME

    Prediction can be safely used as a proxy for explanation in causally consistent Bayesian generalized linear models

    Authors: Maximilian Scholz, Paul-Christian Bürkner

    Abstract: Bayesian modeling provides a principled approach to quantifying uncertainty in model parameters and model structure and has seen a surge of applications in recent years. Within the context of a Bayesian workflow, we are concerned with model selection for the purpose of finding models that best explain the data, that is, help us understand the underlying data generating process. Since we rarely hav… ▽ More

    Submitted 18 March, 2024; v1 submitted 13 October, 2022; originally announced October 2022.

  26. arXiv:2209.02439  [pdf, other

    stat.ME

    Some models are useful, but how do we know which ones? Towards a unified Bayesian model taxonomy

    Authors: Paul-Christian Bürkner, Maximilian Scholz, Stefan T. Radev

    Abstract: Probabilistic (Bayesian) modeling has experienced a surge of applications in almost all quantitative sciences and industrial areas. This development is driven by a combination of several factors, including better probabilistic estimation algorithms, flexible software, increased computing power, and a growing awareness of the benefits of probabilistic learning. However, a principled Bayesian model… ▽ More

    Submitted 25 September, 2023; v1 submitted 6 September, 2022; originally announced September 2022.

  27. arXiv:2208.07132  [pdf, other

    stat.ME stat.CO

    Intuitive Joint Priors for Bayesian Linear Multilevel Models: The R2D2M2 prior

    Authors: Javier Enrique Aguilar, Paul-Christian Bürkner

    Abstract: The training of high-dimensional regression models on comparably sparse data is an important yet complicated topic, especially when there are many more model parameters than observations in the data. From a Bayesian perspective, inference in such cases can be achieved with the help of shrinkage prior distributions, at least for generalized linear models. However, real-world data usually possess mu… ▽ More

    Submitted 11 June, 2023; v1 submitted 15 August, 2022; originally announced August 2022.

    Comments: 61 pages, 21 figures, 9 tables

  28. A fully Bayesian sparse polynomial chaos expansion approach with joint priors on the coefficients and global selection of terms

    Authors: Paul-Christian Bürkner, Ilja Kröker, Sergey Oladyshkin, Wolfgang Nowak

    Abstract: Polynomial chaos expansion (PCE) is a versatile tool widely used in uncertainty quantification and machine learning, but its successful application depends strongly on the accuracy and reliability of the resulting PCE-based response surface. High accuracy typically requires high polynomial degrees, demanding many training points especially in high-dimensional problems through the curse of dimensio… ▽ More

    Submitted 13 January, 2023; v1 submitted 12 April, 2022; originally announced April 2022.

  29. arXiv:2112.08866  [pdf, other

    stat.ME cs.LG stat.ML

    Detecting Model Misspecification in Amortized Bayesian Inference with Neural Networks

    Authors: Marvin Schmitt, Paul-Christian Bürkner, Ullrich Köthe, Stefan T. Radev

    Abstract: Neural density estimators have proven remarkably powerful in performing efficient simulation-based Bayesian inference in various research domains. In particular, the BayesFlow framework uses a two-step approach to enable amortized parameter estimation in settings where the likelihood function is implicitly defined by a simulation program. But how faithful is such inference when simulations are poo… ▽ More

    Submitted 8 November, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

  30. arXiv:2112.01380  [pdf, other

    stat.ME

    Prior knowledge elicitation: The past, present, and future

    Authors: Petrus Mikkola, Osvaldo A. Martin, Suyog Chandramouli, Marcelo Hartmann, Oriol Abril Pla, Owen Thomas, Henri Pesonen, Jukka Corander, Aki Vehtari, Samuel Kaski, Paul-Christian Bürkner, Arto Klami

    Abstract: Specification of the prior distribution for a Bayesian model is a central part of the Bayesian workflow for data analysis, but it is often difficult even for statistical experts. In principle, prior elicitation transforms domain knowledge of various kinds into well-defined prior distributions, and offers a solution to the prior specification problem. In practice, however, we are still fairly far f… ▽ More

    Submitted 9 May, 2023; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: 69 pages, 1 figure

  31. arXiv:2109.04702  [pdf, other

    stat.CO

    Latent space projection predictive inference

    Authors: Alejandro Catalina, Paul Bürkner, Aki Vehtari

    Abstract: Given a reference model that includes all the available variables, projection predictive inference replaces its posterior with a constrained projection including only a subset of all variables. We extend projection predictive inference to enable computationally efficient variable and structure selection in models outside the exponential family. By adopting a latent space projection predictive pers… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

  32. Detecting and diagnosing prior and likelihood sensitivity with power-scaling

    Authors: Noa Kallioinen, Topi Paananen, Paul-Christian Bürkner, Aki Vehtari

    Abstract: Determining the sensitivity of the posterior to perturbations of the prior and likelihood is an important part of the Bayesian workflow. We introduce a practical and computationally efficient sensitivity analysis approach using importance sampling to estimate properties of posteriors resulting from power-scaling the prior or likelihood. On this basis, we suggest a diagnostic that can indicate the… ▽ More

    Submitted 26 May, 2023; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: 31 pages, 15 (+5 suppl) figures

  33. Graphical Test for Discrete Uniformity and its Applications in Goodness of Fit Evaluation and Multiple Sample Comparison

    Authors: Teemu Säilynoja, Paul-Christian Bürkner, Aki Vehtari

    Abstract: Assessing goodness of fit to a given distribution plays an important role in computational statistics. The Probability integral transformation (PIT) can be used to convert the question of whether a given sample originates from a reference distribution into a problem of testing for uniformity. We present new simulation and optimization based methods to obtain simultaneous confidence bands for the w… ▽ More

    Submitted 17 November, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

  34. arXiv:2103.08744  [pdf, other

    stat.ME

    Workflow Techniques for the Robust Use of Bayes Factors

    Authors: Daniel J. Schad, Bruno Nicenboim, Paul-Christian Bürkner, Michael Betancourt, Shravan Vasishth

    Abstract: Inferences about hypotheses are ubiquitous in the cognitive sciences. Bayes factors provide one general way to compare different hypotheses by their compatibility with the observed data. Those quantifications can then also be used to choose between hypotheses. While Bayes factors provide an immediate approach to hypothesis testing, they are highly sensitive to details of the data/model assumptions… ▽ More

    Submitted 18 March, 2021; v1 submitted 15 March, 2021; originally announced March 2021.

  35. arXiv:2011.01808  [pdf, other

    stat.ME

    Bayesian Workflow

    Authors: Andrew Gelman, Aki Vehtari, Daniel Simpson, Charles C. Margossian, Bob Carpenter, Yuling Yao, Lauren Kennedy, Jonah Gabry, Paul-Christian Bürkner, Martin Modrák

    Abstract: The Bayesian approach to data analysis provides a powerful way to handle uncertainty in all observations, model parameters, and model structure using probability theory. Probabilistic programming languages make it easier to specify and fit Bayesian models, but this still leaves us with many options regarding constructing, evaluating, and using these models, along with many remaining challenges in… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: 77 pages, 35 figures

  36. arXiv:2010.06994  [pdf, other

    stat.ME stat.CO

    Projection Predictive Inference for Generalized Linear and Additive Multilevel Models

    Authors: Alejandro Catalina, Paul-Christian Bürkner, Aki Vehtari

    Abstract: Projection predictive inference is a decision theoretic Bayesian approach that decouples model estimation from decision making. Given a reference model previously built including all variables present in the data, projection predictive inference projects its posterior onto a constrained space of a subset of variables. Variable selection is then performed by sequentially adding relevant variables u… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

  37. arXiv:2005.03899  [pdf, other

    stat.ML cs.LG

    Amortized Bayesian Inference for Models of Cognition

    Authors: Stefan T. Radev, Andreas Voss, Eva Marie Wieschen, Paul-Christian Bürkner

    Abstract: As models of cognition grow in complexity and number of parameters, Bayesian inference with standard methods can become intractable, especially when the data-generating model is of unknown analytic form. Recent advances in simulation-based inference using specialized neural network architectures circumvent many previous problems of approximate Bayesian computation. Moreover, due to the properties… ▽ More

    Submitted 13 July, 2020; v1 submitted 8 May, 2020; originally announced May 2020.

  38. arXiv:2005.02773  [pdf, other

    stat.ME stat.CO stat.ML

    Group Heterogeneity Assessment for Multilevel Models

    Authors: Topi Paananen, Alejandro Catalina, Paul-Christian Bürkner, Aki Vehtari

    Abstract: Many data sets contain an inherent multilevel structure, for example, because of repeated measurements of the same observational units. Taking this structure into account is critical for the accuracy and calibration of any statistical analysis performed on such data. However, the large number of possible model configurations hinders the use of multilevel models in practice. In this work, we propos… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

  39. arXiv:2004.13118  [pdf, other

    stat.ME stat.CO

    Using reference models in variable selection

    Authors: Federico Pavone, Juho Piironen, Paul-Christian Bürkner, Aki Vehtari

    Abstract: Variable selection, or more generally, model reduction is an important aspect of the statistical workflow aiming to provide insights from data. In this paper, we discuss and demonstrate the benefits of using a reference model in variable selection. A reference model acts as a noise-filter on the target variable by modeling its data generating mechanism. As a result, using the reference model predi… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

  40. arXiv:2004.11408  [pdf, other

    stat.CO stat.ME

    Practical Hilbert space approximate Bayesian Gaussian processes for probabilistic programming

    Authors: Gabriel Riutort-Mayol, Paul-Christian Bürkner, Michael R. Andersen, Arno Solin, Aki Vehtari

    Abstract: Gaussian processes are powerful non-parametric probabilistic models for stochastic functions. However, the direct implementation entails a complexity that is computationally intractable when the number of observations is large, especially when estimated with fully Bayesian methods such as Markov chain Monte Carlo. In this paper, we focus on a low-rank approximate Bayesian Gaussian processes, based… ▽ More

    Submitted 22 March, 2022; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: 27 pages, 18 figures

  41. arXiv:2004.10629  [pdf, other

    stat.ML cs.LG

    Amortized Bayesian model comparison with evidential deep learning

    Authors: Stefan T. Radev, Marco D'Alessandro, Ulf K. Mertens, Andreas Voss, Ullrich Köthe, Paul-Christian Bürkner

    Abstract: Comparing competing mathematical models of complex natural processes is a shared goal among many branches of science. The Bayesian probabilistic framework offers a principled way to perform model comparison and extract useful metrics for guiding decisions. However, many interesting models are intractable with standard Bayesian methods, as they lack a closed-form likelihood function or the likeliho… ▽ More

    Submitted 2 March, 2021; v1 submitted 22 April, 2020; originally announced April 2020.

  42. arXiv:2002.09868  [pdf, other

    stat.ME

    Flexible Prior Elicitation via the Prior Predictive Distribution

    Authors: Marcelo Hartmann, Georgi Agiashvili, Paul Bürkner, Arto Klami

    Abstract: The prior distribution for the unknown model parameters plays a crucial role in the process of statistical inference based on Bayesian methods. However, specifying suitable priors is often difficult even when detailed prior knowledge is available in principle. The challenge is to express quantitative information in the form of a probability distribution. Prior elicitation addresses this question b… ▽ More

    Submitted 16 March, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

    Comments: 24 pages, 3 figures, conference submission

  43. arXiv:1906.08850  [pdf, other

    stat.CO stat.ME stat.ML

    Implicitly Adaptive Importance Sampling

    Authors: Topi Paananen, Juho Piironen, Paul-Christian Bürkner, Aki Vehtari

    Abstract: Adaptive importance sampling is a class of techniques for finding good proposal distributions for importance sampling. Often the proposal distributions are standard probability distributions whose parameters are adapted based on the mismatch between the current proposal and a target distribution. In this work, we present an implicit adaptive importance sampling method that applies to complicated d… ▽ More

    Submitted 6 May, 2020; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: Major revision: More comparisons to adaptive importance sampling with parametric distributions

    Journal ref: Stat Comput 31, 16 (2021)

  44. arXiv:1905.09501  [pdf, other

    stat.CO

    Bayesian Item Response Modeling in R with brms and Stan

    Authors: Paul-Christian Bürkner

    Abstract: Item Response Theory (IRT) is widely applied in the human sciences to model persons' responses on a set of items measuring one or more latent constructs. While several R packages have been developed that implement IRT models, they tend to be restricted to respective prespecified classes of models. Further, most implementations are frequentist while the availability of Bayesian methods remains comp… ▽ More

    Submitted 1 February, 2020; v1 submitted 23 May, 2019; originally announced May 2019.

    Comments: 54 pages, 16 figures, 3 tables

  45. arXiv:1903.08008  [pdf, other

    stat.CO stat.ME

    Rank-normalization, folding, and localization: An improved $\widehat{R}$ for assessing convergence of MCMC

    Authors: Aki Vehtari, Andrew Gelman, Daniel Simpson, Bob Carpenter, Paul-Christian Bürkner

    Abstract: Markov chain Monte Carlo is a key computational tool in Bayesian statistics, but it can be challenging to monitor the convergence of an iterative stochastic algorithm. In this paper we show that the convergence diagnostic $\widehat{R}$ of Gelman and Rubin (1992) has serious flaws. Traditional $\widehat{R}$ will fail to correctly diagnose convergence failures when the chain has a heavy tail or when… ▽ More

    Submitted 22 June, 2021; v1 submitted 19 March, 2019; originally announced March 2019.

    Comments: Two small fixes. Published in Bayesian analysis https://doi.org/10.1214/20-BA1221

  46. Approximate leave-future-out cross-validation for Bayesian time series models

    Authors: Paul-Christian Bürkner, Jonah Gabry, Aki Vehtari

    Abstract: One of the common goals of time series analysis is to use the observed series to inform predictions for future observations. In the absence of any actual new data to predict, cross-validation can be used to estimate a model's future predictive accuracy, for instance, for the purpose of model comparison or selection. Exact cross-validation for Bayesian models is often computationally expensive, but… ▽ More

    Submitted 8 May, 2020; v1 submitted 17 February, 2019; originally announced February 2019.

    Comments: 28 pages, 15 figures, 2 tables

    Journal ref: Journal of Statistical Computation and Simulation (2020)

  47. Efficient leave-one-out cross-validation for Bayesian non-factorized normal and Student-t models

    Authors: Paul-Christian Bürkner, Jonah Gabry, Aki Vehtari

    Abstract: Cross-validation can be used to measure a model's predictive accuracy for the purpose of model comparison, averaging, or selection. Standard leave-one-out cross-validation (LOO-CV) requires that the observation model can be factorized into simple terms, but a lot of important models in temporal and spatial statistics do not have this property or are inefficient or unstable when forced into a facto… ▽ More

    Submitted 1 October, 2020; v1 submitted 24 October, 2018; originally announced October 2018.

    Comments: 18 pages, 3 figures

    Journal ref: Computational Statistics, 2020

  48. arXiv:1803.06517  [pdf, other

    math.ST stat.ME

    Optimal Designs for the Generalized Partial Credit Model

    Authors: Paul-Christian Bürkner, Rainer Schwabe, Heinz Holling

    Abstract: Analyzing ordinal data becomes increasingly important in psychology, especially in the context of item response theory. The generalized partial credit model (GPCM) is probably the most widely used ordinal model and finds application in many large scale educational assessment studies such as PISA. In the present paper, optimal test designs are investigated for estimating persons' abilities with the… ▽ More

    Submitted 19 October, 2018; v1 submitted 17 March, 2018; originally announced March 2018.

  49. arXiv:1705.11123  [pdf, other

    stat.CO

    Advanced Bayesian Multilevel Modeling with the R Package brms

    Authors: Paul-Christian Bürkner

    Abstract: The brms package allows R users to easily specify a wide range of Bayesian single-level and multilevel models, which are fitted with the probabilistic programming language Stan behind the scenes. Several response distributions are supported, of which all parameters (e.g., location, scale, and shape) can be predicted at the same time thus allowing for distributional regression. Non-linear relations… ▽ More

    Submitted 15 October, 2017; v1 submitted 31 May, 2017; originally announced May 2017.