Skip to main content

Showing 1–29 of 29 results for author: Rockova, V

.
  1. arXiv:2404.10436  [pdf, other

    cs.LG stat.CO stat.ME

    Tree Bandits for Generative Bayes

    Authors: Sean O'Hagan, Jungeum Kim, Veronika Rockova

    Abstract: In generative models with obscured likelihood, Approximate Bayesian Computation (ABC) is often the tool of last resort for inference. However, ABC demands many prior parameter trials to keep only a small fraction that passes an acceptance test. To accelerate ABC rejection sampling, this paper develops a self-aware framework that learns from past trials and errors. We apply recursive partitioning c… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  2. arXiv:2312.05411  [pdf, other

    stat.ME stat.CO stat.ML

    Deep Bayes Factors

    Authors: Jungeum Kim, Veronika Rockova

    Abstract: The is no other model or hypothesis verification tool in Bayesian statistics that is as widely used as the Bayes factor. We focus on generative models that are likelihood-free and, therefore, render the computation of Bayes factors (marginal likelihood ratios) far from obvious. We propose a deep learning estimator of the Bayes factor based on simulated data from two competing models using the like… ▽ More

    Submitted 12 June, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

  3. arXiv:2310.17820  [pdf, other

    stat.ME stat.ML

    Sparse Bayesian Multidimensional Item Response Theory

    Authors: Jiguang Li, Robert Gibbons, Veronika Rockova

    Abstract: Multivariate Item Response Theory (MIRT) is sought-after widely by applied researchers looking for interpretable (sparse) explanations underlying response patterns in questionnaire data. There is, however, an unmet demand for such sparsity discovery tools in practice. Our paper develops a Bayesian platform for binary and ordinal item MIRT which requires minimal tuning and scales well on large data… ▽ More

    Submitted 11 June, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

  4. arXiv:2309.02369  [pdf, other

    math.ST

    Adaptive Bayesian Predictive Inference in High-dimensional Regerssion

    Authors: Veronika Rockova

    Abstract: Bayesian predictive inference provides a coherent description of entire predictive uncertainty through predictive distributions. We examine several widely used sparsity priors from the predictive (as opposed to estimation) inference viewpoint. To start, we investigate predictive distributions in the context of a high-dimensional Gaussian observation with a known variance but an unknown sparse mean… ▽ More

    Submitted 30 May, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

  5. arXiv:2306.00126  [pdf, other

    math.ST stat.ML

    On Mixing Rates for Bayesian CART

    Authors: Jungeum Kim, Veronika Rockova

    Abstract: The success of Bayesian inference with MCMC depends critically on Markov chains rapidly reaching the posterior distribution. Despite the plentitude of inferential theory for posteriors in Bayesian non-parametrics, convergence properties of MCMC algorithms that simulate from such ideal inferential targets are not thoroughly understood. This work focuses on the Bayesian CART algorithm which forms a… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  6. arXiv:2208.12113  [pdf, other

    stat.ME stat.CO stat.ML

    Adversarial Bayesian Simulation

    Authors: Yuexi Wang, Veronika Ročková

    Abstract: In the absence of explicit or tractable likelihoods, Bayesians often resort to approximate Bayesian computation (ABC) for inference. Our work bridges ABC with deep neural implicit samplers based on generative adversarial networks (GANs) and adversarial variational Bayes. Both ABC and GANs compare aspects of observed and fake data to simulate from posteriors and likelihoods, respectively. We develo… ▽ More

    Submitted 20 July, 2023; v1 submitted 25 August, 2022; originally announced August 2022.

  7. arXiv:2205.15374  [pdf, other

    stat.CO

    Deep Bootstrap for Bayesian Inference

    Authors: Lizhen Nie, Veronika Rockova

    Abstract: For a Bayesian, the task to define the likelihood can be as perplexing as the task to define the prior. We focus on situations when the parameter of interest has been emancipated from the likelihood and is linked to data directly through a loss function. We survey existing work on both Bayesian parametric inference with Gibbs posteriors as well as Bayesian non-parametric inference. We then highlig… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

  8. arXiv:2111.11507  [pdf, other

    stat.ME stat.ML

    Approximate Bayesian Computation via Classification

    Authors: Yuexi Wang, Tetsuya Kaji, Veronika Ročková

    Abstract: Approximate Bayesian Computation (ABC) enables statistical inference in simulator-based models whose likelihoods are difficult to calculate but easy to simulate from. ABC constructs a kernel-type approximation to the posterior distribution through an accept/reject mechanism which compares summary statistics of real and simulated data. To obviate the need for summary statistics, we directly compare… ▽ More

    Submitted 30 November, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

  9. arXiv:2105.12793  [pdf, other

    math.ST

    Ideal Bayesian Spatial Adaptation

    Authors: Veronika Rockova, Judith Rousseau

    Abstract: Many real-life applications involve estimation of curves that exhibit complicated shapes including jumps or varying-frequency oscillations. Practical methods have been devised that can adapt to a locally varying complexity of an unknown function (e.g. variable-knot splines, sparse wavelet reconstructions, kernel methods or trees/forests). However, the overwhelming majority of existing asymptotic m… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

  10. arXiv:2103.04177  [pdf, other

    math.ST

    Metropolis-Hastings via Classification

    Authors: Tetsuya Kaji, Veronika Rockova

    Abstract: This paper develops a Bayesian computational platform at the interface between posterior sampling and optimization in models whose marginal likelihoods are difficult to evaluate. Inspired by adversarial optimization, namely Generative Adversarial Networks (GAN), we reframe the likelihood function estimation problem as a classification problem. Pitting a Generator, who simulates fake data, against… ▽ More

    Submitted 29 November, 2021; v1 submitted 6 March, 2021; originally announced March 2021.

  11. arXiv:2011.14279  [pdf, other

    stat.ME stat.CO

    Bayesian Bootstrap Spike-and-Slab LASSO

    Authors: Lizhen Nie, Veronika Ročková

    Abstract: The impracticality of posterior sampling has prevented the widespread adoption of spike-and-slab priors in high-dimensional applications. To alleviate the computational burden, optimization strategies have been proposed that quickly find local posterior modes. Trading off uncertainty quantification for computational speed, these strategies have enabled spike-and-slab deployments at scales that wou… ▽ More

    Submitted 29 March, 2021; v1 submitted 28 November, 2020; originally announced November 2020.

  12. Spike-and-Slab Meets LASSO: A Review of the Spike-and-Slab LASSO

    Authors: Ray Bai, Veronika Rockova, Edward I. George

    Abstract: High-dimensional data sets have become ubiquitous in the past few decades, often with many more covariates than observations. In the frequentist setting, penalized likelihood methods are the most popular approach for variable selection and estimation in high-dimensional data. In the Bayesian framework, spike-and-slab methods are commonly used as probabilistic constructs for high-dimensional modeli… ▽ More

    Submitted 7 May, 2021; v1 submitted 13 October, 2020; originally announced October 2020.

    Comments: 34 pages, 2 tables, 3 figures. Section 3.3 was added to illustrate the method

  13. arXiv:2008.06620  [pdf, other

    math.ST

    The art of BART: Minimax optimality over nonhomogeneous smoothness in high dimension

    Authors: Seonghyun Jeong, Veronika Rockova

    Abstract: Many asymptotically minimax procedures for function estimation often rely on somewhat arbitrary and restrictive assumptions such as isotropy or spatial homogeneity. This work enhances the theoretical understanding of Bayesian additive regression trees under substantially relaxed smoothness assumptions. We provide a comprehensive study of asymptotic optimality and posterior contraction of Bayesian… ▽ More

    Submitted 3 December, 2023; v1 submitted 14 August, 2020; originally announced August 2020.

  14. arXiv:2007.00187  [pdf, other

    cs.LG stat.CO stat.ME

    Variable Selection via Thompson Sampling

    Authors: Yi Liu, Veronika Rockova

    Abstract: Thompson sampling is a heuristic algorithm for the multi-armed bandit problem which has a long tradition in machine learning. The algorithm has a Bayesian spirit in the sense that it selects arms based on posterior samples of reward probabilities of each arm. By forging a connection between combinatorial binary bandits and spike-and-slab variable selection, we propose a stochastic optimization app… ▽ More

    Submitted 11 February, 2021; v1 submitted 30 June, 2020; originally announced July 2020.

  15. arXiv:2002.11815  [pdf, other

    math.ST stat.ML

    Uncertainty Quantification for Sparse Deep Learning

    Authors: Yuexi Wang, Veronika Ročková

    Abstract: Deep learning methods continue to have a decided impact on machine learning, both in theory and in practice. Statistical theoretical developments have been mostly concerned with approximability or rates of estimation when recovering infinite dimensional objects (curves or densities). Despite the impressive array of available theoretical results, the literature has been largely silent about uncerta… ▽ More

    Submitted 12 July, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

    Journal ref: Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, PMLR 108:298-308, 2020

  16. arXiv:1910.07635  [pdf, other

    math.ST

    Uncertainty Quantification for Bayesian CART

    Authors: Ismael Castillo, Veronika Rockova

    Abstract: This work affords new insights into Bayesian CART in the context of structured wavelet shrinkage. The main thrust is to develop a formal inferential framework for Bayesian tree-based regression. We reframe Bayesian CART as a g-type prior which departs from the typical wavelet product priors by harnessing correlation induced by the tree topology. The practically used Bayesian CART priors are shown… ▽ More

    Submitted 24 May, 2021; v1 submitted 16 October, 2019; originally announced October 2019.

  17. arXiv:1909.06631  [pdf, other

    stat.ME stat.AP stat.CO

    Adaptive Bayesian SLOPE -- High-dimensional Model Selection with Missing Values

    Authors: Wei Jiang, Malgorzata Bogdan, Julie Josse, Blazej Miasojedow, Veronika Rockova, TraumaBase Group

    Abstract: We consider the problem of variable selection in high-dimensional settings with missing observations among the covariates. To address this relatively understudied problem, we propose a new synergistic procedure -- adaptive Bayesian SLOPE -- which effectively combines the SLOPE method (sorted $l_1$ regularization) together with the Spike-and-Slab LASSO method. We position our approach within a Baye… ▽ More

    Submitted 6 November, 2019; v1 submitted 14 September, 2019; originally announced September 2019.

    Comments: R package https://github.com/wjiang94/ABSLOPE

  18. arXiv:1905.03735  [pdf, ps, other

    math.ST

    On Semi-parametric Bernstein-von Mises Theorems for BART

    Authors: Veronika Rockova

    Abstract: Few methods in Bayesian non-parametric statistics/ machine learning have received as much attention as Bayesian Additive Regression Trees (BART). While BART is now routinely performed for prediction tasks, its theoretical properties began to be understood only very recently. In this work, we continue the theoretical investigation of BART initiated by Rockova and van der Pas (2017). In particular,… ▽ More

    Submitted 9 May, 2019; originally announced May 2019.

  19. arXiv:1812.04187  [pdf, other

    stat.ME

    Dynamic Sparse Factor Analysis

    Authors: Kenichiro McAlinn, Veronika Rockova, Enakshi Saha

    Abstract: Its conceptual appeal and effectiveness has made latent factor modeling an indispensable tool for multivariate analysis. Despite its popularity across many fields, there are outstanding methodological challenges that have hampered practical deployments. One major challenge is the selection of the number of factors, which is exacerbated for dynamic factor models, where factors can disappear, emerge… ▽ More

    Submitted 10 December, 2018; originally announced December 2018.

  20. arXiv:1810.00787  [pdf, other

    stat.ML cs.LG

    On Theory for BART

    Authors: Veronika Rockova, Enakshi Saha

    Abstract: Ensemble learning is a statistical paradigm built on the premise that many weak learners can perform exceptionally well when deployed collectively. The BART method of Chipman et al. (2010) is a prominent example of Bayesian ensemble learning, where each learner is a tree. Due to its impressive performance, BART has received a lot of attention from practitioners. Despite its wide popularity, howeve… ▽ More

    Submitted 5 October, 2018; v1 submitted 1 October, 2018; originally announced October 2018.

    Comments: 22

  21. arXiv:1807.08336  [pdf, other

    math.ST

    The Median Probability Model and Correlated Variables

    Authors: Marilena Barbieri, James O. Berger, Edward I. George, Veronika Rockova

    Abstract: The median probability model (MPM) Barbieri and Berger (2004) is defined as the model consisting of those variables whose marginal posterior probability of inclusion is at least 0.5. The MPM rule yields the best single model for prediction in orthogonal and nested correlated designs. This result was originally conceived under a specific class of priors, such as the point mass mixtures of non-infor… ▽ More

    Submitted 17 August, 2018; v1 submitted 22 July, 2018; originally announced July 2018.

  22. Variable Selection with ABC Bayesian Forests

    Authors: Yi Liu, Veronika Ročková, Yuexi Wang

    Abstract: Few problems in statistics are as perplexing as variable selection in the presence of very many redundant covariates. The variable selection problem is most familiar in parametric environments such as the linear model or additive variants thereof. In this work, we abandon the linear model framework, which can be quite detrimental when the covariates impact the outcome in a non-linear way, and turn… ▽ More

    Submitted 24 February, 2021; v1 submitted 6 June, 2018; originally announced June 2018.

  23. arXiv:1803.09138  [pdf, ps, other

    stat.ML cs.LG

    Posterior Concentration for Sparse Deep Learning

    Authors: Nicholas Polson, Veronika Rockova

    Abstract: Spike-and-Slab Deep Learning (SS-DL) is a fully Bayesian alternative to Dropout for improving generalizability of deep ReLU networks. This new type of regularization enables provable recovery of smooth input-output maps with unknown levels of smoothness. Indeed, we show that the posterior distribution concentrates at the near minimax rate for $α$-Hölder smooth maps, performing as well as if we kne… ▽ More

    Submitted 24 March, 2018; originally announced March 2018.

  24. arXiv:1801.03019  [pdf, other

    stat.ME

    Variance prior forms for high-dimensional Bayesian variable selection

    Authors: Gemma E. Moran, Veronika Rockova, Edward I. George

    Abstract: Consider the problem of high dimensional variable selection for the Gaussian linear model when the unknown error variance is also of interest. In this paper, we show that the use of conjugate shrinkage priors for Bayesian variable selection can have detrimental consequences for such variance estimation. Such priors are often motivated by the invariance argument of Jeffreys (1961). Revisiting this… ▽ More

    Submitted 13 November, 2018; v1 submitted 9 January, 2018; originally announced January 2018.

  25. Simultaneous Variable and Covariance Selection with the Multivariate Spike-and-Slab Lasso

    Authors: Sameer K. Deshpande, Veronika Rockova, Edward I. George

    Abstract: We propose a Bayesian procedure for simultaneous variable and covariance selection using continuous spike-and-slab priors in multivariate linear regression models where q possibly correlated responses are regressed onto p predictors. Rather than relying on a stochastic search through the high-dimensional model space, we develop an ECM algorithm similar to the EMVS procedure of Rockova & George (20… ▽ More

    Submitted 24 July, 2018; v1 submitted 29 August, 2017; originally announced August 2017.

  26. arXiv:1708.08734  [pdf, other

    math.ST

    Posterior Concentration for Bayesian Regression Trees and Forests

    Authors: Veronika Rockova, Stephanie van der Pas

    Abstract: Since their inception in the 1980's, regression trees have been one of the more widely used non-parametric prediction methods. Tree-structured methods yield a histogram reconstruction of the regression surface, where the bins correspond to terminal nodes of recursive partitioning. Trees are powerful, yet susceptible to over-fitting. Strategies against overfitting have traditionally relied on pruni… ▽ More

    Submitted 13 June, 2019; v1 submitted 29 August, 2017; originally announced August 2017.

  27. arXiv:1708.00085  [pdf, other

    stat.ME

    Dynamic Variable Selection with Spike-and-Slab Process Priors

    Authors: Veronika Rockova, Kenichiro McAlinn

    Abstract: We address the problem of dynamic variable selection in time series regression with unknown residual variances, where the set of active predictors is allowed to evolve over time. To capture time-varying variable selection uncertainty, we introduce new dynamic shrinkage priors for the time series of regression coefficients. These priors are characterized by two main ingredients: smooth parameter ev… ▽ More

    Submitted 21 September, 2019; v1 submitted 31 July, 2017; originally announced August 2017.

  28. arXiv:1708.00078  [pdf, other

    math.ST

    Bayesian Dyadic Trees and Histograms for Regression

    Authors: Stephanie van der Pas, Veronika Rockova

    Abstract: Many machine learning tools for regression are based on recursive partitioning of the covariate space into smaller regions, where the regression function can be estimated locally. Among these, regression trees and their ensembles have demonstrated impressive empirical performance. In this work, we shed light on the machinery behind Bayesian variants of these methods. In particular, we study Bayesi… ▽ More

    Submitted 28 November, 2017; v1 submitted 31 July, 2017; originally announced August 2017.

  29. Mortality Rate Estimation and Standardization for Public Reporting: Medicare's Hospital Compare

    Authors: E. I. George, V. Rockova, P. R. Rosenbaum, V. A. Satopaa, J. H. Silber

    Abstract: Bayesian models are increasing fit to large administrative data sets and then used to make individualized recommendations. For instance, Medicare's Hospital Compare webpage provides information to patients about specific hospital mortality rates for a heart attack or Acute Myocardial Infarction (AMI). Hospital Compare's current recommendations are based on a random effects logit model with a rando… ▽ More

    Submitted 31 March, 2018; v1 submitted 3 October, 2015; originally announced October 2015.

    Comments: Main paper: 31 pages, 7 figures, 4 tables Supplemental Material: 4 pages, 2 figures, 1 table

    Journal ref: Journal of the American Statistical Association (2017), 112:519, 933-947