Skip to main content

Showing 1–50 of 63 results for author: Polson, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.17058  [pdf, other

    stat.ME cs.LG

    Bayesian Deep ICE

    Authors: Jyotishka Datta, Nicholas G. Polson

    Abstract: Deep Independent Component Estimation (DICE) has many applications in modern day machine learning as a feature engineering extraction method. We provide a novel latent variable representation of independent component analysis that enables both point estimates via expectation-maximization (EM) and full posterior sampling via Markov Chain Monte Carlo (MCMC) algorithms. Our methodology also applies t… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    MSC Class: 62F15; 62H25; 68T07

  2. arXiv:2402.09583  [pdf, other

    stat.ME stat.CO

    Horseshoe Priors for Sparse Dirichlet-Multinomial Models

    Authors: Yuexi Wang, Nicholas G. Polson

    Abstract: Bayesian inference for Dirichlet-Multinomial (DM) models has a long and important history. The concentration parameter $α$ is pivotal in smoothing category probabilities within the multinomial distribution and is crucial for the inference afterward. Due to the lack of a tractable form of its marginal likelihood, $α$ is often chosen in an ad-hoc manner, or estimated using approximation algorithms.… ▽ More

    Submitted 11 March, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  3. arXiv:2310.06251  [pdf, other

    stat.ML cs.LG

    Deep Learning: A Tutorial

    Authors: Nick Polson, Vadim Sokolov

    Abstract: Our goal is to provide a review of deep learning methods which provide insight into structured high-dimensional data. Rather than using shallow additive architectures common to most statistical models, deep learning uses layers of semi-affine input transformations to provide a predictive rule. Applying these layers of transformations leads to a set of attributes (or, features) to which probabilist… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: text overlap with arXiv:1808.08618

  4. arXiv:2306.16096  [pdf, other

    stat.ME

    Generative Causal Inference

    Authors: Maria Nareklishvili, Nicholas Polson, Vadim Sokolov

    Abstract: In this paper we propose the use of the generative AI methods in Econometrics. Generative methods avoid the use of densities as done by MCMC. They directrix simulate large samples of observables and unobservable (parameters, latent variables) and then using high-dimensional deep learner to inform a nonlinear transport map from data to parameter inferences. Our themed apply to a wide verity or econ… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: text overlap with arXiv:2305.14972

  5. arXiv:2305.14972  [pdf, other

    stat.CO

    Generative AI for Bayesian Computation

    Authors: Nicholas G. Polson, Vadim Sokolov

    Abstract: Bayesian Generative AI (BayesGen-AI) methods are developed and applied to Bayesian computation. BayesGen-AI reconstructs the posterior distribution by directly modeling the parameter of interest as a map** (a.k.a. deep learner) from a large simulated dataset. This provides a generator that we can evaluate at the observed data and provide draws from the posterior distribution. This method applies… ▽ More

    Submitted 24 February, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:2209.02163

  6. arXiv:2305.03158  [pdf, other

    stat.CO stat.ME

    Quantile Importance Sampling

    Authors: Jyotishka Datta, Nicholas G. Polson

    Abstract: In Bayesian inference, the approximation of integrals of the form $ψ= \mathbb{E}_{F}{l(X)} = \int_χ l(\mathbf{x}) d F(\mathbf{x})$ is a fundamental challenge. Such integrals are crucial for evidence estimation, which is important for various purposes, including model selection and numerical analysis. The existing strategies for evidence estimation are classified into four categories: deterministic… ▽ More

    Submitted 25 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    MSC Class: 65C05; 62F15

  7. arXiv:2208.09563  [pdf, other

    stat.AP math.PR

    On the Probability of Magnus Carlsen reaching 2900

    Authors: Sohan Bendre, Shiva Maharaj, Nick Polson, Vadim Sokolov

    Abstract: How likely is it that Magnus Carlsen will achieve an Elo rating of $2900$? This has been a goal of Magnus and is of great current interest to the chess community. Our paper uses probabilistic methods to address this question. The probabilistic properties of Elo's rating system have long been studied, and we provide an application of such methods. By applying a Brownian motion model of Stern as a s… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

  8. arXiv:2208.08068  [pdf, other

    stat.ML cs.LG math.QA stat.CO

    Quantum Bayesian Computation

    Authors: Nick Polson, Vadim Sokolov, Jianeng Xu

    Abstract: Quantum Bayesian Computation (QBC) is an emerging field that levers the computational gains available from quantum computers to provide an exponential speed-up in Bayesian computation. Our paper adds to the literature in two ways. First, we show how von Neumann quantum measurement can be used to simulate machine learning algorithms such as Markov chain Monte Carlo (MCMC) and Deep Learning (DL) tha… ▽ More

    Submitted 4 March, 2023; v1 submitted 17 August, 2022; originally announced August 2022.

  9. arXiv:2207.02612  [pdf, other

    stat.ME

    Deep Partial Least Squares for Instrumental Variable Regression

    Authors: Maria Nareklishvili, Nicholas Polson, Vadim Sokolov

    Abstract: In this paper, we propose deep partial least squares for the estimation of high-dimensional nonlinear instrumental variable regression. As a precursor to a flexible deep neural network architecture, our methodology uses partial least squares for dimension reduction and feature selection from the set of instruments and covariates. A central theoretical result, due to Brillinger (2012) shows that th… ▽ More

    Submitted 2 June, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

  10. arXiv:2206.10014  [pdf, other

    q-fin.PR cs.LG q-fin.PM q-fin.RM stat.ML

    Deep Partial Least Squares for Empirical Asset Pricing

    Authors: Matthew F. Dixon, Nicholas G. Polson, Kemen Goicoechea

    Abstract: We use deep partial least squares (DPLS) to estimate an asset pricing model for individual stock returns that exploits conditioning information in a flexible and dynamic way while attributing excess returns to a small set of statistical risk factors. The novel contribution is to resolve the non-linear factor structure, thus advancing the current paradigm of deep learning in empirical asset pricing… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  11. arXiv:2204.14121  [pdf, other

    stat.ME

    Inverse Probability Weighting: from Survey Sampling to Evidence Estimation

    Authors: Jyotishka Datta, Nicholas Polson

    Abstract: We consider the class of inverse probability weight (IPW) estimators, including the popular Horvitz-Thompson and Hajek estimators used routinely in survey sampling, causal inference and evidence estimation for Bayesian computation. We focus on the 'weak paradoxes' for these estimators due to two counterexamples by Basu [1988] and Wasserman [2004] and investigate the two natural Bayesian answers to… ▽ More

    Submitted 14 November, 2022; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: 25 pages, 4 figures. Added another simulation study and clarified the assumptions needed for the proof of consistency

    MSC Class: 62F15; 62F12; 62D05; 65C05

  12. arXiv:2110.11561  [pdf, other

    stat.ME cs.LG stat.ML

    Merging Two Cultures: Deep and Statistical Learning

    Authors: Anindya Bhadra, Jyotishka Datta, Nick Polson, Vadim Sokolov, Jianeng Xu

    Abstract: Merging the two cultures of deep and statistical learning provides insights into structured high-dimensional data. Traditional statistical modeling is still a dominant strategy for structured tabular data. Deep learning can be viewed through the lens of generalized linear models (GLMs) with composite link functions. Sufficient dimensionality reduction (SDR) and sparsity performs nonlinear feature… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: text overlap with arXiv:2106.14085

  13. arXiv:2109.11602  [pdf, other

    cs.AI cs.LG stat.ML

    Chess AI: Competing Paradigms for Machine Intelligence

    Authors: Shiva Maharaj, Nick Polson, Alex Turk

    Abstract: Endgame studies have long served as a tool for testing human creativity and intelligence. We find that they can serve as a tool for testing machine ability as well. Two of the leading chess engines, Stockfish and Leela Chess Zero (LCZero), employ significantly different methods during play. We use Plaskett's Puzzle, a famous endgame study from the late 1970s, to compare the two engines. Our experi… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: 15 pages, 8 figures

  14. arXiv:2106.14085  [pdf, other

    stat.ME

    Deep Learning Partial Least Squares

    Authors: Nicholas Polson, Vadim Sokolov, Jianeng Xu

    Abstract: High dimensional data reduction techniques are provided by using partial least squares within deep learning. Our framework provides a nonlinear extension of PLS together with a disciplined approach to feature selection and architecture design in deep learning. This leads to a statistical interpretation of deep learning that is tailor made for predictive problems. We can use the tools of PLS, such… ▽ More

    Submitted 26 June, 2021; originally announced June 2021.

  15. arXiv:2106.01906   

    stat.ME stat.CO stat.ML

    Bayesian Inference for Gamma Models

    Authors: **gyu He, Nicholas Polson, Jianeng Xu

    Abstract: We use the theory of normal variance-mean mixtures to derive a data augmentation scheme for models that include gamma functions. Our methodology applies to many situations in statistics and machine learning, including Multinomial-Dirichlet distributions, Negative binomial regression, Poisson-Gamma hierarchical models, Extreme value models, to name but a few. All of those models include a gamma fun… ▽ More

    Submitted 21 June, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: Duplicate submission of arXiv:1905.12141 Please check arXiv:1905.12141 for future update

  16. arXiv:1905.12141  [pdf, other

    stat.ME stat.CO stat.ML

    Data Augementation with Polya Inverse Gamma

    Authors: **gyu He, Nicholas G. Polson, Jianeng Xu

    Abstract: We use the theory of normal variance-mean mixtures to derive a data augmentation scheme for models that include gamma functions. Our methodology applies to many situations in statistics and machine learning, including Multinomial-Dirichlet distributions, Negative binomial regression, Poisson-Gamma hierarchical models, Extreme value models, to name but a few. All of those models include a gamma fun… ▽ More

    Submitted 1 May, 2022; v1 submitted 28 May, 2019; originally announced May 2019.

  17. arXiv:1904.10939  [pdf, other

    stat.ME stat.ML

    Horseshoe Regularization for Machine Learning in Complex and Deep Models

    Authors: Anindya Bhadra, Jyotishka Datta, Yunfan Li, Nicholas G. Polson

    Abstract: Since the advent of the horseshoe priors for regularization, global-local shrinkage methods have proved to be a fertile ground for the development of Bayesian methodology in machine learning, specifically for high-dimensional regression and classification problems. They have achieved remarkable success in computation, and enjoy strong theoretical support. Most of the existing literature has focuse… ▽ More

    Submitted 22 November, 2019; v1 submitted 24 April, 2019; originally announced April 2019.

  18. arXiv:1903.09668  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Data Augmentation for Bayesian Deep Learning

    Authors: Yuexi Wang, Nicholas G. Polson, Vadim O. Sokolov

    Abstract: Deep Learning (DL) methods have emerged as one of the most powerful tools for functional approximation and prediction. While the representation properties of DL have been well studied, uncertainty quantification remains challenging and largely unexplored. Data augmentation techniques are a natural approach to provide uncertainty quantification and to incorporate stochastic Monte Carlo search into… ▽ More

    Submitted 24 October, 2022; v1 submitted 22 March, 2019; originally announced March 2019.

  19. arXiv:1903.07677  [pdf, other

    stat.ML cs.LG stat.ME

    Deep Fundamental Factor Models

    Authors: Matthew F. Dixon, Nicholas G. Polson

    Abstract: Deep fundamental factor models are developed to automatically capture non-linearity and interaction effects in factor modeling. Uncertainty quantification provides interpretability with interval estimation, ranking of factor importances and estimation of interaction effects. With no hidden layers we recover a linear factor model and for one or more hidden layers, uncertainty bands for the sensitiv… ▽ More

    Submitted 27 August, 2020; v1 submitted 18 March, 2019; originally announced March 2019.

    Journal ref: Forthcoming in SIAM J. Financial Mathematics, 2020

  20. arXiv:1902.06269  [pdf, other

    stat.ME

    Bayesian Regularization: From Tikhonov to Horseshoe

    Authors: Nicholas G. Polson, Vadim Sokolov

    Abstract: Bayesian regularization is a central tool in modern-day statistical and machine learning methods. Many applications involve high-dimensional sparse signal recovery problems. The goal of our paper is to provide a review of the literature on penalty-based regularization approaches, from Tikhonov (Ridge, Lasso) to horseshoe regularization.

    Submitted 17 February, 2019; originally announced February 2019.

  21. arXiv:1808.08618  [pdf, other

    cs.LG stat.CO stat.ML

    Deep Learning: Computational Aspects

    Authors: Nicholas Polson, Vadim Sokolov

    Abstract: In this article we review computational aspects of Deep Learning (DL). Deep learning uses network architectures consisting of hierarchical layers of latent variables to construct predictors for high-dimensional input-output models. Training a deep learning architecture is computationally intensive, and efficient linear algebra libraries is the key for training and inference. Stochastic gradient de… ▽ More

    Submitted 28 August, 2019; v1 submitted 26 August, 2018; originally announced August 2018.

  22. arXiv:1807.07987  [pdf, other

    stat.ML cs.LG

    Deep Learning

    Authors: Nicholas G. Polson, Vadim O. Sokolov

    Abstract: Deep learning (DL) is a high dimensional data reduction technique for constructing high-dimensional predictors in input-output models. DL is a form of machine learning that uses hierarchical layers of latent features. In this article, we review the state-of-the-art of deep learning from a modeling and algorithmic perspective. We provide a list of successful areas of applications in Artificial Inte… ▽ More

    Submitted 3 August, 2018; v1 submitted 20 July, 2018; originally announced July 2018.

    Comments: arXiv admin note: text overlap with arXiv:1602.06561

  23. arXiv:1805.01104  [pdf, other

    stat.ME

    Deep Learning in Characteristics-Sorted Factor Models

    Authors: Guanhao Feng, **gyu He, Nicholas G. Polson, Jianeng Xu

    Abstract: This paper presents an augmented deep factor model that generates latent factors for cross-sectional asset pricing. The conventional security sorting on firm characteristics for constructing long-short factor portfolio weights is nonlinear modeling, while factors are treated as inputs in linear models. We provide a structural deep learning framework to generalize the complete mechanism for fitting… ▽ More

    Submitted 19 July, 2023; v1 submitted 2 May, 2018; originally announced May 2018.

  24. arXiv:1804.09314  [pdf, other

    stat.ML cs.LG econ.EM

    Deep Learning for Predicting Asset Returns

    Authors: Guanhao Feng, **gyu He, Nicholas G. Polson

    Abstract: Deep learning searches for nonlinear factors for predicting asset returns. Predictability is achieved via multiple layers of composite factors as opposed to additive ones. Viewed in this way, asset pricing studies can be revisited using multi-layer deep learners, such as rectified linear units (ReLU) or long-short-term-memory (LSTM) for time-series effects. State-of-the-art algorithms including st… ▽ More

    Submitted 26 April, 2018; v1 submitted 24 April, 2018; originally announced April 2018.

  25. arXiv:1803.09138  [pdf, ps, other

    stat.ML cs.LG

    Posterior Concentration for Sparse Deep Learning

    Authors: Nicholas Polson, Veronika Rockova

    Abstract: Spike-and-Slab Deep Learning (SS-DL) is a fully Bayesian alternative to Dropout for improving generalizability of deep ReLU networks. This new type of regularization enables provable recovery of smooth input-output maps with unknown levels of smoothness. Indeed, we show that the posterior distribution concentrates at the near minimax rate for $α$-Hölder smooth maps, performing as well as if we kne… ▽ More

    Submitted 24 March, 2018; originally announced March 2018.

  26. Weighted Bayesian Bootstrap for Scalable Bayes

    Authors: Michael Newton, Nicholas G. Polson, Jianeng Xu

    Abstract: We develop a weighted Bayesian Bootstrap (WBB) for machine learning and statistics. WBB provides uncertainty quantification by sampling from a high dimensional posterior distribution. WBB is computationally fast and scalable using only off-theshelf optimization software such as TensorFlow. We provide regularity conditions which apply to a wide range of machine learning and statistical models. We i… ▽ More

    Submitted 12 March, 2018; originally announced March 2018.

    Journal ref: Canadian Journal of Statistics 2020

  27. arXiv:1712.03889  [pdf, other

    stat.ME

    Statistical sparsity

    Authors: Peter McCullagh, Nicholas Polson

    Abstract: The main contribution of this paper is a mathematical definition of statistical sparsity, which is expressed as a limiting property of a sequence of probability distributions. The limit is characterized by an exceedance measure~$H$ and a rate parameter~$ρ> 0$, both of which are unrelated to sample size. The definition is sufficient to encompass all sparsity models that have been suggested in the s… ▽ More

    Submitted 23 May, 2018; v1 submitted 11 December, 2017; originally announced December 2017.

    Comments: 21 pages, 6 figures, 1 table

  28. arXiv:1709.00379  [pdf, ps, other

    stat.ML

    Sparse Regularization in Marketing and Economics

    Authors: Guanhao Feng, Nicholas Polson, Yuexi Wang, Jianeng Xu

    Abstract: Sparse alpha-norm regularization has many data-rich applications in Marketing and Economics. Alpha-norm, in contrast to lasso and ridge regularization, jumps to a sparse solution. This feature is attractive for ultra high-dimensional problems that occur in demand estimation and forecasting. The alpha-norm objective is nonconvex and requires coordinate descent and proximal operators to find the spa… ▽ More

    Submitted 5 February, 2018; v1 submitted 1 September, 2017; originally announced September 2017.

  29. arXiv:1706.10179  [pdf, other

    stat.ME

    Lasso Meets Horseshoe : A Survey

    Authors: Anindya Bhadra, Jyotishka Datta, Nicholas G. Polson, Brandon T. Willard

    Abstract: The goal of this paper is to contrast and survey the major advances in two of the most commonly used high-dimensional techniques, namely, the Lasso and horseshoe regularization. Lasso is a gold standard for predictor selection while horseshoe is a state-of-the-art Bayesian estimator for sparse signals. Lasso is fast and scalable and uses convex optimization whilst the horseshoe is non-convex. Our… ▽ More

    Submitted 3 March, 2019; v1 submitted 30 June, 2017; originally announced June 2017.

    Comments: 32 pages, 4 figures

    MSC Class: Primary 62J07; 62J05; Secondary 62H15; 62F03

  30. arXiv:1706.00473  [pdf, other

    stat.ML cs.LG stat.ME

    Deep Learning: A Bayesian Perspective

    Authors: Nicholas Polson, Vadim Sokolov

    Abstract: Deep learning is a form of machine learning for nonlinear high dimensional pattern matching and prediction. By taking a Bayesian probabilistic perspective, we provide a number of insights into more efficient algorithms for optimisation and hyper-parameter tuning. Traditional high-dimensional data reduction techniques, such as principal component analysis (PCA), partial least squares (PLS), reduced… ▽ More

    Submitted 13 November, 2017; v1 submitted 1 June, 2017; originally announced June 2017.

  31. arXiv:1706.00098  [pdf, ps, other

    stat.ML stat.CO

    Bayesian $l_0$-regularized Least Squares

    Authors: Nicholas G. Polson, Lei Sun

    Abstract: Bayesian $l_0$-regularized least squares is a variable selection technique for high dimensional predictors. The challenge is optimizing a non-convex objective function via search over model space consisting of all possible predictor combinations. Spike-and-slab (a.k.a. Bernoulli-Gaussian) priors are the gold standard for Bayesian variable selection, with a caveat of computational speed and scalabi… ▽ More

    Submitted 18 December, 2018; v1 submitted 31 May, 2017; originally announced June 2017.

    Comments: 22 pages, 6 figures, 1 table

    MSC Class: 62-04

  32. arXiv:1705.09851  [pdf, other

    stat.ML

    Deep Learning for Spatio-Temporal Modeling: Dynamic Traffic Flows and High Frequency Trading

    Authors: Matthew F. Dixon, Nicholas G. Polson, Vadim O. Sokolov

    Abstract: Deep learning applies hierarchical layers of hidden variables to construct nonlinear high dimensional predictors. Our goal is to develop and train deep learning architectures for spatio-temporal modeling. Training a deep architecture is achieved by stochastic gradient descent (SGD) and drop-out (DO) for parameter regularization with a goal of minimizing out-of-sample predictive mean squared error.… ▽ More

    Submitted 7 May, 2018; v1 submitted 27 May, 2017; originally announced May 2017.

  33. arXiv:1705.04141  [pdf, ps, other

    stat.ME

    From Least Squares to Signal Processing and Particle Filtering

    Authors: Nozer D. Singpurwalla, Nicholas G. Polson, Refik Soyer

    Abstract: De Facto, signal processing is the interpolation and extrapolation of a sequence of observations viewed as a realization of a stochastic process. Its role in applied statistics ranges from scenarios in forecasting and time series analysis, to image reconstruction, machine learning, and the degradation modeling for reliability assessment. A general solution to the problem of filtering and predictio… ▽ More

    Submitted 11 May, 2017; originally announced May 2017.

  34. arXiv:1702.07400  [pdf, other

    stat.ML stat.CO

    Horseshoe Regularization for Feature Subset Selection

    Authors: Anindya Bhadra, Jyotishka Datta, Nicholas G. Polson, Brandon Willard

    Abstract: Feature subset selection arises in many high-dimensional applications of statistics, such as compressed sensing and genomics. The $\ell_0$ penalty is ideal for this task, the caveat being it requires the NP-hard combinatorial evaluation of all models. A recent area of considerable interest is to develop efficient algorithms to fit models with a non-convex $\ell_γ$ penalty for $γ\in (0,1)$, which r… ▽ More

    Submitted 22 June, 2017; v1 submitted 23 February, 2017; originally announced February 2017.

  35. arXiv:1610.09750  [pdf, other

    stat.AP

    Sequential Bayesian Learning for Merton's Jump Model with Stochastic Volatility

    Authors: Eric Jacquier, Nicholas Polson, Vadim Sokolov

    Abstract: Jump stochastic volatility models are central to financial econometrics for volatility forecasting, portfolio risk management, and derivatives pricing. Markov Chain Monte Carlo (MCMC) algorithms are computationally unfeasible for the sequential learning of volatility state variables and parameters, whereby the investor must update all posterior and predictive densities as new information arrives.… ▽ More

    Submitted 30 October, 2016; originally announced October 2016.

  36. arXiv:1606.01701  [pdf, ps, other

    stat.ME

    Regularizing Bayesian Predictive Regressions

    Authors: Guanhao Feng, Nicholas G. Polson

    Abstract: We show that regularizing Bayesian predictive regressions provides a framework for prior sensitivity analysis. We develop a procedure that jointly regularizes expectations and variance-covariance matrices using a pair of shrinkage priors. Our methodology applies directly to vector autoregressions (VAR) and seemingly unrelated regressions (SUR). The regularization path provides a prior sensitivity… ▽ More

    Submitted 13 September, 2017; v1 submitted 6 June, 2016; originally announced June 2016.

  37. Deep Learning for Short-Term Traffic Flow Prediction

    Authors: Nicholas Polson, Vadim Sokolov

    Abstract: We develop a deep learning model to predict traffic flows. The main contribution is development of an architecture that combines a linear model that is fitted using $\ell_1$ regularization and a sequence of $\tanh$ layers. The challenge of predicting traffic flows are the sharp nonlinearities due to transitions between free flow, breakdown, recovery and congestion. We show that deep learning archi… ▽ More

    Submitted 27 February, 2017; v1 submitted 15 April, 2016; originally announced April 2016.

  38. The Market for English Premier League (EPL) Odds

    Authors: Guanhao Feng, Nicholas G. Polson, Jianeng Xu

    Abstract: This paper employs a Skellam process to represent real-time betting odds for English Premier League (EPL) soccer games. Given a matrix of market odds on all possible score outcomes, we estimate the expected scoring rates for each team. The expected scoring rates then define the implied volatility of an EPL game. As events in the game evolve, we re-estimate the expected scoring rates and our implie… ▽ More

    Submitted 5 January, 2017; v1 submitted 12 April, 2016; originally announced April 2016.

    Journal ref: Journal of Quantitative Analysis in Sports, 12.4 (2017): 167-178

  39. arXiv:1602.01445  [pdf, ps, other

    stat.ME stat.CO

    Sequential Bayesian Analysis of Multivariate Count Data

    Authors: Tevfik Aktekin, Nicholas G. Polson, Refik Soyer

    Abstract: We develop a new class of dynamic multivariate Poisson count models that allow for fast online updating and we refer to these models as multivariate Poisson-scaled beta (MPSB). The MPSB model allows for serial dependence in the counts as well as dependence across multiple series with a random common environment. Other notable features include analytic forms for state propagation and predictive lik… ▽ More

    Submitted 15 September, 2016; v1 submitted 3 February, 2016; originally announced February 2016.

    Comments: 31 pages, 9 figures

  40. arXiv:1511.06750  [pdf, other

    stat.ME

    A deconvolution path for mixtures

    Authors: Oscar Hernan Madrid Padilla, Nicholas G. Polson, James G. Scott

    Abstract: We propose a class of estimators for deconvolution in mixture models based on a simple two-step "bin-and-smooth" procedure applied to histogram counts. The method is both statistically and computationally efficient: by exploiting recent advances in convex optimization, we are able to provide a full deconvolution path that shows the estimate for the mixing distribution across a range of plausible d… ▽ More

    Submitted 25 May, 2017; v1 submitted 20 November, 2015; originally announced November 2015.

    Journal ref: Electronic Journal of Statistics Volume 12, Number 1 (2018), 1717-1751

  41. arXiv:1510.03516  [pdf, ps, other

    stat.ME

    Default Bayesian analysis with global-local shrinkage priors

    Authors: Anindya Bhadra, Jyotishka Datta, Nicholas G. Polson, Brandon T. Willard

    Abstract: We provide a framework for assessing the default nature of a prior distribution using the property of regular variation, which we study for global-local shrinkage priors. In particular, we demonstrate the horseshoe priors, originally designed to handle sparsity, also possess regular variation and thus are appropriate for default Bayesian analysis. To illustrate our methodology, we solve a problem… ▽ More

    Submitted 14 May, 2016; v1 submitted 12 October, 2015; originally announced October 2015.

    Comments: 28 pages, 7 figures, 6 tables

    MSC Class: 62C10; 62F15

  42. arXiv:1509.06061  [pdf, other

    stat.ML

    A Statistical Theory of Deep Learning via Proximal Splitting

    Authors: Nicholas G. Polson, Brandon T. Willard, Massoud Heidari

    Abstract: In this paper we develop a statistical theory and an implementation of deep learning models. We show that an elegant variable splitting scheme for the alternating direction method of multipliers optimises a deep learning objective. We allow for non-smooth non-convex regularisation penalties to induce sparsity in parameter weights. We provide a link between traditional shallow layer statistical mod… ▽ More

    Submitted 20 September, 2015; originally announced September 2015.

  43. arXiv:1502.03175  [pdf, other

    stat.ML cs.LG stat.ME

    Proximal Algorithms in Statistics and Machine Learning

    Authors: Nicholas G. Polson, James G. Scott, Brandon T. Willard

    Abstract: In this paper we develop proximal methods for statistical learning. Proximal point algorithms are useful in statistics and machine learning for obtaining optimization solutions for composite functions. Our approach exploits closed-form solutions of proximal operators and envelope representations based on the Moreau, Forward-Backward, Douglas-Rachford and Half-Quadratic envelopes. Envelope represen… ▽ More

    Submitted 30 May, 2015; v1 submitted 10 February, 2015; originally announced February 2015.

  44. Bayesian Particle Tracking of Traffic Flows

    Authors: Nicholas Polson, Vadim Sokolov

    Abstract: We develop a Bayesian particle filter for tracking traffic flows that is capable of capturing non-linearities and discontinuities present in flow dynamics. Our model includes a hidden state variable that captures sudden regime shifts between traffic free flow, breakdown and recovery. We develop an efficient particle learning algorithm for real time on-line inference of states and parameters. This… ▽ More

    Submitted 15 November, 2015; v1 submitted 18 November, 2014; originally announced November 2014.

    MSC Class: 60K35

  45. Bayesian analysis of traffic flow on interstate I-55: The LWR model

    Authors: Nicholas Polson, Vadim Sokolov

    Abstract: Transportation departments take actions to manage traffic flow and reduce travel times based on estimated current and projected traffic conditions. Travel time estimates and forecasts require information on traffic density which are combined with a model to project traffic flow such as the Lighthill-Whitham-Richards (LWR) model. We develop a particle filtering and learning algorithm to estimate th… ▽ More

    Submitted 29 January, 2016; v1 submitted 21 September, 2014; originally announced September 2014.

    Comments: Published at http://dx.doi.org/10.1214/15-AOAS853 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS853

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 4, 1864-1888

  46. arXiv:1409.3601  [pdf, other

    stat.CO math.ST

    Vertical-likelihood Monte Carlo

    Authors: Nicholas G. Polson, James G. Scott

    Abstract: In this review, we address the use of Monte Carlo methods for approximating definite integrals of the form $Z = \int L(x) d P(x)$, where $L$ is a target function (often a likelihood) and $P$ a finite measure. We present vertical-likelihood Monte Carlo, which is an approach for designing the importance function $g(x)$ used in importance sampling. Our approach exploits a duality between two random v… ▽ More

    Submitted 23 June, 2015; v1 submitted 11 September, 2014; originally announced September 2014.

  47. arXiv:1406.0177  [pdf, other

    stat.ME

    Mixtures, envelopes, and hierarchical duality

    Authors: Nicholas G. Polson, James G. Scott

    Abstract: We develop a connection between mixture and envelope representations of objective functions that arise frequently in statistics. We refer to this connection using the term "hierarchical duality." Our results suggest an interesting and previously under-exploited relationship between marginalization and profiling, or equivalently between the Fenchel--Moreau theorem for convex functions and the Berns… ▽ More

    Submitted 22 February, 2015; v1 submitted 1 June, 2014; originally announced June 2014.

  48. arXiv:1405.0506  [pdf, other

    stat.CO

    Sampling Polya-Gamma random variates: alternate and approximate techniques

    Authors: Jesse Windle, Nicholas G. Polson, James G. Scott

    Abstract: Efficiently sampling from the Pólya-Gamma distribution, ${PG}(b,z)$, is an essential element of Pólya-Gamma data augmentation. Polson et. al (2013) show how to efficiently sample from the ${PG}(1,z)$ distribution. We build two new samplers that offer improved performance when sampling from the ${PG}(b,z)$ distribution and $b$ is not unity.

    Submitted 2 May, 2014; originally announced May 2014.

  49. arXiv:1212.2135  [pdf, other

    math.OC stat.CO

    Optimisation via Slice Sampling

    Authors: John R. Birge, Nicholas G. Polson

    Abstract: In this paper, we develop a simulation-based approach to optimisation with multi-modal functions using slice sampling. Our method specifies the objective function as an energy potential in a Boltzmann distribution and then we use auxiliary exponential slice variables to provide samples for a variety of energy levels. Our slice sampler draws uniformly over the augmented slice region. We identify th… ▽ More

    Submitted 10 December, 2012; originally announced December 2012.

    Comments: 22 pages, 6 figures

    MSC Class: 46N10

  50. arXiv:1212.0534  [pdf, other

    stat.CO

    Split Sampling: Expectations, Normalisation and Rare Events

    Authors: John R. Birge, Changgee Chang, Nicholas G. Polson

    Abstract: In this paper we develop a methodology that we call split sampling methods to estimate high dimensional expectations and rare event probabilities. Split sampling uses an auxiliary variable MCMC simulation and expresses the expectation of interest as an integrated set of rare event probabilities. We derive our estimator from a Rao-Blackwellised estimate of a marginal auxiliary variable distribution… ▽ More

    Submitted 31 October, 2013; v1 submitted 3 December, 2012; originally announced December 2012.

    MSC Class: 65C05; 65C40; 65C60