Skip to main content

Showing 1–15 of 15 results for author: Tan, L S L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2306.02813  [pdf, other

    stat.ME

    Variational inference based on a subclass of closed skew normals

    Authors: Linda S. L. Tan

    Abstract: Gaussian distributions are widely used in Bayesian variational inference to approximate intractable posterior densities, but the ability to accommodate skewness can improve approximation accuracy significantly, especially when data or prior information is scarce. We study the properties of a subclass of closed skew normals constructed using affine transformation of independent standardized univari… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: keywords: Closed skew normal; Gaussian variational approximation; natural gradient; centered parametrization; LU decomposition

  2. arXiv:2210.10566  [pdf, other

    stat.ME

    Second order stochastic gradient update for Cholesky factor in Gaussian variational approximation from Stein's Lemma

    Authors: Linda S. L. Tan

    Abstract: In stochastic variational inference, use of the reparametrization trick for the multivariate Gaussian gives rise to efficient updates for the mean and Cholesky factor of the covariance matrix, which depend on the first order derivative of the log joint model density. In this article, we show that an alternative unbiased gradient estimate for the Cholesky factor which depends on the second order de… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 15 pages, 2 figures

  3. arXiv:2109.00375  [pdf, other

    stat.CO

    Analytic natural gradient updates for Cholesky factor in Gaussian variational approximation

    Authors: Linda S. L. Tan

    Abstract: Natural gradients can improve convergence in stochastic variational inference significantly but inverting the Fisher information matrix is daunting in high dimensions. Moreover, in Gaussian variational approximation, natural gradient updates of the precision matrix do not ensure positive definiteness. To tackle this issue, we derive analytic natural gradient updates of the Cholesky factor of the c… ▽ More

    Submitted 19 May, 2024; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: 47 pages, 10 figures

  4. arXiv:1904.09591  [pdf, other

    stat.CO

    Conditionally structured variational Gaussian approximation with importance weights

    Authors: Linda S. L. Tan, Aishwarya Bhaskaran, David J. Nott

    Abstract: We develop flexible methods of deriving variational inference for models with complex latent variable structure. By splitting the variables in these models into "global" parameters and "local" latent variables, we define a class of variational approximations that exploit this partitioning and go beyond Gaussian variational approximation. This approximation is motivated by the fact that in many hie… ▽ More

    Submitted 21 April, 2019; originally announced April 2019.

    Comments: 18 pages, 7 figures

  5. arXiv:1811.04249  [pdf, other

    stat.CO

    Bayesian variational inference for exponential random graph models

    Authors: Linda S. L. Tan, Nial Friel

    Abstract: Deriving Bayesian inference for exponential random graph models (ERGMs) is a challenging "doubly intractable" problem as the normalizing constants of the likelihood and posterior density are both intractable. Markov chain Monte Carlo (MCMC) methods which yield Bayesian inference for ERGMs, such as the exchange algorithm, are asymptotically exact but computationally intensive, as a network has to b… ▽ More

    Submitted 23 November, 2019; v1 submitted 10 November, 2018; originally announced November 2018.

    Comments: 45 pages

  6. arXiv:1805.07267  [pdf, ps, other

    stat.ME

    Use of model reparametrization to improve variational Bayes

    Authors: Linda S. L. Tan

    Abstract: We propose using model reparametrization to improve variational Bayes inference for hierarchical models whose variables can be classified as global (shared across observations) or local (observation specific). Posterior dependence between local and global variables is minimized by applying an invertible affine transformation on the local variables. The functional form of this transformation is ded… ▽ More

    Submitted 7 March, 2020; v1 submitted 18 May, 2018; originally announced May 2018.

    Journal ref: JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2020

  7. arXiv:1712.08887  [pdf, other

    stat.ME

    Efficient data augmentation techniques for some classes of state space models

    Authors: Linda S. L. Tan

    Abstract: Data augmentation improves the convergence of iterative algorithms, such as the EM algorithm and Gibbs sampler by introducing carefully designed latent variables. In this article, we first propose a data augmentation scheme for the first-order autoregression plus noise model, where optimal values of working parameters introduced for recentering and rescaling of the latent states, can be derived an… ▽ More

    Submitted 4 July, 2022; v1 submitted 24 December, 2017; originally announced December 2017.

    Comments: Keywords: Data augmentation, State space model, Stochastic volatility model, EM algorithm, Reparametrization, Markov chain Monte Carlo, Ancillarity-sufficiency interweaving strategy

  8. Dynamic degree-corrected blockmodels for social networks: a nonparametric approach

    Authors: Linda S. L. Tan, Maria De Iorio

    Abstract: A nonparametric approach to the modeling of social networks using degree-corrected stochastic blockmodels is proposed. The model for static network consists of a stochastic blockmodel using a probit regression formulation and popularity parameters are incorporated to account for degree heterogeneity. Dirichlet processes are used to detect community structure as well as induce clustering in the pop… ▽ More

    Submitted 25 May, 2017; originally announced May 2017.

    Journal ref: Statistical Modelling (2019), 19, 386-411

  9. Gaussian variational approximation with sparse precision matrices

    Authors: Linda S. L. Tan, David J. Nott

    Abstract: We consider the problem of learning a Gaussian variational approximation to the posterior distribution for a high-dimensional parameter, where we impose sparsity in the precision matrix to reflect appropriate conditional independence structure in the model. Incorporating sparsity in the precision matrix allows the Gaussian variational distribution to be both flexible and parsimonious, and the spar… ▽ More

    Submitted 12 April, 2017; v1 submitted 18 May, 2016; originally announced May 2016.

    Comments: 18 pages, 9 figures

    Journal ref: Statistics and Computing 28 (2018) 259-275

  10. Bayesian inference for multiple Gaussian graphical models with application to metabolic association networks

    Authors: Linda S. L. Tan, Ajay Jasra, Maria De Iorio, Timothy M. D. Ebbels

    Abstract: We investigate the effect of cadmium (a toxic environmental pollutant) on the correlation structure of a number of urinary metabolites using Gaussian graphical models (GGMs). The inferred metabolic associations can provide important information on the physiological state of a metabolic system and insights on complex metabolic relationships. Using the fitted GGMs, we construct differential networks… ▽ More

    Submitted 13 April, 2017; v1 submitted 21 March, 2016; originally announced March 2016.

    Journal ref: Ann. Appl. Stat. 11 (2017) 2222-2251

  11. arXiv:1502.07190  [pdf, other

    stat.ML cs.LG

    Topic-adjusted visibility metric for scientific articles

    Authors: Linda S. L. Tan, Aik Hui Chan, Tian Zheng

    Abstract: Measuring the impact of scientific articles is important for evaluating the research output of individual scientists, academic institutions and journals. While citations are raw data for constructing impact measures, there exist biases and potential issues if factors affecting citation patterns are not properly accounted for. In this work, we address the problem of field variation and introduce an… ▽ More

    Submitted 16 October, 2015; v1 submitted 25 February, 2015; originally announced February 2015.

    Journal ref: Annals of Applied Statistics, Volume 10, Number 1 (2016), 1-31

  12. Stochastic variational inference for large-scale discrete choice models using adaptive batch sizes

    Authors: Linda S. L. Tan

    Abstract: Discrete choice models describe the choices made by decision makers among alternatives and play an important role in transportation planning, marketing research and other applications. The mixed multinomial logit (MMNL) model is a popular discrete choice model that captures heterogeneity in the preferences of decision makers through random coefficients. While Markov chain Monte Carlo methods provi… ▽ More

    Submitted 8 October, 2015; v1 submitted 21 May, 2014; originally announced May 2014.

    Journal ref: Statistics and Computing (2017) 27 pp 237-257

  13. Variational inference for sparse spectrum Gaussian process regression

    Authors: Linda S. L. Tan, Victor M. H. Ong, David J. Nott, Ajay Jasra

    Abstract: We develop a fast variational approximation scheme for Gaussian process (GP) regression, where the spectrum of the covariance function is subjected to a sparse approximation. Our approach enables uncertainty in covariance function hyperparameters to be treated without using Monte Carlo methods and is robust to overfitting. Our article makes three contributions. First, we present a variational Baye… ▽ More

    Submitted 26 January, 2015; v1 submitted 9 June, 2013; originally announced June 2013.

    Comments: 20 pages, 11 figures, 1 table

    Journal ref: Statistics and Computing (2016) 26 pp 1243-1261

  14. A stochastic variational framework for fitting and diagnosing generalized linear mixed models

    Authors: Linda S. L. Tan, David J. Nott

    Abstract: In stochastic variational inference, the variational Bayes objective function is optimized using stochastic gradient approximation, where gradients computed on small random subsets of data are used to approximate the true gradient over the whole data set. This enables complex models to be fit to large data sets as data can be processed in mini-batches. In this article, we extend stochastic variati… ▽ More

    Submitted 28 March, 2014; v1 submitted 24 August, 2012; originally announced August 2012.

    Comments: 42 pages, 13 figures, 9 tables

    Journal ref: Bayesian Analysis (2014), 9, 963-1004

  15. arXiv:1205.3906  [pdf, ps, other

    stat.CO stat.ME

    Variational Inference for Generalized Linear Mixed Models Using Partially Noncentered Parametrizations

    Authors: Linda S. L. Tan, David J. Nott

    Abstract: The effects of different parametrizations on the convergence of Bayesian computational algorithms for hierarchical models are well explored. Techniques such as centering, noncentering and partial noncentering can be used to accelerate convergence in MCMC and EM algorithms but are still not well studied for variational Bayes (VB) methods. As a fast deterministic approach to posterior approximation,… ▽ More

    Submitted 11 June, 2013; v1 submitted 17 May, 2012; originally announced May 2012.

    Comments: Published in at http://dx.doi.org/10.1214/13-STS418 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS418

    Journal ref: Statistical Science 2013, Vol. 28, No. 2, 168-188