Skip to main content

Showing 1–18 of 18 results for author: Liu, J S

Searching in archive math. Search in all archives.
.
  1. arXiv:2309.16855  [pdf, other

    stat.ME math.ST

    A Variational Spike-and-Slab Approach for Group Variable Selection

    Authors: Buyu Lin, Changhao Ge, Jun S. Liu

    Abstract: We introduce a class of generic spike-and-slab priors for high-dimensional linear regression with grouped variables and present a Coordinate-ascent Variational Inference (CAVI) algorithm for obtaining an optimal variational Bayes approximation. Using parameter expansion for a specific, yet comprehensive, family of slab distributions, we obtain a further gain in computational efficiency. The method… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 64 pages, 6 figures

  2. arXiv:2307.02777  [pdf, other

    math.ST

    On the Optimality of Functional Sliced Inverse Regression

    Authors: Rui Chen, Songtao Tian, Dongming Huang, Qian Lin, Jun S. Liu

    Abstract: In this paper, we prove that functional sliced inverse regression (FSIR) achieves the optimal (minimax) rate for estimating the central space in functional sufficient dimension reduction problems. First, we provide a concentration inequality for the FSIR estimator of the covariance of the conditional mean, i.e., $\var(\E[\boldsymbol{X}\mid Y])$. Based on this inequality, we establish the root-$n$… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

  3. arXiv:2112.08641  [pdf, ps, other

    stat.CO math.PR

    On Gibbs Sampling for Structured Bayesian Models Discussion of paper by Zanella and Roberts

    Authors: Xiaodong Yang, Jun S. Liu

    Abstract: This article is a discussion of Zanella and Roberts' paper: Multilevel linear models, gibbs samplers and multigrid decompositions. We consider several extensions in which the multigrid decomposition would bring us interesting insights, including vector hierarchical models, linear mixed effects models and partial centering parametrizations.

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: 18 pages

  4. arXiv:2111.15084  [pdf, other

    stat.CO math.ST

    Convergence Rate of Multiple-try Metropolis Independent sampler

    Authors: Xiaodong Yang, Jun S. Liu

    Abstract: The Multiple-try Metropolis (MTM) method is an interesting extension of the classical Metropolis-Hastings algorithm. However, theoretical understandings of its convergence behavior as well as whether and how it may help are still unknown. This paper derives the exact convergence rate for Multiple-try Metropolis Independent sampler (MTM-IS) via an explicit eigen analysis. As a by-product, we prove… ▽ More

    Submitted 3 February, 2023; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: 34 pages; 7 figures

  5. arXiv:2010.08132  [pdf, other

    math.ST

    Power of Knockoff: The Impact of Ranking Algorithm, Augmented Design, and Symmetric Statistic

    Authors: Zheng Tracy Ke, Jun S. Liu, Yucong Ma

    Abstract: The knockoff filter is a recent false discovery rate (FDR) control method for high-dimensional linear models. We point out that knockoff has three key components: ranking algorithm, augmented design, and symmetric statistic, and each component admits multiple choices. By considering various combinations of the three components, we obtain a collection of variants of knockoff. All these variants gua… ▽ More

    Submitted 13 February, 2024; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: 67 pages, 13 figures

    Journal ref: Journal of Machine Learning Research, 2024

  6. arXiv:2004.01975  [pdf, other

    stat.ME math.ST stat.CO

    Stratification and Optimal Resampling for Sequential Monte Carlo

    Authors: Yichao Li, Wenshuo Wang, Ke Deng, Jun S Liu

    Abstract: Sequential Monte Carlo (SMC), also known as particle filters, has been widely accepted as a powerful computational tool for making inference with dynamical systems. A key step in SMC is resampling, which plays the role of steering the algorithm towards the future dynamics. Several strategies have been proposed and used in practice, including multinomial resampling, residual resampling (Liu and Che… ▽ More

    Submitted 7 December, 2020; v1 submitted 4 April, 2020; originally announced April 2020.

  7. arXiv:1911.02171  [pdf, other

    stat.ME math.ST stat.AP stat.ML

    Minimax Nonparametric Two-sample Test under Smoothing

    Authors: Xin Xing, Zuofeng Shang, Pang Du, ** Ma, Wenxuan Zhong, Jun S. Liu

    Abstract: We consider the problem of comparing probability densities between two groups. A new probabilistic tensor product smoothing spline framework is developed to model the joint density of two variables. Under such a framework, the probability density comparison is equivalent to testing the presence/absence of interactions. We propose a penalized likelihood ratio test for such interaction testing and s… ▽ More

    Submitted 11 January, 2021; v1 submitted 5 November, 2019; originally announced November 2019.

  8. arXiv:1907.11985  [pdf, other

    stat.CO cs.LG math.OC

    The Wang-Landau Algorithm as Stochastic Optimization and Its Acceleration

    Authors: Chenguang Dai, Jun S. Liu

    Abstract: We show that the Wang-Landau algorithm can be formulated as a stochastic gradient descent algorithm minimizing a smooth and convex objective function, of which the gradient is estimated using Markov chain Monte Carlo iterations. The optimization formulation provides us a new way to establish the convergence rate of the Wang-Landau algorithm, by exploiting the fact that almost surely, the density e… ▽ More

    Submitted 2 February, 2020; v1 submitted 27 July, 2019; originally announced July 2019.

    Comments: 10 pages, 3 figures

    Journal ref: Phys. Rev. E 101, 033301 (2020)

  9. arXiv:1805.01820  [pdf, ps, other

    math.ST

    Global testing under the sparse alternatives for single index models

    Authors: Qian Lin, Zhigen Zhao, Jun S. Liu

    Abstract: For the single index model $y=f(β^τx,ε)$ with Gaussian design, %satisfying that rank $var(\mathbb{E}[x\mid y])=1$ where $f$ is unknown and $β$ is a sparse $p$-dimensional unit vector with at most $s$ nonzero entries, we are interested in testing the null hypothesis that $β$, when viewed as a whole vector, is zero against the alternative that some entries of $β$ is nonzero. Assuming that… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

    Comments: 22 pages, 4 figures

  10. arXiv:1701.06009  [pdf, other

    math.ST

    On the optimality of sliced inverse regression in high dimensions

    Authors: Qian Lin, Xinran Li, Dongming Huang, Jun S. Liu

    Abstract: The central subspace of a pair of random variables $(y,x) \in \mathbb{R}^{p+1}$ is the minimal subspace $\mathcal{S}$ such that $y \perp \hspace{-2mm} \perp x\mid P_{\mathcal{S}}x$. In this paper, we consider the minimax rate of estimating the central space of the multiple index models $y=f(β_{1}^τx,β_{2}^τx,...,β_{d}^τx,ε)$ with at most $s$ active predictors where $x \sim N(0,I_{p})$. We first in… ▽ More

    Submitted 23 January, 2017; v1 submitted 21 January, 2017; originally announced January 2017.

    Comments: 40 pages, 2 figures

  11. arXiv:1611.06655  [pdf, ps, other

    math.ST stat.ME

    Sparse Sliced Inverse Regression Via Lasso

    Authors: Qian Lin, Zhigen Zhao, Jun S. Liu

    Abstract: For multiple index models, it has recently been shown that the sliced inverse regression (SIR) is consistent for estimating the sufficient dimension reduction (SDR) space if and only if $ρ=\lim\frac{p}{n}=0$, where $p$ is the dimension and $n$ is the sample size. Thus, when $p$ is of the same or a higher order of $n$, additional assumptions such as sparsity must be imposed in order to ensure consi… ▽ More

    Submitted 17 June, 2018; v1 submitted 21 November, 2016; originally announced November 2016.

    Comments: 41 pages, 2 figures

    MSC Class: 62J02 (Primary); 62H25 (Secondary)

  12. arXiv:1511.08102  [pdf, other

    math.ST stat.ML

    L1-Regularized Least Squares for Support Recovery of High Dimensional Single Index Models with Gaussian Designs

    Authors: Matey Neykov, Jun S. Liu, Tianxi Cai

    Abstract: It is known that for a certain class of single index models (SIMs) $Y = f(\boldsymbol{X}_{p \times 1}^\intercal\boldsymbolβ_0, \varepsilon)$, support recovery is impossible when $\boldsymbol{X} \sim \mathcal{N}(0, \mathbb{I}_{p \times p})$ and a model complexity adjusted sample size is below a critical threshold. Recently, optimal algorithms based on Sliced Inverse Regression (SIR) were suggested.… ▽ More

    Submitted 22 June, 2016; v1 submitted 25 November, 2015; originally announced November 2015.

    Comments: 36 pages; 6 figures; typos corrected; clearer notation introduced

  13. arXiv:1511.02270  [pdf, other

    math.ST stat.ML

    Signed Support Recovery for Single Index Models in High-Dimensions

    Authors: Matey Neykov, Qian Lin, Jun S. Liu

    Abstract: In this paper we study the support recovery problem for single index models $Y=f(\boldsymbol{X}^{\intercal} \boldsymbolβ,\varepsilon)$, where $f$ is an unknown link function, $\boldsymbol{X}\sim N_p(0,\mathbb{I}_{p})$ and $\boldsymbolβ$ is an $s$-sparse unit vector such that $\boldsymbolβ_{i}\in \{\pm\frac{1}{\sqrt{s}},0\}$. In particular, we look into the performance of two computationally inexpe… ▽ More

    Submitted 22 June, 2016; v1 submitted 6 November, 2015; originally announced November 2015.

    Comments: 38 pages, 7 figures; 1 table; data set analysis added; typos corrected

  14. arXiv:1510.08986  [pdf, other

    math.ST stat.ME stat.ML

    A Unified Theory of Confidence Regions and Testing for High Dimensional Estimating Equations

    Authors: Matey Neykov, Yang Ning, Jun S. Liu, Han Liu

    Abstract: We propose a new inferential framework for constructing confidence regions and testing hypotheses in statistical models specified by a system of high dimensional estimating equations. We construct an influence function by projecting the fitted estimating equations to a sparse direction obtained by solving a large-scale linear program. Our main theoretical contribution is to establish a unified Z-e… ▽ More

    Submitted 22 June, 2016; v1 submitted 30 October, 2015; originally announced October 2015.

    Comments: 67 pages, 2 tables, 1 figure

  15. arXiv:1507.03895  [pdf, ps, other

    math.ST

    On consistency and sparsity for sliced inverse regression in high dimensions

    Authors: Qian Lin, Zhigen Zhao, Jun S. Liu

    Abstract: We provide here a framework to analyze the phase transition phenomenon of slice inverse regression (SIR), a supervised dimension reduction technique introduced by \cite{Li:1991}. Under mild conditions, the asymptotic ratio $ρ= \lim p/n$ is the phase transition parameter and the SIR estimator is consistent if and only if $ρ= 0$. When dimension $p$ is greater than $n$, we propose a diagonal threshol… ▽ More

    Submitted 21 November, 2016; v1 submitted 14 July, 2015; originally announced July 2015.

    Comments: 49 pages, 4 figures

    MSC Class: 62J02 (Primary); 62H25 (Secondary)

  16. arXiv:1005.5483  [pdf, ps, other

    math.ST stat.ME

    Model Selection Principles in Misspecified Models

    Authors: **chi Lv, Jun S. Liu

    Abstract: Model selection is of fundamental importance to high dimensional modeling featured in many contemporary applications. Classical principles of model selection include the Kullback-Leibler divergence principle and the Bayesian principle, which lead to the Akaike information criterion and Bayesian information criterion when models are correctly specified. Yet model misspecification is unavoidable whe… ▽ More

    Submitted 11 May, 2016; v1 submitted 29 May, 2010; originally announced May 2010.

    Comments: 25 pages, 6 tables

    MSC Class: 62J12(Primary); 62B10; 62F07; 62F15; 62J07(Secondary)

    Journal ref: Journal of the Royal Statistical Society Series B 76, 141-167 (2014)

  17. Discussion of "Equi-energy sampler" by Kou, Zhou and Wong

    Authors: Yves F. Atchadé, Jun S. Liu

    Abstract: We congratulate Samuel Kou, Qing Zhou and Wing Wong [math.ST/0507080] (referred to subsequently as KZW) for this beautifully written paper, which opens a new direction in Monte Carlo computation. This discussion has two parts. First, we describe a very closely related method, multicanonical sampling (MCS), and report a simulation example that compares the equi-energy (EE) sampler with MCS. Overa… ▽ More

    Submitted 8 November, 2006; originally announced November 2006.

    Comments: Published at http://dx.doi.org/10.1214/009053606000000489 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS0088B

    Journal ref: Annals of Statistics 2006, Vol. 34, No. 4, 1620-1628

  18. arXiv:math/0610655  [pdf, ps, other

    math.ST q-bio.GN stat.ME

    Bayesian Clustering of Transcription Factor Binding Motifs

    Authors: Shane T. Jensen, Jun S. Liu

    Abstract: Genes are often regulated in living cells by proteins called transcription factors (TFs) that bind directly to short segments of DNA in close proximity to specific genes. These binding sites have a conserved nucleotide appearance, which is called a motif. Several recent studies of transcriptional regulation require the reduction of a large collection of motifs into clusters based on the similari… ▽ More

    Submitted 21 October, 2006; originally announced October 2006.

    Comments: Submitted to the Journal of the American Statistical Association