Skip to main content

Showing 1–44 of 44 results for author: Lv, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.04317  [pdf, other

    stat.ML cs.LG q-bio.QM

    DeepLINK-T: deep learning inference for time series data using knockoffs and LSTM

    Authors: Wenxuan Zuo, Zifan Zhu, Yuxuan Du, Yi-Chun Yeh, Jed A. Fuhrman, **chi Lv, Yingying Fan, Fengzhu Sun

    Abstract: High-dimensional longitudinal time series data is prevalent across various real-world applications. Many such applications can be modeled as regression problems with high-dimensional time series covariates. Deep learning has been a popular and powerful tool for fitting these regression models. Yet, the development of interpretable and reproducible deep-learning models is challenging and remains un… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  2. arXiv:2309.15032  [pdf, other

    stat.ME math.ST stat.ML

    SOFARI: High-Dimensional Manifold-Based Inference

    Authors: Zemin Zheng, Xin Zhou, Yingying Fan, **chi Lv

    Abstract: Multi-task learning is a widely used technique for harnessing information from various tasks. Recently, the sparse orthogonal factor regression (SOFAR) framework, based on the sparse singular value decomposition (SVD) within the coefficient matrix, was introduced for interpretable multi-task learning, enabling the discovery of meaningful latent feature-response association networks across differen… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 114 pages, 2 figures

  3. arXiv:2307.04400  [pdf, ps, other

    stat.ME math.ST stat.ML

    ARK: Robust Knockoffs Inference with Coupling

    Authors: Yingying Fan, Lan Gao, **chi Lv

    Abstract: We investigate the robustness of the model-X knockoffs framework with respect to the misspecified or estimated feature distribution. We achieve such a goal by theoretically studying the feature selection performance of a practically implemented knockoffs algorithm, which we name as the approximate knockoffs (ARK) procedure, under the measures of the false discovery rate (FDR) and $k$-familywise er… ▽ More

    Submitted 4 June, 2024; v1 submitted 10 July, 2023; originally announced July 2023.

    Comments: 105 pages

  4. arXiv:2211.00128  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    SIMPLE-RC: Group Network Inference with Non-Sharp Nulls and Weak Signals

    Authors: Jianqing Fan, Yingying Fan, **chi Lv, Fan Yang

    Abstract: Large-scale network inference with uncertainty quantification has important applications in natural, social, and medical sciences. The recent work of Fan, Fan, Han and Lv (2022) introduced a general framework of statistical inference on membership profiles in large networks (SIMPLE) for testing the sharp null hypothesis that a pair of given nodes share the same membership profiles. In real applica… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: 71 pages, 4 figures

  5. arXiv:2207.01678  [pdf, other

    stat.ML cs.LG math.ST

    FACT: High-Dimensional Random Forests Inference

    Authors: Chien-Ming Chi, Yingying Fan, **chi Lv

    Abstract: Quantifying the usefulness of individual features in random forests learning can greatly enhance its interpretability. Existing studies have shown that some popularly used feature importance measures for random forests suffer from the bias issue. In addition, there lack comprehensive size and power analyses for most of these existing methods. In this paper, we approach the problem via hypothesis t… ▽ More

    Submitted 12 November, 2023; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: 42 pages, 3 figures

  6. arXiv:2204.04567  [pdf, other

    cs.CV cs.LG stat.ML

    Joint Distribution Matters: Deep Brownian Distance Covariance for Few-Shot Classification

    Authors: Jiangtao Xie, Fei Long, Jiaming Lv, Qilong Wang, Peihua Li

    Abstract: Few-shot classification is a challenging problem as only very few training examples are given for each new task. One of the effective research lines to address this challenge focuses on learning deep representations driven by a similarity measure between a query image and few support images of some class. Statistically, this amounts to measure the dependency of image features, viewed as random vec… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

    Comments: Accepted to CVPR 2022 as an oral presentation. Equal contribution from first two authors

  7. arXiv:2112.09851  [pdf, other

    stat.ME math.ST

    High-Dimensional Knockoffs Inference for Time Series Data

    Authors: Chien-Ming Chi, Yingying Fan, Ching-Kang Ing, **chi Lv

    Abstract: The model-X knockoffs framework provides a flexible tool for achieving finite-sample false discovery rate (FDR) control in variable selection in arbitrary dimensions without assuming any dependence structure of the response on covariates. It also completely bypasses the use of conventional p-values, making it especially appealing in high-dimensional nonlinear models. Existing works have focused on… ▽ More

    Submitted 19 May, 2023; v1 submitted 18 December, 2021; originally announced December 2021.

    Comments: 65 pages, 4 figures

    MSC Class: 62P20 ACM Class: A.0

  8. arXiv:2112.01574  [pdf, other

    stat.ML cs.LG math.ST

    Dimension-Free Average Treatment Effect Inference with Deep Neural Networks

    Authors: Xinze Du, Yingying Fan, **chi Lv, Tianshu Sun, Patrick Vossler

    Abstract: This paper investigates the estimation and inference of the average treatment effect (ATE) using deep neural networks (DNNs) in the potential outcomes framework. Under some regularity conditions, the observed response can be formulated as the response of a mean regression problem with both the confounding variables and the treatment indicator as the independent variables. Using such formulation, w… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

    Comments: 56 pages, 22 figures

  9. arXiv:2009.08607  [pdf, ps, other

    cs.LG stat.ML

    Compact Learning for Multi-Label Classification

    Authors: Jiaqi Lv, Tianran Wu, Chenglun Peng, Yunpeng Liu, Ning Xu, Xin Geng

    Abstract: Multi-label classification (MLC) studies the problem where each instance is associated with multiple relevant labels, which leads to the exponential growth of output space. MLC encourages a popular framework named label compression (LC) for capturing label dependency with dimension reduction. Nevertheless, most existing LC methods failed to consider the influence of the feature space or misguided… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

  10. arXiv:2009.03816  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    PSO-PS: Parameter Synchronization with Particle Swarm Optimization for Distributed Training of Deep Neural Networks

    Authors: Qing Ye, Yuxuan Han, Yanan sun, JIancheng Lv

    Abstract: Parameter updating is an important stage in parallelism-based distributed deep learning. Synchronous methods are widely used in distributed training the Deep Neural Networks (DNNs). To reduce the communication and synchronization overhead of synchronous methods, decreasing the synchronization frequency (e.g., every $n$ mini-batches) is a straightforward approach. However, it often suffers from poo… ▽ More

    Submitted 6 September, 2020; originally announced September 2020.

    Comments: 7pages

    Journal ref: IJCNN2020

  11. arXiv:2009.02701  [pdf, other

    cs.LG stat.ML

    HPSGD: Hierarchical Parallel SGD With Stale Gradients Featuring

    Authors: Yuhao Zhou, Qing Ye, Hailun Zhang, Jiancheng Lv

    Abstract: While distributed training significantly speeds up the training process of the deep neural network (DNN), the utilization of the cluster is relatively low due to the time-consuming data synchronizing between workers. To alleviate this problem, a novel Hierarchical Parallel SGD (HPSGD) strategy is proposed based on the observation that the data synchronization phase can be paralleled with the local… ▽ More

    Submitted 28 November, 2020; v1 submitted 6 September, 2020; originally announced September 2020.

    Comments: 12 pages, 10 figures, ICONIP2020 under review

  12. arXiv:2007.11831  [pdf, other

    cs.LG cs.DC stat.ML

    DBS: Dynamic Batch Size For Distributed Deep Neural Network Training

    Authors: Qing Ye, Yuhao Zhou, Mingjia Shi, Yanan Sun, Jiancheng Lv

    Abstract: Synchronous strategies with data parallelism, such as the Synchronous StochasticGradient Descent (S-SGD) and the model averaging methods, are widely utilizedin distributed training of Deep Neural Networks (DNNs), largely owing to itseasy implementation yet promising performance. Particularly, each worker ofthe cluster hosts a copy of the DNN and an evenly divided share of the datasetwith the fixed… ▽ More

    Submitted 3 November, 2022; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: The latest version of this article has been accepted by IEEE TETCI

  13. arXiv:2007.08929  [pdf, other

    cs.LG stat.ML

    Provably Consistent Partial-Label Learning

    Authors: Lei Feng, Jiaqi Lv, Bo Han, Miao Xu, Gang Niu, Xin Geng, Bo An, Masashi Sugiyama

    Abstract: Partial-label learning (PLL) is a multi-class classification problem, where each training example is associated with a set of candidate labels. Even though many practical PLL methods have been proposed in the last two decades, there lacks a theoretical understanding of the consistency of those methods-none of the PLL methods hitherto possesses a generation process of candidate label sets, and then… ▽ More

    Submitted 23 October, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

    Comments: NeurIPS 2020 camera-ready version

  14. arXiv:2006.03860  [pdf, other

    stat.ML cs.LG

    Do RNN and LSTM have Long Memory?

    Authors: **gyu Zhao, Feiqing Huang, Jia Lv, Yanjie Duan, Zhen Qin, Guodong Li, Guangjian Tian

    Abstract: The LSTM network was proposed to overcome the difficulty in learning long-term dependence, and has made significant advancements in applications. With its success and drawbacks in mind, this paper raises the question - do RNN and LSTM have long memory? We answer it partially by proving that RNN and LSTM do not have long memory from a statistical perspective. A new definition for long memory networ… ▽ More

    Submitted 10 June, 2020; v1 submitted 6 June, 2020; originally announced June 2020.

    Comments: Accepted by ICML 2020. Added references, experiments and acknowledgements

  15. arXiv:2002.08053  [pdf, other

    cs.LG stat.ML

    Progressive Identification of True Labels for Partial-Label Learning

    Authors: Jiaqi Lv, Miao Xu, Lei Feng, Gang Niu, Xin Geng, Masashi Sugiyama

    Abstract: Partial-label learning (PLL) is a typical weakly supervised learning problem, where each training instance is equipped with a set of candidate labels among which only one is the true label. Most existing methods elaborately designed learning objectives as constrained optimizations that must be solved in specific manners, making their computational complexity a bottleneck for scaling up to big data… ▽ More

    Submitted 5 September, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: In Proceedings of the 37th International Conference on Machine Learning (ICML 2020)

  16. arXiv:1912.01201  [pdf, other

    cs.LG cs.CV stat.ML

    Multi-view Subspace Clustering via Partition Fusion

    Authors: Juncheng Lv, Zhao Kang, Boyu Wang, Lu** Ji, Zenglin Xu

    Abstract: Multi-view clustering is an important approach to analyze multi-view data in an unsupervised way. Among various methods, the multi-view subspace clustering approach has gained increasing attention due to its encouraging performance. Basically, it integrates multi-view information into graphs, which are then fed into spectral clustering algorithm for final result. However, its performance may degra… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

  17. arXiv:1910.12970  [pdf, other

    math.ST stat.ME

    Asymptotic Distributions of High-Dimensional Distance Correlation Inference

    Authors: Lan Gao, Yingying Fan, **chi Lv, Qi-Man Shao

    Abstract: Distance correlation has become an increasingly popular tool for detecting the nonlinear dependence between a pair of potentially high-dimensional random vectors. Most existing works have explored its asymptotic distributions under the null hypothesis of independence between the two random vectors when only the sample size or the dimensionality diverges. Yet its asymptotic null distribution for th… ▽ More

    Submitted 20 October, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: 67 pages

    Journal ref: Ann. Statist. 49(4): 1999-2020 (August 2021)

  18. arXiv:1910.01734  [pdf, other

    stat.ME math.ST stat.ML

    SIMPLE: Statistical Inference on Membership Profiles in Large Networks

    Authors: Jianqing Fan, Yingying Fan, Xiao Han, **chi Lv

    Abstract: Network data is prevalent in many contemporary big data applications in which a common interest is to unveil important latent links between different pairs of nodes. Yet a simple fundamental question of how to precisely quantify the statistical uncertainty associated with the identification of latent links still remains largely unexplored. In this paper, we propose the method of statistical infere… ▽ More

    Submitted 29 August, 2021; v1 submitted 3 October, 2019; originally announced October 2019.

    Comments: 59 pages, 4 figures; Journal of the Royal Statistical Society Series B, to appear

  19. arXiv:1909.01108  [pdf

    eess.IV cs.LG stat.ML

    Denoising Auto-encoding Priors in Undecimated Wavelet Domain for MR Image Reconstruction

    Authors: Siyuan Wang, Junjie Lv, Yuanyuan Hu, Dong Liang, Minghui Zhang, Qiegen Liu

    Abstract: Compressive sensing is an impressive approach for fast MRI. It aims at reconstructing MR image using only a few under-sampled data in k-space, enhancing the efficiency of the data acquisition. In this study, we propose to learn priors based on undecimated wavelet transform and an iterative image reconstruction algorithm. At the stage of prior learning, transformed feature images obtained by undeci… ▽ More

    Submitted 3 September, 2019; v1 submitted 3 September, 2019; originally announced September 2019.

    Comments: 10 pages, 11 figures, 6 tables

  20. arXiv:1810.04472   

    cs.LG cs.AI stat.ML

    Domain Confusion with Self Ensembling for Unsupervised Adaptation

    Authors: Jiawei Wang, Zhaoshui He, Chengjian Feng, Zhou** Zhu, Qinzhuang Lin, Jun Lv, Shengli Xie

    Abstract: Data collection and annotation are time-consuming in machine learning, expecially for large scale problem. A common approach for this problem is to transfer knowledge from a related labeled domain to a target one. There are two popular ways to achieve this goal: adversarial learning and self training. In this article, we first analyze the training unstablity problem and the mistaken confusion issu… ▽ More

    Submitted 8 July, 2020; v1 submitted 10 October, 2018; originally announced October 2018.

    Comments: The expression is ambiguous, which is not convenient for readers to understand, and in today's view, the conclusion of the paper is of little significance, so it is no longer open

  21. arXiv:1809.05032  [pdf, other

    math.ST cs.LG stat.ML

    IPAD: Stable Interpretable Forecasting with Knockoffs Inference

    Authors: Yingying Fan, **chi Lv, Mahrad Sharifvaghefi, Yoshimasa Uematsu

    Abstract: Interpretability and stability are two important features that are desired in many contemporary big data applications arising in economics and finance. While the former is enjoyed to some extent by many existing forecasting approaches, the latter in the sense of controlling the fraction of wrongly discovered features which can enhance greatly the interpretability is still largely underdeveloped in… ▽ More

    Submitted 6 September, 2018; originally announced September 2018.

  22. arXiv:1809.01185  [pdf, other

    cs.LG stat.ML

    DeepPINK: reproducible feature selection in deep neural networks

    Authors: Yang Young Lu, Yingying Fan, **chi Lv, William Stafford Noble

    Abstract: Deep learning has become increasingly popular in both supervised and unsupervised machine learning thanks to its outstanding empirical performance. However, because of their intrinsic complexity, most deep learning methods are largely treated as black box tools with little interpretability. Even though recent attempts have been made to facilitate the interpretability of deep neural networks (DNNs)… ▽ More

    Submitted 6 September, 2018; v1 submitted 4 September, 2018; originally announced September 2018.

  23. arXiv:1808.08469  [pdf, other

    stat.ML cs.LG

    Optimal Nonparametric Inference with Two-Scale Distributional Nearest Neighbors

    Authors: Emre Demirkaya, Yingying Fan, Lan Gao, **chi Lv, Patrick Vossler, **gbo Wang

    Abstract: The weighted nearest neighbors (WNN) estimator has been popularly used as a flexible and easy-to-implement nonparametric tool for mean regression estimation. The bagging technique is an elegant way to form WNN estimators with weights automatically generated to the nearest neighbors; we name the resulting estimator as the distributional nearest neighbors (DNN) for easy reference. Yet, there is a la… ▽ More

    Submitted 17 July, 2022; v1 submitted 25 August, 2018; originally announced August 2018.

    Comments: 99 pages, 2 figures, to appear in Journal of the American Statistical Association

  24. arXiv:1808.07292  [pdf, other

    cs.LG stat.ML

    XAI Beyond Classification: Interpretable Neural Clustering

    Authors: Xi Peng, Yunnan Li, Ivor W. Tsang, Hongyuan Zhu, Jiancheng Lv, Joey Tianyi Zhou

    Abstract: In this paper, we study two challenging problems in explainable AI (XAI) and data clustering. The first is how to directly design a neural network with inherent interpretability, rather than giving post-hoc explanations of a black-box model. The second is implementing discrete $k$-means with a differentiable neural network that embraces the advantages of parallel computing, online clustering, and… ▽ More

    Submitted 22 April, 2022; v1 submitted 22 August, 2018; originally announced August 2018.

    Comments: 28 pages

    Journal ref: Journal of Machine Learning Research, 2022

  25. arXiv:1803.07418  [pdf, other

    stat.ME math.ST stat.AP stat.CO stat.ML

    Large-Scale Model Selection with Misspecification

    Authors: Emre Demirkaya, Yang Feng, Pallavi Basu, **chi Lv

    Abstract: Model selection is crucial to high-dimensional learning and inference for contemporary big data applications in pinpointing the best set of covariates among a sequence of candidate interpretable models. Most existing work assumes implicitly that the models are correctly specified or have fixed dimensionality. Yet both features of model misspecification and high dimensionality are prevalent in prac… ▽ More

    Submitted 16 March, 2018; originally announced March 2018.

    Comments: 38 pages, 2 figures. arXiv admin note: text overlap with arXiv:1412.7468

  26. arXiv:1710.02704  [pdf, ps, other

    stat.ME math.ST stat.ML

    Nonsparse learning with latent variables

    Authors: Zemin Zheng, **chi Lv, Wei Lin

    Abstract: As a popular tool for producing meaningful and interpretable models, large-scale sparse learning works efficiently when the underlying structures are indeed or close to sparse. However, naively applying the existing regularization methods can result in misleading outcomes due to model misspecification. In particular, the direct sparsity assumption on coefficient vectors has been questioned in real… ▽ More

    Submitted 7 October, 2017; originally announced October 2017.

    Comments: 30 pages

    MSC Class: 62J

  27. arXiv:1709.00092  [pdf, ps, other

    math.ST stat.ME stat.ML

    RANK: Large-Scale Inference with Graphical Nonlinear Knockoffs

    Authors: Yingying Fan, Emre Demirkaya, Gaorong Li, **chi Lv

    Abstract: Power and reproducibility are key to enabling refined scientific discoveries in contemporary big data applications with general high-dimensional nonlinear models. In this paper, we provide theoretical foundations on the power and robustness for the model-free knockoffs procedure introduced recently in Candès, Fan, Janson and Lv (2016) in high-dimensional setting when the covariate distribution is… ▽ More

    Submitted 31 August, 2017; originally announced September 2017.

    Comments: 37 pages, 6 tables, 9 pages supplementary material

  28. arXiv:1707.09363  [pdf

    stat.ME

    A Comparative Study of Joint-SNVs Analysis Methods and Detection of Susceptibility Genes for Gastric Cancer in Korean Population

    Authors: **-Xiong Lv, Shikui Tu, Lei Xu

    Abstract: Many joint-SNVs (single-nucleotide variants) analysis methods were proposed to tackle the "missing heritability" problem, which emphasizes that the joint genetic variants can explain more heritability of traits and diseases. However, there is still lack of a systematic comparison and investigation on the relative strengths and weaknesses of these methods. In this paper, we evaluated their performa… ▽ More

    Submitted 28 July, 2017; originally announced July 2017.

    Comments: 10 pages

  29. arXiv:1705.03604  [pdf, other

    stat.ME

    Nonuniformity of P-values Can Occur Early in Diverging Dimensions

    Authors: Yingying Fan, Emre Demirkaya, **chi Lv

    Abstract: Evaluating the joint significance of covariates is of fundamental importance in a wide range of applications. To this end, p-values are frequently employed and produced by algorithms that are powered by classical large-sample asymptotic theory. It is well known that the conventional p-values in Gaussian linear model are valid even when the dimensionality is a non-vanishing fraction of the sample s… ▽ More

    Submitted 10 May, 2017; originally announced May 2017.

    Comments: 23 pages including 8 figures

    MSC Class: 62H15 (primary); 62F03; 62J12 (secondary)

  30. arXiv:1704.08349  [pdf, other

    stat.ME stat.ML

    SOFAR: large-scale association network learning

    Authors: Yoshimasa Uematsu, Yingying Fan, Kun Chen, **chi Lv, Wei Lin

    Abstract: Many modern big data applications feature large scale in both numbers of responses and predictors. Better statistical efficiency and scientific insights can be enabled by understanding the large-scale response-predictor association network structures via layers of sparse latent factors ranked by importance. Yet sparsity and orthogonality have been two largely incompatible goals. To accommodate bot… ▽ More

    Submitted 26 April, 2017; originally announced April 2017.

  31. arXiv:1610.02351  [pdf, other

    stat.ME math.ST stat.AP

    Panning for Gold: Model-X Knockoffs for High-dimensional Controlled Variable Selection

    Authors: Emmanuel Candes, Yingying Fan, Lucas Janson, **chi Lv

    Abstract: Many contemporary large-scale applications involve building interpretable models linking a large set of potential covariates to a response in a nonlinear fashion, such as when the response is binary. Although this modeling problem has been extensively studied, it remains unclear how to effectively control the fraction of false discoveries even in high-dimensional logistic regression, not to mentio… ▽ More

    Submitted 12 December, 2017; v1 submitted 7 October, 2016; originally announced October 2016.

    Comments: 39 pages, 10 figures, 2 tables

  32. arXiv:1608.03686  [pdf, other

    stat.ME

    Scalable Interpretable Multi-Response Regression via SEED

    Authors: Mohammad Taha Bahadori, Zemin Zheng, Yan Liu, **chi Lv

    Abstract: Sparse reduced-rank regression is an important tool to uncover meaningful dependence structure between large numbers of predictors and responses in many big data applications such as genome-wide association studies and social media analysis. Despite the recent theoretical and algorithmic advances, scalable estimation of sparse reduced-rank regression remains largely unexplored. In this paper, we s… ▽ More

    Submitted 12 August, 2016; originally announced August 2016.

    Comments: 31 pages, 7 figures

  33. arXiv:1606.03803  [pdf, other

    stat.ME math.ST stat.ML

    Tuning-Free Heterogeneity Pursuit in Massive Networks

    Authors: Zhao Ren, Yongjian Kang, Yingying Fan, **chi Lv

    Abstract: Heterogeneity is often natural in many contemporary applications involving massive data. While posing new challenges to effective learning, it can play a crucial role in powering meaningful scientific discoveries through the understanding of important differences among subpopulations of interest. In this paper, we exploit multiple networks with Gaussian graphs to encode the connectivity patterns o… ▽ More

    Submitted 12 June, 2016; originally announced June 2016.

    Comments: 29 pages for the main text including 1 figure and 7 tables, 28 pages for the Supplementary Material

  34. arXiv:1605.08933  [pdf, ps, other

    stat.ME stat.ML

    Interaction Pursuit with Feature Screening and Selection

    Authors: Yingying Fan, Yinfei Kong, Daoji Li, **chi Lv

    Abstract: Understanding how features interact with each other is of paramount importance in many scientific discoveries and contemporary applications. Yet interaction identification becomes challenging even for a moderate number of covariates. In this paper, we suggest an efficient and flexible procedure, called the interaction pursuit (IP), for interaction identification in ultra-high dimensions. The sugge… ▽ More

    Submitted 28 May, 2016; originally announced May 2016.

    Comments: 34 pages for the main text including 7 figures, 53 pages for the Supplementary Material

    MSC Class: 62H12; 62J02 (Primary); 62F07; 62F12 (Secondary)

  35. arXiv:1605.03335  [pdf, ps, other

    stat.ME stat.ML

    Asymptotic properties for combined $L_1$ and concave regularization

    Authors: Yingying Fan, **chi Lv

    Abstract: Two important goals of high-dimensional modeling are prediction and variable selection. In this article, we consider regularization with combined $L_1$ and concave penalties, and study the sampling properties of the global optimum of the suggested method in ultra-high dimensional settings. The $L_1$-penalty provides the minimum regularization needed for removing noise variables in order to achieve… ▽ More

    Submitted 11 May, 2016; originally announced May 2016.

    Comments: 16 pages

    MSC Class: 62J07(Primary) 62F07(Secondary)

    Journal ref: Biometrika 101, 57-70

  36. arXiv:1605.03315  [pdf, other

    stat.ME stat.ML

    Interaction pursuit in high-dimensional multi-response regression via distance correlation

    Authors: Yinfei Kong, Daoji Li, Yingying Fan, **chi Lv

    Abstract: Feature interactions can contribute to a large proportion of variation in many prediction models. In the era of big data, the coexistence of high dimensionality in both responses and covariates poses unprecedented challenges in identifying important interactions. In this paper, we suggest a two-stage interaction identification method, called the interaction pursuit via distance correlation (IPDC),… ▽ More

    Submitted 11 May, 2016; originally announced May 2016.

    Comments: to appear in The Annals of Statistics (2016)

    MSC Class: 62H12; 62J02 (Primary); 62F07; 62F12 (Secondary)

  37. arXiv:1605.03313  [pdf, ps, other

    stat.ME stat.ML

    Innovated scalable efficient estimation in ultra-large Gaussian graphical models

    Authors: Yingying Fan, **chi Lv

    Abstract: Large-scale precision matrix estimation is of fundamental importance yet challenging in many contemporary applications for recovering Gaussian graphical models. In this paper, we suggest a new approach of innovated scalable efficient estimation (ISEE) for estimating large precision matrix. Motivated by the innovated transformation, we convert the original problem into that of large covariance matr… ▽ More

    Submitted 11 May, 2016; originally announced May 2016.

    Comments: to appear, The Annals of Statistics (2016)

    MSC Class: 62H12; 62F12 (Primary) 62J05 (Secondary)

  38. arXiv:1605.03311  [pdf, ps, other

    stat.ME stat.ML

    The constrained Dantzig selector with enhanced consistency

    Authors: Yinfei Kong, Zemin Zheng, **chi Lv

    Abstract: The Dantzig selector has received popularity for many applications such as compressed sensing and sparse modeling, thanks to its computational efficiency as a linear programming problem and its nice sampling properties. Existing results show that it can recover sparse signals mimicking the accuracy of the ideal procedure, up to a logarithmic factor of the dimensionality. Such a factor has been sho… ▽ More

    Submitted 11 May, 2016; originally announced May 2016.

    Comments: to appear in Journal of Machine Learning Research

    MSC Class: 62J05(Primary); 62F07; 62J07; 94A20(Secondary)

  39. arXiv:1605.03310  [pdf, ps, other

    stat.ME math.ST stat.ML

    Asymptotic equivalence of regularization methods in thresholded parameter space

    Authors: Yingying Fan, **chi Lv

    Abstract: High-dimensional data analysis has motivated a spectrum of regularization methods for variable selection and sparse modeling, with two popular classes of convex ones and concave ones. A long debate has been on whether one class dominates the other, an important question both in theory and to practitioners. In this paper, we characterize the asymptotic equivalence of regularization methods, with ge… ▽ More

    Submitted 11 May, 2016; originally announced May 2016.

    Comments: 39 pages, 3 figures

    MSC Class: 62J07(Primary) 62F07(Secondary)

    Journal ref: Journal of the American Statistical Association 108, 1044-1061

  40. arXiv:1605.03306  [pdf, ps, other

    stat.ME stat.ML

    High dimensional thresholded regression and shrinkage effect

    Authors: Zemin Zheng, Yingying Fan, **chi Lv

    Abstract: High-dimensional sparse modeling via regularization provides a powerful tool for analyzing large-scale data sets and obtaining meaningful, interpretable models. The use of nonconvex penalty functions shows advantage in selecting important features in high dimensions, but the global optimality of such methods still demands more understanding. In this paper, we consider sparse regression with hard-t… ▽ More

    Submitted 11 May, 2016; originally announced May 2016.

    Comments: 23 pages, 3 figures, 5 tables

    MSC Class: 62J07(Primary) 62F07; 62P10(Secondary)

    Journal ref: Journal of the Royal Statistical Society Series B 76, 627-649

  41. arXiv:1412.7468  [pdf, ps, other

    math.ST stat.ME stat.ML

    Model Selection in High-Dimensional Misspecified Models

    Authors: Pallavi Basu, Yang Feng, **chi Lv

    Abstract: Model selection is indispensable to high-dimensional sparse modeling in selecting the best set of covariates among a sequence of candidate models. Most existing work assumes implicitly that the model is correctly specified or of fixed dimensions. Yet model misspecification and high dimensionality are common in real applications. In this paper, we investigate two classical Kullback-Leibler divergen… ▽ More

    Submitted 23 December, 2014; originally announced December 2014.

    Comments: 43 pages

  42. arXiv:1405.6798  [pdf, ps, other

    math.ST stat.ME

    Discussion: "A significance test for the lasso"

    Authors: **chi Lv, Zemin Zheng

    Abstract: Discussion of "A significance test for the lasso" by Richard Lockhart, Jonathan Taylor, Ryan J. Tibshirani, Robert Tibshirani [arXiv:1301.7161].

    Submitted 27 May, 2014; originally announced May 2014.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOS1175D the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1175D

    Journal ref: Annals of Statistics 2014, Vol. 42, No. 2, 493-500

  43. High-Dimensional Sparse Additive Hazards Regression

    Authors: Wei Lin, **chi Lv

    Abstract: High-dimensional sparse modeling with censored survival data is of great practical importance, as exemplified by modern applications in high-throughput genomic data analysis and credit risk analysis. In this article, we propose a class of regularization methods for simultaneous variable selection and estimation in the additive hazards model, by combining the nonconcave penalized likelihood approac… ▽ More

    Submitted 26 December, 2012; originally announced December 2012.

    Comments: 41 pages, 3 figures, to appear in Journal of the American Statistical Association (http://www.tandfonline.com/r/JASA)

    Journal ref: Journal of the American Statistical Association (2013), 108, 247-264

  44. arXiv:1005.5483  [pdf, ps, other

    math.ST stat.ME

    Model Selection Principles in Misspecified Models

    Authors: **chi Lv, Jun S. Liu

    Abstract: Model selection is of fundamental importance to high dimensional modeling featured in many contemporary applications. Classical principles of model selection include the Kullback-Leibler divergence principle and the Bayesian principle, which lead to the Akaike information criterion and Bayesian information criterion when models are correctly specified. Yet model misspecification is unavoidable whe… ▽ More

    Submitted 11 May, 2016; v1 submitted 29 May, 2010; originally announced May 2010.

    Comments: 25 pages, 6 tables

    MSC Class: 62J12(Primary); 62B10; 62F07; 62F15; 62J07(Secondary)

    Journal ref: Journal of the Royal Statistical Society Series B 76, 141-167 (2014)