Skip to main content

Showing 1–13 of 13 results for author: Bellec, P C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.17856  [pdf, other

    stat.ML cs.LG math.ST stat.CO stat.ME

    Uncertainty quantification for iterative algorithms in linear models with application to early stop**

    Authors: Pierre C. Bellec, Kai Tan

    Abstract: This paper investigates the iterates $\hbb^1,\dots,\hbb^T$ obtained from iterative algorithms in high-dimensional linear regression problems, in the regime where the feature dimension $p$ is comparable with the sample size $n$, i.e., $p \asymp n$. The analysis and proposed estimators are applicable to Gradient Descent (GD), proximal GD and their accelerated variants such as Fast Iterative Soft-Thr… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  2. arXiv:2310.01374  [pdf, other

    math.ST stat.ME stat.ML

    Corrected generalized cross-validation for finite ensembles of penalized estimators

    Authors: Pierre C. Bellec, **-Hong Du, Takuya Koriyama, Pratik Patil, Kai Tan

    Abstract: Generalized cross-validation (GCV) is a widely-used method for estimating the squared out-of-sample prediction risk that employs a scalar degrees of freedom adjustment (in a multiplicative sense) to the squared training error. In this paper, we examine the consistency of GCV for estimating the prediction risk of arbitrary ensembles of penalized least-squares estimators. We show that GCV is inconsi… ▽ More

    Submitted 21 April, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 91 pages, 34 figures; this version adds general proof outlines (in Sections 4.3 and 5.3), add more experiments with non-Gaussian data (in Sections D and E), relaxes an assumption (in Section A.7), clarifies explanations at several places, and corrects minor typos at several places

  3. arXiv:2305.17825  [pdf, other

    math.ST stat.ME stat.ML

    Multinomial Logistic Regression: Asymptotic Normality on Null Covariates in High-Dimensions

    Authors: Kai Tan, Pierre C. Bellec

    Abstract: This paper investigates the asymptotic distribution of the maximum-likelihood estimate (MLE) in multinomial logistic models in the high-dimensional regime where dimension and sample size are of the same order. While classical large-sample theory provides asymptotic normality of the MLE under certain conditions, such classical results are expected to fail in high-dimensions as documented for the bi… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

  4. arXiv:2206.07256  [pdf, other

    math.ST stat.ME stat.ML

    Noise Covariance Estimation in Multi-Task High-dimensional Linear Models

    Authors: Kai Tan, Gabriel Romon, Pierre C Bellec

    Abstract: This paper studies the multi-task high-dimensional linear regression models where the noise among different tasks is correlated, in the moderately high dimensional regime where sample size $n$ and dimension $p$ are of the same order. Our goal is to estimate the covariance matrix of the noise random vectors, or equivalently the correlation of the noise variables on any pair of two tasks. Treating t… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  5. arXiv:2204.06990  [pdf, other

    math.ST stat.ML

    Observable adjustments in single-index models for regularized M-estimators

    Authors: Pierre C Bellec

    Abstract: We consider observations $(X,y)$ from single index models with unknown link function, Gaussian covariates and a regularized M-estimator $\hatβ$ constructed from convex loss function and regularizer. In the regime where sample size $n$ and dimension $p$ are both increasing such that $p/n$ has a finite limit, the behavior of the empirical distribution of $\hatβ$ and the predicted values $X\hatβ$ has… ▽ More

    Submitted 3 January, 2024; v1 submitted 14 April, 2022; originally announced April 2022.

  6. arXiv:2107.07828  [pdf, other

    math.ST stat.ML

    Chi-square and normal inference in high-dimensional multi-task regression

    Authors: Pierre C Bellec, Gabriel Romon

    Abstract: The paper proposes chi-square and normal inference methodologies for the unknown coefficient matrix $B^*$ of size $p\times T$ in a Multi-Task (MT) linear model with $p$ covariates, $T$ tasks and $n$ observations under a row-sparse assumption on $B^*$. The row-sparsity $s$, dimension $p$ and number of tasks $T$ are allowed to grow with $n$. In the high-dimensional regime $p\ggg n$, in order to leve… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

  7. arXiv:2107.05143  [pdf, other

    math.ST stat.ML

    Derivatives and residual distribution of regularized M-estimators with application to adaptive tuning

    Authors: Pierre C Bellec, Yiwei Shen

    Abstract: This paper studies M-estimators with gradient-Lipschitz loss function regularized with convex penalty in linear models with Gaussian design matrix and arbitrary noise distribution. A practical example is the robust M-estimator constructed with the Huber loss and the Elastic-Net penalty and the noise distribution has heavy-tails. Our main contributions are three-fold. (i) We provide general formula… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

  8. arXiv:2107.03826  [pdf, other

    math.ST stat.ML

    Asymptotic normality of robust $M$-estimators with convex penalty

    Authors: Pierre C Bellec, Yiwei Shen, Cun-Hui Zhang

    Abstract: This paper develops asymptotic normality results for individual coordinates of robust M-estimators with convex penalty in high-dimensions, where the dimension $p$ is at most of the same order as the sample size $n$, i.e, $p/n\leγ$ for some fixed constant $γ>0$. The asymptotic normality requires a bias correction and holds for most coordinates of the M-estimator for a large class of loss functions… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  9. arXiv:2008.11840  [pdf, other

    math.ST stat.ML

    Out-of-sample error estimate for robust M-estimators with convex penalty

    Authors: Pierre C Bellec

    Abstract: A generic out-of-sample error estimate is proposed for robust $M$-estimators regularized with a convex penalty in high-dimensional linear regression where $(X,y)$ is observed and $p,n$ are of the same order. If $ψ$ is the derivative of the robust data-fitting loss $ρ$, the estimate depends on the observed data only through the quantities $\hatψ= ψ(y-X\hatβ)$, $X^\top \hatψ$ and the derivatives… ▽ More

    Submitted 30 March, 2023; v1 submitted 26 August, 2020; originally announced August 2020.

    Comments: This version adds simulations for the nuclear norm penalty

  10. arXiv:1910.05480  [pdf, ps, other

    math.ST stat.ML

    First order expansion of convex regularized estimators

    Authors: Pierre C Bellec, Arun K Kuchibhotla

    Abstract: We consider first order expansions of convex penalized estimators in high-dimensional regression problems with random designs. Our setting includes linear regression and logistic regression as special cases. For a given penalty function $h$ and the corresponding penalized estimator $\hatβ$, we construct a quantity $η$, the first order expansion of $\hatβ$, such that the distance between $\hatβ$ an… ▽ More

    Submitted 8 March, 2020; v1 submitted 11 October, 2019; originally announced October 2019.

    Comments: Accepted at NeurIPS 2019 and published at https://papers.nips.cc/paper/8606-first-order-expansion-of-convex-regularized-estimators . The version here includes the supplementary material

  11. arXiv:1905.12517  [pdf, ps, other

    math.ST stat.ML

    The cost-free nature of optimally tuning Tikhonov regularizers and other ordered smoothers

    Authors: Pierre C Bellec, Dana Yang

    Abstract: We consider the problem of selecting the best estimator among a family of Tikhonov regularized estimators, or, alternatively, to select a linear combination of these regularizers that is as good as the best regularizer in the family. Our theory reveals that if the Tikhonov regularizers share the same penalty matrix with different tuning parameters, a convex procedure based on $Q$-aggregation achie… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

  12. arXiv:1902.08885  [pdf, other

    math.ST stat.ML

    De-Biasing The Lasso With Degrees-of-Freedom Adjustment

    Authors: Pierre C. Bellec, Cun-Hui Zhang

    Abstract: This paper studies schemes to de-bias the Lasso in a linear model $y=Xβ+ε$ where the goal is to construct confidence intervals for $a_0^Tβ$ in a direction $a_0$, where $X$ has iid $N(0,Σ)$ rows. We show that previously analyzed propositions to de-bias the Lasso require a modification in order to enjoy efficiency in a full range of sparsity. This modification takes the form of a degrees-of-freedom… ▽ More

    Submitted 8 July, 2021; v1 submitted 23 February, 2019; originally announced February 2019.

  13. arXiv:1606.06179  [pdf, ps, other

    math.ST stat.ML

    On the prediction loss of the lasso in the partially labeled setting

    Authors: Pierre C. Bellec, Arnak S. Dalalyan, Edwin Grappin, Quentin Paris

    Abstract: In this paper we revisit the risk bounds of the lasso estimator in the context of transductive and semi-supervised learning. In other terms, the setting under consideration is that of regression with random design under partial labeling. The main goal is to obtain user-friendly bounds on the off-sample prediction risk. To this end, the simple setting of bounded response variable and bounded (high-… ▽ More

    Submitted 8 November, 2016; v1 submitted 20 June, 2016; originally announced June 2016.

    Comments: 25 pages