Skip to main content

Showing 1–32 of 32 results for author: Bellec, P C

.
  1. arXiv:2404.17856  [pdf, other

    stat.ML cs.LG math.ST stat.CO stat.ME

    Uncertainty quantification for iterative algorithms in linear models with application to early stop**

    Authors: Pierre C. Bellec, Kai Tan

    Abstract: This paper investigates the iterates $\hbb^1,\dots,\hbb^T$ obtained from iterative algorithms in high-dimensional linear regression problems, in the regime where the feature dimension $p$ is comparable with the sample size $n$, i.e., $p \asymp n$. The analysis and proposed estimators are applicable to Gradient Descent (GD), proximal GD and their accelerated variants such as Fast Iterative Soft-Thr… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  2. arXiv:2404.02070  [pdf, other

    math.ST

    Asymptotics of resampling without replacement in robust and logistic regression

    Authors: Pierre C Bellec, Takuya Koriyama

    Abstract: This paper studies the asymptotics of resampling without replacement in the proportional regime where dimension $p$ and sample size $n$ are of the same order. For a given dataset $(X,y)\in \mathbb{R}^{n\times p}\times \mathbb{R}^n$ and fixed subsample ratio $q\in(0,1)$, the practitioner samples independently of $(X,y)$ iid subsets $I_1,...,I_M$ of $\{1,...,n\}$ of size $q n$ and trains estimators… ▽ More

    Submitted 16 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 25 pages, 10 figures

  3. arXiv:2312.13257  [pdf, other

    math.ST

    Error estimation and adaptive tuning for unregularized robust M-estimator

    Authors: Pierre C. Bellec, Takuya Koriyama

    Abstract: We consider unregularized robust M-estimators for linear models under Gaussian design and heavy-tailed noise, in the proportional asymptotics regime where the sample size n and the number of features p are both increasing such that $p/n \to γ\in (0,1)$. An estimator of the out-of-sample error of a robust M-estimator is analysed and proved to be consistent for a large family of loss functions that… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 33 pages, 10 figures

  4. arXiv:2312.13254  [pdf, ps, other

    math.ST

    Existence of solutions to the nonlinear equations characterizing the precise error of M-estimators

    Authors: Pierre C Bellec, Takuya Koriyama

    Abstract: Major progress has been made in the previous decade to characterize the asymptotic behavior of regularized M-estimators in high-dimensional regression problems in the proportional asymptotic regime where the sample size $n$ and the number of features $p$ are increasing simultaneously such that $n/p\to δ\in(0,\infty)$, using powerful tools such as Approximate Message Passing or the Convex Gaussian… ▽ More

    Submitted 3 January, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  5. arXiv:2310.01374  [pdf, other

    math.ST stat.ME stat.ML

    Corrected generalized cross-validation for finite ensembles of penalized estimators

    Authors: Pierre C. Bellec, **-Hong Du, Takuya Koriyama, Pratik Patil, Kai Tan

    Abstract: Generalized cross-validation (GCV) is a widely-used method for estimating the squared out-of-sample prediction risk that employs a scalar degrees of freedom adjustment (in a multiplicative sense) to the squared training error. In this paper, we examine the consistency of GCV for estimating the prediction risk of arbitrary ensembles of penalized least-squares estimators. We show that GCV is inconsi… ▽ More

    Submitted 21 April, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 91 pages, 34 figures; this version adds general proof outlines (in Sections 4.3 and 5.3), add more experiments with non-Gaussian data (in Sections D and E), relaxes an assumption (in Section A.7), clarifies explanations at several places, and corrects minor typos at several places

  6. arXiv:2305.17825  [pdf, other

    math.ST stat.ME stat.ML

    Multinomial Logistic Regression: Asymptotic Normality on Null Covariates in High-Dimensions

    Authors: Kai Tan, Pierre C. Bellec

    Abstract: This paper investigates the asymptotic distribution of the maximum-likelihood estimate (MLE) in multinomial logistic models in the high-dimensional regime where dimension and sample size are of the same order. While classical large-sample theory provides asymptotic normality of the MLE under certain conditions, such classical results are expected to fail in high-dimensions as documented for the bi… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

  7. arXiv:2206.07256  [pdf, other

    math.ST stat.ME stat.ML

    Noise Covariance Estimation in Multi-Task High-dimensional Linear Models

    Authors: Kai Tan, Gabriel Romon, Pierre C Bellec

    Abstract: This paper studies the multi-task high-dimensional linear regression models where the noise among different tasks is correlated, in the moderately high dimensional regime where sample size $n$ and dimension $p$ are of the same order. Our goal is to estimate the covariance matrix of the noise random vectors, or equivalently the correlation of the noise variables on any pair of two tasks. Treating t… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  8. arXiv:2204.06990  [pdf, other

    math.ST stat.ML

    Observable adjustments in single-index models for regularized M-estimators

    Authors: Pierre C Bellec

    Abstract: We consider observations $(X,y)$ from single index models with unknown link function, Gaussian covariates and a regularized M-estimator $\hatβ$ constructed from convex loss function and regularizer. In the regime where sample size $n$ and dimension $p$ are both increasing such that $p/n$ has a finite limit, the behavior of the empirical distribution of $\hatβ$ and the predicted values $X\hatβ$ has… ▽ More

    Submitted 3 January, 2024; v1 submitted 14 April, 2022; originally announced April 2022.

  9. arXiv:2107.07828  [pdf, other

    math.ST stat.ML

    Chi-square and normal inference in high-dimensional multi-task regression

    Authors: Pierre C Bellec, Gabriel Romon

    Abstract: The paper proposes chi-square and normal inference methodologies for the unknown coefficient matrix $B^*$ of size $p\times T$ in a Multi-Task (MT) linear model with $p$ covariates, $T$ tasks and $n$ observations under a row-sparse assumption on $B^*$. The row-sparsity $s$, dimension $p$ and number of tasks $T$ are allowed to grow with $n$. In the high-dimensional regime $p\ggg n$, in order to leve… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

  10. arXiv:2107.05143  [pdf, other

    math.ST stat.ML

    Derivatives and residual distribution of regularized M-estimators with application to adaptive tuning

    Authors: Pierre C Bellec, Yiwei Shen

    Abstract: This paper studies M-estimators with gradient-Lipschitz loss function regularized with convex penalty in linear models with Gaussian design matrix and arbitrary noise distribution. A practical example is the robust M-estimator constructed with the Huber loss and the Elastic-Net penalty and the noise distribution has heavy-tails. Our main contributions are three-fold. (i) We provide general formula… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

  11. arXiv:2107.03826  [pdf, other

    math.ST stat.ML

    Asymptotic normality of robust $M$-estimators with convex penalty

    Authors: Pierre C Bellec, Yiwei Shen, Cun-Hui Zhang

    Abstract: This paper develops asymptotic normality results for individual coordinates of robust M-estimators with convex penalty in high-dimensions, where the dimension $p$ is at most of the same order as the sample size $n$, i.e, $p/n\leγ$ for some fixed constant $γ>0$. The asymptotic normality requires a bias correction and holds for most coordinates of the M-estimator for a large class of loss functions… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  12. arXiv:2008.11840  [pdf, other

    math.ST stat.ML

    Out-of-sample error estimate for robust M-estimators with convex penalty

    Authors: Pierre C Bellec

    Abstract: A generic out-of-sample error estimate is proposed for robust $M$-estimators regularized with a convex penalty in high-dimensional linear regression where $(X,y)$ is observed and $p,n$ are of the same order. If $ψ$ is the derivative of the robust data-fitting loss $ρ$, the estimate depends on the observed data only through the quantities $\hatψ= ψ(y-X\hatβ)$, $X^\top \hatψ$ and the derivatives… ▽ More

    Submitted 30 March, 2023; v1 submitted 26 August, 2020; originally announced August 2020.

    Comments: This version adds simulations for the nuclear norm penalty

  13. arXiv:1912.11943  [pdf, other

    math.ST

    De-biasing convex regularized estimators and interval estimation in linear models

    Authors: Pierre C Bellec, Cun-Hui Zhang

    Abstract: New upper bounds are developed for the $L_2$ distance between $ξ/\text{Var}[ξ]^{1/2}$ and linear and quadratic functions of $z\sim N(0,I_n)$ for random variables of the form $ξ=bz^\top f(z) - \text{div} f(z)$. The linear approximation yields a central limit theorem when the squared norm of $f(z)$ dominates the squared Frobenius norm of $\nabla f(z)$ in expectation. Applications of this normal appr… ▽ More

    Submitted 28 September, 2021; v1 submitted 26 December, 2019; originally announced December 2019.

    Comments: Manuscript title was updated; see former title at arXiv:912.11943v3

  14. arXiv:1910.05480  [pdf, ps, other

    math.ST stat.ML

    First order expansion of convex regularized estimators

    Authors: Pierre C Bellec, Arun K Kuchibhotla

    Abstract: We consider first order expansions of convex penalized estimators in high-dimensional regression problems with random designs. Our setting includes linear regression and logistic regression as special cases. For a given penalty function $h$ and the corresponding penalized estimator $\hatβ$, we construct a quantity $η$, the first order expansion of $\hatβ$, such that the distance between $\hatβ$ an… ▽ More

    Submitted 8 March, 2020; v1 submitted 11 October, 2019; originally announced October 2019.

    Comments: Accepted at NeurIPS 2019 and published at https://papers.nips.cc/paper/8606-first-order-expansion-of-convex-regularized-estimators . The version here includes the supplementary material

  15. arXiv:1905.12517  [pdf, ps, other

    math.ST stat.ML

    The cost-free nature of optimally tuning Tikhonov regularizers and other ordered smoothers

    Authors: Pierre C Bellec, Dana Yang

    Abstract: We consider the problem of selecting the best estimator among a family of Tikhonov regularized estimators, or, alternatively, to select a linear combination of these regularizers that is as good as the best regularizer in the family. Our theory reveals that if the Tikhonov regularizers share the same penalty matrix with different tuning parameters, a convex procedure based on $Q$-aggregation achie… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

  16. arXiv:1902.08885  [pdf, other

    math.ST stat.ML

    De-Biasing The Lasso With Degrees-of-Freedom Adjustment

    Authors: Pierre C. Bellec, Cun-Hui Zhang

    Abstract: This paper studies schemes to de-bias the Lasso in a linear model $y=Xβ+ε$ where the goal is to construct confidence intervals for $a_0^Tβ$ in a direction $a_0$, where $X$ has iid $N(0,Σ)$ rows. We show that previously analyzed propositions to de-bias the Lasso require a modification in order to enjoy efficiency in a full range of sparsity. This modification takes the form of a degrees-of-freedom… ▽ More

    Submitted 8 July, 2021; v1 submitted 23 February, 2019; originally announced February 2019.

  17. arXiv:1901.08736  [pdf, ps, other

    math.ST

    Concentration of quadratic forms under a Bernstein moment assumption

    Authors: Pierre C Bellec

    Abstract: A concentration result for quadratic form of independent subgaussian random variables is derived. If the moments of the random variables satisfy a "Bernstein condition", then the variance term of the Hanson-Wright inequality can be improved. The Bernstein condition is satisfied, for instance, by all log-concave subgaussian distributions.

    Submitted 24 January, 2019; originally announced January 2019.

    Comments: This short note presents a result that initially appeared in arXiv:1410.0346v1 (see Assumption 3.3). The result was later removed from arXiv:1410.0346 and the published version https://projecteuclid.org/euclid.aos/1519268423 due to space constraints

  18. arXiv:1811.04121  [pdf, other

    math.ST

    Second order Stein: SURE for SURE and other applications in high-dimensional inference

    Authors: Pierre C Bellec, Cun-Hui Zhang

    Abstract: Stein's formula states that a random variable of the form $z^\top f(z) - \text{div} f(z)$ is mean-zero for functions $f$ with integrable gradient. Here, $\text{div} f$ is the divergence of the function $f$ and $z$ is a standard normal vector. This paper aims to propose a Second Order Stein formula to characterize the variance of such random variables for all functions $f(z)$ with square integrable… ▽ More

    Submitted 6 February, 2020; v1 submitted 9 November, 2018; originally announced November 2018.

  19. arXiv:1804.01230  [pdf, ps, other

    math.ST

    The noise barrier and the large signal bias of the Lasso and other convex estimators

    Authors: Pierre C Bellec

    Abstract: Convex estimators such as the Lasso, the matrix Lasso and the group Lasso have been studied extensively in the last two decades, demonstrating great success in both theory and practice. Two quantities are introduced, the noise barrier and the large scale bias, that provides insights on the performance of these convex regularized estimators. It is now well understood that the Lasso achieves fast pr… ▽ More

    Submitted 27 October, 2018; v1 submitted 3 April, 2018; originally announced April 2018.

    Comments: This paper supersedes the previous article arXiv:1703.01332

  20. arXiv:1706.06977  [pdf, other

    math.ST

    A sharp oracle inequality for Graph-Slope

    Authors: Pierre C Bellec, Joseph Salmon, Samuel Vaiter

    Abstract: Following recent success on the analysis of the Slope estimator, we provide a sharp oracle inequality in term of prediction error for Graph-Slope, a generalization of Slope to signals observed over a graph. In addition to improving upon best results obtained so far for the Total Variation denoiser (also referred to as Graph-Lasso or Generalized Lasso), we propose an efficient algorithm to compute… ▽ More

    Submitted 20 November, 2017; v1 submitted 21 June, 2017; originally announced June 2017.

  21. arXiv:1705.10696  [pdf, ps, other

    math.ST

    Localized Gaussian width of $M$-convex hulls with applications to Lasso and convex aggregation

    Authors: Pierre C Bellec

    Abstract: Upper and lower bounds are derived for the Gaussian mean width of the intersection of a convex hull of $M$ points with an Euclidean ball of a given radius. The upper bound holds for any collection of extreme point bounded in Euclidean norm. The upper bound and the lower bound match up to a multiplicative constant whenever the extreme points satisfy a one sided Restricted Isometry Property. This… ▽ More

    Submitted 26 September, 2017; v1 submitted 30 May, 2017; originally announced May 2017.

  22. arXiv:1703.01332  [pdf, ps, other

    math.ST

    Optimistic lower bounds for convex regularized least-squares

    Authors: Pierre C Bellec

    Abstract: Minimax lower bounds are pessimistic in nature: for any given estimator, minimax lower bounds yield the existence of a worst-case target vector $β^*_{worst}$ for which the prediction error of the given estimator is bounded from below. However, minimax lower bounds shed no light on the prediction error of the given estimator for target vectors different than $β^*_{worst}$. A characterization of the… ▽ More

    Submitted 6 October, 2017; v1 submitted 3 March, 2017; originally announced March 2017.

  23. arXiv:1701.09120  [pdf, ps, other

    math.ST

    Towards the study of least squares estimators with convex penalty

    Authors: Pierre C. Bellec, Guillaume Lecué, Alexandre B. Tsybakov

    Abstract: Penalized least squares estimation is a popular technique in high-dimensional statistics. It includes such methods as the LASSO, the group LASSO, and the nuclear norm penalized least squares. The existing theory of these methods is not fully satisfying since it allows one to prove oracle inequalities with fixed high probability only for the estimators depending on this probability. Furthermore, th… ▽ More

    Submitted 7 July, 2017; v1 submitted 31 January, 2017; originally announced January 2017.

  24. arXiv:1609.06675  [pdf, ps, other

    math.ST

    Bounds on the prediction error of penalized least squares estimators with convex penalty

    Authors: Pierre C. Bellec, Alexandre B. Tsybakov

    Abstract: This paper considers the penalized least squares estimator with arbitrary convex penalty. When the observation noise is Gaussian, we show that the prediction error is a subgaussian random variable concentrated around its median. We apply this concentration property to derive sharp oracle inequalities for the prediction error of the LASSO, the group LASSO and the SLOPE estimators, both in probabili… ▽ More

    Submitted 21 September, 2016; originally announced September 2016.

  25. arXiv:1606.06179  [pdf, ps, other

    math.ST stat.ML

    On the prediction loss of the lasso in the partially labeled setting

    Authors: Pierre C. Bellec, Arnak S. Dalalyan, Edwin Grappin, Quentin Paris

    Abstract: In this paper we revisit the risk bounds of the lasso estimator in the context of transductive and semi-supervised learning. In other terms, the setting under consideration is that of regression with random design under partial labeling. The main goal is to obtain user-friendly bounds on the off-sample prediction risk. To this end, the simple setting of bounded response variable and bounded (high-… ▽ More

    Submitted 8 November, 2016; v1 submitted 20 June, 2016; originally announced June 2016.

    Comments: 25 pages

  26. arXiv:1605.08651  [pdf, ps, other

    math.ST

    Slope meets Lasso: improved oracle bounds and optimality

    Authors: Pierre C. Bellec, Guillaume Lecué, Alexandre B. Tsybakov

    Abstract: We show that two polynomial time methods, a Lasso estimator with adaptively chosen tuning parameter and a Slope estimator, adaptively achieve the exact minimax prediction and $\ell_2$ estimation rate $(s/n)\log (p/s)$ in high-dimensional linear regression on the class of $s$-sparse target vectors in $\mathbb R^p$. This is done under the Restricted Eigenvalue (RE) condition for the Lasso and under… ▽ More

    Submitted 24 May, 2017; v1 submitted 27 May, 2016; originally announced May 2016.

  27. arXiv:1602.03427  [pdf, ps, other

    math.ST

    Aggregation of supports along the Lasso path

    Authors: Pierre C. Bellec

    Abstract: In linear regression with fixed design, we propose two procedures that aggregate a data-driven collection of supports. The collection is a subset of the $2^p$ possible supports and both its cardinality and its elements can depend on the data. The procedures satisfy oracle inequalities with no assumption on the design matrix. Then we use these procedures to aggregate the supports that appear on the… ▽ More

    Submitted 31 May, 2016; v1 submitted 10 February, 2016; originally announced February 2016.

  28. arXiv:1601.05766  [pdf, ps, other

    math.ST

    Adaptive confidence sets in shape restricted regression

    Authors: Pierre C. Bellec

    Abstract: A simple construction of adaptive confidence sets is proposed in isotonic, convex and unimodal regression. In univariate isotonic regression, the proposed confidence set enjoys uniform coverage over all non-decreasing regression functions. Furthermore, the diameter of the proposed confidence set automatically adapts to the unknown number of pieces of the true parameter, in the sense that the diame… ▽ More

    Submitted 9 April, 2019; v1 submitted 21 January, 2016; originally announced January 2016.

  29. arXiv:1510.08029  [pdf, ps, other

    math.ST

    Sharp oracle inequalities for Least Squares estimators in shape restricted regression

    Authors: Pierre C. Bellec

    Abstract: The performance of Least Squares (LS) estimators is studied in isotonic, unimodal and convex regression. Our results have the form of sharp oracle inequalities that account for the model misspecification error. In isotonic and unimodal regression, the LS estimator achieves the nonparametric rate $n^{-2/3}$ as well as a parametric rate of order $k/n$ up to logarithmic factors, where $k$ is the numb… ▽ More

    Submitted 7 August, 2016; v1 submitted 27 October, 2015; originally announced October 2015.

  30. arXiv:1506.08724  [pdf, ps, other

    math.ST

    Sharp oracle bounds for monotone and convex regression through aggregation

    Authors: Pierre C. Bellec, Alexandre B. Tsybakov

    Abstract: We derive oracle inequalities for the problems of isotonic and convex regression using the combination of $Q$-aggregation procedure and sparsity pattern aggregation. This improves upon the previous results including the oracle inequalities for the constrained least squares estimator. One of the improvements is that our oracle inequalities are sharp, i.e., with leading constant 1. It allows us to o… ▽ More

    Submitted 30 September, 2015; v1 submitted 29 June, 2015; originally announced June 2015.

  31. arXiv:1410.0346  [pdf, other

    math.ST math.PR

    Optimal bounds for aggregation of affine estimators

    Authors: Pierre C. Bellec

    Abstract: We study the problem of aggregation of estimators when the estimators are not independent of the data used for aggregation and no sample splitting is allowed. If the estimators are deterministic vectors, it is well known that the minimax rate of aggregation is of order $\log(M)$, where $M$ is the number of estimators to aggregate. It is proved that for affine estimators, the minimax rate of aggreg… ▽ More

    Submitted 27 February, 2018; v1 submitted 1 October, 2014; originally announced October 2014.

    Comments: Published at https://projecteuclid.org/euclid.aos/1519268423 in the Annals of Statistics (http://imstat.org/aos/ ) by the Institute of Mathematical Statistics (http://imstat.org/ )

    Journal ref: Ann. Statist. Volume 46, Number 1 (2018), 30-59

  32. Optimal exponential bounds for aggregation of density estimators

    Authors: Pierre C. Bellec

    Abstract: We consider the problem of model selection type aggregation in the context of density estimation. We first show that empirical risk minimization is sub-optimal for this problem and it shares this property with the exponential weights aggregate, empirical risk minimization over the convex hull of the dictionary functions, and all selectors. Using a penalty inspired by recent works on the $Q$-aggreg… ▽ More

    Submitted 28 September, 2016; v1 submitted 15 May, 2014; originally announced May 2014.

    Comments: Published at http://dx.doi.org/10.3150/15-BEJ742 in the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm)

    Report number: IMS-BEJ-BEJ742

    Journal ref: Bernoulli 2017, Vol. 23, No. 1, 219-248