Skip to main content

Showing 1–12 of 12 results for author: Sur, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13944  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Generalization error of min-norm interpolators in transfer learning

    Authors: Yanke Song, Sohom Bhattacharya, Pragya Sur

    Abstract: This paper establishes the generalization error of pooled min-$\ell_2$-norm interpolation in transfer learning where data from diverse distributions are available. Min-norm interpolators emerge naturally as implicit regularized limits of modern machine learning algorithms. Previous work characterized their out-of-distribution risk when samples from the test distribution are unavailable during trai… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 53 pages, 2 figures

  2. arXiv:2406.11666  [pdf, other

    math.ST cs.LG stat.ML

    ROTI-GCV: Generalized Cross-Validation for right-ROTationally Invariant Data

    Authors: Kevin Luo, Yufan Li, Pragya Sur

    Abstract: Two key tasks in high-dimensional regularized regression are tuning the regularization strength for good predictions and estimating the out-of-sample risk. It is known that the standard approach -- $k$-fold cross-validation -- is inconsistent in modern high-dimensional settings. While leave-one-out and generalized cross-validation remain consistent in some high-dimensional cases, they become incon… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 25 pages, 3 figures

  3. arXiv:2403.16336  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Predictive Inference in Multi-environment Scenarios

    Authors: John C. Duchi, Suyash Gupta, Kuanhao Jiang, Pragya Sur

    Abstract: We address the challenge of constructing valid confidence intervals and sets in problems of prediction across multiple environments. We investigate two types of coverage suitable for these problems, extending the jackknife and split-conformal methods to show how to obtain distribution-free coverage in such non-traditional, hierarchical data-generating scenarios. Our contributions also include exte… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  4. arXiv:2309.07810  [pdf, other

    math.ST cs.IT stat.ME stat.ML

    Spectrum-Aware Adjustment: A New Debiasing Framework with Applications to Principal Component Regression

    Authors: Yufan Li, Pragya Sur

    Abstract: We introduce a new debiasing framework for high-dimensional linear regression that bypasses the restrictions on covariate distributions imposed by modern debiasing technology. We study the prevalent setting where the number of features and samples are both large and comparable. In this context, state-of-the-art debiasing technology uses a degrees-of-freedom correction to remove the shrinkage bias… ▽ More

    Submitted 16 October, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

  5. arXiv:2210.12082  [pdf, other

    stat.ML cs.LG math.ST

    A Non-Asymptotic Moreau Envelope Theory for High-Dimensional Generalized Linear Models

    Authors: Lijia Zhou, Frederic Koehler, Pragya Sur, Danica J. Sutherland, Nathan Srebro

    Abstract: We prove a new generalization bound that shows for any class of linear predictors in Gaussian space, the Rademacher complexity of the class and the training error under any continuous loss $\ell$ can control the test error under all Moreau envelopes of the loss $\ell$. We use our finite-sample bound to directly recover the "optimistic rate" of Zhou et al. (2021) for linear regression with the squa… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: As published at NeurIPS 2022

  6. arXiv:2207.04588  [pdf, other

    stat.ML cs.LG

    Multi-Study Boosting: Theoretical Considerations for Merging vs. Ensembling

    Authors: Cathy Shyr, Pragya Sur, Giovanni Parmigiani, Prasad Patil

    Abstract: Cross-study replicability is a powerful model evaluation criterion that emphasizes generalizability of predictions. When training cross-study replicable prediction models, it is critical to decide between merging and treating the studies separately. We study boosting algorithms in the presence of potential heterogeneity in predictor-outcome relationships across studies and compare two multi-study… ▽ More

    Submitted 12 July, 2022; v1 submitted 10 July, 2022; originally announced July 2022.

  7. arXiv:2204.04476  [pdf, other

    math.ST cs.LG math.PR stat.ML

    High-dimensional Asymptotics of Langevin Dynamics in Spiked Matrix Models

    Authors: Tengyuan Liang, Subhabrata Sen, Pragya Sur

    Abstract: We study Langevin dynamics for recovering the planted signal in the spiked matrix model. We provide a "path-wise" characterization of the overlap between the output of the Langevin algorithm and the planted signal. This overlap is characterized in terms of a self-consistent system of integro-differential equations, usually referred to as the Crisanti-Horner-Sommers-Cugliandolo-Kurchan (CHSCK) equa… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

    Comments: 26 pages

    Journal ref: Information and Inference: A Journal of the IMA, 12(4):2720-2752, 2023

  8. arXiv:2006.11478  [pdf, ps, other

    cs.LG stat.ML

    Representation via Representations: Domain Generalization via Adversarially Learned Invariant Representations

    Authors: Zhun Deng, Frances Ding, Cynthia Dwork, Rachel Hong, Giovanni Parmigiani, Prasad Patil, Pragya Sur

    Abstract: We investigate the power of censoring techniques, first developed for learning {\em fair representations}, to address domain generalization. We examine {\em adversarial} censoring techniques for learning invariant representations from multiple "studies" (or domains), where each study is drawn according to a distribution on domains. The map** is used at test time to classify instances from a new… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

  9. arXiv:2004.01840  [pdf, other

    cs.LG stat.ML

    Abstracting Fairness: Oracles, Metrics, and Interpretability

    Authors: Cynthia Dwork, Christina Ilvento, Guy N. Rothblum, Pragya Sur

    Abstract: It is well understood that classification algorithms, for example, for deciding on loan applications, cannot be evaluated for fairness without taking context into account. We examine what can be learned from a fairness oracle equipped with an underlying understanding of ``true'' fairness. The oracle takes as input a (context, classifier) pair satisfying an arbitrary fairness definition, and accept… ▽ More

    Submitted 3 April, 2020; originally announced April 2020.

    Comments: 17 pages, 1 figure

  10. arXiv:2002.01586  [pdf, other

    math.ST cs.IT cs.LG stat.ML

    A Precise High-Dimensional Asymptotic Theory for Boosting and Minimum-$\ell_1$-Norm Interpolated Classifiers

    Authors: Tengyuan Liang, Pragya Sur

    Abstract: This paper establishes a precise high-dimensional asymptotic theory for boosting on separable data, taking statistical and computational perspectives. We consider a high-dimensional setting where the number of features (weak learners) $p$ scales with the sample size $n$, in an overparametrized regime. Under a class of statistical models, we provide an exact analysis of the generalization error of… ▽ More

    Submitted 22 July, 2021; v1 submitted 4 February, 2020; originally announced February 2020.

    Comments: 68 pages, 4 figures

    Journal ref: The Annals of Statistics, 50(3):1669-1695, 2022

  11. arXiv:1706.01191  [pdf, other

    math.ST cs.IT math.PR stat.ML

    The Likelihood Ratio Test in High-Dimensional Logistic Regression Is Asymptotically a Rescaled Chi-Square

    Authors: Pragya Sur, Yuxin Chen, Emmanuel J. Candès

    Abstract: Logistic regression is used thousands of times a day to fit data, predict future outcomes, and assess the statistical significance of explanatory variables. When used for the purpose of statistical inference, logistic models produce p-values for the regression coefficients by using an approximation to the distribution of the likelihood-ratio test. Indeed, Wilks' theorem asserts that whenever we ha… ▽ More

    Submitted 5 June, 2017; originally announced June 2017.

    Comments: 58 pages, 7 figures

  12. arXiv:1403.2508  [pdf, other

    cs.DC

    Heuristic-based Optimal Resource Provisioning in Application-centric Cloud

    Authors: Sunirmal Khatua, Preetam K. Sur, Rajib K. Das, Nandini Mukherjee

    Abstract: Cloud Service Providers (CSPs) adapt different pricing models for their offered services. Some of the models are suitable for short term requirement while others may be suitable for the Cloud Service User's (CSU) long term requirement. In this paper, we look at the problem of finding the amount of resources to be reserved to satisfy the CSU's long term demands with the aim of minimizing the total… ▽ More

    Submitted 11 March, 2014; originally announced March 2014.