Skip to main content

Showing 1–4 of 4 results for author: Stromberg, N

.
  1. arXiv:2406.09561  [pdf, other

    cs.LG cs.AI

    Label Noise Robustness for Domain-Agnostic Fair Corrections via Nearest Neighbors Label Spreading

    Authors: Nathan Stromberg, Rohan Ayyagari, Sanmi Koyejo, Richard Nock, Lalitha Sankar

    Abstract: Last-layer retraining methods have emerged as an efficient framework for correcting existing base models. Within this framework, several methods have been proposed to deal with correcting models for subgroup fairness with and without group membership information. Importantly, prior work has demonstrated that many methods are susceptible to noisy labels. To this end, we propose a drop-in correction… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2405.05934  [pdf, other

    cs.LG cs.CV cs.IT stat.ML

    Theoretical Guarantees of Data Augmented Last Layer Retraining Methods

    Authors: Monica Welfert, Nathan Stromberg, Lalitha Sankar

    Abstract: Ensuring fair predictions across many distinct subpopulations in the training data can be prohibitive for large models. Recently, simple linear last layer retraining strategies, in combination with data augmentation methods such as upweighting, downsampling and mixup, have been shown to achieve state-of-the-art performance for worst-group accuracy, which quantifies accuracy for the least prevalent… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: Extended version of a paper accepted to ISIT 2024. arXiv admin note: text overlap with arXiv:2402.11039

  3. arXiv:2402.11039  [pdf, other

    cs.LG stat.ML

    Robustness to Subpopulation Shift with Domain Label Noise via Regularized Annotation of Domains

    Authors: Nathan Stromberg, Rohan Ayyagari, Monica Welfert, Sanmi Koyejo, Richard Nock, Lalitha Sankar

    Abstract: Existing methods for last layer retraining that aim to optimize worst-group accuracy (WGA) rely heavily on well-annotated groups in the training data. We show, both in theory and practice, that annotation-based data augmentations using either downsampling or upweighting for WGA are susceptible to domain annotation noise, and in high-noise regimes approach the WGA of a model trained with vanilla em… ▽ More

    Submitted 26 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Generalized Gaussian assumption

  4. arXiv:2302.09114  [pdf, other

    cs.LG cs.IT

    Smoothly Giving up: Robustness for Simple Models

    Authors: Tyler Sypherd, Nathan Stromberg, Richard Nock, Visar Berisha, Lalitha Sankar

    Abstract: There is a growing need for models that are interpretable and have reduced energy and computational cost (e.g., in health care analytics and federated learning). Examples of algorithms to train such models include logistic regression and boosting. However, one challenge facing these algorithms is that they provably suffer from label noise; this has been attributed to the joint interaction between… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: To appear in AISTATS 2023