Skip to main content

Showing 1–8 of 8 results for author: Springer, J M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20439  [pdf, other

    cs.LG

    Sharpness-Aware Minimization Enhances Feature Quality via Balanced Learning

    Authors: Jacob Mitchell Springer, Vaishnavh Nagarajan, Aditi Raghunathan

    Abstract: Sharpness-Aware Minimization (SAM) has emerged as a promising alternative optimizer to stochastic gradient descent (SGD). The originally-proposed motivation behind SAM was to bias neural networks towards flatter minima that are believed to generalize better. However, recent studies have shown conflicting evidence on the relationship between flatness and generalization, suggesting that flatness doe… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 25 pages, 10 figures, 2 tables

  2. arXiv:2402.15449  [pdf, other

    cs.CL cs.LG

    Repetition Improves Language Model Embeddings

    Authors: Jacob Mitchell Springer, Suhas Kotha, Daniel Fried, Graham Neubig, Aditi Raghunathan

    Abstract: Recent approaches to improving the extraction of text embeddings from autoregressive large language models (LLMs) have largely focused on improvements to data, backbone pretrained language models, or improving task-differentiation via instructions. In this work, we address an architectural limitation of autoregressive models: token embeddings cannot contain information from tokens that appear late… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 36 pages, 11 figures, 16 tables

  3. arXiv:2309.10105  [pdf, other

    cs.CL cs.LG

    Understanding Catastrophic Forgetting in Language Models via Implicit Inference

    Authors: Suhas Kotha, Jacob Mitchell Springer, Aditi Raghunathan

    Abstract: We lack a systematic understanding of the effects of fine-tuning (via methods such as instruction-tuning or reinforcement learning from human feedback), particularly on tasks outside the narrow fine-tuning distribution. In a simplified scenario, we demonstrate that improving performance on tasks within the fine-tuning data distribution comes at the expense of capabilities on other tasks. We hypoth… ▽ More

    Submitted 13 April, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: ICLR 2024

  4. arXiv:2106.02105  [pdf, other

    cs.LG cs.CR

    A Little Robustness Goes a Long Way: Leveraging Robust Features for Targeted Transfer Attacks

    Authors: Jacob M. Springer, Melanie Mitchell, Garrett T. Kenyon

    Abstract: Adversarial examples for neural network image classifiers are known to be transferable: examples optimized to be misclassified by a source classifier are often misclassified as well by classifiers with different architectures. However, targeted adversarial examples -- optimized to be classified as a chosen target class -- tend to be less transferable between architectures. While prior research on… ▽ More

    Submitted 25 October, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: NeurIPS '21

  5. arXiv:2102.05110  [pdf, other

    cs.LG

    Adversarial Perturbations Are Not So Weird: Entanglement of Robust and Non-Robust Features in Neural Network Classifiers

    Authors: Jacob M. Springer, Melanie Mitchell, Garrett T. Kenyon

    Abstract: Neural networks trained on visual data are well-known to be vulnerable to often imperceptible adversarial perturbations. The reasons for this vulnerability are still being debated in the literature. Recently Ilyas et al. (2019) showed that this vulnerability arises, in part, because neural network classifiers rely on highly predictive but brittle "non-robust" features. In this paper we extend the… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: 20 pages, 14 figures, 6 tables

  6. arXiv:2009.13562  [pdf, other

    cs.LG cs.CR stat.ML

    STRATA: Simple, Gradient-Free Attacks for Models of Code

    Authors: Jacob M. Springer, Bryn Marie Reinstadler, Una-May O'Reilly

    Abstract: Neural networks are well-known to be vulnerable to imperceptible perturbations in the input, called adversarial examples, that result in misclassification. Generating adversarial examples for source code poses an additional challenge compared to the domains of images and natural language, because source code perturbations must retain the functional meaning of the code. We identify a striking relat… ▽ More

    Submitted 19 August, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: KDD'21 AdvML Workshop

  7. arXiv:2009.01398  [pdf, other

    cs.LG stat.ML

    It's Hard for Neural Networks To Learn the Game of Life

    Authors: Jacob M. Springer, Garrett T. Kenyon

    Abstract: Efforts to improve the learning abilities of neural networks have focused mostly on the role of optimization methods rather than on weight initializations. Recent findings, however, suggest that neural networks rely on lucky random initial weights of subnetworks called "lottery tickets" that converge quickly to a solution. To investigate how weight initializations affect performance, we examine sm… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.

    Comments: 12 pages, 6 figures

  8. arXiv:1811.07211  [pdf, other

    cs.LG cs.CR cs.CV cs.NE stat.ML

    Classifiers Based on Deep Sparse Coding Architectures are Robust to Deep Learning Transferable Examples

    Authors: Jacob M. Springer, Charles S. Strauss, Austin M. Thresher, Edward Kim, Garrett T. Kenyon

    Abstract: Although deep learning has shown great success in recent years, researchers have discovered a critical flaw where small, imperceptible changes in the input to the system can drastically change the output classification. These attacks are exploitable in nearly all of the existing deep learning classification frameworks. However, the susceptibility of deep sparse coding models to adversarial example… ▽ More

    Submitted 20 November, 2018; v1 submitted 17 November, 2018; originally announced November 2018.

    Comments: 8 pages, 8 figures, fixed typos