Skip to main content

Showing 1–3 of 3 results for author: Bair, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.04317  [pdf, other

    cs.CV cs.LG

    Text Descriptions are Compressive and Invariant Representations for Visual Learning

    Authors: Zhili Feng, Anna Bair, J. Zico Kolter

    Abstract: Modern image classification is based upon directly predicting classes via large discriminative networks, which do not directly contain information about the intuitive visual features that may constitute a classification decision. Recently, work in vision-language models (VLM) such as CLIP has provided ways to specify natural language descriptions of image classes, but typically focuses on providin… ▽ More

    Submitted 30 October, 2023; v1 submitted 9 July, 2023; originally announced July 2023.

  2. arXiv:2306.14306  [pdf, other

    cs.LG cs.CV

    Adaptive Sharpness-Aware Pruning for Robust Sparse Networks

    Authors: Anna Bair, Hongxu Yin, Maying Shen, Pavlo Molchanov, Jose Alvarez

    Abstract: Robustness and compactness are two essential attributes of deep learning models that are deployed in the real world. The goals of robustness and compactness may seem to be at odds, since robustness requires generalization across domains, while the process of compression exploits specificity in one domain. We introduce Adaptive Sharpness-Aware Pruning (AdaSAP), which unifies these goals through the… ▽ More

    Submitted 13 March, 2024; v1 submitted 25 June, 2023; originally announced June 2023.

  3. arXiv:2306.11695  [pdf, other

    cs.CL cs.AI cs.LG

    A Simple and Effective Pruning Approach for Large Language Models

    Authors: Mingjie Sun, Zhuang Liu, Anna Bair, J. Zico Kolter

    Abstract: As their size increases, Large Languages Models (LLMs) are natural candidates for network pruning methods: approaches that drop a subset of network weights while striving to preserve performance. Existing methods, however, require either retraining, which is rarely affordable for billion-scale LLMs, or solving a weight reconstruction problem reliant on second-order information, which may also be c… ▽ More

    Submitted 6 May, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: ICLR 2024. Website at https://eric-mingjie.github.io/wanda/home.html