Skip to main content

Showing 1–8 of 8 results for author: Ben-Baruch, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.07854  [pdf, other

    cs.CV cs.LG

    Distilling the Knowledge in Data Pruning

    Authors: Emanuel Ben-Baruch, Adam Botach, Igor Kviatkovsky, Manoj Aggarwal, GĂ©rard Medioni

    Abstract: With the increasing size of datasets used for training neural networks, data pruning becomes an attractive field of research. However, most current data pruning algorithms are limited in their ability to preserve accuracy compared to models trained on the full data, especially in high pruning regimes. In this paper we explore the application of data pruning while incorporating knowledge distillati… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  2. arXiv:2204.03475  [pdf, other

    cs.CV cs.LG

    Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results

    Authors: Tal Ridnik, Hussam Lawen, Emanuel Ben-Baruch, Asaf Noy

    Abstract: ImageNet serves as the primary dataset for evaluating the quality of computer-vision models. The common practice today is training each architecture with a tailor-made scheme, designed and tuned by an expert. In this paper, we present a unified scheme for training any backbone on ImageNet. The scheme, named USI (Unified Scheme for ImageNet), is based on knowledge distillation and modern tricks. It… ▽ More

    Submitted 12 May, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

  3. arXiv:2201.06945  [pdf, ps, other

    cs.CV

    It's All in the Head: Representation Knowledge Distillation through Classifier Sharing

    Authors: Emanuel Ben-Baruch, Matan Karklinsky, Yossi Biton, Avi Ben-Cohen, Hussam Lawen, Nadav Zamir

    Abstract: Representation knowledge distillation aims at transferring rich information from one model to another. Common approaches for representation distillation mainly focus on the direct minimization of distance metrics between the models' embedding vectors. Such direct methods may be limited in transferring high-order dependencies embedded in the representation vectors, or in handling the capacity gap b… ▽ More

    Submitted 5 April, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

  4. arXiv:2111.12933  [pdf, other

    cs.CV cs.LG

    ML-Decoder: Scalable and Versatile Classification Head

    Authors: Tal Ridnik, Gilad Sharir, Avi Ben-Cohen, Emanuel Ben-Baruch, Asaf Noy

    Abstract: In this paper, we introduce ML-Decoder, a new attention-based classification head. ML-Decoder predicts the existence of class labels via queries, and enables better utilization of spatial data compared to global average pooling. By redesigning the decoder architecture, and using a novel group-decoding scheme, ML-Decoder is highly efficient, and can scale well to thousands of classes. Compared to u… ▽ More

    Submitted 31 December, 2021; v1 submitted 25 November, 2021; originally announced November 2021.

  5. arXiv:2110.10955  [pdf, ps, other

    cs.CV

    Multi-label Classification with Partial Annotations using Class-aware Selective Loss

    Authors: Emanuel Ben-Baruch, Tal Ridnik, Itamar Friedman, Avi Ben-Cohen, Nadav Zamir, Asaf Noy, Lihi Zelnik-Manor

    Abstract: Large-scale multi-label classification datasets are commonly, and perhaps inevitably, partially annotated. That is, only a small subset of labels are annotated per sample. Different methods for handling the missing labels induce different properties on the model and impact its accuracy. In this work, we analyze the partial labeling problem, then propose a solution based on two key ideas. First, un… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

  6. arXiv:2109.12499  [pdf, other

    cs.CV cs.LG

    PETA: Photo Albums Event Recognition using Transformers Attention

    Authors: Tamar Glaser, Emanuel Ben-Baruch, Gilad Sharir, Nadav Zamir, Asaf Noy, Lihi Zelnik-Manor

    Abstract: In recent years the amounts of personal photos captured increased significantly, giving rise to new challenges in multi-image understanding and high-level image understanding. Event recognition in personal photo albums presents one challenging scenario where life events are recognized from a disordered collection of images, including both relevant and irrelevant images. Event recognition in images… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

    Comments: 8 pages, 10 including references, 3 figures, was submitted to WACV 2022

  7. arXiv:2104.10972  [pdf, ps, other

    cs.CV cs.LG

    ImageNet-21K Pretraining for the Masses

    Authors: Tal Ridnik, Emanuel Ben-Baruch, Asaf Noy, Lihi Zelnik-Manor

    Abstract: ImageNet-1K serves as the primary dataset for pretraining deep learning models for computer vision tasks. ImageNet-21K dataset, which is bigger and more diverse, is used less frequently for pretraining, mainly due to its complexity, low accessibility, and underestimation of its added value. This paper aims to close this gap, and make high-quality efficient pretraining on ImageNet-21K available for… ▽ More

    Submitted 5 August, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

    Comments: Accepted to NeurIPS 2021 (Datasets and Benchmarks)

  8. arXiv:2009.14119  [pdf, ps, other

    cs.CV cs.LG

    Asymmetric Loss For Multi-Label Classification

    Authors: Emanuel Ben-Baruch, Tal Ridnik, Nadav Zamir, Asaf Noy, Itamar Friedman, Matan Protter, Lihi Zelnik-Manor

    Abstract: In a typical multi-label setting, a picture contains on average few positive labels, and many negative ones. This positive-negative imbalance dominates the optimization process, and can lead to under-emphasizing gradients from positive labels during training, resulting in poor accuracy. In this paper, we introduce a novel asymmetric loss ("ASL"), which operates differently on positive and negative… ▽ More

    Submitted 29 July, 2021; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: Accepted to ICCV 2021

    ACM Class: I.2.6; I.2.10; I.0; I.4.0