Skip to main content

Showing 1–4 of 4 results for author: Lawen, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.03475  [pdf, other

    cs.CV cs.LG

    Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results

    Authors: Tal Ridnik, Hussam Lawen, Emanuel Ben-Baruch, Asaf Noy

    Abstract: ImageNet serves as the primary dataset for evaluating the quality of computer-vision models. The common practice today is training each architecture with a tailor-made scheme, designed and tuned by an expert. In this paper, we present a unified scheme for training any backbone on ImageNet. The scheme, named USI (Unified Scheme for ImageNet), is based on knowledge distillation and modern tricks. It… ▽ More

    Submitted 12 May, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

  2. arXiv:2201.06945  [pdf, ps, other

    cs.CV

    It's All in the Head: Representation Knowledge Distillation through Classifier Sharing

    Authors: Emanuel Ben-Baruch, Matan Karklinsky, Yossi Biton, Avi Ben-Cohen, Hussam Lawen, Nadav Zamir

    Abstract: Representation knowledge distillation aims at transferring rich information from one model to another. Common approaches for representation distillation mainly focus on the direct minimization of distance metrics between the models' embedding vectors. Such direct methods may be limited in transferring high-order dependencies embedded in the representation vectors, or in handling the capacity gap b… ▽ More

    Submitted 5 April, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

  3. arXiv:2003.13630  [pdf, other

    cs.CV cs.LG eess.IV

    TResNet: High Performance GPU-Dedicated Architecture

    Authors: Tal Ridnik, Hussam Lawen, Asaf Noy, Emanuel Ben Baruch, Gilad Sharir, Itamar Friedman

    Abstract: Many deep learning models, developed in recent years, reach higher ImageNet accuracy than ResNet50, with fewer or comparable FLOPS count. While FLOPs are often seen as a proxy for network efficiency, when measuring actual GPU training and inference throughput, vanilla ResNet50 is usually significantly faster than its recent competitors, offering better throughput-accuracy trade-off. In this work… ▽ More

    Submitted 27 August, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: 11 pages, 5 figures

  4. Compact Network Training for Person ReID

    Authors: Hussam Lawen, Avi Ben-Cohen, Matan Protter, Itamar Friedman, Lihi Zelnik-Manor

    Abstract: The task of person re-identification (ReID) has attracted growing attention in recent years leading to improved performance, albeit with little focus on real-world applications. Most SotA methods are based on heavy pre-trained models, e.g. ResNet50 (~25M parameters), which makes them less practical and more tedious to explore architecture modifications. In this study, we focus on a small-sized ran… ▽ More

    Submitted 9 April, 2020; v1 submitted 15 October, 2019; originally announced October 2019.