Skip to main content

Showing 1–18 of 18 results for author: Noy, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.11479  [pdf, other

    cs.SD cs.CV eess.AS

    End-to-End Audio Strikes Back: Boosting Augmentations Towards An Efficient Audio Classification Network

    Authors: Avi Gazneli, Gadi Zimerman, Tal Ridnik, Gilad Sharir, Asaf Noy

    Abstract: While efficient architectures and a plethora of augmentations for end-to-end image classification tasks have been suggested and heavily investigated, state-of-the-art techniques for audio classifications still rely on numerous representations of the audio signal together with large architectures, fine-tuned from large datasets. By utilizing the inherited lightweight nature of audio and novel audio… ▽ More

    Submitted 5 July, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

  2. arXiv:2204.09134  [pdf, other

    cs.CV cs.LG

    Diverse Imagenet Models Transfer Better

    Authors: Niv Nayman, Avram Golbert, Asaf Noy, Tan **, Lihi Zelnik-Manor

    Abstract: A commonly accepted hypothesis is that models with higher accuracy on Imagenet perform better on other downstream tasks, leading to much research dedicated to optimizing Imagenet accuracy. Recently this hypothesis has been challenged by evidence showing that self-supervised models transfer better than their supervised counterparts, despite their inferior Imagenet accuracy. This calls for identifyi… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    MSC Class: 68T07; 68T10; 68T45 ACM Class: I.2.10; I.2.6; I.4.10

  3. arXiv:2204.03475  [pdf, other

    cs.CV cs.LG

    Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results

    Authors: Tal Ridnik, Hussam Lawen, Emanuel Ben-Baruch, Asaf Noy

    Abstract: ImageNet serves as the primary dataset for evaluating the quality of computer-vision models. The common practice today is training each architecture with a tailor-made scheme, designed and tuned by an expert. In this paper, we present a unified scheme for training any backbone on ImageNet. The scheme, named USI (Unified Scheme for ImageNet), is based on knowledge distillation and modern tricks. It… ▽ More

    Submitted 12 May, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

  4. arXiv:2111.12933  [pdf, other

    cs.CV cs.LG

    ML-Decoder: Scalable and Versatile Classification Head

    Authors: Tal Ridnik, Gilad Sharir, Avi Ben-Cohen, Emanuel Ben-Baruch, Asaf Noy

    Abstract: In this paper, we introduce ML-Decoder, a new attention-based classification head. ML-Decoder predicts the existence of class labels via queries, and enables better utilization of spatial data compared to global average pooling. By redesigning the decoder architecture, and using a novel group-decoding scheme, ML-Decoder is highly efficient, and can scale well to thousands of classes. Compared to u… ▽ More

    Submitted 31 December, 2021; v1 submitted 25 November, 2021; originally announced November 2021.

  5. arXiv:2110.12399  [pdf, other

    cs.LG cs.AI cs.CV math.OC stat.ML

    BINAS: Bilinear Interpretable Neural Architecture Search

    Authors: Niv Nayman, Yonathan Aflalo, Asaf Noy, Rong **, Lihi Zelnik-Manor

    Abstract: Practical use of neural networks often involves requirements on latency, energy and memory among others. A popular approach to find networks under such requirements is through constrained Neural Architecture Search (NAS). However, previous methods use complicated predictors for the accuracy of the network. Those predictors are hard to interpret and sensitive to many hyperparameters to be tuned, he… ▽ More

    Submitted 27 April, 2022; v1 submitted 24 October, 2021; originally announced October 2021.

    Comments: The full code is released at https://github.com/Alibaba-MIIL/BINAS

    MSC Class: 68T09; 68T45 ACM Class: G.1.6; G.3; I.2.8; I.2.10; I.5.1

  6. arXiv:2110.10955  [pdf, ps, other

    cs.CV

    Multi-label Classification with Partial Annotations using Class-aware Selective Loss

    Authors: Emanuel Ben-Baruch, Tal Ridnik, Itamar Friedman, Avi Ben-Cohen, Nadav Zamir, Asaf Noy, Lihi Zelnik-Manor

    Abstract: Large-scale multi-label classification datasets are commonly, and perhaps inevitably, partially annotated. That is, only a small subset of labels are annotated per sample. Different methods for handling the missing labels induce different properties on the model and impact its accuracy. In this work, we analyze the partial labeling problem, then propose a solution based on two key ideas. First, un… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

  7. arXiv:2109.12499  [pdf, other

    cs.CV cs.LG

    PETA: Photo Albums Event Recognition using Transformers Attention

    Authors: Tamar Glaser, Emanuel Ben-Baruch, Gilad Sharir, Nadav Zamir, Asaf Noy, Lihi Zelnik-Manor

    Abstract: In recent years the amounts of personal photos captured increased significantly, giving rise to new challenges in multi-image understanding and high-level image understanding. Event recognition in personal photo albums presents one challenging scenario where life events are recognized from a disordered collection of images, including both relevant and irrelevant images. Event recognition in images… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

    Comments: 8 pages, 10 including references, 3 figures, was submitted to WACV 2022

  8. arXiv:2104.10972  [pdf, ps, other

    cs.CV cs.LG

    ImageNet-21K Pretraining for the Masses

    Authors: Tal Ridnik, Emanuel Ben-Baruch, Asaf Noy, Lihi Zelnik-Manor

    Abstract: ImageNet-1K serves as the primary dataset for pretraining deep learning models for computer vision tasks. ImageNet-21K dataset, which is bigger and more diverse, is used less frequently for pretraining, mainly due to its complexity, low accessibility, and underestimation of its added value. This paper aims to close this gap, and make high-quality efficient pretraining on ImageNet-21K available for… ▽ More

    Submitted 5 August, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

    Comments: Accepted to NeurIPS 2021 (Datasets and Benchmarks)

  9. arXiv:2103.13915  [pdf, other

    cs.CV

    An Image is Worth 16x16 Words, What is a Video Worth?

    Authors: Gilad Sharir, Asaf Noy, Lihi Zelnik-Manor

    Abstract: Leading methods in the domain of action recognition try to distill information from both the spatial and temporal dimensions of an input video. Methods that reach State of the Art (SotA) accuracy, usually make use of 3D convolution layers as a way to abstract the temporal information from video frames. The use of such convolutions requires sampling short clips from the input video, where each clip… ▽ More

    Submitted 27 May, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

  10. arXiv:2102.11646  [pdf, other

    cs.LG cs.AI cs.CV math.OC stat.ML

    HardCoRe-NAS: Hard Constrained diffeRentiable Neural Architecture Search

    Authors: Niv Nayman, Yonathan Aflalo, Asaf Noy, Lihi Zelnik-Manor

    Abstract: Realistic use of neural networks often requires adhering to multiple constraints on latency, energy and memory among others. A popular approach to find fitting networks is through constrained Neural Architecture Search (NAS), however, previous methods enforce the constraint only softly. Therefore, the resulting networks do not exactly adhere to the resource constraint and their accuracy is harmed.… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: Niv Nayman and Yonathan Aflalo contributed equally. An implementation of HardCoRe-NAS is available at: https://github.com/Alibaba-MIIL/HardCoReNAS

    MSC Class: 68T09; 68T45 ACM Class: G.1.6; G.3; I.2.8; I.2.10; I.5.1

  11. arXiv:2101.04243  [pdf, other

    cs.LG math.OC stat.ML

    A Convergence Theory Towards Practical Over-parameterized Deep Neural Networks

    Authors: Asaf Noy, Yi Xu, Yonathan Aflalo, Lihi Zelnik-Manor, Rong **

    Abstract: Deep neural networks' remarkable ability to correctly fit training data when optimized by gradient-based algorithms is yet to be fully understood. Recent theoretical results explain the convergence for ReLU networks that are wider than those used in practice by orders of magnitude. In this work, we take a step towards closing the gap between theory and practice by significantly improving the known… ▽ More

    Submitted 8 February, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

  12. arXiv:2010.01267  [pdf, ps, other

    cs.LG cs.CV math.OC stat.ML

    WeMix: How to Better Utilize Data Augmentation

    Authors: Yi Xu, Asaf Noy, Ming Lin, Qi Qian, Hao Li, Rong **

    Abstract: Data augmentation is a widely used training trick in deep learning to improve the network generalization ability. Despite many encouraging results, several recent studies did point out limitations of the conventional data augmentation scheme in certain scenarios, calling for a better theoretical understanding of data augmentation. In this work, we develop a comprehensive analysis that reveals pros… ▽ More

    Submitted 2 October, 2020; originally announced October 2020.

  13. arXiv:2009.14119  [pdf, ps, other

    cs.CV cs.LG

    Asymmetric Loss For Multi-Label Classification

    Authors: Emanuel Ben-Baruch, Tal Ridnik, Nadav Zamir, Asaf Noy, Itamar Friedman, Matan Protter, Lihi Zelnik-Manor

    Abstract: In a typical multi-label setting, a picture contains on average few positive labels, and many negative ones. This positive-negative imbalance dominates the optimization process, and can lead to under-emphasizing gradients from positive labels during training, resulting in poor accuracy. In this paper, we introduce a novel asymmetric loss ("ASL"), which operates differently on positive and negative… ▽ More

    Submitted 29 July, 2021; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: Accepted to ICCV 2021

    ACM Class: I.2.6; I.2.10; I.0; I.4.0

  14. arXiv:2003.13630  [pdf, other

    cs.CV cs.LG eess.IV

    TResNet: High Performance GPU-Dedicated Architecture

    Authors: Tal Ridnik, Hussam Lawen, Asaf Noy, Emanuel Ben Baruch, Gilad Sharir, Itamar Friedman

    Abstract: Many deep learning models, developed in recent years, reach higher ImageNet accuracy than ResNet50, with fewer or comparable FLOPS count. While FLOPs are often seen as a proxy for network efficiency, when measuring actual GPU training and inference throughput, vanilla ResNet50 is usually significantly faster than its recent competitors, offering better throughput-accuracy trade-off. In this work… ▽ More

    Submitted 27 August, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: 11 pages, 5 figures

  15. arXiv:2002.08258  [pdf, ps, other

    cs.LG stat.ML

    Knapsack Pruning with Inner Distillation

    Authors: Yonathan Aflalo, Asaf Noy, Ming Lin, Itamar Friedman, Lihi Zelnik

    Abstract: Neural network pruning reduces the computational cost of an over-parameterized network to improve its efficiency. Popular methods vary from $\ell_1$-norm sparsification to Neural Architecture Search (NAS). In this work, we propose a novel pruning method that optimizes the final accuracy of the pruned network and distills knowledge from the over-parameterized parent network's inner layers. To enabl… ▽ More

    Submitted 3 June, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

  16. arXiv:1910.00975  [pdf, other

    cs.CR

    Eradicating Attacks on the Internal Network with Internal Network Policy

    Authors: Yehuda Afek, Anat Bremler-Barr, Alon Noy

    Abstract: In this paper we present three attacks on private internal networks behind a NAT and a corresponding new protection mechanism, Internal Network Policy, to mitigate a wide range of attacks that penetrate internal networks behind a NAT. In the attack scenario, a victim is tricked to visit the attacker's website, which contains a malicious script that lets the attacker access the victim's internal ne… ▽ More

    Submitted 3 October, 2019; v1 submitted 2 October, 2019; originally announced October 2019.

  17. arXiv:1906.08031  [pdf, other

    cs.LG cs.CV math.OC stat.ML

    XNAS: Neural Architecture Search with Expert Advice

    Authors: Niv Nayman, Asaf Noy, Tal Ridnik, Itamar Friedman, Rong **, Lihi Zelnik-Manor

    Abstract: This paper introduces a novel optimization method for differential neural architecture search, based on the theory of prediction with expert advice. Its optimization criterion is well fitted for an architecture-selection, i.e., it minimizes the regret incurred by a sub-optimal selection of operations. Unlike previous search relaxations, that require hard pruning of architectures, our method is des… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

  18. arXiv:1904.04123  [pdf, other

    stat.ML cs.LG

    ASAP: Architecture Search, Anneal and Prune

    Authors: Asaf Noy, Niv Nayman, Tal Ridnik, Nadav Zamir, Sivan Doveh, Itamar Friedman, Raja Giryes, Lihi Zelnik-Manor

    Abstract: Automatic methods for Neural Architecture Search (NAS) have been shown to produce state-of-the-art network models. Yet, their main drawback is the computational complexity of the search process. As some primal methods optimized over a discrete search space, thousands of days of GPU were required for convergence. A recent approach is based on constructing a differentiable search space that enables… ▽ More

    Submitted 10 October, 2019; v1 submitted 8 April, 2019; originally announced April 2019.