Skip to main content

Showing 1–10 of 10 results for author: Ridnik, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.08500  [pdf, other

    cs.LG cs.CL cs.SE

    Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering

    Authors: Tal Ridnik, Dedy Kredo, Itamar Friedman

    Abstract: Code generation problems differ from common natural language problems - they require matching the exact syntax of the target language, identifying happy paths and edge cases, paying attention to numerous small details in the problem spec, and addressing other code-specific issues and requirements. Hence, many of the optimizations and tricks that have been successful in natural language generation… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  2. arXiv:2204.11479  [pdf, other

    cs.SD cs.CV eess.AS

    End-to-End Audio Strikes Back: Boosting Augmentations Towards An Efficient Audio Classification Network

    Authors: Avi Gazneli, Gadi Zimerman, Tal Ridnik, Gilad Sharir, Asaf Noy

    Abstract: While efficient architectures and a plethora of augmentations for end-to-end image classification tasks have been suggested and heavily investigated, state-of-the-art techniques for audio classifications still rely on numerous representations of the audio signal together with large architectures, fine-tuned from large datasets. By utilizing the inherited lightweight nature of audio and novel audio… ▽ More

    Submitted 5 July, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

  3. arXiv:2204.03475  [pdf, other

    cs.CV cs.LG

    Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results

    Authors: Tal Ridnik, Hussam Lawen, Emanuel Ben-Baruch, Asaf Noy

    Abstract: ImageNet serves as the primary dataset for evaluating the quality of computer-vision models. The common practice today is training each architecture with a tailor-made scheme, designed and tuned by an expert. In this paper, we present a unified scheme for training any backbone on ImageNet. The scheme, named USI (Unified Scheme for ImageNet), is based on knowledge distillation and modern tricks. It… ▽ More

    Submitted 12 May, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

  4. arXiv:2111.12933  [pdf, other

    cs.CV cs.LG

    ML-Decoder: Scalable and Versatile Classification Head

    Authors: Tal Ridnik, Gilad Sharir, Avi Ben-Cohen, Emanuel Ben-Baruch, Asaf Noy

    Abstract: In this paper, we introduce ML-Decoder, a new attention-based classification head. ML-Decoder predicts the existence of class labels via queries, and enables better utilization of spatial data compared to global average pooling. By redesigning the decoder architecture, and using a novel group-decoding scheme, ML-Decoder is highly efficient, and can scale well to thousands of classes. Compared to u… ▽ More

    Submitted 31 December, 2021; v1 submitted 25 November, 2021; originally announced November 2021.

  5. arXiv:2110.10955  [pdf, ps, other

    cs.CV

    Multi-label Classification with Partial Annotations using Class-aware Selective Loss

    Authors: Emanuel Ben-Baruch, Tal Ridnik, Itamar Friedman, Avi Ben-Cohen, Nadav Zamir, Asaf Noy, Lihi Zelnik-Manor

    Abstract: Large-scale multi-label classification datasets are commonly, and perhaps inevitably, partially annotated. That is, only a small subset of labels are annotated per sample. Different methods for handling the missing labels induce different properties on the model and impact its accuracy. In this work, we analyze the partial labeling problem, then propose a solution based on two key ideas. First, un… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

  6. arXiv:2104.10972  [pdf, ps, other

    cs.CV cs.LG

    ImageNet-21K Pretraining for the Masses

    Authors: Tal Ridnik, Emanuel Ben-Baruch, Asaf Noy, Lihi Zelnik-Manor

    Abstract: ImageNet-1K serves as the primary dataset for pretraining deep learning models for computer vision tasks. ImageNet-21K dataset, which is bigger and more diverse, is used less frequently for pretraining, mainly due to its complexity, low accessibility, and underestimation of its added value. This paper aims to close this gap, and make high-quality efficient pretraining on ImageNet-21K available for… ▽ More

    Submitted 5 August, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

    Comments: Accepted to NeurIPS 2021 (Datasets and Benchmarks)

  7. arXiv:2009.14119  [pdf, ps, other

    cs.CV cs.LG

    Asymmetric Loss For Multi-Label Classification

    Authors: Emanuel Ben-Baruch, Tal Ridnik, Nadav Zamir, Asaf Noy, Itamar Friedman, Matan Protter, Lihi Zelnik-Manor

    Abstract: In a typical multi-label setting, a picture contains on average few positive labels, and many negative ones. This positive-negative imbalance dominates the optimization process, and can lead to under-emphasizing gradients from positive labels during training, resulting in poor accuracy. In this paper, we introduce a novel asymmetric loss ("ASL"), which operates differently on positive and negative… ▽ More

    Submitted 29 July, 2021; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: Accepted to ICCV 2021

    ACM Class: I.2.6; I.2.10; I.0; I.4.0

  8. arXiv:2003.13630  [pdf, other

    cs.CV cs.LG eess.IV

    TResNet: High Performance GPU-Dedicated Architecture

    Authors: Tal Ridnik, Hussam Lawen, Asaf Noy, Emanuel Ben Baruch, Gilad Sharir, Itamar Friedman

    Abstract: Many deep learning models, developed in recent years, reach higher ImageNet accuracy than ResNet50, with fewer or comparable FLOPS count. While FLOPs are often seen as a proxy for network efficiency, when measuring actual GPU training and inference throughput, vanilla ResNet50 is usually significantly faster than its recent competitors, offering better throughput-accuracy trade-off. In this work… ▽ More

    Submitted 27 August, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: 11 pages, 5 figures

  9. arXiv:1906.08031  [pdf, other

    cs.LG cs.CV math.OC stat.ML

    XNAS: Neural Architecture Search with Expert Advice

    Authors: Niv Nayman, Asaf Noy, Tal Ridnik, Itamar Friedman, Rong **, Lihi Zelnik-Manor

    Abstract: This paper introduces a novel optimization method for differential neural architecture search, based on the theory of prediction with expert advice. Its optimization criterion is well fitted for an architecture-selection, i.e., it minimizes the regret incurred by a sub-optimal selection of operations. Unlike previous search relaxations, that require hard pruning of architectures, our method is des… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

  10. arXiv:1904.04123  [pdf, other

    stat.ML cs.LG

    ASAP: Architecture Search, Anneal and Prune

    Authors: Asaf Noy, Niv Nayman, Tal Ridnik, Nadav Zamir, Sivan Doveh, Itamar Friedman, Raja Giryes, Lihi Zelnik-Manor

    Abstract: Automatic methods for Neural Architecture Search (NAS) have been shown to produce state-of-the-art network models. Yet, their main drawback is the computational complexity of the search process. As some primal methods optimized over a discrete search space, thousands of days of GPU were required for convergence. A recent approach is based on constructing a differentiable search space that enables… ▽ More

    Submitted 10 October, 2019; v1 submitted 8 April, 2019; originally announced April 2019.