Skip to main content

Showing 1–12 of 12 results for author: Friedman, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.00505  [pdf, other

    cs.IR cs.LG

    KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents

    Authors: Oshri Naparstek, Roi Pony, Inbar Shapira, Foad Abo Dahood, Ophir Azulai, Yevgeny Yaroker, Nadav Rubinstein, Maksym Lysak, Peter Staar, Ahmed Nassar, Nikolaos Livathinos, Christoph Auer, Elad Amrani, Idan Friedman, Orit Prince, Yevgeny Burshtein, Adi Raz Goldfarb, Udi Barzelay

    Abstract: In recent years, the challenge of extracting information from business documents has emerged as a critical task, finding applications across numerous domains. This effort has attracted substantial interest from both industry and academy, highlighting its significance in the current technological landscape. Most datasets in this area are primarily focused on Key Information Extraction (KIE), where… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: accepted ICDAR2024

  2. arXiv:2401.08500  [pdf, other

    cs.LG cs.CL cs.SE

    Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering

    Authors: Tal Ridnik, Dedy Kredo, Itamar Friedman

    Abstract: Code generation problems differ from common natural language problems - they require matching the exact syntax of the target language, identifying happy paths and edge cases, paying attention to numerous small details in the problem spec, and addressing other code-specific issues and requirements. Hence, many of the optimizations and tricks that have been successful in natural language generation… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  3. arXiv:2110.10955  [pdf, ps, other

    cs.CV

    Multi-label Classification with Partial Annotations using Class-aware Selective Loss

    Authors: Emanuel Ben-Baruch, Tal Ridnik, Itamar Friedman, Avi Ben-Cohen, Nadav Zamir, Asaf Noy, Lihi Zelnik-Manor

    Abstract: Large-scale multi-label classification datasets are commonly, and perhaps inevitably, partially annotated. That is, only a small subset of labels are annotated per sample. Different methods for handling the missing labels induce different properties on the model and impact its accuracy. In this work, we analyze the partial labeling problem, then propose a solution based on two key ideas. First, un… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

  4. arXiv:2105.05926  [pdf, other

    cs.CV

    Semantic Diversity Learning for Zero-Shot Multi-label Classification

    Authors: Avi Ben-Cohen, Nadav Zamir, Emanuel Ben Baruch, Itamar Friedman, Lihi Zelnik-Manor

    Abstract: Training a neural network model for recognizing multiple labels associated with an image, including identifying unseen labels, is challenging, especially for images that portray numerous semantically diverse labels. As challenging as this task is, it is an essential task to tackle since it represents many real-world cases, such as image retrieval of natural images. We argue that using a single emb… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  5. arXiv:2009.14119  [pdf, ps, other

    cs.CV cs.LG

    Asymmetric Loss For Multi-Label Classification

    Authors: Emanuel Ben-Baruch, Tal Ridnik, Nadav Zamir, Asaf Noy, Itamar Friedman, Matan Protter, Lihi Zelnik-Manor

    Abstract: In a typical multi-label setting, a picture contains on average few positive labels, and many negative ones. This positive-negative imbalance dominates the optimization process, and can lead to under-emphasizing gradients from positive labels during training, resulting in poor accuracy. In this paper, we introduce a novel asymmetric loss ("ASL"), which operates differently on positive and negative… ▽ More

    Submitted 29 July, 2021; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: Accepted to ICCV 2021

    ACM Class: I.2.6; I.2.10; I.0; I.4.0

  6. arXiv:2003.13630  [pdf, other

    cs.CV cs.LG eess.IV

    TResNet: High Performance GPU-Dedicated Architecture

    Authors: Tal Ridnik, Hussam Lawen, Asaf Noy, Emanuel Ben Baruch, Gilad Sharir, Itamar Friedman

    Abstract: Many deep learning models, developed in recent years, reach higher ImageNet accuracy than ResNet50, with fewer or comparable FLOPS count. While FLOPs are often seen as a proxy for network efficiency, when measuring actual GPU training and inference throughput, vanilla ResNet50 is usually significantly faster than its recent competitors, offering better throughput-accuracy trade-off. In this work… ▽ More

    Submitted 27 August, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: 11 pages, 5 figures

  7. arXiv:2002.08258  [pdf, ps, other

    cs.LG stat.ML

    Knapsack Pruning with Inner Distillation

    Authors: Yonathan Aflalo, Asaf Noy, Ming Lin, Itamar Friedman, Lihi Zelnik

    Abstract: Neural network pruning reduces the computational cost of an over-parameterized network to improve its efficiency. Popular methods vary from $\ell_1$-norm sparsification to Neural Architecture Search (NAS). In this work, we propose a novel pruning method that optimizes the final accuracy of the pruned network and distills knowledge from the over-parameterized parent network's inner layers. To enabl… ▽ More

    Submitted 3 June, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

  8. arXiv:1912.11850  [pdf, other

    cs.CV

    Graph Embedded Pose Clustering for Anomaly Detection

    Authors: Amir Markovitz, Gilad Sharir, Itamar Friedman, Lihi Zelnik-Manor, Shai Avidan

    Abstract: We propose a new method for anomaly detection of human actions. Our method works directly on human pose graphs that can be computed from an input video sequence. This makes the analysis independent of nuisance parameters such as viewpoint or illumination. We map these graphs to a latent space and cluster them. Each action is then represented by its soft-assignment to each of the clusters. This giv… ▽ More

    Submitted 10 April, 2020; v1 submitted 26 December, 2019; originally announced December 2019.

    Comments: Code is available at https://github.com/amirmk89/gepc. CVPR 2020

  9. Compact Network Training for Person ReID

    Authors: Hussam Lawen, Avi Ben-Cohen, Matan Protter, Itamar Friedman, Lihi Zelnik-Manor

    Abstract: The task of person re-identification (ReID) has attracted growing attention in recent years leading to improved performance, albeit with little focus on real-world applications. Most SotA methods are based on heavy pre-trained models, e.g. ResNet50 (~25M parameters), which makes them less practical and more tedious to explore architecture modifications. In this study, we focus on a small-sized ran… ▽ More

    Submitted 9 April, 2020; v1 submitted 15 October, 2019; originally announced October 2019.

  10. arXiv:1906.08031  [pdf, other

    cs.LG cs.CV math.OC stat.ML

    XNAS: Neural Architecture Search with Expert Advice

    Authors: Niv Nayman, Asaf Noy, Tal Ridnik, Itamar Friedman, Rong **, Lihi Zelnik-Manor

    Abstract: This paper introduces a novel optimization method for differential neural architecture search, based on the theory of prediction with expert advice. Its optimization criterion is well fitted for an architecture-selection, i.e., it minimizes the regret incurred by a sub-optimal selection of operations. Unlike previous search relaxations, that require hard pruning of architectures, our method is des… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

  11. arXiv:1904.04123  [pdf, other

    stat.ML cs.LG

    ASAP: Architecture Search, Anneal and Prune

    Authors: Asaf Noy, Niv Nayman, Tal Ridnik, Nadav Zamir, Sivan Doveh, Itamar Friedman, Raja Giryes, Lihi Zelnik-Manor

    Abstract: Automatic methods for Neural Architecture Search (NAS) have been shown to produce state-of-the-art network models. Yet, their main drawback is the computational complexity of the search process. As some primal methods optimized over a discrete search space, thousands of days of GPU were required for convergence. A recent approach is based on constructing a differentiable search space that enables… ▽ More

    Submitted 10 October, 2019; v1 submitted 8 April, 2019; originally announced April 2019.

  12. arXiv:1707.06545  [pdf, other

    cs.CV

    Video Object Segmentation using Tracked Object Proposals

    Authors: Gilad Sharir, Eddie Smolyansky, Itamar Friedman

    Abstract: We present an approach to semi-supervised video object segmentation, in the context of the DAVIS 2017 challenge. Our approach combines category-based object detection, category-independent object appearance segmentation and temporal object tracking. We are motivated by the fact that the objects semantic category tends not to change throughout the video while its appearance and location can vary co… ▽ More

    Submitted 20 July, 2017; originally announced July 2017.

    Comments: All authors contributed equally, CVPR-2017 workshop, DAVIS-2017 Challenge