Skip to main content

Showing 1–28 of 28 results for author: Zelnik-Manor, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.08344  [pdf, other

    cs.CV cs.GR cs.LG

    STMPL: Human Soft-Tissue Simulation

    Authors: Anton Agafonov, Lihi Zelnik-Manor

    Abstract: In various applications, such as virtual reality and gaming, simulating the deformation of soft tissues in the human body during interactions with external objects is essential. Traditionally, Finite Element Methods (FEM) have been employed for this purpose, but they tend to be slow and resource-intensive. In this paper, we propose a unified representation of human body shape and soft tissue with… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  2. arXiv:2204.09134  [pdf, other

    cs.CV cs.LG

    Diverse Imagenet Models Transfer Better

    Authors: Niv Nayman, Avram Golbert, Asaf Noy, Tan **, Lihi Zelnik-Manor

    Abstract: A commonly accepted hypothesis is that models with higher accuracy on Imagenet perform better on other downstream tasks, leading to much research dedicated to optimizing Imagenet accuracy. Recently this hypothesis has been challenged by evidence showing that self-supervised models transfer better than their supervised counterparts, despite their inferior Imagenet accuracy. This calls for identifyi… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    MSC Class: 68T07; 68T10; 68T45 ACM Class: I.2.10; I.2.6; I.4.10

  3. arXiv:2110.12399  [pdf, other

    cs.LG cs.AI cs.CV math.OC stat.ML

    BINAS: Bilinear Interpretable Neural Architecture Search

    Authors: Niv Nayman, Yonathan Aflalo, Asaf Noy, Rong **, Lihi Zelnik-Manor

    Abstract: Practical use of neural networks often involves requirements on latency, energy and memory among others. A popular approach to find networks under such requirements is through constrained Neural Architecture Search (NAS). However, previous methods use complicated predictors for the accuracy of the network. Those predictors are hard to interpret and sensitive to many hyperparameters to be tuned, he… ▽ More

    Submitted 27 April, 2022; v1 submitted 24 October, 2021; originally announced October 2021.

    Comments: The full code is released at https://github.com/Alibaba-MIIL/BINAS

    MSC Class: 68T09; 68T45 ACM Class: G.1.6; G.3; I.2.8; I.2.10; I.5.1

  4. arXiv:2110.10955  [pdf, ps, other

    cs.CV

    Multi-label Classification with Partial Annotations using Class-aware Selective Loss

    Authors: Emanuel Ben-Baruch, Tal Ridnik, Itamar Friedman, Avi Ben-Cohen, Nadav Zamir, Asaf Noy, Lihi Zelnik-Manor

    Abstract: Large-scale multi-label classification datasets are commonly, and perhaps inevitably, partially annotated. That is, only a small subset of labels are annotated per sample. Different methods for handling the missing labels induce different properties on the model and impact its accuracy. In this work, we analyze the partial labeling problem, then propose a solution based on two key ideas. First, un… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

  5. arXiv:2109.12499  [pdf, other

    cs.CV cs.LG

    PETA: Photo Albums Event Recognition using Transformers Attention

    Authors: Tamar Glaser, Emanuel Ben-Baruch, Gilad Sharir, Nadav Zamir, Asaf Noy, Lihi Zelnik-Manor

    Abstract: In recent years the amounts of personal photos captured increased significantly, giving rise to new challenges in multi-image understanding and high-level image understanding. Event recognition in personal photo albums presents one challenging scenario where life events are recognized from a disordered collection of images, including both relevant and irrelevant images. Event recognition in images… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

    Comments: 8 pages, 10 including references, 3 figures, was submitted to WACV 2022

  6. arXiv:2105.05926  [pdf, other

    cs.CV

    Semantic Diversity Learning for Zero-Shot Multi-label Classification

    Authors: Avi Ben-Cohen, Nadav Zamir, Emanuel Ben Baruch, Itamar Friedman, Lihi Zelnik-Manor

    Abstract: Training a neural network model for recognizing multiple labels associated with an image, including identifying unseen labels, is challenging, especially for images that portray numerous semantically diverse labels. As challenging as this task is, it is an essential task to tackle since it represents many real-world cases, such as image retrieval of natural images. We argue that using a single emb… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  7. arXiv:2104.10972  [pdf, ps, other

    cs.CV cs.LG

    ImageNet-21K Pretraining for the Masses

    Authors: Tal Ridnik, Emanuel Ben-Baruch, Asaf Noy, Lihi Zelnik-Manor

    Abstract: ImageNet-1K serves as the primary dataset for pretraining deep learning models for computer vision tasks. ImageNet-21K dataset, which is bigger and more diverse, is used less frequently for pretraining, mainly due to its complexity, low accessibility, and underestimation of its added value. This paper aims to close this gap, and make high-quality efficient pretraining on ImageNet-21K available for… ▽ More

    Submitted 5 August, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

    Comments: Accepted to NeurIPS 2021 (Datasets and Benchmarks)

  8. arXiv:2103.13915  [pdf, other

    cs.CV

    An Image is Worth 16x16 Words, What is a Video Worth?

    Authors: Gilad Sharir, Asaf Noy, Lihi Zelnik-Manor

    Abstract: Leading methods in the domain of action recognition try to distill information from both the spatial and temporal dimensions of an input video. Methods that reach State of the Art (SotA) accuracy, usually make use of 3D convolution layers as a way to abstract the temporal information from video frames. The use of such convolutions requires sampling short clips from the input video, where each clip… ▽ More

    Submitted 27 May, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

  9. arXiv:2102.11646  [pdf, other

    cs.LG cs.AI cs.CV math.OC stat.ML

    HardCoRe-NAS: Hard Constrained diffeRentiable Neural Architecture Search

    Authors: Niv Nayman, Yonathan Aflalo, Asaf Noy, Lihi Zelnik-Manor

    Abstract: Realistic use of neural networks often requires adhering to multiple constraints on latency, energy and memory among others. A popular approach to find fitting networks is through constrained Neural Architecture Search (NAS), however, previous methods enforce the constraint only softly. Therefore, the resulting networks do not exactly adhere to the resource constraint and their accuracy is harmed.… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: Niv Nayman and Yonathan Aflalo contributed equally. An implementation of HardCoRe-NAS is available at: https://github.com/Alibaba-MIIL/HardCoReNAS

    MSC Class: 68T09; 68T45 ACM Class: G.1.6; G.3; I.2.8; I.2.10; I.5.1

  10. arXiv:2101.04243  [pdf, other

    cs.LG math.OC stat.ML

    A Convergence Theory Towards Practical Over-parameterized Deep Neural Networks

    Authors: Asaf Noy, Yi Xu, Yonathan Aflalo, Lihi Zelnik-Manor, Rong **

    Abstract: Deep neural networks' remarkable ability to correctly fit training data when optimized by gradient-based algorithms is yet to be fully understood. Recent theoretical results explain the convergence for ReLU networks that are wider than those used in practice by orders of magnitude. In this work, we take a step towards closing the gap between theory and practice by significantly improving the known… ▽ More

    Submitted 8 February, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

  11. arXiv:2009.14119  [pdf, ps, other

    cs.CV cs.LG

    Asymmetric Loss For Multi-Label Classification

    Authors: Emanuel Ben-Baruch, Tal Ridnik, Nadav Zamir, Asaf Noy, Itamar Friedman, Matan Protter, Lihi Zelnik-Manor

    Abstract: In a typical multi-label setting, a picture contains on average few positive labels, and many negative ones. This positive-negative imbalance dominates the optimization process, and can lead to under-emphasizing gradients from positive labels during training, resulting in poor accuracy. In this paper, we introduce a novel asymmetric loss ("ASL"), which operates differently on positive and negative… ▽ More

    Submitted 29 July, 2021; v1 submitted 29 September, 2020; originally announced September 2020.

    Comments: Accepted to ICCV 2021

    ACM Class: I.2.6; I.2.10; I.0; I.4.0

  12. arXiv:1912.11850  [pdf, other

    cs.CV

    Graph Embedded Pose Clustering for Anomaly Detection

    Authors: Amir Markovitz, Gilad Sharir, Itamar Friedman, Lihi Zelnik-Manor, Shai Avidan

    Abstract: We propose a new method for anomaly detection of human actions. Our method works directly on human pose graphs that can be computed from an input video sequence. This makes the analysis independent of nuisance parameters such as viewpoint or illumination. We map these graphs to a latent space and cluster them. Each action is then represented by its soft-assignment to each of the clusters. This giv… ▽ More

    Submitted 10 April, 2020; v1 submitted 26 December, 2019; originally announced December 2019.

    Comments: Code is available at https://github.com/amirmk89/gepc. CVPR 2020

  13. Compact Network Training for Person ReID

    Authors: Hussam Lawen, Avi Ben-Cohen, Matan Protter, Itamar Friedman, Lihi Zelnik-Manor

    Abstract: The task of person re-identification (ReID) has attracted growing attention in recent years leading to improved performance, albeit with little focus on real-world applications. Most SotA methods are based on heavy pre-trained models, e.g. ResNet50 (~25M parameters), which makes them less practical and more tedious to explore architecture modifications. In this study, we focus on a small-sized ran… ▽ More

    Submitted 9 April, 2020; v1 submitted 15 October, 2019; originally announced October 2019.

  14. arXiv:1906.08031  [pdf, other

    cs.LG cs.CV math.OC stat.ML

    XNAS: Neural Architecture Search with Expert Advice

    Authors: Niv Nayman, Asaf Noy, Tal Ridnik, Itamar Friedman, Rong **, Lihi Zelnik-Manor

    Abstract: This paper introduces a novel optimization method for differential neural architecture search, based on the theory of prediction with expert advice. Its optimization criterion is well fitted for an architecture-selection, i.e., it minimizes the regret incurred by a sub-optimal selection of operations. Unlike previous search relaxations, that require hard pruning of architectures, our method is des… ▽ More

    Submitted 19 June, 2019; originally announced June 2019.

  15. arXiv:1904.04123  [pdf, other

    stat.ML cs.LG

    ASAP: Architecture Search, Anneal and Prune

    Authors: Asaf Noy, Niv Nayman, Tal Ridnik, Nadav Zamir, Sivan Doveh, Itamar Friedman, Raja Giryes, Lihi Zelnik-Manor

    Abstract: Automatic methods for Neural Architecture Search (NAS) have been shown to produce state-of-the-art network models. Yet, their main drawback is the computational complexity of the search process. As some primal methods optimized over a discrete search space, thousands of days of GPU were required for convergence. A recent approach is based on constructing a differentiable search space that enables… ▽ More

    Submitted 10 October, 2019; v1 submitted 8 April, 2019; originally announced April 2019.

  16. arXiv:1901.11420  [pdf, other

    cs.CV

    Is Image Memorability Prediction Solved?

    Authors: Shay Perera, Ayellet Tal, Lihi Zelnik-Manor

    Abstract: This paper deals with the prediction of the memorability of a given image. We start by proposing an algorithm that reaches human-level performance on the LaMem dataset - the only large scale benchmark for memorability prediction. The suggested algorithm is based on three observations we make regarding convolutional neural networks (CNNs) that affect memorability prediction. Having reached human-le… ▽ More

    Submitted 31 January, 2019; originally announced January 2019.

  17. arXiv:1811.08760  [pdf, other

    cs.CV

    Dynamic-Net: Tuning the Objective Without Re-training for Synthesis Tasks

    Authors: Alon Shoshan, Roey Mechrez, Lihi Zelnik-Manor

    Abstract: One of the key ingredients for successful optimization of modern CNNs is identifying a suitable objective. To date, the objective is fixed a-priori at training time, and any variation to it requires re-training a new network. In this paper we present a first attempt at alleviating the need for re-training. Rather than fixing the network at training time, we train a "Dynamic-Net" that can be modifi… ▽ More

    Submitted 25 August, 2019; v1 submitted 21 November, 2018; originally announced November 2018.

    Comments: version update

  18. arXiv:1811.08126  [pdf, other

    cs.CV

    Adversarial Feedback Loop

    Authors: Firas Shama, Roey Mechrez, Alon Shoshan, Lihi Zelnik-Manor

    Abstract: Thanks to their remarkable generative capabilities, GANs have gained great popularity, and are used abundantly in state-of-the-art methods and applications. In a GAN based model, a discriminator is trained to learn the real data distribution. To date, it has been used only for training purposes, where it's utilized to train the generator to provide real-looking outputs. In this paper we propose a… ▽ More

    Submitted 20 November, 2018; originally announced November 2018.

  19. arXiv:1809.07517  [pdf, other

    cs.CV

    The 2018 PIRM Challenge on Perceptual Image Super-resolution

    Authors: Yochai Blau, Roey Mechrez, Radu Timofte, Tomer Michaeli, Lihi Zelnik-Manor

    Abstract: This paper reports on the 2018 PIRM challenge on perceptual super-resolution (SR), held in conjunction with the Perceptual Image Restoration and Manipulation (PIRM) workshop at ECCV 2018. In contrast to previous SR challenges, our evaluation methodology jointly quantifies accuracy and perceptual quality, therefore enabling perceptual-driven methods to compete alongside algorithms that target PSNR… ▽ More

    Submitted 31 January, 2019; v1 submitted 20 September, 2018; originally announced September 2018.

    Comments: Workshop and Challenge on Perceptual Image Restoration and Manipulation in conjunction with ECCV 2018 webpage: https://www.pirm2018.org/

    Journal ref: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2018

  20. arXiv:1803.04626  [pdf, other

    cs.CV

    Maintaining Natural Image Statistics with the Contextual Loss

    Authors: Roey Mechrez, Itamar Talmi, Firas Shama, Lihi Zelnik-Manor

    Abstract: Maintaining natural image statistics is a crucial factor in restoration and generation of realistic looking images. When training CNNs, photorealism is usually attempted by adversarial training (GAN), that pushes the output images to lie on the manifold of natural images. GANs are very powerful, but not perfect. They are hard to train and the results still often suffer from artifacts. In this pape… ▽ More

    Submitted 18 July, 2018; v1 submitted 13 March, 2018; originally announced March 2018.

  21. arXiv:1803.02077  [pdf, other

    cs.CV cs.LG

    The Contextual Loss for Image Transformation with Non-Aligned Data

    Authors: Roey Mechrez, Itamar Talmi, Lihi Zelnik-Manor

    Abstract: Feed-forward CNNs trained for image transformation problems rely on loss functions that measure the similarity between the generated image and a target image. Most of the common loss functions assume that these images are spatially aligned and compare pixels at corresponding locations. However, for many tasks, aligned training pairs of images will not be available. We present an alternative loss f… ▽ More

    Submitted 18 July, 2018; v1 submitted 6 March, 2018; originally announced March 2018.

    Comments: ECCV Oral. Paper web page: http://cgm.technion.ac.il/Computer-Graphics-Multimedia/Software/contextual/

  22. arXiv:1709.09828  [pdf, other

    cs.CV

    Photorealistic Style Transfer with Screened Poisson Equation

    Authors: Roey Mechrez, Eli Shechtman, Lihi Zelnik-Manor

    Abstract: Recent work has shown impressive success in transferring painterly style to images. These approaches, however, fall short of photorealistic style transfer. Even when both the input and reference images are photographs, the output still exhibits distortions reminiscent of a painting. In this paper we propose an approach that takes as input a stylized image and makes it more photorealistic. It relie… ▽ More

    Submitted 28 September, 2017; originally announced September 2017.

    Comments: presented in BMVC 2017

  23. arXiv:1612.02190  [pdf, other

    cs.CV

    Template Matching with Deformable Diversity Similarity

    Authors: Itamar Talmi, Roey Mechrez, Lihi Zelnik-Manor

    Abstract: We propose a novel measure for template matching named Deformable Diversity Similarity -- based on the diversity of feature matches between a target image window and the template. We rely on both local appearance and geometric information that jointly lead to a powerful approach for matching. Our key contribution is a similarity measure, that is robust to complex deformations, significant backgrou… ▽ More

    Submitted 18 April, 2017; v1 submitted 7 December, 2016; originally announced December 2016.

    Comments: accepted to CVPR2017 (spotlight)

  24. arXiv:1612.02184  [pdf, other

    cs.CV

    Saliency Driven Image Manipulation

    Authors: Roey Mechrez, Eli Shechtman, Lihi Zelnik-Manor

    Abstract: Have you ever taken a picture only to find out that an unimportant background object ended up being overly salient? Or one of those team sports photos where your favorite player blends with the rest? Wouldn't it be nice if you could tweak these pictures just a little bit so that the distractor would be attenuated and your favorite player will stand-out among her peers? Manipulating images in order… ▽ More

    Submitted 17 January, 2018; v1 submitted 7 December, 2016; originally announced December 2016.

    Comments: to appear in WACV'18

  25. arXiv:1508.07953  [pdf, other

    cs.CV

    Approximate Nearest Neighbor Fields in Video

    Authors: Nir Ben-Zrihem, Lihi Zelnik-Manor

    Abstract: We introduce RIANN (Ring Intersection Approximate Nearest Neighbor search), an algorithm for matching patches of a video to a set of reference patches in real-time. For each query, RIANN finds potential matches by intersecting rings around key points in appearance space. Its search complexity is reversely correlated to the amount of temporal change, making it a good fit for videos, where typically… ▽ More

    Submitted 31 August, 2015; originally announced August 2015.

    Comments: A CVPR 2015 oral paper

  26. arXiv:1204.3367  [pdf, other

    cs.SI cs.HC

    Crowdsourcing Gaze Data Collection

    Authors: Dmitry Rudoy, Dan B. Goldman, Eli Shechtman, Lihi Zelnik-Manor

    Abstract: Knowing where people look is a useful tool in many various image and video applications. However, traditional gaze tracking hardware is expensive and requires local study participants, so acquiring gaze location data from a large number of participants is very problematic. In this work we propose a crowdsourced method for acquisition of gaze direction data from a virtually unlimited number of part… ▽ More

    Submitted 16 April, 2012; originally announced April 2012.

    Comments: Presented at Collective Intelligence conference, 2012 (arXiv:1204.2991)

    Report number: CollectiveIntelligence/2012/106

  27. arXiv:1009.1533  [pdf, ps, other

    cs.IT

    Sensing Matrix Optimization for Block-Sparse Decoding

    Authors: Kevin Rosenblum, Lihi Zelnik-Manor, Yonina C. Eldar

    Abstract: Recent work has demonstrated that using a carefully designed sensing matrix rather than a random one, can improve the performance of compressed sensing. In particular, a well-designed sensing matrix can reduce the coherence between the atoms of the equivalent dictionary, and as a consequence, reduce the reconstruction error. In some applications, the signals of interest can be well approximated by… ▽ More

    Submitted 8 September, 2010; originally announced September 2010.

  28. arXiv:1005.0202  [pdf, ps, other

    cs.IT

    Dictionary Optimization for Block-Sparse Representations

    Authors: Kevin Rosenblum, Lihi Zelnik-Manor, Yonina C. Eldar

    Abstract: Recent work has demonstrated that using a carefully designed dictionary instead of a predefined one, can improve the sparsity in jointly representing a class of signals. This has motivated the derivation of learning methods for designing a dictionary which leads to the sparsest representation for a given set of signals. In some applications, the signals of interest can have further structure, so t… ▽ More

    Submitted 3 May, 2010; originally announced May 2010.

    Comments: submitted to IEEE Transactions on Signal Processing