Skip to main content

Showing 1–5 of 5 results for author: Baruch, E B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.11120  [pdf, other

    cs.CV

    TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing

    Authors: Sherry X. Chen, Yaron Vaxman, Elad Ben Baruch, David Asulin, Aviad Moreshet, Kuo-Chin Lien, Misha Sra, Pradeep Sen

    Abstract: Despite many attempts to leverage pre-trained text-to-image models (T2I) like Stable Diffusion (SD) for controllable image editing, producing good predictable results remains a challenge. Previous approaches have focused on either fine-tuning pre-trained T2I models on specific datasets to generate certain kinds of images (e.g., with a specific object or person), or on optimizing the weights, text… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Conference on Computer Vision and Pattern Recognition (CVPR) 2024

  2. arXiv:2310.19024  [pdf, other

    cs.CV

    FPGAN-Control: A Controllable Fingerprint Generator for Training with Synthetic Data

    Authors: Alon Shoshan, Nadav Bhonker, Emanuel Ben Baruch, Ori Nizan, Igor Kviatkovsky, Joshua Engelsma, Manoj Aggarwal, Gerard Medioni

    Abstract: Training fingerprint recognition models using synthetic data has recently gained increased attention in the biometric community as it alleviates the dependency on sensitive personal data. Existing approaches for fingerprint generation are limited in their ability to generate diverse impressions of the same finger, a key property for providing effective data for training recognition models. To addr… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

  3. arXiv:2105.05926  [pdf, other

    cs.CV

    Semantic Diversity Learning for Zero-Shot Multi-label Classification

    Authors: Avi Ben-Cohen, Nadav Zamir, Emanuel Ben Baruch, Itamar Friedman, Lihi Zelnik-Manor

    Abstract: Training a neural network model for recognizing multiple labels associated with an image, including identifying unseen labels, is challenging, especially for images that portray numerous semantically diverse labels. As challenging as this task is, it is an essential task to tackle since it represents many real-world cases, such as image retrieval of natural images. We argue that using a single emb… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  4. arXiv:2003.13630  [pdf, other

    cs.CV cs.LG eess.IV

    TResNet: High Performance GPU-Dedicated Architecture

    Authors: Tal Ridnik, Hussam Lawen, Asaf Noy, Emanuel Ben Baruch, Gilad Sharir, Itamar Friedman

    Abstract: Many deep learning models, developed in recent years, reach higher ImageNet accuracy than ResNet50, with fewer or comparable FLOPS count. While FLOPs are often seen as a proxy for network efficiency, when measuring actual GPU training and inference throughput, vanilla ResNet50 is usually significantly faster than its recent competitors, offering better throughput-accuracy trade-off. In this work… ▽ More

    Submitted 27 August, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: 11 pages, 5 figures

  5. arXiv:1810.12941  [pdf, other

    cs.CV

    Joint detection and matching of feature points in multimodal images

    Authors: Elad Ben Baruch, Yosi Keller

    Abstract: In this work, we propose a novel Convolutional Neural Network (CNN) architecture for the joint detection and matching of feature points in images acquired by different sensors using a single forward pass. The resulting feature detector is tightly coupled with the feature descriptor, in contrast to classical approaches (SIFT, etc.), where the detection phase precedes and differs from computing the… ▽ More

    Submitted 16 June, 2021; v1 submitted 30 October, 2018; originally announced October 2018.