Skip to main content

Showing 1–9 of 9 results for author: Shaharabany, T

.
  1. arXiv:2309.03884  [pdf, other

    cs.SD cs.CL eess.AS

    Zero-Shot Audio Captioning via Audibility Guidance

    Authors: Tal Shaharabany, Ariel Shaulov, Lior Wolf

    Abstract: The task of audio captioning is similar in essence to tasks such as image and video captioning. However, it has received much less attention. We propose three desiderata for captioning audio -- (i) fluency of the generated text, (ii) faithfulness of the generated text to the input audio, and the somewhat related (iii) audibility, which is the quality of being able to be perceived based only on aud… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  2. arXiv:2309.03874  [pdf, other

    cs.CV

    Box-based Refinement for Weakly Supervised and Unsupervised Localization Tasks

    Authors: Eyal Gomel, Tal Shaharabany, Lior Wolf

    Abstract: It has been established that training a box-based detector network can enhance the localization performance of weakly supervised and unsupervised methods. Moreover, we extend this understanding by demonstrating that these detectors can be utilized to improve the original network, paving the way for further advancements. To accomplish this, we train the detectors on top of the network output instea… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  3. arXiv:2306.09004  [pdf, other

    eess.IV cs.CV

    Annotator Consensus Prediction for Medical Image Segmentation with Diffusion Models

    Authors: Tomer Amit, Shmuel Shichrur, Tal Shaharabany, Lior Wolf

    Abstract: A major challenge in the segmentation of medical images is the large inter- and intra-observer variability in annotations provided by multiple experts. To address this challenge, we propose a novel method for multi-expert prediction using diffusion models. Our method leverages the diffusion-based approach to incorporate information from multiple annotations and fuse it into a unified segmentation… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: text overlap with arXiv:2112.00390

  4. arXiv:2306.06370  [pdf, other

    cs.CV

    AutoSAM: Adapting SAM to Medical Images by Overloading the Prompt Encoder

    Authors: Tal Shaharabany, Aviad Dahan, Raja Giryes, Lior Wolf

    Abstract: The recently introduced Segment Anything Model (SAM) combines a clever architecture and large quantities of training data to obtain remarkable image segmentation capabilities. However, it fails to reproduce such results for Out-Of-Distribution (OOD) domains such as medical images. Moreover, while SAM is conditioned on either a mask or a set of points, it may be desirable to have a fully automatic… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

  5. arXiv:2206.09358  [pdf, other

    cs.CV

    What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text Inputs

    Authors: Tal Shaharabany, Yoad Tewel, Lior Wolf

    Abstract: Given an input image, and nothing else, our method returns the bounding boxes of objects in the image and phrases that describe the objects. This is achieved within an open world paradigm, in which the objects in the input image may not have been encountered during the training of the localization mechanism. Moreover, training takes place in a weakly supervised setting, where no bounding boxes are… ▽ More

    Submitted 27 June, 2022; v1 submitted 19 June, 2022; originally announced June 2022.

  6. arXiv:2112.02535  [pdf, other

    cs.CV

    End-to-End Segmentation via Patch-wise Polygons Prediction

    Authors: Tal Shaharabany, Lior Wolf

    Abstract: The leading segmentation methods represent the output map as a pixel grid. We study an alternative representation in which the object edges are modeled, per image patch, as a polygon with $k$ vertices that is coupled with per-patch label probabilities. The vertices are optimized by employing a differentiable neural renderer to create a raster image. The delineated region is then compared with the… ▽ More

    Submitted 5 December, 2021; originally announced December 2021.

  7. arXiv:2111.14131  [pdf, other

    cs.CV

    Learning a Weight Map for Weakly-Supervised Localization

    Authors: Tal Shaharabany, Lior Wolf

    Abstract: In the weakly supervised localization setting, supervision is given as an image-level label. We propose to employ an image classifier $f$ and to train a generative network $g$ that outputs, given the input image, a per-pixel weight map that indicates the location of the object within the image. Network $g$ is trained by minimizing the discrepancy between the output of the classifier $f$ on the ori… ▽ More

    Submitted 28 November, 2021; originally announced November 2021.

  8. arXiv:2103.13677  [pdf, other

    eess.IV cs.CV cs.LG

    Explainability Guided Multi-Site COVID-19 CT Classification

    Authors: Ameen Ali, Tal Shaharabany, Lior Wolf

    Abstract: Radiologist examination of chest CT is an effective way for screening COVID-19 cases. In this work, we overcome three challenges in the automation of this process: (i) the limited number of supervised positive cases, (ii) the lack of region-based supervision, and (iii) the variability across acquisition sites. These challenges are met by incorporating a recent augmentation solution called SnapMix,… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

  9. arXiv:1912.00367  [pdf, other

    cs.CV

    End to End Trainable Active Contours via Differentiable Rendering

    Authors: Shir Gur, Tal Shaharabany, Lior Wolf

    Abstract: We present an image segmentation method that iteratively evolves a polygon. At each iteration, the vertices of the polygon are displaced based on the local value of a 2D shift map that is inferred from the input image via an encoder-decoder architecture. The main training loss that is used is the difference between the polygon shape and the ground truth segmentation mask. The network employs a neu… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.