Skip to main content

Showing 1–2 of 2 results for author: Barazani, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17423  [pdf, other

    cs.CV cs.CL

    Privacy-Aware Visual Language Models

    Authors: Laurens Samson, Nimrod Barazani, Sennay Ghebreab, Yuki M. Asano

    Abstract: This paper aims to advance our understanding of how Visual Language Models (VLMs) handle privacy-sensitive information, a crucial concern as these technologies become integral to everyday life. To this end, we introduce a new benchmark PrivBench, which contains images from 8 sensitive categories such as passports, or fingerprints. We evaluate 10 state-of-the-art VLMs on this benchmark and observe… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: preprint

  2. arXiv:2402.08657  [pdf, other

    cs.CV

    PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs

    Authors: Michael Dorkenwald, Nimrod Barazani, Cees G. M. Snoek, Yuki M. Asano

    Abstract: Vision-Language Models (VLMs), such as Flamingo and GPT-4V, have shown immense potential by integrating large language models with vision systems. Nevertheless, these models face challenges in the fundamental computer vision task of object localisation, due to their training on multimodal data containing mostly captions without explicit spatial grounding. While it is possible to construct custom,… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.