Skip to main content

Showing 1–1 of 1 results for author: Parolari, L

.
  1. arXiv:2305.10913  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.MM

    Weakly-Supervised Visual-Textual Grounding with Semantic Prior Refinement

    Authors: Davide Rigoni, Luca Parolari, Luciano Serafini, Alessandro Sperduti, Lamberto Ballan

    Abstract: Using only image-sentence pairs, weakly-supervised visual-textual grounding aims to learn region-phrase correspondences of the respective entity mentions. Compared to the supervised approach, learning is more difficult since bounding boxes and textual phrases correspondences are unavailable. In light of this, we propose the Semantic Prior Refinement Model (SPRM), whose predictions are obtained by… ▽ More

    Submitted 26 September, 2023; v1 submitted 18 May, 2023; originally announced May 2023.