Skip to main content

Showing 1–1 of 1 results for author: Madvil, N

.
  1. arXiv:2307.04532  [pdf, other

    cs.CV cs.AI cs.CL eess.AS

    Read, Look or Listen? What's Needed for Solving a Multimodal Dataset

    Authors: Netta Madvil, Yonatan Bitton, Roy Schwartz

    Abstract: The prevalence of large-scale multimodal datasets presents unique challenges in assessing dataset quality. We propose a two-step method to analyze multimodal datasets, which leverages a small seed of human annotation to map each multimodal instance to the modalities required to process it. Our method sheds light on the importance of different modalities in datasets, as well as the relationship bet… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.