Skip to main content

Showing 1–4 of 4 results for author: Viridiano, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05967  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

    Authors: David Romero, Chenyang Lyu, Haryo Akbarianto Wibowo, Teresa Lynn, Injy Hamed, Aditya Nanda Kishore, Aishik Mandal, Alina Dragonetti, Artem Abzaliev, Atnafu Lambebo Tonja, Bontu Fufa Balcha, Chenxi Whitehouse, Christian Salamea, Dan John Velasco, David Ifeoluwa Adelani, David Le Meur, Emilio Villa-Cueva, Fajri Koto, Fauzan Farooqui, Frederico Belcavello, Ganzorig Batnasan, Gisela Vallejo, Grainne Caulfield, Guido Ivetta, Haiyue Song , et al. (50 additional authors not shown)

    Abstract: Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recen… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  2. arXiv:2205.11840  [pdf, other

    cs.CL

    Lutma: a Frame-Making Tool for Collaborative FrameNet Development

    Authors: Tiago Timponi Torrent, Arthur Lorenzi, Ely Edison da Silva Matos, Frederico Belcavello, Marcelo Viridiano, Maucha Andrade Gamonal

    Abstract: This paper presents Lutma, a collaborative, semi-constrained, tutorial-based tool for contributing frames and lexical units to the Global FrameNet initiative. The tool parameterizes the process of frame creation, avoiding consistency violations and promoting the integration of frames contributed by the community with existing frames. Lutma is structured in a wizard-like fashion so as to provide us… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: Accepted submission for the 1st Workshop on Perspectivist Approaches to NLP (NLPerspectives)

  3. arXiv:2205.11836  [pdf, other

    cs.CL

    Charon: a FrameNet Annotation Tool for Multimodal Corpora

    Authors: Frederico Belcavello, Marcelo Viridiano, Ely Edison Matos, Tiago Timponi Torrent

    Abstract: This paper presents Charon, a web tool for annotating multimodal corpora with FrameNet categories. Annotation can be made for corpora containing both static images and video sequences paired - or not - with text sequences. The pipeline features, besides the annotation interface, corpus import and pre-processing tools.

    Submitted 24 May, 2022; originally announced May 2022.

    Comments: Accepted submission for the The Sixteenth Linguistic Annotation Workshop (LAW-XVI 2022)

  4. arXiv:2205.10902  [pdf, other

    cs.CL

    The Case for Perspective in Multimodal Datasets

    Authors: Marcelo Viridiano, Tiago Timponi Torrent, Oliver Czulo, Arthur Lorenzi Almeida, Ely Edison da Silva Matos, Frederico Belcavello

    Abstract: This paper argues in favor of the adoption of annotation practices for multimodal datasets that recognize and represent the inherently perspectivized nature of multimodal communication. To support our claim, we present a set of annotation experiments in which FrameNet annotation is applied to the Multi30k and the Flickr 30k Entities datasets. We assess the cosine similarity between the semantic re… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

    Comments: Accepted submission for the 1st Workshop on Perspectivist Approaches to NLP (NLPerspectives)