Skip to main content

Showing 1–8 of 8 results for author: Ferret, O

.
  1. arXiv:2401.05736  [pdf, other

    cs.CL cs.IR

    Cross-modal Retrieval for Knowledge-based Visual Question Answering

    Authors: Paul Lerner, Olivier Ferret, Camille Guinaudeau

    Abstract: Knowledge-based Visual Question Answering about Named Entities is a challenging task that requires retrieving information from a multimodal Knowledge Base. Named entities have diverse visual representations and are therefore difficult to recognize. We argue that cross-modal retrieval may help bridge the semantic gap between an entity and its depictions, and is foremost complementary with mono-moda… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  2. arXiv:2312.09670  [pdf, other

    cs.CL cs.IR

    Probing Pretrained Language Models with Hierarchy Properties

    Authors: Jesús Lovón-Melgarejo, Jose G. Moreno, Romaric Besançon, Olivier Ferret, Lynda Tamine

    Abstract: Since Pretrained Language Models (PLMs) are the cornerstone of the most recent Information Retrieval (IR) models, the way they encode semantic knowledge is particularly important. However, little attention has been given to studying the PLMs' capability to capture hierarchical semantic knowledge. Traditionally, evaluating such knowledge encoded in PLMs relies on their performance on a task-depende… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted at ECIR 2024

  3. arXiv:2307.05134  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    TIAM -- A Metric for Evaluating Alignment in Text-to-Image Generation

    Authors: Paul Grimal, Hervé Le Borgne, Olivier Ferret, Julien Tourille

    Abstract: The progress in the generation of synthetic images has made it crucial to assess their quality. While several metrics have been proposed to assess the rendering of images, it is crucial for Text-to-Image (T2I) models, which generate images based on a prompt, to consider additional aspects such as to which extent the generated image matches the important content of the prompt. Moreover, although th… ▽ More

    Submitted 2 January, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

    Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024, pp. 2890-2899

  4. arXiv:2301.04366  [pdf, other

    cs.CL cs.IR cs.LG cs.MM

    Multimodal Inverse Cloze Task for Knowledge-based Visual Question Answering

    Authors: Paul Lerner, Olivier Ferret, Camille Guinaudeau

    Abstract: We present a new pre-training method, Multimodal Inverse Cloze Task, for Knowledge-based Visual Question Answering about named Entities (KVQAE). KVQAE is a recently introduced task that consists in answering questions about named entities grounded in a visual context using a Knowledge Base. Therefore, the interaction between the modalities is paramount to retrieve information and must be captured… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: Accepted at ECIR 2023

  5. arXiv:2111.12174  [pdf, other

    cs.CL

    Using Distributional Principles for the Semantic Study of Contextual Language Models

    Authors: Olivier Ferret

    Abstract: Many studies were recently done for investigating the properties of contextual language models but surprisingly, only a few of them consider the properties of these models in terms of semantic similarity. In this article, we first focus on these properties for English by exploiting the distributional principle of substitution as a probing mechanism in the controlled context of SemCor and WordNet p… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: PACLIC 35

  6. Multimodal Entity Linking for Tweets

    Authors: Omar Adjali, Romaric Besançon, Olivier Ferret, Herve Le Borgne, Brigitte Grau

    Abstract: In many information extraction applications, entity linking (EL) has emerged as a crucial task that allows leveraging information about named entities from a knowledge base. In this paper, we address the task of multimodal entity linking (MEL), an emerging research field in which textual and visual information is used to map an ambiguous mention to an entity in a knowledge base (KB). First, we pro… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

    Journal ref: In: Jose J. et al. (eds) Advances in Information Retrieval. ECIR 2020. Lecture Notes in Computer Science, vol 12035. Springer, Cham

  7. arXiv:2010.10392  [pdf, other

    cs.CL

    CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters

    Authors: Hicham El Boukkouri, Olivier Ferret, Thomas Lavergne, Hiroshi Noji, Pierre Zweigenbaum, Junichi Tsujii

    Abstract: Due to the compelling improvements brought by BERT, many recent representation models adopted the Transformer architecture as their main building block, consequently inheriting the wordpiece tokenization system despite it not being intrinsically linked to the notion of Transformers. While this system is thought to achieve a good balance between the flexibility of characters and the efficiency of f… ▽ More

    Submitted 31 October, 2020; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: 13 pages, 8 figures and 3 tables. Accepted at COLING 2020

  8. arXiv:1904.05557  [pdf, other

    cs.CL

    Searching News Articles Using an Event Knowledge Graph Leveraged by Wikidata

    Authors: Charlotte Rudnik, Thibault Ehrhart, Olivier Ferret, Denis Teyssou, Raphaël Troncy, Xavier Tannier

    Abstract: News agencies produce thousands of multimedia stories describing events happening in the world that are either scheduled such as sports competitions, political summits and elections, or breaking events such as military conflicts, terrorist attacks, natural disasters, etc. When writing up those stories, journalists refer to contextual background and to compare with past similar events. However, sea… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

    Journal ref: WikiWorkshop at The Web Conference 2019