Skip to main content

Showing 1–16 of 16 results for author: de Lacalle, O L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07302  [pdf, ps, other

    cs.CL cs.AI cs.LG

    BertaQA: How Much Do Language Models Know About Local Culture?

    Authors: Julen Etxaniz, Gorka Azkune, Aitor Soroa, Oier Lopez de Lacalle, Mikel Artetxe

    Abstract: Large Language Models (LLMs) exhibit extensive knowledge about the world, but most evaluations have been limited to global or anglocentric subjects. This raises the question of how well these models perform on topics relevant to other cultures, whose presence on the web is not that prominent. To address this gap, we introduce BertaQA, a multiple-choice trivia dataset that is parallel in English an… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2404.06392  [pdf, other

    cs.CL cs.AI

    Event Extraction in Basque: Typologically motivated Cross-Lingual Transfer-Learning Analysis

    Authors: Mikel Zubillaga, Oscar Sainz, Ainara Estarrona, Oier Lopez de Lacalle, Eneko Agirre

    Abstract: Cross-lingual transfer-learning is widely used in Event Extraction for low-resource languages and involves a Multilingual Language Model that is trained in a source language and applied to the target language. This paper studies whether the typological similarity between source and target languages impacts the performance of cross-lingual transfer, an under-explored topic. We first focus on Basque… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted at LREC-Coling 2024

  3. arXiv:2403.00587  [pdf, other

    cs.CV cs.AI

    Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset

    Authors: Ander Salaberria, Gorka Azkune, Oier Lopez de Lacalle, Aitor Soroa, Eneko Agirre, Frank Keller

    Abstract: Existing work has observed that current text-to-image systems do not accurately reflect explicit spatial relations between objects such as 'left of' or 'below'. We hypothesize that this is because explicit spatial relations rarely appear in the image captions used to train these models. We propose an automatic method that, given existing images, generates synthetic captions that contain 14 explici… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 12 pages and 5 figures

  4. arXiv:2310.18018  [pdf, other

    cs.CL

    NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark

    Authors: Oscar Sainz, Jon Ander Campos, Iker García-Ferrero, Julen Etxaniz, Oier Lopez de Lacalle, Eneko Agirre

    Abstract: In this position paper, we argue that the classical evaluation on Natural Language Processing (NLP) tasks using annotated benchmarks is in trouble. The worst kind of data contamination happens when a Large Language Model (LLM) is trained on the test split of a benchmark, and then evaluated in the same benchmark. The extent of the problem is unknown, as it is not straightforward to measure. Contami… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP2024-Findings

  5. arXiv:2310.03668  [pdf, other

    cs.CL

    GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction

    Authors: Oscar Sainz, Iker García-Ferrero, Rodrigo Agerri, Oier Lopez de Lacalle, German Rigau, Eneko Agirre

    Abstract: Large Language Models (LLMs) combined with instruction tuning have made significant progress when generalizing to unseen tasks. However, they have been less successful in Information Extraction (IE), lagging behind task-specific models. Typically, IE tasks are characterized by complex annotation guidelines that describe the task and give examples to humans. Previous attempts to leverage such infor… ▽ More

    Submitted 6 March, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: The Twelfth International Conference on Learning Representations - ICLR 2024

  6. arXiv:2308.01223  [pdf, other

    cs.CL cs.AI cs.LG

    Do Multilingual Language Models Think Better in English?

    Authors: Julen Etxaniz, Gorka Azkune, Aitor Soroa, Oier Lopez de Lacalle, Mikel Artetxe

    Abstract: Translate-test is a popular technique to improve the performance of multilingual language models. This approach works by translating the input into English using an external machine translation system, and running inference over the translated input. However, these improvements can be attributed to the use of a separate translation system, which is typically trained on large amounts of parallel da… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  7. arXiv:2302.03353  [pdf, other

    cs.CL

    What do Language Models know about word senses? Zero-Shot WSD with Language Models and Domain Inventories

    Authors: Oscar Sainz, Oier Lopez de Lacalle, Eneko Agirre, German Rigau

    Abstract: Language Models are the core for almost any Natural Language Processing system nowadays. One of their particularities is their contextualized representations, a game changer feature when a disambiguation between word senses is necessary. In this paper we aim to explore to what extent language models are capable of discerning among senses at inference time. We performed this analysis by prompting c… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: Presented at GWC2023

  8. arXiv:2205.01376  [pdf, other

    cs.CL

    Textual Entailment for Event Argument Extraction: Zero- and Few-Shot with Multi-Source Learning

    Authors: Oscar Sainz, Itziar Gonzalez-Dios, Oier Lopez de Lacalle, Bonan Min, Eneko Agirre

    Abstract: Recent work has shown that NLP tasks such as Relation Extraction (RE) can be recasted as Textual Entailment tasks using verbalizations, with strong performance in zero-shot and few-shot settings thanks to pre-trained entailment models. The fact that relations in current RE datasets are easily verbalized casts doubts on whether entailment would be effective in more complex tasks. In this work we sh… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: Accepted as Findings of NAACL2022

  9. arXiv:2203.13602  [pdf, other

    cs.CL

    ZS4IE: A toolkit for Zero-Shot Information Extraction with simple Verbalizations

    Authors: Oscar Sainz, Haoling Qiu, Oier Lopez de Lacalle, Eneko Agirre, Bonan Min

    Abstract: The current workflow for Information Extraction (IE) analysts involves the definition of the entities/relations of interest and a training corpus with annotated examples. In this demonstration we introduce a new workflow where the analyst directly verbalizes the entities/relations, which are then used by a Textual Entailment model to perform zero-shot IE. We present the design and implementation o… ▽ More

    Submitted 3 May, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: Accepted at NAACL2022 Demo track

  10. Image Captioning for Effective Use of Language Models in Knowledge-Based Visual Question Answering

    Authors: Ander Salaberria, Gorka Azkune, Oier Lopez de Lacalle, Aitor Soroa, Eneko Agirre

    Abstract: Integrating outside knowledge for reasoning in visio-linguistic tasks such as visual question answering (VQA) is an open problem. Given that pretrained language models have been shown to include world knowledge, we propose to use a unimodal (text-only) train and inference procedure based on automatic off-the-shelf captioning of images and pretrained language models. Our results on a visual questio… ▽ More

    Submitted 25 March, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: Under review. 25 pages with 4 figures

    Journal ref: Expert Systems with Applications, Volume 212, 2023, 118669

  11. arXiv:2109.03659  [pdf, other

    cs.CL

    Label Verbalization and Entailment for Effective Zero- and Few-Shot Relation Extraction

    Authors: Oscar Sainz, Oier Lopez de Lacalle, Gorka Labaka, Ander Barrena, Eneko Agirre

    Abstract: Relation extraction systems require large amounts of labeled examples which are costly to annotate. In this work we reformulate relation extraction as an entailment task, with simple, hand-made, verbalizations of relations produced in less than 15 min per relation. The system relies on a pretrained textual entailment engine which is run as-is (no training examples, zero-shot) or further fine-tuned… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP2021

  12. Inferring spatial relations from textual descriptions of images

    Authors: Aitzol Elu, Gorka Azkune, Oier Lopez de Lacalle, Ignacio Arganda-Carreras, Aitor Soroa, Eneko Agirre

    Abstract: Generating an image from its textual description requires both a certain level of language understanding and common sense knowledge about the spatial relations of the physical entities being described. In this work, we focus on inferring the spatial relation between entities, a key step in the process of composing scenes based on text. More specifically, given a caption containing a mention to a s… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: Accepted in Pattern Recognition

    Journal ref: Pattern Recognition, Volume 113, 2021, 107847

  13. arXiv:2004.01894  [pdf, other

    cs.CL

    Evaluating Multimodal Representations on Visual Semantic Textual Similarity

    Authors: Oier Lopez de Lacalle, Ander Salaberria, Aitor Soroa, Gorka Azkune, Eneko Agirre

    Abstract: The combination of visual and textual representations has produced excellent results in tasks such as image captioning and visual question answering, but the inference capabilities of multimodal representations are largely untested. In the case of textual representations, inference tasks such as Textual Entailment and Semantic Textual Similarity have been often used to benchmark the quality of tex… ▽ More

    Submitted 4 April, 2020; originally announced April 2020.

    Comments: Accepted in ECAI-2020, 8 pages, 6 tables, 6 figures

  14. arXiv:1809.03695  [pdf, other

    cs.CL cs.AI

    Evaluating Multimodal Representations on Sentence Similarity: vSTS, Visual Semantic Textual Similarity Dataset

    Authors: Oier Lopez de Lacalle, Aitor Soroa, Eneko Agirre

    Abstract: In this paper we introduce vSTS, a new dataset for measuring textual similarity of sentences using multimodal information. The dataset is comprised by images along with its respectively textual captions. We describe the dataset both quantitatively and qualitatively, and claim that it is a valid gold standard for measuring automatic multimodal textual similarity systems. We also describe the initia… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

    Journal ref: ICCV17: second workshop on Closing the Loop Between Vision and Language. Venice, Italy. 2017

  15. arXiv:1805.04277  [pdf, ps, other

    cs.CL

    The risk of sub-optimal use of Open Source NLP Software: UKB is inadvertently state-of-the-art in knowledge-based WSD

    Authors: Eneko Agirre, Oier López de Lacalle, Aitor Soroa

    Abstract: UKB is an open source collection of programs for performing, among other tasks, knowledge-based Word Sense Disambiguation (WSD). Since it was released in 2009 it has been often used out-of-the-box in sub-optimal settings. We show that nine years later it is the state-of-the-art on knowledge-based WSD. This case shows the pitfalls of releasing open source NLP software without optimal default settin… ▽ More

    Submitted 11 May, 2018; originally announced May 2018.

  16. arXiv:1507.03462  [pdf, other

    cs.CL

    Supervised Hierarchical Classification for Student Answer Scoring

    Authors: Itziar Aldabe, Oier Lopez de Lacalle, Iñigo Lopez-Gazpio, Montse Maritxalar

    Abstract: This paper describes a hierarchical system that predicts one label at a time for automated student response analysis. For the task, we build a classification binary tree that delays more easily confused labels to later stages using hierarchical processes. In particular, the paper describes how the hierarchical classifier has been built and how the classification task has been broken down into bina… ▽ More

    Submitted 13 July, 2015; originally announced July 2015.

    Comments: 5 pages with references