Skip to main content

Showing 1–10 of 10 results for author: Garcia-Ferrero, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.07613  [pdf, other

    cs.CL cs.AI cs.LG

    Medical mT5: An Open-Source Multilingual Text-to-Text LLM for The Medical Domain

    Authors: Iker García-Ferrero, Rodrigo Agerri, Aitziber Atutxa Salazar, Elena Cabrio, Iker de la Iglesia, Alberto Lavelli, Bernardo Magnini, Benjamin Molinet, Johana Ramirez-Romero, German Rigau, Jose Maria Villa-Gonzalez, Serena Villata, Andrea Zaninello

    Abstract: Research on language technology for the development of medical applications is currently a hot topic in Natural Language Understanding and Generation. Thus, a number of large language models (LLMs) have recently been adapted to the medical domain, so that they can be used as a tool for mediating in human-AI interaction. While these LLMs display competitive performance on automated medical texts be… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: LREC-COLING 2024

  2. arXiv:2404.07611  [pdf, other

    cs.CL cs.AI

    NoticIA: A Clickbait Article Summarization Dataset in Spanish

    Authors: Iker García-Ferrero, Begoña Altuna

    Abstract: We present NoticIA, a dataset consisting of 850 Spanish news articles featuring prominent clickbait headlines, each paired with high-quality, single-sentence generative summarizations written by humans. This task demands advanced text understanding and summarization abilities, challenging the models' capacity to infer and connect diverse pieces of information to meet the user's informational needs… ▽ More

    Submitted 31 May, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: Accepted in the journal Procesamiento del Lenguaje Natural

  3. arXiv:2310.18018  [pdf, other

    cs.CL

    NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark

    Authors: Oscar Sainz, Jon Ander Campos, Iker García-Ferrero, Julen Etxaniz, Oier Lopez de Lacalle, Eneko Agirre

    Abstract: In this position paper, we argue that the classical evaluation on Natural Language Processing (NLP) tasks using annotated benchmarks is in trouble. The worst kind of data contamination happens when a Large Language Model (LLM) is trained on the test split of a benchmark, and then evaluated in the same benchmark. The extent of the problem is unknown, as it is not straightforward to measure. Contami… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP2024-Findings

  4. arXiv:2310.15941  [pdf, other

    cs.CL

    This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models

    Authors: Iker García-Ferrero, Begoña Altuna, Javier Álvez, Itziar Gonzalez-Dios, German Rigau

    Abstract: Although large language models (LLMs) have apparently acquired a certain level of grammatical knowledge and the ability to make generalizations, they fail to interpret negation, a crucial step in Natural Language Processing. We try to clarify the reasons for the sub-optimal performance of LLMs understanding negation. We introduce a large semi-automatically generated dataset of circa 400,000 descri… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted in the The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)

  5. arXiv:2310.03668  [pdf, other

    cs.CL

    GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction

    Authors: Oscar Sainz, Iker García-Ferrero, Rodrigo Agerri, Oier Lopez de Lacalle, German Rigau, Eneko Agirre

    Abstract: Large Language Models (LLMs) combined with instruction tuning have made significant progress when generalizing to unseen tasks. However, they have been less successful in Information Extraction (IE), lagging behind task-specific models. Typically, IE tasks are characterized by complex annotation guidelines that describe the task and give examples to humans. Previous attempts to leverage such infor… ▽ More

    Submitted 6 March, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: The Twelfth International Conference on Learning Representations - ICLR 2024

  6. arXiv:2306.06029  [pdf, other

    cs.CL cs.AI

    HiTZ@Antidote: Argumentation-driven Explainable Artificial Intelligence for Digital Medicine

    Authors: Rodrigo Agerri, Iñigo Alonso, Aitziber Atutxa, Ander Berrondo, Ainara Estarrona, Iker Garcia-Ferrero, Iakes Goenaga, Koldo Gojenola, Maite Oronoz, Igor Perez-Tejedor, German Rigau, Anar Yeginbergenova

    Abstract: Providing high quality explanations for AI predictions based on machine learning is a challenging and complex task. To work well it requires, among other factors: selecting a proper level of generality/specificity of the explanation; considering assumptions about the familiarity of the explanation beneficiary with the AI task under consideration; referring to specific elements that have contribute… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: To appear: In SEPLN 2023: 39th International Conference of the Spanish Society for Natural Language Processing

  7. arXiv:2304.10637  [pdf, other

    cs.CL

    IXA/Cogcomp at SemEval-2023 Task 2: Context-enriched Multilingual Named Entity Recognition using Knowledge Bases

    Authors: Iker García-Ferrero, Jon Ander Campos, Oscar Sainz, Ander Salaberria, Dan Roth

    Abstract: Named Entity Recognition (NER) is a core natural language processing task in which pre-trained language models have shown remarkable performance. However, standard benchmarks like CoNLL 2003 do not address many of the challenges that deployed NER systems face, such as having to classify emerging or complex entities in a fine-grained way. In this paper we present a novel NER cascade approach compri… ▽ More

    Submitted 27 April, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: SemEval 2023

  8. arXiv:2212.10548  [pdf, other

    cs.CL

    T-Projection: High Quality Annotation Projection for Sequence Labeling Tasks

    Authors: Iker García-Ferrero, Rodrigo Agerri, German Rigau

    Abstract: In the absence of readily available labeled data for a given sequence labeling task and language, annotation projection has been proposed as one of the possible strategies to automatically generate annotated data. Annotation projection has often been formulated as the task of transporting, on parallel corpora, the labels pertaining to a given span in the source language into its corresponding span… ▽ More

    Submitted 24 October, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Findings of the EMNLP 2023

  9. arXiv:2210.12623  [pdf, other

    cs.CL

    Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings

    Authors: Iker García-Ferrero, Rodrigo Agerri, German Rigau

    Abstract: Zero-resource cross-lingual transfer approaches aim to apply supervised models from a source language to unlabelled target languages. In this paper we perform an in-depth study of the two main techniques employed so far for cross-lingual zero-resource sequence labelling, based either on data or model transfer. Although previous research has proposed translation and annotation projection (data-base… ▽ More

    Submitted 27 April, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: Findings of the Association for Computational Linguistics: EMNLP 2022

    Journal ref: Findings of the Association for Computational Linguistics EMNLP 2022, 6403-6416

  10. arXiv:2001.06381  [pdf, other

    cs.CL

    A Common Semantic Space for Monolingual and Cross-Lingual Meta-Embeddings

    Authors: Iker García-Ferrero, Rodrigo Agerri, German Rigau

    Abstract: This paper presents a new technique for creating monolingual and cross-lingual meta-embeddings. Our method integrates multiple word embeddings created from complementary techniques, textual sources, knowledge bases and languages. Existing word vectors are projected to a common semantic space using linear transformations and averaging. With our method the resulting meta-embeddings maintain the dime… ▽ More

    Submitted 8 September, 2021; v1 submitted 17 January, 2020; originally announced January 2020.