Skip to main content

Showing 1–4 of 4 results for author: Barrena, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.07373  [pdf, other

    cs.CL

    EriBERTa: A Bilingual Pre-Trained Language Model for Clinical Natural Language Processing

    Authors: Iker de la Iglesia, Aitziber Atutxa, Koldo Gojenola, Ander Barrena

    Abstract: The utilization of clinical reports for various secondary purposes, including health research and treatment monitoring, is crucial for enhancing patient care. Natural Language Processing (NLP) tools have emerged as valuable assets for extracting and processing relevant information from these reports. However, the availability of specialized language models for the clinical domain in Spanish has be… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  2. arXiv:2109.03659  [pdf, other

    cs.CL

    Label Verbalization and Entailment for Effective Zero- and Few-Shot Relation Extraction

    Authors: Oscar Sainz, Oier Lopez de Lacalle, Gorka Labaka, Ander Barrena, Eneko Agirre

    Abstract: Relation extraction systems require large amounts of labeled examples which are costly to annotate. In this work we reformulate relation extraction as an entailment task, with simple, hand-made, verbalizations of relations produced in less than 15 min per relation. The system relies on a pretrained textual entailment engine which is run as-is (no training examples, zero-shot) or further fine-tuned… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP2021

  3. arXiv:2004.00033  [pdf, ps, other

    cs.CL

    Give your Text Representation Models some Love: the Case for Basque

    Authors: Rodrigo Agerri, IƱaki San Vicente, Jon Ander Campos, Ander Barrena, Xabier Saralegi, Aitor Soroa, Eneko Agirre

    Abstract: Word embeddings and pre-trained language models allow to build rich representations of text and have enabled improvements across most NLP tasks. Unfortunately they are very expensive to train, and many small companies and research groups tend to use models that have been pre-trained and made available by third parties, rather than building their own. This is suboptimal as, for many languages, the… ▽ More

    Submitted 2 April, 2020; v1 submitted 31 March, 2020; originally announced April 2020.

    Comments: Accepted at LREC 2020; 8 pages, 7 tables

  4. arXiv:1503.01655  [pdf, other

    cs.CL

    Studying the Wikipedia Hyperlink Graph for Relatedness and Disambiguation

    Authors: Eneko Agirre, Ander Barrena, Aitor Soroa

    Abstract: Hyperlinks and other relations in Wikipedia are a extraordinary resource which is still not fully understood. In this paper we study the different types of links in Wikipedia, and contrast the use of the full graph with respect to just direct links. We apply a well-known random walk algorithm on two tasks, word relatedness and named-entity disambiguation. We show that using the full graph is more… ▽ More

    Submitted 12 March, 2015; v1 submitted 5 March, 2015; originally announced March 2015.