Skip to main content

Showing 1–4 of 4 results for author: Jiménez, Á B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.02199  [pdf, other

    cs.CL cs.AI cs.LG

    A Survey of Spanish Clinical Language Models

    Authors: Guillem García Subies, Álvaro Barbero Jiménez, Paloma Martínez Fernández

    Abstract: This survey focuses in encoder Language Models for solving tasks in the clinical domain in the Spanish language. We review the contributions of 17 corpora focused mainly in clinical tasks, then list the most relevant Spanish Language Models and Spanish Clinical Language models. We perform a thorough comparison of these models by benchmarking them over a curated subset of the available corpora, in… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  2. arXiv:2305.14115  [pdf, other

    cs.LG

    RLBoost: Boosting Supervised Models using Deep Reinforcement Learning

    Authors: Eloy Anguiano Batanero, Ángela Fernández Pascual, Álvaro Barbero Jiménez

    Abstract: Data quality or data evaluation is sometimes a task as important as collecting a large volume of data when it comes to generating accurate artificial intelligence models. In fact, being able to evaluate the data can lead to a larger database that is better suited to a particular problem because we have the ability to filter out data obtained automatically of dubious quality. In this paper we prese… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 25 pages, 14 figures

  3. arXiv:2302.02412  [pdf, other

    cs.CV cs.AI cs.LG

    Mixture of Diffusers for scene composition and high resolution image generation

    Authors: Álvaro Barbero Jiménez

    Abstract: Diffusion methods have been proven to be very effective to generate images while conditioning on a text prompt. However, and although the quality of the generated images is unprecedented, these methods seem to struggle when trying to generate specific image compositions. In this paper we present Mixture of Diffusers, an algorithm that builds over existing diffusion models to provide a more detaile… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

    ACM Class: I.2.6

  4. arXiv:2205.10233  [pdf, other

    cs.CL

    RigoBERTa: A State-of-the-Art Language Model For Spanish

    Authors: Alejandro Vaca Serrano, Guillem Garcia Subies, Helena Montoro Zamorano, Nuria Aldama Garcia, Doaa Samy, David Betancur Sanchez, Antonio Moreno Sandoval, Marta Guerrero Nieto, Alvaro Barbero Jimenez

    Abstract: This paper presents RigoBERTa, a State-of-the-Art Language Model for Spanish. RigoBERTa is trained over a well-curated corpus formed up from different subcorpora with key features. It follows the DeBERTa architecture, which has several advantages over other architectures of similar size as BERT or RoBERTa. RigoBERTa performance is assessed over 13 NLU tasks in comparison with other available Spani… ▽ More

    Submitted 3 June, 2022; v1 submitted 27 April, 2022; originally announced May 2022.