Skip to main content

Showing 1–10 of 10 results for author: Huertas-Tato, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.09874  [pdf, other

    cs.CL

    Camouflage is all you need: Evaluating and Enhancing Language Model Robustness Against Camouflage Adversarial Attacks

    Authors: Álvaro Huertas-García, Alejandro Martín, Javier Huertas-Tato, David Camacho

    Abstract: Adversarial attacks represent a substantial challenge in Natural Language Processing (NLP). This study undertakes a systematic exploration of this challenge in two distinct phases: vulnerability evaluation and resilience enhancement of Transformer-based models under adversarial attacks. In the evaluation phase, we assess the susceptibility of three Transformer configurations, encoder-decoder, en… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 19 pages, 8 figures, 5 tables

  2. arXiv:2310.11081  [pdf, other

    cs.CL cs.SI

    Understanding writing style in social media with a supervised contrastively pre-trained transformer

    Authors: Javier Huertas-Tato, Alejandro Martin, David Camacho

    Abstract: Online Social Networks serve as fertile ground for harmful behavior, ranging from hate speech to the dissemination of disinformation. Malicious actors now have unprecedented freedom to misbehave, leading to severe societal unrest and dire consequences, as exemplified by events such as the Capitol assault during the US presidential election and the Antivaxx movement during the COVID-19 pandemic. Un… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  3. arXiv:2306.05045  [pdf, other

    cs.CV cs.AI eess.IV

    Spain on Fire: A novel wildfire risk assessment model based on image satellite processing and atmospheric information

    Authors: Helena Liz-López, Javier Huertas-Tato, Jorge Pérez-Aracil, Carlos Casanova-Mateo, Julia Sanz-Justo, David Camacho

    Abstract: Each year, wildfires destroy larger areas of Spain, threatening numerous ecosystems. Humans cause 90% of them (negligence or provoked) and the behaviour of individuals is unpredictable. However, atmospheric and environmental variables affect the spread of wildfires, and they can be analysed by using deep learning. In order to mitigate the damage of these events we proposed the novel Wildfire Asses… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  4. arXiv:2209.15373  [pdf, other

    cs.CL

    PART: Pre-trained Authorship Representation Transformer

    Authors: Javier Huertas-Tato, Alvaro Huertas-Garcia, Alejandro Martin, David Camacho

    Abstract: Authors writing documents imprint identifying information within their texts: vocabulary, registry, punctuation, misspellings, or even emoji usage. Finding these details is very relevant to profile authors, relating back to their gender, occupation, age, and so on. But most importantly, repeating writing patterns can help attributing authorship to a text. Previous works use hand-crafted features o… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

  5. arXiv:2207.14408  [pdf, other

    eess.IV cs.CV cs.LG

    Deep learning for understanding multilabel imbalanced Chest X-ray datasets

    Authors: Helena Liz, Javier Huertas-Tato, Manuel Sánchez-Montañés, Javier Del Ser, David Camacho

    Abstract: Over the last few years, convolutional neural networks (CNNs) have dominated the field of computer vision thanks to their ability to extract features and their outstanding performance in classification problems, for example in the automatic analysis of X-rays. Unfortunately, these neural networks are considered black-box algorithms, i.e. it is impossible to understand how the algorithm has achieve… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

  6. arXiv:2204.08415  [pdf, other

    cs.CL

    Exploring Dimensionality Reduction Techniques in Multilingual Transformers

    Authors: Álvaro Huertas-García, Alejandro Martín, Javier Huertas-Tato, David Camacho

    Abstract: Both in scientific literature and in industry,, Semantic and context-aware Natural Language Processing-based solutions have been gaining importance in recent years. The possibilities and performance shown by these models when dealing with complex Language Understanding tasks is unquestionable, from conversational agents to the fight against disinformation in social networks. In addition, considera… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 22 pages, 4 figures and 8 tables

  7. arXiv:2204.03465  [pdf, other

    cs.CL cs.LG

    BERTuit: Understanding Spanish language in Twitter through a native transformer

    Authors: Javier Huertas-Tato, Alejandro Martin, David Camacho

    Abstract: The appearance of complex attention-based language models such as BERT, Roberta or GPT-3 has allowed to address highly complex tasks in a plethora of scenarios. However, when applied to specific domains, these models encounter considerable difficulties. This is the case of Social Networks such as Twitter, an ever-changing stream of information written with informal and complex language, where each… ▽ More

    Submitted 13 June, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: Support: 1) BBVA FOUNDATION - CIVIC, 2) Spanish Ministry of Science and Innovation - FightDIS (PID2020-117263GB-100) and XAI-Disinfodemics (PLEC2021-007681), 3) Comunidad Autonoma de Madrid - S2018/TCS-4566, 4) European Comission - IBERIFIER (2020-EU-IA-0252), 5) Digital Future Society (Mobile World Capital Barcelona) - DisTrack, 6) UPM - Programa de Excelencia para el Profesorado Universitario

  8. arXiv:2110.14532  [pdf, other

    cs.CL

    FacTeR-Check: Semi-automated fact-checking through Semantic Similarity and Natural Language Inference

    Authors: Alejandro Martín, Javier Huertas-Tato, Álvaro Huertas-García, Guillermo Villar-Rodríguez, David Camacho

    Abstract: Our society produces and shares overwhelming amounts of information through Online Social Networks (OSNs). Within this environment, misinformation and disinformation have proliferated, becoming a public safety concern in most countries. Allowing the public and professionals to efficiently find reliable evidences about the factual veracity of a claim is a crucial step to mitigate this harmful sprea… ▽ More

    Submitted 16 February, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

  9. arXiv:2103.09635  [pdf, other

    cs.CL cs.LG

    SILT: Efficient transformer training for inter-lingual inference

    Authors: Javier Huertas-Tato, Alejandro Martín, David Camacho

    Abstract: The ability of transformers to perform precision tasks such as question answering, Natural Language Inference (NLI) or summarising, have enabled them to be ranked as one of the best paradigm to address Natural Language Processing (NLP) tasks. NLI is one of the best scenarios to test these architectures, due to the knowledge required to understand complex sentences and established relationships bet… ▽ More

    Submitted 17 May, 2021; v1 submitted 17 March, 2021; originally announced March 2021.

    Comments: This research is funded by the project CIVIC: Intelligent characterisation of the veracity of the information related to COVID-19, granted by BBVA FOUNDATION GRANTS FOR SCIENTIFIC RESEARCH TEAMS SARS-CoV-2 and COVID-19

  10. arXiv:2012.11049  [pdf, other

    cs.CV cs.AI

    Fusing CNNs and statistical indicators to improve image classification

    Authors: Javier Huertas-Tato, Alejandro Martín, Julián Fierrez, David Camacho

    Abstract: Convolutional Networks have dominated the field of computer vision for the last ten years, exhibiting extremely powerful feature extraction capabilities and outstanding classification performance. The main strategy to prolong this trend relies on further upscaling networks in size. However, costs increase rapidly while performance improvements may be marginal. We hypothesise that adding heterogene… ▽ More

    Submitted 4 June, 2021; v1 submitted 20 December, 2020; originally announced December 2020.

    Comments: 16 pages