Skip to main content

Showing 1–11 of 11 results for author: Basile, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.09743  [pdf, other

    cs.CL

    Capturing Perspectives of Crowdsourced Annotators in Subjective Learning Tasks

    Authors: Negar Mokhberian, Myrl G. Marmarelis, Frederic R. Hopp, Valerio Basile, Fred Morstatter, Kristina Lerman

    Abstract: Supervised classification heavily depends on datasets annotated by humans. However, in subjective tasks such as toxicity classification, these annotations often exhibit low agreement among raters. Annotations have commonly been aggregated by employing methods like majority voting to determine a single ground truth label. In subjective tasks, aggregating labels will result in biased labeling and, c… ▽ More

    Submitted 16 May, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  2. arXiv:2304.14803  [pdf

    cs.CL

    SemEval-2023 Task 11: Learning With Disagreements (LeWiDi)

    Authors: Elisa Leonardelli, Alexandra Uma, Gavin Abercrombie, Dina Almanea, Valerio Basile, Tommaso Fornaciari, Barbara Plank, Verena Rieser, Massimo Poesio

    Abstract: NLP datasets annotated with human judgments are rife with disagreements between the judges. This is especially true for tasks depending on subjective judgments such as sentiment analysis or offensive language detection. Particularly in these latter cases, the NLP community has come to realize that the approach of 'reconciling' these different subjective interpretations is inappropriate. Many NLP r… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

  3. arXiv:2207.10652  [pdf, other

    cs.CL

    O-Dang! The Ontology of Dangerous Speech Messages

    Authors: Marco A. Stranisci, Simona Frenda, Mirko Lai, Oscar Araque, Alessandra T. Cignarella, Valerio Basile, Viviana Patti, Cristina Bosco

    Abstract: Inside the NLP community there is a considerable amount of language resources created, annotated and released every day with the aim of studying specific linguistic phenomena. Despite a variety of attempts in order to organize such resources has been carried on, a lack of systematic methods and of possible interoperability between resources are still present. Furthermore, when storing linguistic i… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

  4. arXiv:2205.15627  [pdf, other

    cs.CL

    APPReddit: a Corpus of Reddit Posts Annotated for Appraisal

    Authors: Marco Antonio Stranisci, Simona Frenda, Eleonora Ceccaldi, Valerio Basile, Rossana Damiano, Viviana Patti

    Abstract: Despite the large number of computational resources for emotion recognition, there is a lack of data sets relying on appraisal models. According to Appraisal theories, emotions are the outcome of a multi-dimensional evaluation of events. In this paper, we present APPReddit, the first corpus of non-experimental data annotated according to this theory. After describing its development, we compare ou… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

  5. arXiv:2111.07224  [pdf, other

    cs.CV cs.AI

    Local Multi-Head Channel Self-Attention for Facial Expression Recognition

    Authors: Roberto Pecoraro, Valerio Basile, Viviana Bono, Sara Gallo

    Abstract: Since the Transformer architecture was introduced in 2017 there has been many attempts to bring the self-attention paradigm in the field of computer vision. In this paper we propose a novel self-attention module that can be easily integrated in virtually every convolutional neural network and that is specifically designed for computer vision, the LHC: Local (multi) Head Channel (self-attention). L… ▽ More

    Submitted 18 November, 2021; v1 submitted 13 November, 2021; originally announced November 2021.

    Comments: https://github.com/Bodhis4ttva/LHC_Net

  6. Toward a Perspectivist Turn in Ground Truthing for Predictive Computing

    Authors: Valerio Basile, Federico Cabitza, Andrea Campagner, Michael Fell

    Abstract: Most Artificial Intelligence applications are based on supervised machine learning (ML), which ultimately grounds on manually annotated data. The annotation process is often performed in terms of a majority vote and this has been proved to be often problematic, as highlighted by recent studies on the evaluation of ML models. In this article we describe and advocate for a different paradigm, which… ▽ More

    Submitted 29 June, 2023; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: If you wish to cite this work, consider citing the AAAI 2023 proceedings version (https://doi.org/10.1609/aaai.v37i6.25840) and citing it in this way: Cabitza, F., Campagner, A., & Basile, V. (2023). Toward a Perspectivist Turn in Ground Truthing for Predictive Computing. Proceedings of the AAAI Conference on Artificial Intelligence, 37(6), 6860-6868. https://doi.org/10.1609/aaai.v37i6.25840

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, 37(6), 6860-6868 (2023)

  7. arXiv:2106.15896  [pdf, other

    cs.CL cs.AI

    Whose Opinions Matter? Perspective-aware Models to Identify Opinions of Hate Speech Victims in Abusive Language Detection

    Authors: Sohail Akhtar, Valerio Basile, Viviana Patti

    Abstract: Social media platforms provide users the freedom of expression and a medium to exchange information and express diverse opinions. Unfortunately, this has also resulted in the growth of abusive content with the purpose of discriminating people and targeting the most vulnerable communities such as immigrants, LGBT, Muslims, Jews and women. Because abusive language is subjective in nature, there migh… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

  8. arXiv:2011.05706  [pdf, ps, other

    cs.CL

    Multilingual Irony Detection with Dependency Syntax and Neural Models

    Authors: Alessandra Teresa Cignarella, Valerio Basile, Manuela Sanguinetti, Cristina Bosco, Paolo Rosso, Farah Benamara

    Abstract: This paper presents an in-depth investigation of the effectiveness of dependency-based syntactic features on the irony detection task in a multilingual perspective (English, Spanish, French and Italian). It focuses on the contribution from syntactic knowledge, exploiting linguistic resources where syntax is annotated according to the Universal Dependencies scheme. Three distinct experimental setti… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: long paper accepted at COLING 2020

  9. arXiv:2010.12472  [pdf, other

    cs.CL

    HateBERT: Retraining BERT for Abusive Language Detection in English

    Authors: Tommaso Caselli, Valerio Basile, Jelena Mitrović, Michael Granitzer

    Abstract: In this paper, we introduce HateBERT, a re-trained BERT model for abusive language detection in English. The model was trained on RAL-E, a large-scale dataset of Reddit comments in English from communities banned for being offensive, abusive, or hateful that we have collected and made available to the public. We present the results of a detailed comparison between a general pre-trained language mo… ▽ More

    Submitted 4 February, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

  10. arXiv:1901.01911  [pdf, other

    cs.CL

    Stance Classification for Rumour Analysis in Twitter: Exploiting Affective Information and Conversation Structure

    Authors: Endang Wahyu Pamungkas, Valerio Basile, Viviana Patti

    Abstract: Analysing how people react to rumours associated with news in social media is an important task to prevent the spreading of misinformation, which is nowadays widely recognized as a dangerous tendency. In social media conversations, users show different stances and attitudes towards rumourous stories. Some users take a definite stance, supporting or denying the rumour at issue, while others just co… ▽ More

    Submitted 7 January, 2019; originally announced January 2019.

    Comments: To appear in Proceedings of the 2nd International Workshop on Rumours and Deception in Social Media (RDSM), co-located with CIKM 2018, Turin, Italy, October 2018

  11. arXiv:1803.09840  [pdf, other

    cs.AI cs.CL

    Empirical Analysis of Foundational Distinctions in Linked Open Data

    Authors: Luigi Asprino, Valerio Basile, Paolo Ciancarini, Valentina Presutti

    Abstract: The Web and its Semantic extension (i.e. Linked Open Data) contain open global-scale knowledge and make it available to potentially intelligent machines that want to benefit from it. Nevertheless, most of Linked Open Data lack ontological distinctions and have sparse axiomatisation. For example, distinctions such as whether an entity is inherently a class or an individual, or whether it is a physi… ▽ More

    Submitted 23 May, 2018; v1 submitted 26 March, 2018; originally announced March 2018.