Skip to main content

Showing 1–4 of 4 results for author: Menini, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17647  [pdf, other

    cs.CL

    Variationist: Exploring Multifaceted Variation and Bias in Written Language Data

    Authors: Alan Ramponi, Camilla Casula, Stefano Menini

    Abstract: Exploring and understanding language data is a fundamental stage in all areas dealing with human language. It allows NLP practitioners to uncover quality concerns and harmful biases in data before training, and helps linguists and social scientists to gain insight into language use and human behavior. Yet, there is currently a lack of a unified, customizable tool to seamlessly inspect and visualiz… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: ACL 2024 (System Demonstrations)

  2. Agreeing to Disagree: Annotating Offensive Language Datasets with Annotators' Disagreement

    Authors: Elisa Leonardelli, Stefano Menini, Alessio Palmero Aprosio, Marco Guerini, Sara Tonelli

    Abstract: Since state-of-the-art approaches to offensive language detection rely on supervised learning, it is crucial to quickly adapt them to the continuously evolving scenario of social media. While several approaches have been proposed to tackle the problem from an algorithmic perspective, so to reduce the need for annotated data, less attention has been paid to the quality of these data. Following a tr… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: To appear at EMNLP 2021 (long paper)

  3. arXiv:2103.14916  [pdf, other

    cs.CL

    Abuse is Contextual, What about NLP? The Role of Context in Abusive Language Annotation and Detection

    Authors: Stefano Menini, Alessio Palmero Aprosio, Sara Tonelli

    Abstract: The datasets most widely used for abusive language detection contain lists of messages, usually tweets, that have been manually judged as abusive or not by one or more annotators, with the annotation performed at message level. In this paper, we investigate what happens when the hateful content of a message is judged also based on the context, given that messages are often ambiguous and need to be… ▽ More

    Submitted 27 March, 2021; originally announced March 2021.

  4. arXiv:2005.02235  [pdf, other

    cs.CL

    Creating a Multimodal Dataset of Images and Text to Study Abusive Language

    Authors: Alessio Palmero Aprosio, Stefano Menini, Sara Tonelli

    Abstract: In order to study online hate speech, the availability of datasets containing the linguistic phenomena of interest are of crucial importance. However, when it comes to specific target groups, for example teenagers, collecting such data may be problematic due to issues with consent and privacy restrictions. Furthermore, while text-only datasets of this kind have been widely used, limitations set by… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.