Skip to main content

Showing 1–9 of 9 results for author: Plepi, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.05740  [pdf, other

    cs.CL

    Do Multilingual Large Language Models Mitigate Stereotype Bias?

    Authors: Shangrui Nie, Michael Fromm, Charles Welch, Rebekka Görge, Akbar Karimi, Joan Plepi, Nazia Afsan Mowmita, Nicolas Flores-Herr, Mehdi Ali, Lucie Flek

    Abstract: While preliminary findings indicate that multilingual LLMs exhibit reduced bias compared to monolingual ones, a comprehensive understanding of the effect of multilingual training on bias mitigation, is lacking. This study addresses this gap by systematically training six LLMs of identical size (2.6B parameters) and architecture: five monolingual models (English, German, French, Italian, and Spanis… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 19 pages, 8 figures

  2. arXiv:2404.02340  [pdf, other

    cs.CL

    Corpus Considerations for Annotator Modeling and Scaling

    Authors: Olufunke O. Sarumi, Béla Neuendorf, Joan Plepi, Lucie Flek, Jörg Schlötterer, Charles Welch

    Abstract: Recent trends in natural language processing research and annotation tasks affirm a paradigm shift from the traditional reliance on a single ground truth to a focus on individual perspectives, particularly in subjective tasks. In scenarios where annotation tasks are meant to encompass diversity, models that solely rely on the majority class labels may inadvertently disregard valuable minority pers… ▽ More

    Submitted 17 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted at NAACL 2024

    ACM Class: F.2.2; I.2.7

  3. arXiv:2210.14531  [pdf, other

    cs.CL

    Unifying Data Perspectivism and Personalization: An Application to Social Norms

    Authors: Joan Plepi, Béla Neuendorf, Lucie Flek, Charles Welch

    Abstract: Instead of using a single ground truth for language processing tasks, several recent studies have examined how to represent and predict the labels of the set of annotators. However, often little or no information about annotators is known, or the set of annotators is small. In this work, we examine a corpus of social media posts about conflict from a set of 13k annotators and 210k judgements of so… ▽ More

    Submitted 22 October, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

  4. arXiv:2208.08758  [pdf, other

    cs.CL

    Understanding Interpersonal Conflict Types and their Impact on Perception Classification

    Authors: Charles Welch, Joan Plepi, Béla Neuendorf, Lucie Flek

    Abstract: Studies on interpersonal conflict have a long history and contain many suggestions for conflict typology. We use this as the basis of a novel annotation scheme and release a new dataset of situations and conflict aspect annotations. We then build a classifier to predict whether someone will perceive the actions of one individual as right or wrong in a given situation. Our analyses include conflict… ▽ More

    Submitted 27 October, 2022; v1 submitted 18 August, 2022; originally announced August 2022.

  5. arXiv:2205.06181  [pdf, other

    cs.SI

    FACTOID: A New Dataset for Identifying Misinformation Spreaders and Political Bias

    Authors: Flora Sakketou, Joan Plepi, Riccardo Cervero, Henri-Jacques Geiss, Paolo Rosso, Lucie Flek

    Abstract: Proactively identifying misinformation spreaders is an important step towards mitigating the impact of fake news on our society. In this paper, we introduce a new contemporary Reddit dataset for fake news spreader analysis, called FACTOID, monitoring political discussions on Reddit since the beginning of 2020. The dataset contains over 4K users with 3.4M Reddit posts, and includes, beyond the user… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: Accepted to LREC 2022

  6. arXiv:2204.13329  [pdf, other

    cs.AI

    Refining Diagnosis Paths for Medical Diagnosis based on an Augmented Knowledge Graph

    Authors: Niclas Heilig, Jan Kirchhoff, Florian Stumpe, Joan Plepi, Lucie Flek, Heiko Paulheim

    Abstract: Medical diagnosis is the process of making a prediction of the disease a patient is likely to have, given a set of symptoms and observations. This requires extensive expert knowledge, in particular when covering a large variety of diseases. Such knowledge can be coded in a knowledge graph -- encompassing diseases, symptoms, and diagnosis paths. Since both the knowledge itself and its encoding can… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    Comments: Accepted at the 5th Workshop on Semantic Web solutions for large-scale biomedical data analytics

  7. arXiv:2110.04001  [pdf, other

    cs.CL

    Perceived and Intended Sarcasm Detection with Graph Attention Networks

    Authors: Joan Plepi, Lucie Flek

    Abstract: Existing sarcasm detection systems focus on exploiting linguistic markers, context, or user-level priors. However, social studies suggest that the relationship between the author and the audience can be equally relevant for the sarcasm usage and interpretation. In this work, we propose a framework jointly leveraging (1) a user context from their historical tweets together with (2) the social infor… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  8. arXiv:2104.01569  [pdf, other

    cs.CL

    Conversational Question Answering over Knowledge Graphs with Transformer and Graph Attention Networks

    Authors: Endri Kacupaj, Joan Plepi, Kuldeep Singh, Harsh Thakkar, Jens Lehmann, Maria Maleshkova

    Abstract: This paper addresses the task of (complex) conversational question answering over a knowledge graph. For this task, we propose LASAGNE (muLti-task semAntic parSing with trAnsformer and Graph atteNtion nEtworks). It is the first approach, which employs a transformer architecture extended with Graph Attention Networks for multi-task neural semantic parsing. LASAGNE uses a transformer model for gener… ▽ More

    Submitted 24 June, 2021; v1 submitted 4 April, 2021; originally announced April 2021.

    Comments: 16th conference of the European Chapter of the Association for Computational Linguistics (EACL 2021)

  9. arXiv:2103.07766  [pdf, other

    cs.CL

    Context Transformer with Stacked Pointer Networks for Conversational Question Answering over Knowledge Graphs

    Authors: Joan Plepi, Endri Kacupaj, Kuldeep Singh, Harsh Thakkar, Jens Lehmann

    Abstract: Neural semantic parsing approaches have been widely used for Question Answering (QA) systems over knowledge graphs. Such methods provide the flexibility to handle QA datasets with complex queries and a large number of entities. In this work, we propose a novel framework named CARTON, which performs multi-task semantic parsing for handling the problem of conversational question answering over a lar… ▽ More

    Submitted 24 June, 2021; v1 submitted 13 March, 2021; originally announced March 2021.

    Comments: 18th Extended Semantic Web Conference 2021 (ESWC'2021) - Research Track