Skip to main content

Showing 1–5 of 5 results for author: Pardo, T A S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2301.11850  [pdf, other

    cs.CL

    Predicting Sentence-Level Factuality of News and Bias of Media Outlets

    Authors: Francielle Vargas, Kokil Jaidka, Thiago A. S. Pardo, Fabrício Benevenuto

    Abstract: Automated news credibility and fact-checking at scale require accurately predicting news factuality and media bias. This paper introduces a large sentence-level dataset, titled "FactNews", composed of 6,191 sentences expertly annotated according to factuality and media bias definitions proposed by AllSides. We use FactNews to assess the overall reliability of news sources, by formulating two text… ▽ More

    Submitted 28 June, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  2. arXiv:2104.12265  [pdf, other

    cs.CL

    Contextual-Lexicon Approach for Abusive Language Detection

    Authors: Francielle Vargas, Fabiana Rodrigues de Góes, Isabelle Carvalho, Fabrício Benevenuto, Thiago Alexandre Salgueiro Pardo

    Abstract: Since a lexicon-based approach is more elegant scientifically, explaining the solution components and being easier to generalize to other applications, this paper provides a new approach for offensive language and hate speech detection on social media. Our approach embodies a lexicon of implicit and explicit offensive and swearing expressions annotated with contextual information. Due to the sever… ▽ More

    Submitted 20 December, 2022; v1 submitted 25 April, 2021; originally announced April 2021.

    Comments: Please cite: https://aclanthology.org/2021.ranlp-1.161/

  3. arXiv:2103.14972  [pdf, other

    cs.CL

    HateBR: A Large Expert Annotated Corpus of Brazilian Instagram Comments for Offensive Language and Hate Speech Detection

    Authors: Francielle Alves Vargas, Isabelle Carvalho, Fabiana Rodrigues de Góes, Fabrício Benevenuto, Thiago Alexandre Salgueiro Pardo

    Abstract: Due to the severity of the social media offensive and hateful comments in Brazil, and the lack of research in Portuguese, this paper provides the first large-scale expert annotated corpus of Brazilian Instagram comments for hate speech and offensive language detection. The HateBR corpus was collected from the comment section of Brazilian politicians' accounts on Instagram and manually annotated by… ▽ More

    Submitted 27 December, 2022; v1 submitted 27 March, 2021; originally announced March 2021.

    Comments: Published at LREC 2022 Proceedings

    Journal ref: https://aclanthology.org/2022.lrec-1.777/

  4. arXiv:2008.06079  [pdf, other

    cs.CL cs.AI cs.IR

    Studying Dishonest Intentions in Brazilian Portuguese Texts

    Authors: Francielle Alves Vargas, Thiago Alexandre Salgueiro Pardo

    Abstract: Previous work in the social sciences, psychology and linguistics has show that liars have some control over the content of their stories, however their underlying state of mind may "leak out" through the way that they tell them. To the best of our knowledge, no previous systematic effort exists in order to describe and model deception language for Brazilian Portuguese. To fill this important gap,… ▽ More

    Submitted 1 April, 2021; v1 submitted 13 August, 2020; originally announced August 2020.

  5. arXiv:1905.12069  [pdf, other

    cs.CL

    SEMA: an Extended Semantic Evaluation Metric for AMR

    Authors: Rafael T. Anchieta, Marco A. S. Cabezudo, Thiago A. S. Pardo

    Abstract: Abstract Meaning Representation (AMR) is a recently designed semantic representation language intended to capture the meaning of a sentence, which may be represented as a single-rooted directed acyclic graph with labeled nodes and edges. The automatic evaluation of this structure plays an important role in the development of better systems, as well as for semantic annotation. Despite there is one… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Comments: Accepted by CICLing 2019