Skip to main content

Showing 1–4 of 4 results for author: Petrak, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.17163  [pdf, other

    cs.CL cs.AI

    Misogyny classification of German newspaper forum comments

    Authors: Johann Petrak, Brigitte Krenn

    Abstract: This paper presents work on detecting misogyny in the comments of a large Austrian German language newspaper forum. We describe the creation of a corpus of 6600 comments which were annotated with 5 levels of misogyny. The forum moderators were involved as experts in the creation of the annotation guidelines and the annotation of the comments. We also describe the results of training transformer-ba… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

  2. arXiv:2006.03354  [pdf, other

    cs.LG cs.CL cs.SI stat.ML

    Classification Aware Neural Topic Model and its Application on a New COVID-19 Disinformation Corpus

    Authors: Xingyi Song, Johann Petrak, Ye Jiang, Iknoor Singh, Diana Maynard, Kalina Bontcheva

    Abstract: The explosion of disinformation accompanying the COVID-19 pandemic has overloaded fact-checkers and media worldwide, and brought a new major challenge to government responses worldwide. Not only is disinformation creating confusion about medical science amongst citizens, but it is also amplifying distrust in policy makers and governments. To help tackle this, we developed computational methods to… ▽ More

    Submitted 11 March, 2021; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: This is arXiv version of "Classification Aware Neural Topic Model for COVID-19 Disinformation Categorisation"

    Journal ref: PLOS ONE 2021

  3. arXiv:1809.00934  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    A Deep Neural Network Sentence Level Classification Method with Context Information

    Authors: Xingyi Song, Johann Petrak, Angus Roberts

    Abstract: In the sentence classification task, context formed from sentences adjacent to the sentence being classified can provide important information for classification. This context is, however, often ignored. Where methods do make use of context, only small amounts are considered, making it difficult to scale. We present a new method for sentence classification, Context-LSTM-CNN, that makes use of pote… ▽ More

    Submitted 31 August, 2018; originally announced September 2018.

    Comments: Accepted at EMNLP2018

  4. Analysis of Named Entity Recognition and Linking for Tweets

    Authors: Leon Derczynski, Diana Maynard, Giuseppe Rizzo, Marieke van Erp, Genevieve Gorrell, Raphaƫl Troncy, Johann Petrak, Kalina Bontcheva

    Abstract: Applying natural language processing for mining and intelligent information access to tweets (a form of microblog) is a challenging, emerging research area. Unlike carefully authored news text and other longer content, tweets pose a number of new challenges, due to their short, noisy, context-dependent, and dynamic nature. Information extraction from tweets is typically performed in a pipeline, co… ▽ More

    Submitted 27 October, 2014; originally announced October 2014.

    Comments: 35 pages, accepted to journal Information Processing and Management

    Journal ref: Information Processing & Management 51 (2), 32-49, 2014