Skip to main content

Showing 1–7 of 7 results for author: Nickel, R M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2106.15825  [pdf, other

    cs.CL

    O2D2: Out-Of-Distribution Detector to Capture Undecidable Trials in Authorship Verification

    Authors: Benedikt Boenninghoff, Robert M. Nickel, Dorothea Kolossa

    Abstract: The PAN 2021 authorship verification (AV) challenge is part of a three-year strategy, moving from a cross-topic/closed-set AV task to a cross-topic/open-set AV task over a collection of fanfiction texts. In this work, we present a novel hybrid neural-probabilistic framework that is designed to tackle the challenges of the 2021 task. Our system is based on our 2020 winning submission, with updates… ▽ More

    Submitted 30 July, 2021; v1 submitted 30 June, 2021; originally announced June 2021.

    Comments: PAN@CLEF 2021

  2. arXiv:2106.11196  [pdf, other

    cs.CL

    Self-Calibrating Neural-Probabilistic Model for Authorship Verification Under Covariate Shift

    Authors: Benedikt Boenninghoff, Dorothea Kolossa, Robert M. Nickel

    Abstract: We are addressing two fundamental problems in authorship verification (AV): Topic variability and miscalibration. Variations in the topic of two disputed texts are a major cause of error for most AV systems. In addition, it is observed that the underlying probability estimates produced by deep learning AV mechanisms oftentimes do not match the actual case counts in the respective training data. As… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: 12th International Conference of the CLEF Association, 2021

  3. arXiv:2103.01173  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Unsupervised Classification of Voiced Speech and Pitch Tracking Using Forward-Backward Kalman Filtering

    Authors: Benedikt Boenninghoff, Robert M. Nickel, Steffen Zeiler, Dorothea Kolossa

    Abstract: The detection of voiced speech, the estimation of the fundamental frequency, and the tracking of pitch values over time are crucial subtasks for a variety of speech processing techniques. Many different algorithms have been developed for each of the three subtasks. We present a new algorithm that integrates the three subtasks into a single procedure. The algorithm can be applied to pre-recorded sp… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: Speech Communication; 12. ITG Symposium, 5-7 Oct. 2016

  4. arXiv:2008.10105  [pdf, other

    cs.CL cs.LG

    Deep Bayes Factor Scoring for Authorship Verification

    Authors: Benedikt Boenninghoff, Julian Rupp, Robert M. Nickel, Dorothea Kolossa

    Abstract: The PAN 2020 authorship verification (AV) challenge focuses on a cross-topic/closed-set AV task over a collection of fanfiction texts. Fanfiction is a fan-written extension of a storyline in which a so-called fandom topic describes the principal subject of the document. The data provided in the PAN 2020 AV task is quite challenging because authors of texts across multiple/different fandom topics a… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

    Comments: CLEF 2020 Labs and Workshops, Notebook Papers, September 2020. CEUR-WS.org

  5. arXiv:2005.13930  [pdf, other

    cs.LG cs.CL stat.ML

    Variational Autoencoder with Embedded Student-$t$ Mixture Model for Authorship Attribution

    Authors: Benedikt Boenninghoff, Steffen Zeiler, Robert M. Nickel, Dorothea Kolossa

    Abstract: Traditional computational authorship attribution describes a classification task in a closed-set scenario. Given a finite set of candidate authors and corresponding labeled texts, the objective is to determine which of the authors has written another set of anonymous or disputed texts. In this work, we propose a probabilistic autoencoding framework to deal with this supervised classification task.… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    Comments: Preprint

  6. arXiv:1910.08144  [pdf, ps, other

    cs.CL

    Explainable Authorship Verification in Social Media via Attention-based Similarity Learning

    Authors: Benedikt Boenninghoff, Steffen Hessler, Dorothea Kolossa, Robert M. Nickel

    Abstract: Authorship verification is the task of analyzing the linguistic patterns of two or more texts to determine whether they were written by the same author or not. The analysis is traditionally performed by experts who consider linguistic features, which include spelling mistakes, grammatical inconsistencies, and stylistics for example. Machine learning algorithms, on the other hand, can be trained to… ▽ More

    Submitted 19 November, 2019; v1 submitted 17 October, 2019; originally announced October 2019.

    Comments: Accepted for 2019 IEEE International Conference on Big Data (IEEE Big Data 2019)

  7. arXiv:1908.07844  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Similarity Learning for Authorship Verification in Social Media

    Authors: Benedikt Boenninghoff, Robert M. Nickel, Steffen Zeiler, Dorothea Kolossa

    Abstract: Authorship verification tries to answer the question if two documents with unknown authors were written by the same author or not. A range of successful technical approaches has been proposed for this task, many of which are based on traditional linguistic features such as n-grams. These algorithms achieve good results for certain types of written documents like books and novels. Forensic authorsh… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: 5 pages, 3 figures, 1 table, presented on ICASSP 2019 in Brighton, UK