Skip to main content

Showing 1–2 of 2 results for author: Klikowski, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10255  [pdf, other

    cs.CL cs.SI

    WarCov -- Large multilabel and multimodal dataset from social platform

    Authors: Weronika Borek-Marciniec, Pawel Zyblewski, Jakub Klikowski, Pawel Ksieniewicz

    Abstract: In the classification tasks, from raw data acquisition to the curation of a dataset suitable for use in evaluating machine learning models, a series of steps - often associated with high costs - are necessary. In the case of Natural Language Processing, initial cleaning and conversion can be performed automatically, but obtaining labels still requires the rationalized input of human experts. As a… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 13 pages, 6 figures

  2. arXiv:2102.00266  [pdf, ps, other

    cs.CV

    Hellinger Distance Weighted Ensemble for Imbalanced Data Stream Classification

    Authors: Joanna Grzyb, Jakub Klikowski, Michał Woźniak

    Abstract: The imbalanced data classification remains a vital problem. The key is to find such methods that classify both the minority and majority class correctly. The paper presents the classifier ensemble for classifying binary, non-stationary and imbalanced data streams where the Hellinger Distance is used to prune the ensemble. The paper includes an experimental evaluation of the method based on the con… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.