Skip to main content

Showing 1–3 of 3 results for author: Scheibel, W

.
  1. arXiv:2406.13552  [pdf, other

    cs.LG cs.HC

    Standardness Fogs Meaning: A Position Regarding the Informed Usage of Standard Datasets

    Authors: Tim Cech, Ole Wegen, Daniel Atzberger, Rico Richter, Willy Scheibel, Jürgen Döllner

    Abstract: Standard datasets are frequently used to train and evaluate Machine Learning models. However, the assumed standardness of these datasets leads to a lack of in-depth discussion on how their labels match the derived categories for the respective use case. In other words, the standardness of the datasets seems to fog coherency and applicability, thus impeding the trust in Machine Learning models. We… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. Large-Scale Evaluation of Topic Models and Dimensionality Reduction Methods for 2D Text Spatialization

    Authors: Daniel Atzberger, Tim Cech, Willy Scheibel, Matthias Trapp, Rico Richter, Jürgen Döllner, Tobias Schreck

    Abstract: Topic models are a class of unsupervised learning algorithms for detecting the semantic structure within a text corpus. Together with a subsequent dimensionality reduction algorithm, topic models can be used for deriving spatializations for text corpora as two-dimensional scatter plots, reflecting semantic similarity between the documents and supporting corpus analysis. Although the choice of the… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: To be published at IEEE VIS 2023 conference

  3. Tooling for Time- and Space-efficient git Repository Mining

    Authors: Fabian Heseding, Willy Scheibel, Jürgen Döllner

    Abstract: Software projects under version control grow with each commit, accumulating up to hundreds of thousands of commits per repository. Especially for such large projects, the traversal of a repository and data extraction for static source code analysis poses a trade-off between granularity and speed. We showcase the command-line tool pyrepositoryminer that combines a set of optimization approaches for… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.