Skip to main content

Showing 1–1 of 1 results for author: Devanne, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2305.11921  [pdf, other

    stat.ME cs.AI cs.LG cs.PF

    An Approach to Multiple Comparison Benchmark Evaluations that is Stable Under Manipulation of the Comparate Set

    Authors: Ali Ismail-Fawaz, Angus Dempster, Chang Wei Tan, Matthieu Herrmann, Lynn Miller, Daniel F. Schmidt, Stefano Berretti, Jonathan Weber, Maxime Devanne, Germain Forestier, Geoffrey I. Webb

    Abstract: The measurement of progress using benchmarks evaluations is ubiquitous in computer science and machine learning. However, common approaches to analyzing and presenting the results of benchmark comparisons of multiple algorithms over multiple datasets, such as the critical difference diagram introduced by Demšar (2006), have important shortcomings and, we show, are open to both inadvertent and inte… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.