Skip to main content

Showing 1–4 of 4 results for author: Bungert, T J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01032  [pdf, other

    cs.LG cs.CV stat.ME

    Overcoming Common Flaws in the Evaluation of Selective Classification Systems

    Authors: Jeremias Traub, Till J. Bungert, Carsten T. Lüth, Michael Baumgartner, Klaus H. Maier-Hein, Lena Maier-Hein, Paul F Jaeger

    Abstract: Selective Classification, wherein models can reject low-confidence predictions, promises reliable translation of machine-learning based classification systems to real-world scenarios such as clinical diagnostics. While current evaluation of these systems typically assumes fixed working points based on pre-defined rejection thresholds, methodological progress requires benchmarking the general perfo… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2307.14729  [pdf, other

    eess.IV cs.CV cs.LG

    Understanding Silent Failures in Medical Image Classification

    Authors: Till J. Bungert, Levin Kobelke, Paul F. Jaeger

    Abstract: To ensure the reliable use of classification systems in medical applications, it is crucial to prevent silent failures. This can be achieved by either designing classifiers that are robust enough to avoid failures in the first place, or by detecting remaining failures using confidence scoring functions (CSFs). A predominant source of failures in image classification is distribution shifts between… ▽ More

    Submitted 22 August, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: Accepted at MICCAI 23

  3. arXiv:2301.10625  [pdf, other

    cs.CV

    Navigating the Pitfalls of Active Learning Evaluation: A Systematic Framework for Meaningful Performance Assessment

    Authors: Carsten T. Lüth, Till J. Bungert, Lukas Klein, Paul F. Jaeger

    Abstract: Active Learning (AL) aims to reduce the labeling burden by interactively selecting the most informative samples from a pool of unlabeled data. While there has been extensive research on improving AL query methods in recent years, some studies have questioned the effectiveness of AL compared to emerging paradigms such as semi-supervised (Semi-SL) and self-supervised learning (Self-SL), or a simple… ▽ More

    Submitted 3 November, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: Accepted at NeurIPS 2023

  4. arXiv:2211.15259  [pdf, other

    cs.CV cs.LG

    A Call to Reflect on Evaluation Practices for Failure Detection in Image Classification

    Authors: Paul F. Jaeger, Carsten T. Lüth, Lukas Klein, Till J. Bungert

    Abstract: Reliable application of machine learning-based decision systems in the wild is one of the major challenges currently investigated by the field. A large portion of established approaches aims to detect erroneous predictions by means of assigning confidence scores. This confidence may be obtained by either quantifying the model's predictive uncertainty, learning explicit scoring functions, or assess… ▽ More

    Submitted 5 April, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Journal ref: ICLR 2023 (oral)