Skip to main content

Showing 1–4 of 4 results for author: Gruber, S G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.08589  [pdf, other

    cs.LG stat.ML

    Consistent and Asymptotically Unbiased Estimation of Proper Calibration Errors

    Authors: Teodora Popordanoska, Sebastian G. Gruber, Aleksei Tiulpin, Florian Buettner, Matthew B. Blaschko

    Abstract: Proper scoring rules evaluate the quality of probabilistic predictions, playing an essential role in the pursuit of accurate and well-calibrated models. Every proper score decomposes into two fundamental components -- proper calibration error and refinement -- utilizing a Bregman divergence. While uncertainty calibration has gained significant attention, current literature lacks a general estimato… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: Preprint

  2. arXiv:2310.05833  [pdf, other

    cs.LG stat.ML

    A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models

    Authors: Sebastian G. Gruber, Florian Buettner

    Abstract: Generative models, like large language models, are becoming increasingly relevant in our daily lives, yet a theoretical framework to assess their generalization behavior and uncertainty does not exist. Particularly, the problem of uncertainty estimation is commonly solved in an ad-hoc manner and task dependent. For example, natural language approaches cannot be transferred to image generation. In… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: Preprint

  3. arXiv:2210.12256  [pdf, other

    cs.LG stat.ML

    Uncertainty Estimates of Predictions via a General Bias-Variance Decomposition

    Authors: Sebastian G. Gruber, Florian Buettner

    Abstract: Reliably estimating the uncertainty of a prediction throughout the model lifecycle is crucial in many safety-critical applications. The most common way to measure this uncertainty is via the predicted confidence. While this tends to work well for in-domain samples, these estimates are unreliable under domain drift and restricted to classification. Alternatively, proper scores can be used for most… ▽ More

    Submitted 20 April, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted at AISTATS 2023

  4. arXiv:2203.07835  [pdf, other

    cs.LG stat.ML

    Better Uncertainty Calibration via Proper Scores for Classification and Beyond

    Authors: Sebastian G. Gruber, Florian Buettner

    Abstract: With model trustworthiness being crucial for sensitive real-world applications, practitioners are putting more and more focus on improving the uncertainty calibration of deep neural networks. Calibration errors are designed to quantify the reliability of probabilistic predictions but their estimators are usually biased and inconsistent. In this work, we introduce the framework of proper calibratio… ▽ More

    Submitted 12 March, 2024; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: Published at NeurIPS 2022. Corrected conference version Theorem 3.1 and Proposition 3.2 since CWCE=0 does not imply TCE=0

    Journal ref: Advances in Neural Information Processing Systems 35 (NeurIPS 2022)