Skip to main content

Showing 1–9 of 9 results for author: Viering, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.04111  [pdf, other

    cs.LG

    The Unreasonable Effectiveness Of Early Discarding After One Epoch In Neural Network Hyperparameter Optimization

    Authors: Romain Egele, Felix Mohr, Tom Viering, Prasanna Balaprakash

    Abstract: To reach high performance with deep learning, hyperparameter optimization (HPO) is essential. This process is usually time-consuming due to costly evaluations of neural networks. Early discarding techniques limit the resources granted to unpromising candidates by observing the empirical learning curves and canceling neural network training as soon as the lack of competitiveness of a candidate beco… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  2. arXiv:2211.14061  [pdf, other

    cs.LG stat.ME stat.ML

    A Survey of Learning Curves with Bad Behavior: or How More Data Need Not Lead to Better Performance

    Authors: Marco Loog, Tom Viering

    Abstract: Plotting a learner's generalization performance against the training set size results in a so-called learning curve. This tool, providing insight in the behavior of the learner, is also practically valuable for model selection, predicting the effect of more training data, and reducing the computational complexity of training. We set out to make the (ideal) learning curve concept precise and briefl… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2103.10948

    ACM Class: A.1; I.5.0; I.2.6

  3. arXiv:2103.10948  [pdf, other

    cs.LG

    The Shape of Learning Curves: a Review

    Authors: Tom Viering, Marco Loog

    Abstract: Learning curves provide insight into the dependence of a learner's generalization performance on the training set size. This important tool can be used for model selection, to predict the effect of more training data, and to reduce the computational complexity of model training and hyperparameter tuning. This review recounts the origins of the term, provides a formal definition of the learning cur… ▽ More

    Submitted 5 November, 2022; v1 submitted 19 March, 2021; originally announced March 2021.

    Comments: 20 pages. Revised version of original manuscript. Expresses, in particular, more scepticism towards the idea that it has been proven that power laws often provide accurate learning curve models

  4. A Brief Prehistory of Double Descent

    Authors: Marco Loog, Tom Viering, Alexander Mey, Jesse H. Krijthe, David M. J. Tax

    Abstract: In their thought-provoking paper [1], Belkin et al. illustrate and discuss the shape of risk curves in the context of modern high-complexity learners. Given a fixed training sample size $n$, such curves show the risk of a learner as a function of some (approximate) measure of its complexity $N$. With $N$ the number of features, these curves are also referred to as feature curves. A salient observa… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

  5. arXiv:1911.11030  [pdf, other

    cs.LG stat.ML

    Making Learners (More) Monotone

    Authors: Tom J. Viering, Alexander Mey, Marco Loog

    Abstract: Learning performance can show non-monotonic behavior. That is, more data does not necessarily lead to better models, even on average. We propose three algorithms that take a supervised learning model and make it perform more monotone. We prove consistency and monotonicity with high probability, and evaluate the algorithms on scenarios where non-monotone behaviour occurs. Our proposed algorithm… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

  6. arXiv:1907.10901  [pdf, other

    cs.CV cs.CR cs.LG

    How to Manipulate CNNs to Make Them Lie: the GradCAM Case

    Authors: Tom Viering, Ziqi Wang, Marco Loog, Elmar Eisemann

    Abstract: Recently many methods have been introduced to explain CNN decisions. However, it has been shown that some methods can be sensitive to manipulation of the input. We continue this line of work and investigate the explanation method GradCAM. Instead of manipulating the input, we consider an adversary that manipulates the model itself to attack the explanation. By changing weights and architecture, we… ▽ More

    Submitted 16 August, 2019; v1 submitted 25 July, 2019; originally announced July 2019.

    Comments: Presented at BMVC 2019: Workshop on Interpretable and Explainable Machine Vision, Cardiff, UK. Updated to BMVC template

  7. arXiv:1907.05476  [pdf, other

    cs.LG stat.ML

    Minimizers of the Empirical Risk and Risk Monotonicity

    Authors: Marco Loog, Tom Viering, Alexander Mey

    Abstract: Plotting a learner's average performance against the number of training samples results in a learning curve. Studying such curves on one or more data sets is a way to get to a better understanding of the generalization properties of this learner. The behavior of learning curves is, however, not very well understood and can display (for most researchers) quite unexpected behavior. Our work introduc… ▽ More

    Submitted 13 March, 2020; v1 submitted 11 July, 2019; originally announced July 2019.

    Comments: New version fixes some minor issues especially in the proof of Theorem 1

  8. A Distribution Dependent and Independent Complexity Analysis of Manifold Regularization

    Authors: Alexander Mey, Tom Viering, Marco Loog

    Abstract: Manifold regularization is a commonly used technique in semi-supervised learning. It enforces the classification rule to be smooth with respect to the data-manifold. Here, we derive sample complexity bounds based on pseudo-dimension for models that add a convex data dependent regularization term to a supervised learning process, as is in particular done in Manifold regularization. We then compare… ▽ More

    Submitted 27 November, 2019; v1 submitted 14 June, 2019; originally announced June 2019.

  9. arXiv:1706.02645  [pdf, other

    cs.LG stat.ML

    Nuclear Discrepancy for Active Learning

    Authors: Tom J. Viering, Jesse H. Krijthe, Marco Loog

    Abstract: Active learning algorithms propose which unlabeled objects should be queried for their labels to improve a predictive model the most. We study active learners that minimize generalization bounds and uncover relationships between these bounds that lead to an improved approach to active learning. In particular we show the relation between the bound of the state-of-the-art Maximum Mean Discrepancy (M… ▽ More

    Submitted 8 June, 2017; originally announced June 2017.

    Comments: 32 pages, 5 figures, 4 tables