Skip to main content

Showing 1–22 of 22 results for author: Waegeman, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08853  [pdf, other

    stat.ML cs.LG q-bio.QM

    Assessment of Uncertainty Quantification in Universal Differential Equations

    Authors: Nina Schmid, David Fernandes del Pozo, Willem Waegeman, Jan Hasenauer

    Abstract: Scientific Machine Learning is a new class of approaches that integrate physical knowledge and mechanistic models with data-driven techniques for uncovering governing equations of complex processes. Among the available approaches, Universal Differential Equations (UDEs) are used to combine prior knowledge in the form of mechanistic formulations with universal function approximators, like neural ne… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Shared last authorship between W.W. and J.H

  2. arXiv:2402.09056  [pdf, other

    cs.AI cs.LG

    Is Epistemic Uncertainty Faithfully Represented by Evidential Deep Learning Methods?

    Authors: Mira Jürgens, Nis Meinert, Viktor Bengs, Eyke Hüllermeier, Willem Waegeman

    Abstract: Trustworthy ML systems should not only return accurate predictions, but also a reliable representation of their uncertainty. Bayesian methods are commonly used to quantify both aleatoric and epistemic uncertainty, but alternative approaches, such as evidential deep learning methods, have become popular in recent years. The latter group of methods in essence extends empirical risk minimization (ERM… ▽ More

    Submitted 20 February, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  3. arXiv:2309.08313  [pdf, other

    stat.ML cs.LG

    Conditional validity of heteroskedastic conformal regression

    Authors: Nicolas Dewolf, Bernard De Baets, Willem Waegeman

    Abstract: Conformal prediction, and split conformal prediction as a specific implementation, offer a distribution-free approach to estimating prediction intervals with statistical guarantees. Recent work has shown that split conformal prediction can produce state-of-the-art prediction intervals when focusing on marginal coverage, i.e. on a calibration dataset the method produces on average prediction interv… ▽ More

    Submitted 30 April, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 36 pages

  4. arXiv:2301.12736  [pdf, ps, other

    cs.LG stat.ML

    On Second-Order Scoring Rules for Epistemic Uncertainty Quantification

    Authors: Viktor Bengs, Eyke Hüllermeier, Willem Waegeman

    Abstract: It is well known that accurate probabilistic predictors can be trained through empirical risk minimisation with proper scoring rules as loss functions. While such learners capture so-called aleatoric uncertainty of predictions, various machine learning methods have recently been developed with the goal to let the learner also represent its epistemic uncertainty, i.e., the uncertainty caused by a l… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    MSC Class: 68T37 (Primary) 68T30 (Secondary)

  5. arXiv:2211.04362  [pdf, other

    cs.LG

    Hyperparameter optimization in deep multi-target prediction

    Authors: Dimitrios Iliadis, Marcel Wever, Bernard De Baets, Willem Waegeman

    Abstract: As a result of the ever increasing complexity of configuring and fine-tuning machine learning models, the field of automated machine learning (AutoML) has emerged over the past decade. However, software implementations like Auto-WEKA and Auto-sklearn typically focus on classical machine learning (ML) tasks such as classification and regression. Our work can be seen as the first attempt at offering… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: 17 pages, 4 figures, 1 table

  6. arXiv:2205.10082  [pdf, other

    stat.ML cs.LG

    On the Calibration of Probabilistic Classifier Sets

    Authors: Thomas Mortier, Viktor Bengs, Eyke Hüllermeier, Stijn Luca, Willem Waegeman

    Abstract: Multi-class classification methods that produce sets of probabilistic classifiers, such as ensemble learning methods, are able to model aleatoric and epistemic uncertainty. Aleatoric uncertainty is then typically quantified via the Bayes error, and epistemic uncertainty via the size of the set. In this paper, we extend the notion of calibration, which is commonly used to evaluate the validity of t… ▽ More

    Submitted 19 April, 2023; v1 submitted 20 May, 2022; originally announced May 2022.

  7. arXiv:2203.06676  [pdf, other

    cs.LG cs.AI stat.ML

    Set-valued prediction in hierarchical classification with constrained representation complexity

    Authors: Thomas Mortier, Eyke Hüllermeier, Krzysztof Dembczyński, Willem Waegeman

    Abstract: Set-valued prediction is a well-known concept in multi-class classification. When a classifier is uncertain about the class label for a test instance, it can predict a set of classes instead of a single class. In this paper, we focus on hierarchical multi-class classification problems, where valid sets (typically) correspond to internal nodes of the hierarchy. We argue that this is a very strong r… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

  8. arXiv:2203.06102  [pdf, other

    cs.LG stat.ML

    Pitfalls of Epistemic Uncertainty Quantification through Loss Minimisation

    Authors: Viktor Bengs, Eyke Hüllermeier, Willem Waegeman

    Abstract: Uncertainty quantification has received increasing attention in machine learning in the recent past. In particular, a distinction between aleatoric and epistemic uncertainty has been found useful in this regard. The latter refers to the learner's (lack of) knowledge and appears to be especially difficult to measure and quantify. In this paper, we analyse a recent proposal based on the idea of a se… ▽ More

    Submitted 13 October, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

    MSC Class: 68T37 (Primary) 68T30 (Secondary)

  9. Valid prediction intervals for regression problems

    Authors: Nicolas Dewolf, Bernard De Baets, Willem Waegeman

    Abstract: Over the last few decades, various methods have been proposed for estimating prediction intervals in regression settings, including Bayesian methods, ensemble methods, direct interval estimation methods and conformal prediction methods. An important issue is the calibration of these methods: the generated prediction intervals should have a predefined coverage level, without being overly conservati… ▽ More

    Submitted 1 April, 2024; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: Minor correction (bibliography and typo in Fig. 3). Thanks to Dr. María Moreno de Castro for spotting this typo

  10. arXiv:2104.09967  [pdf, other

    cs.LG

    Multi-target prediction for dummies using two-branch neural networks

    Authors: Dimitrios Iliadis, Bernard De Baets, Willem Waegeman

    Abstract: Multi-target prediction (MTP) serves as an umbrella term for machine learning tasks that concern the simultaneous prediction of multiple target variables. Classical instantiations are multi-label classification, multivariate regression, multi-task learning, dyadic prediction, zero-shot learning, network inference, and matrix completion. Despite the significant similarities, all these domains have… ▽ More

    Submitted 25 October, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

  11. Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction to Concepts and Methods

    Authors: Eyke Hüllermeier, Willem Waegeman

    Abstract: The notion of uncertainty is of major importance in machine learning and constitutes a key element of machine learning methodology. In line with the statistical tradition, uncertainty has long been perceived as almost synonymous with standard probability and probabilistic predictions. Yet, due to the steadily increasing relevance of machine learning for practical applications and related issues su… ▽ More

    Submitted 16 September, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: 59 pages

  12. arXiv:1906.08129  [pdf, other

    cs.LG stat.ML

    Efficient Set-Valued Prediction in Multi-Class Classification

    Authors: Thomas Mortier, Marek Wydmuch, Krzysztof Dembczyński, Eyke Hüllermeier, Willem Waegeman

    Abstract: In cases of uncertainty, a multi-class classifier preferably returns a set of candidate classes instead of predicting a single class label with little guarantee. More precisely, the classifier should strive for an optimal balance between the correctness (the true class is among the candidates) and the precision (the candidates are not too many) of its prediction. We formalize this problem within a… ▽ More

    Submitted 27 May, 2020; v1 submitted 19 June, 2019; originally announced June 2019.

  13. arXiv:1809.02352  [pdf, other

    stat.ML cs.LG

    Multi-Target Prediction: A Unifying View on Problems and Methods

    Authors: Willem Waegeman, Krzysztof Dembczynski, Eyke Huellermeier

    Abstract: Multi-target prediction (MTP) is concerned with the simultaneous prediction of multiple target variables of diverse type. Due to its enormous application potential, it has developed into an active and rapidly expanding research field that combines several subfields of machine learning, including multivariate regression, multi-label classification, multi-task learning, dyadic prediction, zero-shot… ▽ More

    Submitted 7 September, 2018; originally announced September 2018.

  14. arXiv:1803.01575  [pdf, other

    stat.ML cs.LG

    A Comparative Study of Pairwise Learning Methods based on Kernel Ridge Regression

    Authors: Michiel Stock, Tapio Pahikkala, Antti Airola, Bernard De Baets, Willem Waegeman

    Abstract: Many machine learning problems can be formulated as predicting labels for a pair of objects. Problems of that kind are often referred to as pairwise learning, dyadic prediction or network inference problems. During the last decade kernel methods have played a dominant role in pairwise learning. They still obtain a state-of-the-art predictive performance, but a theoretical analysis of their behavio… ▽ More

    Submitted 5 March, 2018; originally announced March 2018.

    Comments: arXiv admin note: text overlap with arXiv:1606.04275

  15. Exact and efficient top-K inference for multi-target prediction by querying separable linear relational models

    Authors: Michiel Stock, Krzysztof Dembczynski, Bernard De Baets, Willem Waegeman

    Abstract: Many complex multi-target prediction problems that concern large target spaces are characterised by a need for efficient prediction strategies that avoid the computation of predictions for all targets explicitly. Examples of such problems emerge in several subfields of machine learning, such as collaborative filtering, multi-label classification, dyadic prediction and biological network inference.… ▽ More

    Submitted 14 June, 2016; originally announced June 2016.

    Journal ref: Data Min Knowl Disc (2016) 30:1370-1394

  16. arXiv:1606.04275  [pdf, other

    cs.LG

    Efficient Pairwise Learning Using Kernel Ridge Regression: an Exact Two-Step Method

    Authors: Michiel Stock, Tapio Pahikkala, Antti Airola, Bernard De Baets, Willem Waegeman

    Abstract: Pairwise learning or dyadic prediction concerns the prediction of properties for pairs of objects. It can be seen as an umbrella covering various machine learning problems such as matrix completion, collaborative filtering, multi-task learning, transfer learning, network prediction and zero-shot learning. In this work we analyze kernel-based methods for pairwise learning, with a particular focus o… ▽ More

    Submitted 14 June, 2016; originally announced June 2016.

  17. arXiv:1506.05950  [pdf, ps, other

    cs.LG stat.ML

    Spectral Analysis of Symmetric and Anti-Symmetric Pairwise Kernels

    Authors: Tapio Pahikkala, Markus Viljanen, Antti Airola, Willem Waegeman

    Abstract: We consider the problem of learning regression functions from pairwise data when there exists prior knowledge that the relation to be learned is symmetric or anti-symmetric. Such prior knowledge is commonly enforced by symmetrizing or anti-symmetrizing pairwise kernel functions. Through spectral analysis, we show that these transformations reduce the kernel's effective dimension. Further, we provi… ▽ More

    Submitted 19 June, 2015; originally announced June 2015.

  18. arXiv:1405.4423  [pdf, other

    cs.LG

    A two-step learning approach for solving full and almost full cold start problems in dyadic prediction

    Authors: Tapio Pahikkala, Michiel Stock, Antti Airola, Tero Aittokallio, Bernard De Baets, Willem Waegeman

    Abstract: Dyadic prediction methods operate on pairs of objects (dyads), aiming to infer labels for out-of-sample dyads. We consider the full and almost full cold start problem in dyadic prediction, a setting that occurs when both objects in an out-of-sample dyad have not been observed during training, or if one of them has been observed, but very few times. A popular approach for addressing this problem is… ▽ More

    Submitted 17 May, 2014; originally announced May 2014.

  19. arXiv:1405.4394  [pdf, other

    cs.LG cs.CE q-bio.QM stat.ML

    Identification of functionally related enzymes by learning-to-rank methods

    Authors: Michiel Stock, Thomas Fober, Eyke Hüllermeier, Serghei Glinca, Gerhard Klebe, Tapio Pahikkala, Antti Airola, Bernard De Baets, Willem Waegeman

    Abstract: Enzyme sequences and structures are routinely used in the biological sciences as queries to search for functionally related enzymes in online databases. To this end, one usually departs from some notion of similarity, comparing two enzymes by looking for correspondences in their sequences, structures or surfaces. For a given query, the search operation results in a ranking of the enzymes in the da… ▽ More

    Submitted 17 May, 2014; originally announced May 2014.

  20. arXiv:1310.4849  [pdf, other

    stat.ML cs.LG

    On the Bayes-optimality of F-measure maximizers

    Authors: Willem Waegeman, Krzysztof Dembczynski, Arkadiusz Jachnik, Weiwei Cheng, Eyke Hullermeier

    Abstract: The F-measure, which has originally been introduced in information retrieval, is nowadays routinely used as a performance metric for problems such as binary classification, multi-label classification, and structured output prediction. Optimizing this measure is a statistically and computationally challenging problem, since no closed-form solution exists. Adopting a decision-theoretic perspective,… ▽ More

    Submitted 6 March, 2015; v1 submitted 17 October, 2013; originally announced October 2013.

    Journal ref: JMLR 15 (2014) 3333-3388

  21. arXiv:1209.4825  [pdf, ps, other

    cs.LG stat.ML

    Efficient Regularized Least-Squares Algorithms for Conditional Ranking on Relational Data

    Authors: Tapio Pahikkala, Antti Airola, Michiel Stock, Bernard De Baets, Willem Waegeman

    Abstract: In domains like bioinformatics, information retrieval and social network analysis, one can find learning tasks where the goal consists of inferring a ranking of objects, conditioned on a particular target object. We present a general kernel framework for learning conditional rankings from various types of relational data, where rankings can be conditioned on unseen data objects. We propose efficie… ▽ More

    Submitted 8 June, 2013; v1 submitted 21 September, 2012; originally announced September 2012.

  22. A kernel-based framework for learning graded relations from data

    Authors: Willem Waegeman, Tapio Pahikkala, Antti Airola, Tapio Salakoski, Michiel Stock, Bernard De Baets

    Abstract: Driven by a large number of potential applications in areas like bioinformatics, information retrieval and social network analysis, the problem setting of inferring relations between pairs of data objects has recently been investigated quite intensively in the machine learning community. To this end, current approaches typically consider datasets containing crisp relations, so that standard classi… ▽ More

    Submitted 28 November, 2011; originally announced November 2011.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

    Journal ref: IEEE Transactions on Fuzzy Systems, Volume: 20, Issue: 6, Dec. 2012, pages 1090 - 1101