Skip to main content

Showing 1–50 of 108 results for author: Hüllermeier, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.20031  [pdf, other

    cs.LG cs.AI

    Pairwise Difference Learning for Classification

    Authors: Mohamed Karim Belaid, Maximilian Rabus, Eyke Hüllermeier

    Abstract: Pairwise difference learning (PDL) has recently been introduced as a new meta-learning technique for regression. Instead of learning a map** from instances to outcomes in the standard way, the key idea is to learn a function that takes two instances as input and predicts the difference between the respective outcomes. Given a function of this kind, predictions for a query instance are derived fr… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2406.17322  [pdf, other

    cs.LG cs.AI

    ALPBench: A Benchmark for Active Learning Pipelines on Tabular Data

    Authors: Valentin Margraf, Marcel Wever, Sandra Gilhuber, Gabriel Marques Tavares, Thomas Seidl, Eyke Hüllermeier

    Abstract: In settings where only a budgeted amount of labeled data can be afforded, active learning seeks to devise query strategies for selecting the most informative data points to be labeled, aiming to enhance learning algorithms' efficiency and performance. Numerous such query strategies have been proposed and compared in the active learning literature. However, the community still lacks standardized be… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.16748  [pdf, other

    cs.LG cs.CL

    OCALM: Object-Centric Assessment with Language Models

    Authors: Timo Kaufmann, Jannis Blüml, Antonia Wüst, Quentin Delfosse, Kristian Kersting, Eyke Hüllermeier

    Abstract: Properly defining a reward signal to efficiently train a reinforcement learning (RL) agent is a challenging task. Designing balanced objective functions from which a desired behavior can emerge requires expert knowledge, especially for complex environments. Learning rewards from human feedback or using large language models (LLMs) to directly provide rewards are promising alternatives, allowing no… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted at the RLBRew Workshop at RLC 2024

  4. arXiv:2406.06560  [pdf, other

    cs.CL cs.AI

    Inverse Constitutional AI: Compressing Preferences into Principles

    Authors: Arduin Findeis, Timo Kaufmann, Eyke Hüllermeier, Samuel Albanie, Robert Mullins

    Abstract: Feedback data plays an important role in fine-tuning and evaluating state-of-the-art AI models. Often pairwise text preferences are used: given two texts, human (or AI) annotators select the "better" one. Such feedback data is widely used to align models to human preferences (e.g., reinforcement learning from human feedback), or to rank models according to human preferences (e.g., Chatbot Arena).… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  5. arXiv:2406.04041  [pdf, other

    cs.LG

    Linear Opinion Pooling for Uncertainty Quantification on Graphs

    Authors: Clemens Damke, Eyke Hüllermeier

    Abstract: We address the problem of uncertainty quantification for graph-structured data, or, more specifically, the problem to quantify the predictive uncertainty in (semi-supervised) node classification. Key questions in this regard concern the distinction between two different types of uncertainty, aleatoric and epistemic, and how to support uncertainty quantification by leveraging the structural informa… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted for the 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024). Implementation available at https://github.com/Cortys/gpn-extensions

  6. arXiv:2406.02354  [pdf, other

    cs.LG stat.ML

    Label-wise Aleatoric and Epistemic Uncertainty Quantification

    Authors: Yusuf Sale, Paul Hofman, Timo Löhr, Lisa Wimmer, Thomas Nagler, Eyke Hüllermeier

    Abstract: We present a novel approach to uncertainty quantification in classification tasks based on label-wise decomposition of uncertainty measures. This label-wise perspective allows uncertainty to be quantified at the individual class level, thereby improving cost-sensitive decision-making and hel** understand the sources of uncertainty. Furthermore, it allows to define total, aleatoric, and epistemic… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Uncertainty in Artificial Intelligence. arXiv admin note: substantial text overlap with arXiv:2401.00276

  7. arXiv:2405.10852  [pdf, other

    cs.LG cs.AI

    KernelSHAP-IQ: Weighted Least-Square Optimization for Shapley Interactions

    Authors: Fabian Fumagalli, Maximilian Muschalik, Patrick Kolpaczki, Eyke Hüllermeier, Barbara Hammer

    Abstract: The Shapley value (SV) is a prevalent approach of allocating credit to machine learning (ML) entities to understand black box ML models. Enriching such interpretations with higher-order interactions is inevitable for complex systems, where the Shapley Interaction Index (SII) is a direct axiomatic extension of the SV. While it is well-known that the SV yields an optimal approximation of any game vi… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted Paper at ICML 2024. This version is not the Camera Ready Version

  8. arXiv:2405.02200  [pdf, other

    cs.LG stat.ML

    Position: Why We Must Rethink Empirical Research in Machine Learning

    Authors: Moritz Herrmann, F. Julian D. Lange, Katharina Eggensperger, Giuseppe Casalicchio, Marcel Wever, Matthias Feurer, David Rügamer, Eyke Hüllermeier, Anne-Laure Boulesteix, Bernd Bischl

    Abstract: We warn against a common but incomplete understanding of empirical research in machine learning that leads to non-replicable results, makes findings unreliable, and threatens to undermine progress in the field. To overcome this alarming situation, we call for more awareness of the plurality of ways of gaining knowledge experimentally but also of some epistemic limitations. In particular, we argue… ▽ More

    Submitted 25 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: 20 pages, accepted for publication at ICML 2024, camera-ready version

  9. arXiv:2404.12215  [pdf, other

    cs.LG stat.ML

    Quantifying Aleatoric and Epistemic Uncertainty with Proper Scoring Rules

    Authors: Paul Hofman, Yusuf Sale, Eyke Hüllermeier

    Abstract: Uncertainty representation and quantification are paramount in machine learning and constitute an important prerequisite for safety-critical applications. In this paper, we propose novel measures for the quantification of aleatoric and epistemic uncertainty based on proper scoring rules, which are loss functions with the meaningful property that they incentivize the learner to predict ground-truth… ▽ More

    Submitted 19 April, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  10. arXiv:2403.04629  [pdf, other

    cs.LG cs.AI cs.HC cs.RO stat.ML

    Explaining Bayesian Optimization by Shapley Values Facilitates Human-AI Collaboration

    Authors: Julian Rodemann, Federico Croppi, Philipp Arens, Yusuf Sale, Julia Herbinger, Bernd Bischl, Eyke Hüllermeier, Thomas Augustin, Conor J. Walsh, Giuseppe Casalicchio

    Abstract: Bayesian optimization (BO) with Gaussian processes (GP) has become an indispensable algorithm for black box optimization problems. Not without a dash of irony, BO is often considered a black box itself, lacking ways to provide reasons as to why certain parameters are proposed to be evaluated. This is particularly relevant in human-in-the-loop applications of BO, such as in robotics. We address thi… ▽ More

    Submitted 8 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: Preprint. Copyright by the authors. 19 pages, 24 figures

    ACM Class: I.2.6; I.2.9; F.2.2; J.6

  11. arXiv:2402.10723  [pdf, other

    stat.ML cs.LG

    Conformalized Credal Set Predictors

    Authors: Alireza Javanmardi, David Stutz, Eyke Hüllermeier

    Abstract: Credal sets are sets of probability distributions that are considered as candidates for an imprecisely known ground-truth distribution. In machine learning, they have recently attracted attention as an appealing formalism for uncertainty representation, in particular due to their ability to represent both the aleatoric and epistemic uncertainty in a prediction. However, the design of methods for l… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  12. arXiv:2402.09056  [pdf, other

    cs.AI cs.LG

    Is Epistemic Uncertainty Faithfully Represented by Evidential Deep Learning Methods?

    Authors: Mira Jürgens, Nis Meinert, Viktor Bengs, Eyke Hüllermeier, Willem Waegeman

    Abstract: Trustworthy ML systems should not only return accurate predictions, but also a reliable representation of their uncertainty. Bayesian methods are commonly used to quantify both aleatoric and epistemic uncertainty, but alternative approaches, such as evidential deep learning methods, have become popular in recent years. The latter group of methods in essence extends empirical risk minimization (ERM… ▽ More

    Submitted 20 February, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  13. arXiv:2401.14283  [pdf, other

    stat.ML cs.LG

    Information Leakage Detection through Approximate Bayes-optimal Prediction

    Authors: Pritha Gupta, Marcel Wever, Eyke Hüllermeier

    Abstract: In today's data-driven world, the proliferation of publicly available information intensifies the challenge of information leakage (IL), raising security concerns. IL involves unintentionally exposing secret (sensitive) information to unauthorized parties via systems' observable information. Conventional statistical approaches, which estimate mutual information (MI) between observable and secret i… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Under submission in JMLR

    MSC Class: 94A15; 62H30; 94A60 ACM Class: I.5.1; G.3; E.3

  14. arXiv:2401.13371  [pdf, other

    cs.GT

    SVARM-IQ: Efficient Approximation of Any-order Shapley Interactions through Stratification

    Authors: Patrick Kolpaczki, Maximilian Muschalik, Fabian Fumagalli, Barbara Hammer, Eyke Hüllermeier

    Abstract: Addressing the limitations of individual attribution scores via the Shapley value (SV), the field of explainable AI (XAI) has recently explored intricate interactions of features or data points. In particular, extensions of the SV, such as the Shapley Interaction Index (SII), have been proposed as a measure to still benefit from the axiomatic basis of the SV. However, similar to the SV, their exac… ▽ More

    Submitted 1 March, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  15. Beyond TreeSHAP: Efficient Computation of Any-Order Shapley Interactions for Tree Ensembles

    Authors: Maximilian Muschalik, Fabian Fumagalli, Barbara Hammer, Eyke Hüllermeier

    Abstract: While shallow decision trees may be interpretable, larger ensemble models like gradient-boosted trees, which often set the state of the art in machine learning problems involving tabular data, still remain black box models. As a remedy, the Shapley value (SV) is a well-known concept in explainable artificial intelligence (XAI) research for quantifying additive feature attributions of predictions.… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  16. arXiv:2401.00276  [pdf, other

    cs.LG stat.ML

    Second-Order Uncertainty Quantification: Variance-Based Measures

    Authors: Yusuf Sale, Paul Hofman, Lisa Wimmer, Eyke Hüllermeier, Thomas Nagler

    Abstract: Uncertainty quantification is a critical aspect of machine learning models, providing important insights into the reliability of predictions and aiding the decision-making process in real-world applications. This paper proposes a novel way to use variance-based measures to quantify uncertainty on the basis of second-order distributions in classification problems. A distinctive feature of the measu… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: 22 pages, 10 figures

  17. arXiv:2312.14925  [pdf, ps, other

    cs.LG

    A Survey of Reinforcement Learning from Human Feedback

    Authors: Timo Kaufmann, Paul Weng, Viktor Bengs, Eyke Hüllermeier

    Abstract: Reinforcement learning from human feedback (RLHF) is a variant of reinforcement learning (RL) that learns from human feedback instead of relying on an engineered reward function. Building on prior work on the related setting of preference-based reinforcement learning (PbRL), it stands at the intersection of artificial intelligence and human-computer interaction. This positioning offers a promising… ▽ More

    Submitted 30 April, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    ACM Class: I.2.6

  18. arXiv:2312.00995  [pdf, other

    cs.LG stat.ML

    Second-Order Uncertainty Quantification: A Distance-Based Approach

    Authors: Yusuf Sale, Viktor Bengs, Michele Caprio, Eyke Hüllermeier

    Abstract: In the past couple of years, various approaches to representing and quantifying different types of predictive uncertainty in machine learning, notably in the setting of classification, have been proposed on the basis of second-order probability distributions, i.e., predictions in the form of distributions on probability distributions. A completely conclusive solution has not yet been found, howeve… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 16 pages, 2 figures

  19. arXiv:2311.11908  [pdf, other

    cs.LG cs.AI cs.CV

    Continual Learning: Applications and the Road Forward

    Authors: Eli Verwimp, Rahaf Aljundi, Shai Ben-David, Matthias Bethge, Andrea Cossu, Alexander Gepperth, Tyler L. Hayes, Eyke Hüllermeier, Christopher Kanan, Dhireesha Kudithipudi, Christoph H. Lampert, Martin Mundt, Razvan Pascanu, Adrian Popescu, Andreas S. Tolias, Joost van de Weijer, Bing Liu, Vincenzo Lomonaco, Tinne Tuytelaars, Gido M. van de Ven

    Abstract: Continual learning is a subfield of machine learning, which aims to allow machine learning models to continuously learn on new data, by accumulating knowledge without forgetting what was learned in the past. In this work, we take a step back, and ask: "Why should one care about continual learning in the first place?". We set the stage by examining recent continual learning papers published at four… ▽ More

    Submitted 28 March, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Journal ref: Transactions on Machine Learning Research (TMLR), 2024

  20. arXiv:2310.00750  [pdf, ps, other

    cs.LG stat.ML

    Identifying Copeland Winners in Dueling Bandits with Indifferences

    Authors: Viktor Bengs, Björn Haddenhorst, Eyke Hüllermeier

    Abstract: We consider the task of identifying the Copeland winner(s) in a dueling bandits problem with ternary feedback. This is an underexplored but practically relevant variant of the conventional dueling bandits problem, in which, in addition to strict preference between two arms, one may observe feedback in the form of an indifference. We provide a lower bound on the sample complexity for any learning a… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

    MSC Class: 68W27 (Primary) 68T05 (Secondary)

  21. arXiv:2309.02048  [pdf, other

    cs.LG stat.ML

    Probabilistic Self-supervised Learning via Scoring Rules Minimization

    Authors: Amirhossein Vahidi, Simon Schoßer, Lisa Wimmer, Yawei Li, Bernd Bischl, Eyke Hüllermeier, Mina Rezaei

    Abstract: In this paper, we propose a novel probabilistic self-supervised learning via Scoring Rule Minimization (ProSMIN), which leverages the power of probabilistic models to enhance representation quality and mitigate collapsing representations. Our proposed approach involves two neural networks; the online network and the target network, which collaborate and learn the diverse distribution of representa… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  22. arXiv:2308.14705  [pdf, other

    stat.ML cs.LG

    Diversified Ensemble of Independent Sub-Networks for Robust Self-Supervised Representation Learning

    Authors: Amirhossein Vahidi, Lisa Wimmer, Hüseyin Anil Gündüz, Bernd Bischl, Eyke Hüllermeier, Mina Rezaei

    Abstract: Ensembling a neural network is a widely recognized approach to enhance model performance, estimate uncertainty, and improve robustness in deep supervised learning. However, deep ensembles often come with high computational costs and memory demands. In addition, the efficiency of a deep ensemble is related to diversity among the ensemble members which is challenging for large, over-parameterized de… ▽ More

    Submitted 1 September, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

  23. arXiv:2308.10622  [pdf, other

    stat.ME cs.AI cs.LG

    Weighting by Tying: A New Approach to Weighted Rank Correlation

    Authors: Sascha Henzgen, Eyke Hüllermeier

    Abstract: Measures of rank correlation are commonly used in statistics to capture the degree of concordance between two orderings of the same set of items. Standard measures like Kendall's tau and Spearman's rho coefficient put equal emphasis on each position of a ranking. Yet, motivated by applications in which some of the positions (typically those on the top) are more important than others, a few weighte… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 15 pages

  24. arXiv:2307.06831  [pdf, ps, other

    stat.ML cs.LG

    A Novel Bayes' Theorem for Upper Probabilities

    Authors: Michele Caprio, Yusuf Sale, Eyke Hüllermeier, Insup Lee

    Abstract: In their seminal 1990 paper, Wasserman and Kadane establish an upper bound for the Bayes' posterior probability of a measurable set $A$, when the prior lies in a class of probability measures $\mathcal{P}$ and the likelihood is precise. They also give a sufficient condition for such upper bound to hold with equality. In this paper, we introduce a generalization of their result by additionally addr… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: text overlap with arXiv:2302.09656

    MSC Class: 68T37

  25. arXiv:2306.09586  [pdf, other

    cs.LG cs.AI stat.ML

    Is the Volume of a Credal Set a Good Measure for Epistemic Uncertainty?

    Authors: Yusuf Sale, Michele Caprio, Eyke Hüllermeier

    Abstract: Adequate uncertainty representation and quantification have become imperative in various scientific disciplines, especially in machine learning and artificial intelligence. As an alternative to representing uncertainty via one single probability measure, we consider credal sets (convex sets of probability measures). The geometric representation of credal sets as $d$-dimensional polytopes implies a… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  26. iPDP: On Partial Dependence Plots in Dynamic Modeling Scenarios

    Authors: Maximilian Muschalik, Fabian Fumagalli, Rohit Jagtani, Barbara Hammer, Eyke Hüllermeier

    Abstract: Post-hoc explanation techniques such as the well-established partial dependence plot (PDP), which investigates feature dependencies, are used in explainable artificial intelligence (XAI) to understand black-box machine learning models. While many real-world applications require dynamic models that constantly adapt over time and react to changes in the underlying distribution, XAI, so far, has prim… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: This preprint has not undergone peer review or any post-submission improvements or corrections

  27. arXiv:2306.01191  [pdf, ps, other

    cs.LG stat.ML

    Conformal Prediction with Partially Labeled Data

    Authors: Alireza Javanmardi, Yusuf Sale, Paul Hofman, Eyke Hüllermeier

    Abstract: While the predictions produced by conformal prediction are set-valued, the data used for training and calibration is supposed to be precise. In the setting of superset learning or learning from partial labels, a variant of weakly supervised learning, it is exactly the other way around: training data is possibly imprecise (set-valued), but the model induced from this data yields precise predictions… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  28. UNGOML: Automated Classification of unsafe Usages in Go

    Authors: Anna-Katharina Wickert, Clemens Damke, Lars Baumgärtner, Eyke Hüllermeier, Mira Mezini

    Abstract: The Go programming language offers strong protection from memory corruption. As an escape hatch of these protections, it provides the unsafe package. Previous studies identified that this unsafe package is frequently used in real-world code for several purposes, e.g., serialization or casting types. Due to the variety of these reasons, it may be possible to refactor specific usages to avoid potent… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 13 pages, accepted at the 2023 IEEE/ACM 20th International Conference on Mining Software Repositories (MSR 2023)

  29. arXiv:2305.16215  [pdf, other

    cs.LG eess.SY math.DS stat.ML

    Koopman Kernel Regression

    Authors: Petar Bevanda, Max Beier, Armin Lederer, Stefan Sosnowski, Eyke Hüllermeier, Sandra Hirche

    Abstract: Many machine learning approaches for decision making, such as reinforcement learning, rely on simulators or predictive models to forecast the time-evolution of quantities of interest, e.g., the state of an agent or the reward of a policy. Forecasts of such complex phenomena are commonly described by highly nonlinear dynamical systems, making their use in optimization-based decision-making challeng… ▽ More

    Submitted 16 January, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted to the thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

  30. arXiv:2305.13764  [pdf, other

    cs.LG

    Mitigating Label Noise through Data Ambiguation

    Authors: Julian Lienen, Eyke Hüllermeier

    Abstract: Label noise poses an important challenge in machine learning, especially in deep learning, in which large models with high expressive power dominate the field. Models of that kind are prone to memorizing incorrect labels, thereby harming generalization performance. Many methods have been proposed to address this problem, including robust loss functions and more complex label correction approaches.… ▽ More

    Submitted 25 January, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Paper incl. appendix accepted at AAAI-2024 (cf. copyright remark on title page), 20 pages, 9 figures

  31. arXiv:2305.00983  [pdf, other

    cs.CV

    Detecting Novelties with Empty Classes

    Authors: Svenja Uhlemeyer, Julian Lienen, Eyke Hüllermeier, Hanno Gottschalk

    Abstract: For open world applications, deep neural networks (DNNs) need to be aware of previously unseen data and adaptable to evolving environments. Furthermore, it is desirable to detect and learn novel classes which are not included in the DNNs underlying set of semantic classes in an unsupervised fashion. The method proposed in this article builds upon anomaly detection to retrieve out-of-distribution (… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: 13 pages, 13 figures, 4 tables

  32. arXiv:2304.01224  [pdf, other

    cs.LG cs.AI cs.GT

    Optimizing Data Shapley Interaction Calculation from O(2^n) to O(t n^2) for KNN models

    Authors: Mohamed Karim Belaid, Dorra El Mekki, Maximilian Rabus, Eyke Hüllermeier

    Abstract: With the rapid growth of data availability and usage, quantifying the added value of each training data point has become a crucial process in the field of artificial intelligence. The Shapley values have been recognized as an effective method for data valuation, enabling efficient training set summarization, acquisition, and outlier removal. In this paper, we introduce "STI-KNN", an innovative alg… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

  33. iSAGE: An Incremental Version of SAGE for Online Explanation on Data Streams

    Authors: Maximilian Muschalik, Fabian Fumagalli, Barbara Hammer, Eyke Hüllermeier

    Abstract: Existing methods for explainable artificial intelligence (XAI), including popular feature importance measures such as SAGE, are mostly restricted to the batch learning scenario. However, machine learning is often applied in dynamic environments, where data arrives continuously and learning must be done in an online manner. Therefore, we propose iSAGE, a time- and memory-efficient incrementalizatio… ▽ More

    Submitted 14 June, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

  34. arXiv:2303.01179  [pdf, other

    cs.LG cs.AI

    SHAP-IQ: Unified Approximation of any-order Shapley Interactions

    Authors: Fabian Fumagalli, Maximilian Muschalik, Patrick Kolpaczki, Eyke Hüllermeier, Barbara Hammer

    Abstract: Predominately in explainable artificial intelligence (XAI) research, the Shapley value (SV) is applied to determine feature attributions for any black box model. Shapley interaction indices extend the SV to define any-order feature interactions. Defining a unique Shapley interaction index is an open research question and, so far, three definitions have been proposed, which differ by their choice o… ▽ More

    Submitted 30 October, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

  35. arXiv:2302.00736  [pdf, other

    cs.LG cs.GT

    Approximating the Shapley Value without Marginal Contributions

    Authors: Patrick Kolpaczki, Viktor Bengs, Maximilian Muschalik, Eyke Hüllermeier

    Abstract: The Shapley value, which is arguably the most popular approach for assigning a meaningful contribution value to players in a cooperative game, has recently been used intensively in explainable artificial intelligence. Its meaningfulness is due to axiomatic properties that only the Shapley value satisfies, which, however, comes at the expense of an exact computation growing exponentially with the n… ▽ More

    Submitted 30 January, 2024; v1 submitted 1 February, 2023; originally announced February 2023.

  36. arXiv:2302.00511  [pdf, other

    cs.LG cs.AI

    Iterative Deepening Hyperband

    Authors: Jasmin Brandt, Marcel Wever, Dimitrios Iliadis, Viktor Bengs, Eyke Hüllermeier

    Abstract: Hyperparameter optimization (HPO) is concerned with the automated search for the most appropriate hyperparameter configuration (HPC) of a parameterized machine learning algorithm. A state-of-the-art HPO method is Hyperband, which, however, has its own parameters that influence its performance. One of these parameters, the maximal budget, is especially problematic: If chosen too small, the budget n… ▽ More

    Submitted 6 February, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

  37. arXiv:2301.12736  [pdf, ps, other

    cs.LG stat.ML

    On Second-Order Scoring Rules for Epistemic Uncertainty Quantification

    Authors: Viktor Bengs, Eyke Hüllermeier, Willem Waegeman

    Abstract: It is well known that accurate probabilistic predictors can be trained through empirical risk minimisation with proper scoring rules as loss functions. While such learners capture so-called aleatoric uncertainty of predictions, various machine learning methods have recently been developed with the goal to let the learner also represent its epistemic uncertainty, i.e., the uncertainty caused by a l… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    MSC Class: 68T37 (Primary) 68T30 (Secondary)

  38. arXiv:2212.14612  [pdf, other

    cs.LG stat.AP

    Conformal Prediction Intervals for Remaining Useful Lifetime Estimation

    Authors: Alireza Javanmardi, Eyke Hüllermeier

    Abstract: The main objective of Prognostics and Health Management is to estimate the Remaining Useful Lifetime (RUL), namely, the time that a system or a piece of equipment is still in working order before starting to function incorrectly. In recent years, numerous machine learning algorithms have been proposed for RUL estimation, mainly focusing on providing more accurate RUL predictions. However, there ar… ▽ More

    Submitted 30 December, 2022; originally announced December 2022.

  39. arXiv:2212.00333  [pdf, other

    cs.LG cs.DS

    AC-Band: A Combinatorial Bandit-Based Approach to Algorithm Configuration

    Authors: Jasmin Brandt, Elias Schede, Viktor Bengs, Björn Haddenhorst, Eyke Hüllermeier, Kevin Tierney

    Abstract: We study the algorithm configuration (AC) problem, in which one seeks to find an optimal parameter configuration of a given target algorithm in an automated way. Recently, there has been significant progress in designing AC approaches that satisfy strong theoretical guarantees. However, a significant gap still remains between the practical performance of these approaches and state-of-the-art heuri… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  40. arXiv:2209.03302  [pdf, other

    cs.LG

    Quantifying Aleatoric and Epistemic Uncertainty in Machine Learning: Are Conditional Entropy and Mutual Information Appropriate Measures?

    Authors: Lisa Wimmer, Yusuf Sale, Paul Hofman, Bern Bischl, Eyke Hüllermeier

    Abstract: The quantification of aleatoric and epistemic uncertainty in terms of conditional entropy and mutual information, respectively, has recently become quite common in machine learning. While the properties of these measures, which are rooted in information theory, seem appealing at first glance, we identify various incoherencies that call their appropriateness into question. In addition to the measur… ▽ More

    Submitted 25 June, 2023; v1 submitted 7 September, 2022; originally announced September 2022.

    Comments: To appear in: Proc. UAI, 39th Conference on Uncertainty in Artificial Intelligence, Pittsburgh, PA, USA, 2023

  41. Incremental Permutation Feature Importance (iPFI): Towards Online Explanations on Data Streams

    Authors: Fabian Fumagalli, Maximilian Muschalik, Eyke Hüllermeier, Barbara Hammer

    Abstract: Explainable Artificial Intelligence (XAI) has mainly focused on static learning scenarios so far. We are interested in dynamic scenarios where data is sampled progressively, and learning is done in an incremental rather than a batch mode. We seek efficient incremental algorithms for computing feature importance (FI) measures, specifically, an incremental FI measure based on feature marginalization… ▽ More

    Submitted 7 September, 2022; v1 submitted 5 September, 2022; originally announced September 2022.

  42. arXiv:2207.14160  [pdf, other

    cs.SE cs.AI

    Do We Need Another Explainable AI Method? Toward Unifying Post-hoc XAI Evaluation Methods into an Interactive and Multi-dimensional Benchmark

    Authors: Mohamed Karim Belaid, Eyke Hüllermeier, Maximilian Rabus, Ralf Krestel

    Abstract: In recent years, Explainable AI (xAI) attracted a lot of attention as various countries turned explanations into a legal right. xAI allows for improving models beyond the accuracy metric by, e.g., debugging the learned pattern and demystifying the AI's behavior. The widespread use of xAI brought new challenges. On the one hand, the number of published xAI algorithms underwent a boom, and it became… ▽ More

    Submitted 4 October, 2022; v1 submitted 8 June, 2022; originally announced July 2022.

  43. arXiv:2206.05530  [pdf, other

    cs.LG

    Memorization-Dilation: Modeling Neural Collapse Under Label Noise

    Authors: Duc Anh Nguyen, Ron Levie, Julian Lienen, Gitta Kutyniok, Eyke Hüllermeier

    Abstract: The notion of neural collapse refers to several emergent phenomena that have been empirically observed across various canonical classification problems. During the terminal phase of training a deep neural network, the feature embedding of all examples of the same class tend to collapse to a single representation, and the features of different classes tend to separate as much as possible. Neural co… ▽ More

    Submitted 4 April, 2023; v1 submitted 11 June, 2022; originally announced June 2022.

    Comments: to be published at ICLR 2023

  44. arXiv:2205.15239  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Conformal Credal Self-Supervised Learning

    Authors: Julian Lienen, Caglar Demir, Eyke Hüllermeier

    Abstract: In semi-supervised learning, the paradigm of self-training refers to the idea of learning from pseudo-labels suggested by the learner itself. Across various domains, corresponding methods have proven effective and achieve state-of-the-art performance. However, pseudo-labels typically stem from ad-hoc heuristics, relying on the quality of the predictions though without guaranteeing their validity.… ▽ More

    Submitted 9 June, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: 26 pages, 5 figures, 10 tables, to be published at the 12th Symposium on Conformal and Probabilistic Prediction with Applications (COPA 2023)

  45. arXiv:2205.10082  [pdf, other

    stat.ML cs.LG

    On the Calibration of Probabilistic Classifier Sets

    Authors: Thomas Mortier, Viktor Bengs, Eyke Hüllermeier, Stijn Luca, Willem Waegeman

    Abstract: Multi-class classification methods that produce sets of probabilistic classifiers, such as ensemble learning methods, are able to model aleatoric and epistemic uncertainty. Aleatoric uncertainty is then typically quantified via the Bayes error, and epistemic uncertainty via the size of the set. In this paper, we extend the notion of calibration, which is commonly used to evaluate the validity of t… ▽ More

    Submitted 19 April, 2023; v1 submitted 20 May, 2022; originally announced May 2022.

  46. arXiv:2203.06676  [pdf, other

    cs.LG cs.AI stat.ML

    Set-valued prediction in hierarchical classification with constrained representation complexity

    Authors: Thomas Mortier, Eyke Hüllermeier, Krzysztof Dembczyński, Willem Waegeman

    Abstract: Set-valued prediction is a well-known concept in multi-class classification. When a classifier is uncertain about the class label for a test instance, it can predict a set of classes instead of a single class. In this paper, we focus on hierarchical multi-class classification problems, where valid sets (typically) correspond to internal nodes of the hierarchy. We argue that this is a very strong r… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

  47. arXiv:2203.06102  [pdf, other

    cs.LG stat.ML

    Pitfalls of Epistemic Uncertainty Quantification through Loss Minimisation

    Authors: Viktor Bengs, Eyke Hüllermeier, Willem Waegeman

    Abstract: Uncertainty quantification has received increasing attention in machine learning in the recent past. In particular, a distinction between aleatoric and epistemic uncertainty has been found useful in this regard. The latter refers to the learner's (lack of) knowledge and appears to be especially difficult to measure and quantify. In this paper, we analyse a recent proposal based on the idea of a se… ▽ More

    Submitted 13 October, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

    MSC Class: 68T37 (Primary) 68T30 (Secondary)

  48. arXiv:2202.04593  [pdf, other

    cs.LG stat.ML

    Stochastic Contextual Dueling Bandits under Linear Stochastic Transitivity Models

    Authors: Viktor Bengs, Aadirupa Saha, Eyke Hüllermeier

    Abstract: We consider the regret minimization task in a dueling bandits problem with context information. In every round of the sequential decision problem, the learner makes a context-dependent selection of two choice alternatives (arms) to be compared with each other and receives feedback in the form of noisy preference information. We assume that the feedback process is determined by a linear stochastic… ▽ More

    Submitted 13 October, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    MSC Class: 68W27 (Primary) 68T05 (Secondary)

    Journal ref: Proceedings of the 39th International Conference on Machine Learning (ICML), PMLR 162:1764-1786, 2022

  49. arXiv:2202.04487  [pdf, other

    cs.LG stat.ML

    Finding Optimal Arms in Non-stochastic Combinatorial Bandits with Semi-bandit Feedback and Finite Budget

    Authors: Jasmin Brandt, Viktor Bengs, Björn Haddenhorst, Eyke Hüllermeier

    Abstract: We consider the combinatorial bandits problem with semi-bandit feedback under finite sampling budget constraints, in which the learner can carry out its action only for a limited number of times specified by an overall budget. The action is to choose a set of arms, whereupon feedback for each arm in the chosen set is received. Unlike existing works, we study this problem in a non-stochastic settin… ▽ More

    Submitted 14 October, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    MSC Class: 68Q32 (Primary) 68T05; 68W27 (Secondary)

  50. A Survey of Methods for Automated Algorithm Configuration

    Authors: Elias Schede, Jasmin Brandt, Alexander Tornede, Marcel Wever, Viktor Bengs, Eyke Hüllermeier, Kevin Tierney

    Abstract: Algorithm configuration (AC) is concerned with the automated search of the most suitable parameter configuration of a parametrized algorithm. There is currently a wide variety of AC problem variants and methods proposed in the literature. Existing reviews do not take into account all derivatives of the AC problem, nor do they offer a complete classification scheme. To this end, we introduce taxono… ▽ More

    Submitted 13 October, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    ACM Class: I.2.6

    Journal ref: Journal of Artificial Intelligence Research (JAIR) 75 (2022) 425-487