Search | arXiv e-print repository

Adaptive Conformal Prediction by Reweighting Nonconformity Score

Authors: Salim I. Amoukou, Nicolas J. B Brunel

Abstract: Despite attractive theoretical guarantees and practical successes, Predictive Interval (PI) given by Conformal Prediction (CP) may not reflect the uncertainty of a given model. This limitation arises from CP methods using a constant correction for all test points, disregarding their individual uncertainties, to ensure coverage properties. To address this issue, we propose using a Quantile Regressi… ▽ More Despite attractive theoretical guarantees and practical successes, Predictive Interval (PI) given by Conformal Prediction (CP) may not reflect the uncertainty of a given model. This limitation arises from CP methods using a constant correction for all test points, disregarding their individual uncertainties, to ensure coverage properties. To address this issue, we propose using a Quantile Regression Forest (QRF) to learn the distribution of nonconformity scores and utilizing the QRF's weights to assign more importance to samples with residuals similar to the test point. This approach results in PI lengths that are more aligned with the model's uncertainty. In addition, the weights learnt by the QRF provide a partition of the features space, allowing for more efficient computations and improved adaptiveness of the PI through groupwise conformalization. Our approach enjoys an assumption-free finite sample marginal and training-conditional coverage, and under suitable assumptions, it also ensures conditional coverage. Our methods work for any nonconformity score and are available as a Python package. We conduct experiments on simulated and real-world data that demonstrate significant improvements compared to existing methods. △ Less

Submitted 31 May, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

arXiv:2209.14568 [pdf, other]

Local and Regional Counterfactual Rules: Summarized and Robust Recourses

Authors: Salim I. Amoukou, Nicolas J. B Brunel

Abstract: Counterfactual Explanations (CE) face several unresolved challenges, such as ensuring stability, synthesizing multiple CEs, and providing plausibility and sparsity guarantees. From a more practical point of view, recent studies [Pawelczyk et al., 2022] show that the prescribed counterfactual recourses are often not implemented exactly by individuals and demonstrate that most state-of-the-art CE al… ▽ More Counterfactual Explanations (CE) face several unresolved challenges, such as ensuring stability, synthesizing multiple CEs, and providing plausibility and sparsity guarantees. From a more practical point of view, recent studies [Pawelczyk et al., 2022] show that the prescribed counterfactual recourses are often not implemented exactly by individuals and demonstrate that most state-of-the-art CE algorithms are very likely to fail in this noisy environment. To address these issues, we propose a probabilistic framework that gives a sparse local counterfactual rule for each observation, providing rules that give a range of values capable of changing decisions with high probability. These rules serve as a summary of diverse counterfactual explanations and yield robust recourses. We further aggregate these local rules into a regional counterfactual rule, identifying shared recourses for subgroups of the data. Our local and regional rules are derived from the Random Forest algorithm, which offers statistical guarantees and fidelity to data distribution by selecting recourses in high-density regions. Moreover, our rules are sparse as we first select the smallest set of variables having a high probability of changing the decision. We have conducted experiments to validate the effectiveness of our counterfactual rules in comparison to standard CE and recent similar attempts. Our methods are available as a Python package. △ Less

Submitted 18 March, 2024; v1 submitted 29 September, 2022; originally announced September 2022.

Comments: ICML (International Conference on Machine Learning) 2023 Workshop on Counterfactuals in Minds and Machines

Journal ref: ICML (International Conference on Machine Learning) 2023 Workshop on Counterfactuals in Minds and Machines

arXiv:2111.04658 [pdf, other]

Consistent Sufficient Explanations and Minimal Local Rules for explaining regression and classification models

Authors: Salim I. Amoukou, Nicolas J. B Brunel

Abstract: To explain the decision of any model, we extend the notion of probabilistic Sufficient Explanations (P-SE). For each instance, this approach selects the minimal subset of features that is sufficient to yield the same prediction with high probability, while removing other features. The crux of P-SE is to compute the conditional probability of maintaining the same prediction. Therefore, we introduce… ▽ More To explain the decision of any model, we extend the notion of probabilistic Sufficient Explanations (P-SE). For each instance, this approach selects the minimal subset of features that is sufficient to yield the same prediction with high probability, while removing other features. The crux of P-SE is to compute the conditional probability of maintaining the same prediction. Therefore, we introduce an accurate and fast estimator of this probability via random Forests for any data $(\boldsymbol{X}, Y)$ and show its efficiency through a theoretical analysis of its consistency. As a consequence, we extend the P-SE to regression problems. In addition, we deal with non-discrete features, without learning the distribution of $\boldsymbol{X}$ nor having the model for making predictions. Finally, we introduce local rule-based explanations for regression/classification based on the P-SE and compare our approaches w.r.t other explainable AI methods. These methods are available as a Python package at \url{www.github.com/salimamoukou/acv00}. △ Less

Submitted 14 October, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

Comments: Accepted at the 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

arXiv:2106.03820 [pdf, other]

Accurate Shapley Values for explaining tree-based models

Authors: Salim I. Amoukou, Nicolas J-B. Brunel, Tangi Salaün

Abstract: Shapley Values (SV) are widely used in explainable AI, but their estimation and interpretation can be challenging, leading to inaccurate inferences and explanations. As a starting point, we remind an invariance principle for SV and derive the correct approach for computing the SV of categorical variables that are particularly sensitive to the encoding used. In the case of tree-based models, we int… ▽ More Shapley Values (SV) are widely used in explainable AI, but their estimation and interpretation can be challenging, leading to inaccurate inferences and explanations. As a starting point, we remind an invariance principle for SV and derive the correct approach for computing the SV of categorical variables that are particularly sensitive to the encoding used. In the case of tree-based models, we introduce two estimators of Shapley Values that exploit the tree structure efficiently and are more accurate than state-of-the-art methods. Simulations and comparisons are performed with state-of-the-art algorithms and show the practical gain of our approach. Finally, we discuss the limitations of Shapley Values as a local explanation. These methods are available as a Python package. △ Less

Submitted 31 May, 2023; v1 submitted 7 June, 2021; originally announced June 2021.

Comments: Accepted at the 25th International Conference on Artificial Intelligence and Statistics (AISTATS), 2022. V2: The section on Active Shapley Values has been removed in this updated version

Journal ref: AISTATS 2022

arXiv:2103.13342

The Shapley Value of coalition of variables provides better explanations

Authors: Salim I. Amoukou, Nicolas J-B. Brunel, Tangi Salaün

Abstract: While Shapley Values (SV) are one of the gold standard for interpreting machine learning models, we show that they are still poorly understood, in particular in the presence of categorical variables or of variables of low importance. For instance, we show that the popular practice that consists in summing the SV of dummy variables is false as it provides wrong estimates of all the SV in the model… ▽ More While Shapley Values (SV) are one of the gold standard for interpreting machine learning models, we show that they are still poorly understood, in particular in the presence of categorical variables or of variables of low importance. For instance, we show that the popular practice that consists in summing the SV of dummy variables is false as it provides wrong estimates of all the SV in the model and implies spurious interpretations. Based on the identification of null and active coalitions, and a coalitional version of the SV, we provide a correct computation and inference of important variables. Moreover, a Python library (All the experiments and simulations can be reproduced with the publicly available library Active Coalition of Variables, https://www.github.com/salimamoukou/acv00) that computes reliably conditional expectations and SV for tree-based models, is implemented and compared with state-of-the-art algorithms on toy models and real data sets. △ Less

Submitted 6 April, 2022; v1 submitted 24 March, 2021; originally announced March 2021.

Comments: This paper has been withdrawn by the authors, because it has now been merged with (and superseded by) a parallel work arXiv:2106.03820

arXiv:1910.12049 [pdf, other]

Mean curvature and mean shape for multivariate functional data under Frenet-Serret framework

Authors: Juhyun Park, Nicolas J-B. Brunel

Abstract: The analysis of curves has been routinely dealt with using tools from functional data analysis. However its extension to multi-dimensional curves poses a new challenge due to its inherent geometric features that are difficult to capture with the classical approaches that rely on linear approximations. We propose a new framework for functional data as multidimensional curves that allows us to extra… ▽ More The analysis of curves has been routinely dealt with using tools from functional data analysis. However its extension to multi-dimensional curves poses a new challenge due to its inherent geometric features that are difficult to capture with the classical approaches that rely on linear approximations. We propose a new framework for functional data as multidimensional curves that allows us to extract geometrical features from noisy data. We define a mean through measuring shape variation of the curves. The notion of shape has been used in functional data analysis somewhat intuitively to find a common pattern in one dimensional curves. As a generalization, we directly utilize a geometric representation of the curves through the Frenet-Serret ordinary differential equations and introduce a new definition of mean curvature and mean shape through the mean ordinary differential equation. We formulate the estimation problem in a penalized regression and develop an efficient algorithm. We demonstrate our approach with both simulated data and a real data example. △ Less

Submitted 26 October, 2019; originally announced October 2019.

arXiv:1410.7566 [pdf, other]

doi 10.1080/01621459.2013.841583

Parametric Estimation of Ordinary Differential Equations with Orthogonality Conditions

Authors: Nicolas J-B Brunel, Quentin Clairon, Florence d'Alche-Buc

Abstract: Differential equations are commonly used to model dynamical deterministic systems in applications. When statistical parameter estimation is required to calibrate theoretical models to data, classical statistical estimators are often confronted to complex and potentially ill-posed optimization problem. As a consequence, alternative estimators to classical parametric estimators are needed for obtain… ▽ More Differential equations are commonly used to model dynamical deterministic systems in applications. When statistical parameter estimation is required to calibrate theoretical models to data, classical statistical estimators are often confronted to complex and potentially ill-posed optimization problem. As a consequence, alternative estimators to classical parametric estimators are needed for obtaining reliable estimates. We propose a gradient matching approach for the estimation of parametric Ordinary Differential Equations observed with noise. Starting from a nonparametric proxy of a true solution of the ODE, we build a parametric estimator based on a variational characterization of the solution. As a Generalized Moment Estimator, our estimator must satisfy a set of orthogonal conditions that are solved in the least squares sense. Despite the use of a nonparametric estimator, we prove the root-$n$ consistency and asymptotic normality of the Orthogonal Conditions estimator. We can derive confidence sets thanks to a closed-form expression for the asymptotic variance. Finally, the OC estimator is compared to classical estimators in several (simulated and real) experiments and ODE models in order to show its versatility and relevance with respect to classical Gradient Matching and Nonlinear Least Squares estimators. In particular, we show on a real dataset of influenza infection that the approach gives reliable estimates. Moreover, we show that our approach can deal directly with more elaborated models such as Delay Differential Equation (DDE). △ Less

Submitted 28 October, 2014; originally announced October 2014.

Comments: 35 pages, 5 figures

Journal ref: Brunel, N. JB, Clairon, Q., d Alche Buc, F. (2014). Parametric Estimation of Ordinary Differential Equations With Orthogonality Conditions. Journal of the American Statistical Association, 109(505), 173-185

Showing 1–7 of 7 results for author: Brunel, N J