Skip to main content

Showing 1–18 of 18 results for author: Hooker, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.18702  [pdf, other

    cs.LG cs.CR stat.AP stat.ML

    Why You Should Not Trust Interpretations in Machine Learning: Adversarial Attacks on Partial Dependence Plots

    Authors: Xi Xin, Giles Hooker, Fei Huang

    Abstract: The adoption of artificial intelligence (AI) across industries has led to the widespread use of complex black-box models and interpretation tools for decision making. This paper proposes an adversarial framework to uncover the vulnerability of permutation-based interpretation methods for machine learning tasks, with a particular focus on partial dependence (PD) plots. This adversarial framework mo… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  2. arXiv:2403.00105  [pdf, other

    cs.LG cs.CY

    Longitudinal Counterfactuals: Constraints and Opportunities

    Authors: Alexander Asemota, Giles Hooker

    Abstract: Counterfactual explanations are a common approach to providing recourse to data subjects. However, current methodology can produce counterfactuals that cannot be achieved by the subject, making the use of counterfactuals for recourse difficult to justify in practice. Though there is agreement that plausibility is an important quality when using counterfactuals for algorithmic recourse, ground trut… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  3. arXiv:2401.15800  [pdf, other

    stat.ML cs.LG

    Provably Stable Feature Rankings with SHAP and LIME

    Authors: Jeremy Goldwasser, Giles Hooker

    Abstract: Feature attributions are ubiquitous tools for understanding the predictions of machine learning models. However, the calculation of popular methods for scoring input variables such as SHAP and LIME suffers from high instability due to random sampling. Leveraging ideas from multiple hypothesis testing, we devise attribution methods that ensure the most important features are ranked correctly with h… ▽ More

    Submitted 2 June, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  4. arXiv:2310.07672  [pdf, other

    stat.ML cs.LG

    Stabilizing Estimates of Shapley Values with Control Variates

    Authors: Jeremy Goldwasser, Giles Hooker

    Abstract: Shapley values are among the most popular tools for explaining predictions of blackbox machine learning models. However, their high computational cost motivates the use of sampling approximations, inducing a considerable degree of uncertainty. To stabilize these model explanations, we propose ControlSHAP, an approach based on the Monte Carlo technique of control variates. Our methodology is applic… ▽ More

    Submitted 9 April, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  5. arXiv:2211.12631  [pdf, other

    stat.ML cs.LG

    A Generic Approach for Reproducible Model Distillation

    Authors: Yunzhe Zhou, Peiru Xu, Giles Hooker

    Abstract: Model distillation has been a popular method for producing interpretable machine learning. It uses an interpretable "student" model to mimic the predictions made by the black box "teacher" model. However, when the student model is sensitive to the variability of the data sets used for training even when kee** the teacher fixed, the corresponded interpretation is not reliable. Existing strategies… ▽ More

    Submitted 27 April, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: 31 pages, 8 figures

  6. arXiv:2209.00147  [pdf, other

    stat.ML cs.LG stat.ME

    The Infinitesimal Jackknife and Combinations of Models

    Authors: Indrayudh Ghosal, Yunzhe Zhou, Giles Hooker

    Abstract: The Infinitesimal Jackknife is a general method for estimating variances of parametric models, and more recently also for some ensemble methods. In this paper we extend the Infinitesimal Jackknife to estimate the covariance between any two models. This can be used to quantify uncertainty for combinations of models, or to construct test statistics for comparing different models or ensembles of mode… ▽ More

    Submitted 31 August, 2022; originally announced September 2022.

    Comments: 47 pages, 11 figures

  7. S-LIME: Stabilized-LIME for Model Explanation

    Authors: Zhengze Zhou, Giles Hooker, Fei Wang

    Abstract: An increasing number of machine learning models have been deployed in domains with high stakes such as finance and healthcare. Despite their superior performances, many models are black boxes in nature which are hard to explain. There are growing efforts for researchers to develop methods to interpret these black-box models. Post hoc explanations based on perturbations, such as LIME, are widely us… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '21), August 14--18, 2021, Virtual Event, Singapore

  8. arXiv:2102.12328  [pdf, ps, other

    stat.OT cs.LG

    Bridging Breiman's Brook: From Algorithmic Modeling to Statistical Learning

    Authors: Lucas Mentch, Giles Hooker

    Abstract: In 2001, Leo Breiman wrote of a divide between "data modeling" and "algorithmic modeling" cultures. Twenty years later this division feels far more ephemeral, both in terms of assigning individuals to camps, and in terms of intellectual boundaries. We argue that this is largely due to the "data modelers" incorporating algorithmic methods into their toolbox, particularly driven by recent developmen… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: In response to the Journal of Observational Studies reprinting Leo Breiman's paper "Statistical Modeling: The Two Cultures" on its 20th anniversary

  9. arXiv:1912.01089  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    $V$-statistics and Variance Estimation

    Authors: Zhengze Zhou, Lucas Mentch, Giles Hooker

    Abstract: This paper develops a general framework for analyzing asymptotics of $V$-statistics. Previous literature on limiting distribution mainly focuses on the cases when $n \to \infty$ with fixed kernel size $k$. Under some regularity conditions, we demonstrate asymptotic normality when $k$ grows with $n$ by utilizing existing results for $U$-statistics. The key in our approach lies in a mathematical red… ▽ More

    Submitted 6 May, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

    Comments: This version supersedes the previous technical report titled "Asymptotic Normality and Variance Estimation For Supervised Ensembles". Extensive simulations are added and we also provide a more detailed discussion on the bias phenomenon in variance estimation

  10. arXiv:1911.04974  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Purifying Interaction Effects with the Functional ANOVA: An Efficient Algorithm for Recovering Identifiable Additive Models

    Authors: Benjamin Lengerich, Sarah Tan, Chun-Hao Chang, Giles Hooker, Rich Caruana

    Abstract: Models which estimate main effects of individual variables alongside interaction effects have an identifiability challenge: effects can be freely moved between main effects and interaction effects without changing the model prediction. This is a critical problem for interpretability because it permits "contradictory" models to represent the same function. To solve this problem, we propose pure int… ▽ More

    Submitted 1 May, 2020; v1 submitted 12 November, 2019; originally announced November 2019.

    Comments: AISTATS 2020

  11. arXiv:1905.03151  [pdf, other

    stat.ME cs.LG stat.ML

    Unrestricted Permutation forces Extrapolation: Variable Importance Requires at least One More Model, or There Is No Free Variable Importance

    Authors: Giles Hooker, Lucas Mentch, Siyu Zhou

    Abstract: This paper reviews and advocates against the use of permute-and-predict (PaP) methods for interpreting black box functions. Methods such as the variable importance measures proposed for random forests, partial dependence plots, and individual conditional expectation plots remain popular because they are both model-agnostic and depend only on the pre-trained model output, making them computationall… ▽ More

    Submitted 7 October, 2021; v1 submitted 1 May, 2019; originally announced May 2019.

    MSC Class: 62G08 ACM Class: I.5.1

  12. arXiv:1903.05179  [pdf, other

    stat.ML cs.LG

    Unbiased Measurement of Feature Importance in Tree-Based Methods

    Authors: Zhengze Zhou, Giles Hooker

    Abstract: We propose a modification that corrects for split-improvement variable importance measures in Random Forests and other tree-based methods. These methods have been shown to be biased towards increasing the importance of features with more potential splits. We show that by appropriately incorporating split-improvement as measured on out of sample data, this bias can be corrected yielding better summ… ▽ More

    Submitted 23 March, 2020; v1 submitted 12 March, 2019; originally announced March 2019.

    Comments: add Section 3.4 to compare with other methods for dealing with similar bias; add more simulation results in Section 5; add link to Github repository for code access

  13. arXiv:1808.07573  [pdf, other

    stat.ML cs.LG

    Approximation Trees: Statistical Stability in Model Distillation

    Authors: Yichen Zhou, Zhengze Zhou, Giles Hooker

    Abstract: This paper examines the stability of learned explanations for black-box predictions via model distillation with decision trees. One approach to intelligibility in machine learning is to use an understandable `student' model to mimic the output of an accurate `teacher'. Here, we consider the use of regression trees as a student model, in which nodes of the tree can be used as `explanations' for par… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

    Comments: This paper supercedes arXiv:1610.09036

  14. arXiv:1803.08000  [pdf, other

    stat.ML cs.LG stat.ME

    Boosting Random Forests to Reduce Bias; One-Step Boosted Forest and its Variance Estimate

    Authors: Indrayudh Ghosal, Giles Hooker

    Abstract: In this paper we propose using the principle of boosting to reduce the bias of a random forest prediction in the regression setting. From the original random forest fit we extract the residuals and then fit another random forest to these residuals. We call the sum of these two random forests a \textit{one-step boosted forest}. We show with simulated and real data that the one-step boosted forest h… ▽ More

    Submitted 22 April, 2020; v1 submitted 21 March, 2018; originally announced March 2018.

    Comments: 39 pages, 7 tables, 3 figures

  15. Considerations When Learning Additive Explanations for Black-Box Models

    Authors: Sarah Tan, Giles Hooker, Paul Koch, Albert Gordo, Rich Caruana

    Abstract: Many methods to explain black-box models, whether local or global, are additive. In this paper, we study global additive explanations for non-additive models, focusing on four explanation methods: partial dependence, Shapley explanations adapted to a global setting, distilled additive explanations, and gradient-based explanations. We show that different explanation methods characterize non-additiv… ▽ More

    Submitted 31 July, 2023; v1 submitted 25 January, 2018; originally announced January 2018.

    Comments: Published at Machine Learning (2023). Previously titled "Learning Global Additive Explanations for Neural Nets Using Model Distillation". A short version was presented at NeurIPS 2018 Machine Learning for Health Workshop

  16. arXiv:1710.06169  [pdf, other

    stat.ML cs.AI cs.LG

    Distill-and-Compare: Auditing Black-Box Models Using Transparent Model Distillation

    Authors: Sarah Tan, Rich Caruana, Giles Hooker, Yin Lou

    Abstract: Black-box risk scoring models permeate our lives, yet are typically proprietary or opaque. We propose Distill-and-Compare, a model distillation and comparison approach to audit such models. To gain insight into black-box models, we treat them as teachers, training transparent student models to mimic the risk scores assigned by black-box models. We compare the student model trained with distillatio… ▽ More

    Submitted 11 October, 2018; v1 submitted 17 October, 2017; originally announced October 2017.

    Comments: Camera-ready version for AAAI/ACM AIES 2018. Data and pseudocode at https://github.com/shftan/auditblackbox. Previously titled "Detecting Bias in Black-Box Models Using Transparent Model Distillation". A short version was presented at NIPS 2017 Symposium on Interpretable Machine Learning

  17. arXiv:1704.04688  [pdf

    stat.ML cs.LG

    Machine Learning and the Future of Realism

    Authors: Giles Hooker, Cliff Hooker

    Abstract: The preceding three decades have seen the emergence, rise, and proliferation of machine learning (ML). From half-recognised beginnings in perceptrons, neural nets, and decision trees, algorithms that extract correlations (that is, patterns) from a set of data points have broken free from their origin in computational cognition to embrace all forms of problem solving, from voice recognition to medi… ▽ More

    Submitted 15 April, 2017; originally announced April 2017.

  18. arXiv:1611.07115  [pdf, other

    stat.ML cs.LG

    Tree Space Prototypes: Another Look at Making Tree Ensembles Interpretable

    Authors: Sarah Tan, Matvey Soloviev, Giles Hooker, Martin T. Wells

    Abstract: Ensembles of decision trees perform well on many problems, but are not interpretable. In contrast to existing approaches in interpretability that focus on explaining relationships between features and predictions, we propose an alternative approach to interpret tree ensemble classifiers by surfacing representative points for each class -- prototypes. We introduce a new distance for Gradient Booste… ▽ More

    Submitted 25 August, 2020; v1 submitted 21 November, 2016; originally announced November 2016.

    Comments: Camera-ready version for ACM-IMS FODS 2020. A short version was presented at NIPS 2016 Workshop on Interpretable Machine Learning for Complex Systems