Skip to main content

Showing 1–7 of 7 results for author: Murdoch, W J

.
  1. arXiv:1909.13584  [pdf, other

    cs.LG cs.CV stat.ML

    Interpretations are useful: penalizing explanations to align neural networks with prior knowledge

    Authors: Laura Rieger, Chandan Singh, W. James Murdoch, Bin Yu

    Abstract: For an explanation of a deep learning model to be effective, it must provide both insight into a model and suggest a corresponding action in order to achieve some objective. Too often, the litany of proposed explainable deep learning methods stop at the first step, providing practitioners with insight into a model, but no way to act on it. In this paper, we propose contextual decomposition explana… ▽ More

    Submitted 8 October, 2020; v1 submitted 30 September, 2019; originally announced September 2019.

    Comments: 18 pages; published in ICML2020; Erratum: numbers in table 1 were too high (now corrected) with the trend remaining the same

  2. arXiv:1905.07631  [pdf, other

    stat.ML cs.LG stat.ME

    Disentangled Attribution Curves for Interpreting Random Forests and Boosted Trees

    Authors: Summer Devlin, Chandan Singh, W. James Murdoch, Bin Yu

    Abstract: Tree ensembles, such as random forests and AdaBoost, are ubiquitous machine learning models known for achieving strong predictive performance across a wide variety of domains. However, this strong performance comes at the cost of interpretability (i.e. users are unable to understand the relationships a trained random forest has learned and why it is making its predictions). In particular, it is ch… ▽ More

    Submitted 18 May, 2019; originally announced May 2019.

    Comments: Under review

  3. arXiv:1901.04592  [pdf, other

    stat.ML cs.AI cs.LG stat.AP

    Interpretable machine learning: definitions, methods, and applications

    Authors: W. James Murdoch, Chandan Singh, Karl Kumbier, Reza Abbasi-Asl, Bin Yu

    Abstract: Machine-learning models have demonstrated great success in learning complex patterns that enable them to make predictions about unobserved data. In addition to using models for prediction, the ability to interpret what a model has learned is receiving an increasing amount of attention. However, this increased focus has led to considerable confusion about the notion of interpretability. In particul… ▽ More

    Submitted 14 January, 2019; originally announced January 2019.

    Comments: 11 pages

    Journal ref: Published in PNAS 2019

  4. arXiv:1806.05337  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    Hierarchical interpretations for neural network predictions

    Authors: Chandan Singh, W. James Murdoch, Bin Yu

    Abstract: Deep neural networks (DNNs) have achieved impressive predictive performance due to their ability to learn complex, non-linear relationships between variables. However, the inability to effectively visualize these relationships has led to DNNs being characterized as black boxes and consequently limited their applications. To ameliorate this problem, we introduce the use of hierarchical interpretati… ▽ More

    Submitted 16 January, 2019; v1 submitted 13 June, 2018; originally announced June 2018.

    Comments: Published in ICLR 2019

    Journal ref: ICLR 2019

  5. arXiv:1801.05453  [pdf, other

    cs.CL cs.LG stat.ML

    Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs

    Authors: W. James Murdoch, Peter J. Liu, Bin Yu

    Abstract: The driving force behind the recent success of LSTMs has been their ability to learn complex and non-linear relationships. Consequently, our inability to describe these relationships has led to LSTMs being characterized as black boxes. To this end, we introduce contextual decomposition (CD), an interpretation algorithm for analysing individual predictions made by standard LSTMs, without any change… ▽ More

    Submitted 27 April, 2018; v1 submitted 16 January, 2018; originally announced January 2018.

    Comments: Oral presentation at ICLR 2018

  6. arXiv:1702.02540  [pdf, ps, other

    cs.CL cs.AI cs.NE stat.ML

    Automatic Rule Extraction from Long Short Term Memory Networks

    Authors: W. James Murdoch, Arthur Szlam

    Abstract: Although deep learning models have proven effective at solving problems in natural language processing, the mechanism by which they come to their conclusions is often unclear. As a result, these models are generally treated as black boxes, yielding no insight of the underlying learned patterns. In this paper we consider Long Short Term Memory networks (LSTMs) and demonstrate a new approach for tra… ▽ More

    Submitted 24 February, 2017; v1 submitted 8 February, 2017; originally announced February 2017.

    Comments: ICLR 2017 accepted paper

  7. arXiv:1412.4128  [pdf, other

    stat.CO stat.ML

    Expanded Alternating Optimization of Nonconvex Functions with Applications to Matrix Factorization and Penalized Regression

    Authors: W. James Murdoch, Mu Zhu

    Abstract: We propose a general technique for improving alternating optimization (AO) of nonconvex functions. Starting from the solution given by AO, we conduct another sequence of searches over subspaces that are both meaningful to the optimization problem at hand and different from those used by AO. To demonstrate the utility of our approach, we apply it to the matrix factorization (MF) algorithm for recom… ▽ More

    Submitted 12 December, 2014; originally announced December 2014.