Skip to main content

Showing 1–11 of 11 results for author: Plumb, G

Searching in archive cs. Search in all archives.
.
  1. Where Does My Model Underperform? A Human Evaluation of Slice Discovery Algorithms

    Authors: Nari Johnson, Ángel Alexander Cabrera, Gregory Plumb, Ameet Talwalkar

    Abstract: Machine learning (ML) models that achieve high average accuracy can still underperform on semantically coherent subsets ("slices") of data. This behavior can have significant societal consequences for the safety or bias of the model in deployment, but identifying these underperforming slices can be difficult in practice, especially in domains where practitioners lack access to group annotations to… ▽ More

    Submitted 9 February, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, 11(1), 65-76. Best Paper Award

  2. arXiv:2207.04104  [pdf, other

    cs.LG cs.CV

    Towards a More Rigorous Science of Blindspot Discovery in Image Classification Models

    Authors: Gregory Plumb, Nari Johnson, Ángel Alexander Cabrera, Ameet Talwalkar

    Abstract: A growing body of work studies Blindspot Discovery Methods ("BDM"s): methods that use an image embedding to find semantically meaningful (i.e., united by a human-understandable concept) subsets of the data where an image classifier performs significantly worse. Motivated by observed gaps in prior work, we introduce a new framework for evaluating BDMs, SpotCheck, that uses synthetic image datasets… ▽ More

    Submitted 11 July, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

    Comments: reviewed on OpenReview: https://openreview.net/forum?id=MaDvbLaBiF

    Journal ref: TMLR 2023

  3. arXiv:2206.02256  [pdf, other

    cs.HC cs.AI cs.LG

    Use-Case-Grounded Simulations for Explanation Evaluation

    Authors: Valerie Chen, Nari Johnson, Nicholay Topin, Gregory Plumb, Ameet Talwalkar

    Abstract: A growing body of research runs human subject evaluations to study whether providing users with explanations of machine learning models can help them with practical real-world use cases. However, running user studies is challenging and costly, and consequently each study typically only evaluates a limited number of different settings, e.g., studies often only evaluate a few arbitrarily selected ex… ▽ More

    Submitted 20 August, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

  4. arXiv:2106.02112  [pdf, other

    cs.LG

    Finding and Fixing Spurious Patterns with Explanations

    Authors: Gregory Plumb, Marco Tulio Ribeiro, Ameet Talwalkar

    Abstract: Image classifiers often use spurious patterns, such as "relying on the presence of a person to detect a tennis racket, which do not generalize. In this work, we present an end-to-end pipeline for identifying and mitigating spurious patterns for such models, under the assumption that we have access to pixel-wise object-annotations. We start by identifying patterns such as "the model's prediction fo… ▽ More

    Submitted 17 August, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

  5. arXiv:2105.06506  [pdf, other

    cs.LG

    Sanity Simulations for Saliency Methods

    Authors: Joon Sik Kim, Gregory Plumb, Ameet Talwalkar

    Abstract: Saliency methods are a popular class of feature attribution explanation methods that aim to capture a model's predictive reasoning by identifying "important" pixels in an input image. However, the development and adoption of these methods are hindered by the lack of access to ground-truth model reasoning, which prevents accurate evaluation. In this work, we design a synthetic benchmarking framewor… ▽ More

    Submitted 16 June, 2022; v1 submitted 13 May, 2021; originally announced May 2021.

    Comments: Accepted to International Conference on Machine Learning (ICML 2022)

  6. arXiv:2103.06254  [pdf, other

    cs.LG

    Interpretable Machine Learning: Moving From Mythos to Diagnostics

    Authors: Valerie Chen, Jeffrey Li, Joon Sik Kim, Gregory Plumb, Ameet Talwalkar

    Abstract: Despite increasing interest in the field of Interpretable Machine Learning (IML), a significant gap persists between the technical objectives targeted by researchers' methods and the high-level goals of consumers' use cases. In this work, we synthesize foundational work on IML methods and evaluation into an actionable taxonomy. This taxonomy serves as a tool to conceptualize the gap between resear… ▽ More

    Submitted 28 July, 2021; v1 submitted 10 March, 2021; originally announced March 2021.

    Comments: Presented at ICML HILL Workshop 2021

  7. arXiv:2011.01205  [pdf, other

    cs.LG stat.ML

    A Learning Theoretic Perspective on Local Explainability

    Authors: Jeffrey Li, Vaishnavh Nagarajan, Gregory Plumb, Ameet Talwalkar

    Abstract: In this paper, we explore connections between interpretable machine learning and learning theory through the lens of local approximation explanations. First, we tackle the traditional problem of performance generalization and bound the test-time accuracy of a model using a notion of how locally explainable it is. Second, we explore the novel problem of explanation generalization which is an import… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  8. arXiv:2003.01640  [pdf, other

    cs.LG stat.ML

    Explaining Groups of Points in Low-Dimensional Representations

    Authors: Gregory Plumb, Jonathan Terhorst, Sriram Sankararaman, Ameet Talwalkar

    Abstract: A common workflow in data exploration is to learn a low-dimensional representation of the data, identify groups of points in that representation, and examine the differences between the groups to determine what they represent. We treat this workflow as an interpretable machine learning problem by leveraging the model that learned the low-dimensional representation to help identify the key differen… ▽ More

    Submitted 14 August, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

  9. arXiv:1906.01431  [pdf, other

    cs.LG stat.ML

    Regularizing Black-box Models for Improved Interpretability (HILL 2019 Version)

    Authors: Gregory Plumb, Maruan Al-Shedivat, Eric Xing, Ameet Talwalkar

    Abstract: Most of the work on interpretable machine learning has focused on designing either inherently interpretable models, which typically trade-off accuracy for interpretability, or post-hoc explanation systems, which lack guarantees about their explanation quality. We propose an alternative to these approaches by directly regularizing a black-box model for interpretability at training time. Our approac… ▽ More

    Submitted 31 May, 2019; originally announced June 2019.

    Comments: presented at 2019 ICML Workshop on Human in the Loop Learning (HILL 2019), Long Beach, USA. arXiv admin note: substantial text overlap with arXiv:1902.06787

  10. arXiv:1902.06787  [pdf, other

    cs.LG stat.ML

    Regularizing Black-box Models for Improved Interpretability

    Authors: Gregory Plumb, Maruan Al-Shedivat, Angel Alexander Cabrera, Adam Perer, Eric Xing, Ameet Talwalkar

    Abstract: Most of the work on interpretable machine learning has focused on designing either inherently interpretable models, which typically trade-off accuracy for interpretability, or post-hoc explanation systems, whose explanation quality can be unpredictable. Our method, ExpO, is a hybridization of these approaches that regularizes a model for explanation quality at training time. Importantly, these reg… ▽ More

    Submitted 8 November, 2020; v1 submitted 18 February, 2019; originally announced February 2019.

  11. arXiv:1807.02910  [pdf, other

    cs.LG stat.ML

    Model Agnostic Supervised Local Explanations

    Authors: Gregory Plumb, Denali Molitor, Ameet Talwalkar

    Abstract: Model interpretability is an increasingly important component of practical machine learning. Some of the most common forms of interpretability systems are example-based, local, and global explanations. One of the main challenges in interpretability is designing explanation systems that can capture aspects of each of these explanation types, in order to develop a more thorough understanding of the… ▽ More

    Submitted 5 January, 2019; v1 submitted 8 July, 2018; originally announced July 2018.