Skip to main content

Showing 1–24 of 24 results for author: Lundberg, S

.
  1. arXiv:2303.12712  [pdf, other

    cs.CL cs.AI

    Sparks of Artificial General Intelligence: Early experiments with GPT-4

    Authors: Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang

    Abstract: Artificial intelligence (AI) researchers have been develo** and refining large language models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. The latest model developed by OpenAI, GPT-4, was trained using an unprecedented scale of compute and data. In this paper, we report on our investigation of an earl… ▽ More

    Submitted 13 April, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  2. arXiv:2303.09014  [pdf, other

    cs.CL

    ART: Automatic multi-step reasoning and tool-use for large language models

    Authors: Bhargavi Paranjape, Scott Lundberg, Sameer Singh, Hannaneh Hajishirzi, Luke Zettlemoyer, Marco Tulio Ribeiro

    Abstract: Large language models (LLMs) can perform complex reasoning in few- and zero-shot settings by generating intermediate chain of thought (CoT) reasoning steps. Further, each reasoning step can rely on external tools to support computation beyond the core LLM capabilities (e.g. search/running code). Prior work on CoT prompting and tool use typically requires hand-crafting task-specific demonstrations… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  3. arXiv:2212.02774  [pdf, other

    cs.CV

    Adaptive Testing of Computer Vision Models

    Authors: Irena Gao, Gabriel Ilharco, Scott Lundberg, Marco Tulio Ribeiro

    Abstract: Vision models often fail systematically on groups of data that share common semantic characteristics (e.g., rare objects or unusual scenes), but identifying these failure modes is a challenge. We introduce AdaVision, an interactive process for testing vision models which helps users identify and fix coherent failure modes. Given a natural language description of a coherent group, AdaVision retriev… ▽ More

    Submitted 16 August, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: ICCV camera-ready

  4. arXiv:2211.03318  [pdf, other

    cs.CL

    Fixing Model Bugs with Natural Language Patches

    Authors: Shikhar Murty, Christopher D. Manning, Scott Lundberg, Marco Tulio Ribeiro

    Abstract: Current approaches for fixing systematic problems in NLP models (e.g. regex patches, finetuning on more data) are either brittle, or labor-intensive and liable to shortcuts. In contrast, humans often provide corrections to each other through natural language. Taking inspiration from this, we explore natural language patches -- declarative statements that allow developers to provide corrective feed… ▽ More

    Submitted 20 November, 2022; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: Accepted at EMNLP 2022 [Fixed fig-1]

  5. arXiv:2207.07605  [pdf, other

    cs.LG cs.GT

    Algorithms to estimate Shapley value feature attributions

    Authors: Hugh Chen, Ian C. Covert, Scott M. Lundberg, Su-In Lee

    Abstract: Feature attributions based on the Shapley value are popular for explaining machine learning models; however, their estimation is complex from both a theoretical and computational standpoint. We disentangle this complexity into two factors: (1)~the approach to removing feature information, and (2)~the tractable estimation strategy. These two factors provide a natural lens through which we can bette… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

  6. Explaining a Series of Models by Propagating Shapley Values

    Authors: Hugh Chen, Scott M. Lundberg, Su-In Lee

    Abstract: Local feature attribution methods are increasingly used to explain complex machine learning models. However, current methods are limited because they are extremely expensive to compute or are not capable of explaining a distributed series of models where each model is owned by a separate institution. The latter is particularly important because it often arises in finance where explanations are man… ▽ More

    Submitted 13 October, 2021; v1 submitted 30 April, 2021; originally announced May 2021.

  7. arXiv:2103.00370  [pdf, other

    cs.LG cs.CV cs.HC cs.IR

    Axiomatic Explanations for Visual Search, Retrieval, and Similarity Learning

    Authors: Mark Hamilton, Scott Lundberg, Lei Zhang, Stephanie Fu, William T. Freeman

    Abstract: Visual search, recommendation, and contrastive similarity learning power technologies that impact billions of users worldwide. Modern model architectures can be complex and difficult to interpret, and there are several competing techniques one can use to explain a search engine's behavior. We show that the theory of fair credit assignment provides a $\textit{unique}$ axiomatic solution that genera… ▽ More

    Submitted 16 March, 2022; v1 submitted 27 February, 2021; originally announced March 2021.

  8. arXiv:2011.14878  [pdf, other

    cs.LG stat.ML

    Explaining by Removing: A Unified Framework for Model Explanation

    Authors: Ian Covert, Scott Lundberg, Su-In Lee

    Abstract: Researchers have proposed a wide variety of model explanation approaches, but it remains unclear how most methods are related or when one method is preferable to another. We describe a new unified class of methods, removal-based explanations, that are based on the principle of simulating feature removal to quantify each feature's influence. These methods vary in several respects, so we develop a f… ▽ More

    Submitted 12 May, 2022; v1 submitted 20 November, 2020; originally announced November 2020.

    Comments: Updated with JMLR revisions, arXiv admin note: text overlap with arXiv:2011.03623

  9. arXiv:2011.03623  [pdf, other

    cs.LG stat.ML

    Feature Removal Is a Unifying Principle for Model Explanation Methods

    Authors: Ian Covert, Scott Lundberg, Su-In Lee

    Abstract: Researchers have proposed a wide variety of model explanation approaches, but it remains unclear how most methods are related or when one method is preferable to another. We examine the literature and find that many methods are based on a shared principle of explaining by removing - essentially, measuring the impact of removing sets of features from a model. These methods vary in several respects,… ▽ More

    Submitted 22 August, 2022; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: Updated for consistency with arxiv:2011.14878

  10. arXiv:2010.14592  [pdf, other

    cs.LG stat.ML

    Shapley Flow: A Graph-based Approach to Interpreting Model Predictions

    Authors: Jiaxuan Wang, Jenna Wiens, Scott Lundberg

    Abstract: Many existing approaches for estimating feature importance are problematic because they ignore or hide dependencies among features. A causal graph, which encodes the relationships among input variables, can aid in assigning feature importance. However, current approaches that assign credit to nodes in the causal graph fail to explain the entire graph. In light of these limitations, we propose Shap… ▽ More

    Submitted 26 February, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: camera ready version for AISTATS 2021

  11. arXiv:2006.16234  [pdf, other

    cs.LG stat.ML

    True to the Model or True to the Data?

    Authors: Hugh Chen, Joseph D. Janizek, Scott Lundberg, Su-In Lee

    Abstract: A variety of recent papers discuss the application of Shapley values, a concept for explaining coalitional games, for feature attribution in machine learning. However, the correct way to connect a machine learning model to a coalitional game has been a source of controversy. The two main approaches that have been proposed differ in the way that they condition on known features, using either (1) an… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

  12. arXiv:2004.00668  [pdf, other

    cs.LG stat.ML

    Understanding Global Feature Contributions With Additive Importance Measures

    Authors: Ian Covert, Scott Lundberg, Su-In Lee

    Abstract: Understanding the inner workings of complex machine learning models is a long-standing problem and most recent research has focused on local interpretability. To assess the role of individual input features in a global sense, we explore the perspective of defining feature importance through the predictive power associated with each feature. We introduce two notions of predictive power (model-based… ▽ More

    Submitted 27 October, 2020; v1 submitted 1 April, 2020; originally announced April 2020.

    Comments: NeurIPS 2020 Camera Ready

  13. arXiv:2002.04770  [pdf, other

    cs.LG eess.SP stat.ML

    Forecasting adverse surgical events using self-supervised transfer learning for physiological signals

    Authors: Hugh Chen, Scott Lundberg, Gabe Erion, Jerry H. Kim, Su-In Lee

    Abstract: Hundreds of millions of surgical procedures take place annually across the world, which generate a prevalent type of electronic health record (EHR) data comprising time series physiological signals. Here, we present a transferable embedding method (i.e., a method to transform time series signals into input features for predictive machine learning models) named PHASE (PHysiologicAl Signal Embedding… ▽ More

    Submitted 21 January, 2021; v1 submitted 11 February, 2020; originally announced February 2020.

  14. arXiv:1911.11888  [pdf, other

    cs.LG stat.ML

    Explaining Models by Propagating Shapley Values of Local Components

    Authors: Hugh Chen, Scott Lundberg, Su-In Lee

    Abstract: In healthcare, making the best possible predictions with complex models (e.g., neural networks, ensembles/stacks of different models) can impact patient welfare. In order to make these complex models explainable, we present DeepSHAP for mixed model types, a framework for layer wise propagation of Shapley values that builds upon DeepLIFT (an existing approach for explaining neural networks). We sho… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: 4 pages and references

  15. arXiv:1906.10670  [pdf, other

    cs.LG stat.ML

    Improving performance of deep learning models with axiomatic attribution priors and expected gradients

    Authors: Gabriel Erion, Joseph D. Janizek, Pascal Sturmfels, Scott Lundberg, Su-In Lee

    Abstract: Recent research has demonstrated that feature attribution methods for deep networks can themselves be incorporated into training; these attribution priors optimize for a model whose attributions have certain desirable properties -- most frequently, that particular features are important or unimportant. These attribution priors are often based on attribution methods that are not guaranteed to satis… ▽ More

    Submitted 11 November, 2020; v1 submitted 25 June, 2019; originally announced June 2019.

    Comments: Updated after submission to Nature Machine Intelligence

  16. arXiv:1905.04610  [pdf, other

    cs.LG cs.AI stat.ML

    Explainable AI for Trees: From Local Explanations to Global Understanding

    Authors: Scott M. Lundberg, Gabriel Erion, Hugh Chen, Alex DeGrave, Jordan M. Prutkin, Bala Nair, Ronit Katz, Jonathan Himmelfarb, Nisha Bansal, Su-In Lee

    Abstract: Tree-based machine learning models such as random forests, decision trees, and gradient boosted trees are the most popular non-linear predictive models used in practice today, yet comparatively little attention has been paid to explaining their predictions. Here we significantly improve the interpretability of tree-based models through three main contributions: 1) The first polynomial time algorit… ▽ More

    Submitted 11 May, 2019; originally announced May 2019.

  17. arXiv:1802.03888  [pdf, other

    cs.LG stat.ML

    Consistent Individualized Feature Attribution for Tree Ensembles

    Authors: Scott M. Lundberg, Gabriel G. Erion, Su-In Lee

    Abstract: Interpreting predictions from tree ensemble methods such as gradient boosting machines and random forests is important, yet feature attribution for trees is often heuristic and not individualized for each prediction. Here we show that popular feature attribution methods are inconsistent, meaning they can lower a feature's assigned importance when the true impact of that feature actually increases.… ▽ More

    Submitted 6 March, 2019; v1 submitted 11 February, 2018; originally announced February 2018.

    Comments: Follow-up to 2017 ICML Workshop arXiv:1706.06060

  18. arXiv:1801.07384  [pdf, other

    cs.LG cs.AI stat.ML

    Hybrid Gradient Boosting Trees and Neural Networks for Forecasting Operating Room Data

    Authors: Hugh Chen, Scott Lundberg, Su-In Lee

    Abstract: Time series data constitutes a distinct and growing problem in machine learning. As the corpus of time series data grows larger, deep models that simultaneously learn features and classify with these features can be intractable or suboptimal. In this paper, we present feature learning via long short term memory (LSTM) networks and prediction via gradient boosting trees (XGB). Focusing on the conse… ▽ More

    Submitted 23 January, 2018; v1 submitted 22 January, 2018; originally announced January 2018.

    Comments: Presented at Machine Learning for Health Workshop: 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA

  19. arXiv:1712.00563  [pdf, other

    cs.LG stat.AP stat.ML

    Anesthesiologist-level forecasting of hypoxemia with only SpO2 data using deep learning

    Authors: Gabriel Erion, Hugh Chen, Scott M. Lundberg, Su-In Lee

    Abstract: We use a deep learning model trained only on a patient's blood oxygenation data (measurable with an inexpensive fingertip sensor) to predict impending hypoxemia (low blood oxygen) more accurately than trained anesthesiologists with access to all the data recorded in a modern operating room. We also provide a simple way to visualize the reason why a patient's risk is low or high by assigning weight… ▽ More

    Submitted 2 December, 2017; originally announced December 2017.

    Comments: To be presented at Machine Learning for Health Workshop: 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA

  20. arXiv:1710.03282  [pdf, other

    cs.LG

    Checkpoint Ensembles: Ensemble Methods from a Single Training Process

    Authors: Hugh Chen, Scott Lundberg, Su-In Lee

    Abstract: We present the checkpoint ensembles method that can learn ensemble models on a single training process. Although checkpoint ensembles can be applied to any parametric iterative learning technique, here we focus on neural networks. Neural networks' composable and simple neurons make it possible to capture many individual and interaction effects among features. However, small sample sizes and sampli… ▽ More

    Submitted 9 October, 2017; originally announced October 2017.

    Comments: 7 pages, 4 figures, under review AAAI

  21. arXiv:1706.06060  [pdf, other

    cs.AI cs.LG stat.ML

    Consistent feature attribution for tree ensembles

    Authors: Scott M. Lundberg, Su-In Lee

    Abstract: Note that a newer expanded version of this paper is now available at: arXiv:1802.03888 It is critical in many applications to understand what features are important for a model, and why individual predictions were made. For tree ensemble methods these questions are usually answered by attributing importance values to input features, either globally or for a single prediction. Here we show that c… ▽ More

    Submitted 16 February, 2018; v1 submitted 19 June, 2017; originally announced June 2017.

    Comments: presented at 2017 ICML Workshop on Human Interpretability in Machine Learning (WHI 2017), Sydney, NSW, Australia

  22. arXiv:1705.07874  [pdf, other

    cs.AI cs.LG stat.ML

    A Unified Approach to Interpreting Model Predictions

    Authors: Scott Lundberg, Su-In Lee

    Abstract: Understanding why a model makes a certain prediction can be as crucial as the prediction's accuracy in many applications. However, the highest accuracy for large modern datasets is often achieved by complex models that even experts struggle to interpret, such as ensemble or deep learning models, creating a tension between accuracy and interpretability. In response, various methods have recently be… ▽ More

    Submitted 24 November, 2017; v1 submitted 22 May, 2017; originally announced May 2017.

    Comments: To appear in NIPS 2017

  23. arXiv:1611.07478  [pdf, other

    cs.AI

    An unexpected unity among methods for interpreting model predictions

    Authors: Scott Lundberg, Su-In Lee

    Abstract: Understanding why a model made a certain prediction is crucial in many data science fields. Interpretable predictions engender appropriate trust and provide insight into how the model may be improved. However, with large modern datasets the best accuracy is often achieved by complex models even experts struggle to interpret, which creates a tension between accuracy and interpretability. Recently,… ▽ More

    Submitted 8 December, 2016; v1 submitted 22 November, 2016; originally announced November 2016.

    Comments: Presented at NIPS 2016 Workshop on Interpretable Machine Learning in Complex Systems

  24. arXiv:1412.0207  [pdf, other

    q-bio.QM

    Learning microbial interaction networks from metagenomic count data

    Authors: Surojit Biswas, Meredith McDonald, Derek S. Lundberg, Jeffery L. Dangl, Vladimir Jojic

    Abstract: Many microbes associate with higher eukaryotes and impact their vitality. In order to engineer microbiomes for host benefit, we must understand the rules of community assembly and maintenence, which in large part, demands an understanding of the direct interactions between community members. Toward this end, we've developed a Poisson-multivariate normal hierarchical model to learn direct interacti… ▽ More

    Submitted 30 November, 2014; originally announced December 2014.

    Comments: Submitted to RECOMB 2015