Skip to main content

Showing 1–27 of 27 results for author: Parbhoo, S

.
  1. arXiv:2406.08636  [pdf, other

    cs.LG

    Towards Integrating Personal Knowledge into Test-Time Predictions

    Authors: Isaac Lage, Sonali Parbhoo, Finale Doshi-Velez

    Abstract: Machine learning (ML) models can make decisions based on large amounts of data, but they can be missing personal knowledge available to human users about whom predictions are made. For example, a model trained to predict psychiatric outcomes may know nothing about a patient's social support system, and social support may look different for different patients. In this work, we introduce the problem… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2402.12737  [pdf, other

    cs.LG

    Guarantee Regions for Local Explanations

    Authors: Marton Havasi, Sonali Parbhoo, Finale Doshi-Velez

    Abstract: Interpretability methods that utilise local surrogate models (e.g. LIME) are very good at describing the behaviour of the predictive model at a point of interest, but they are not guaranteed to extrapolate to the local region surrounding the point. However, overfitting to the local curvature of the predictive model and malicious tampering can significantly limit extrapolation. We propose an anchor… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  3. arXiv:2310.13224  [pdf, other

    cs.CR

    Adaptive Experimental Design for Intrusion Data Collection

    Authors: Kate Highnam, Zach Hanif, Ellie Van Vogt, Sonali Parbhoo, Sergio Maffeis, Nicholas R. Jennings

    Abstract: Intrusion research frequently collects data on attack techniques currently employed and their potential symptoms. This includes deploying honeypots, logging events from existing devices, employing a red team for a sample attack campaign, or simulating system activity. However, these observational studies do not clearly discern the cause-and-effect relationships between the design of the environmen… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: CAMLIS'23 Pre-publication - TO BE UPDATED!!

  4. arXiv:2308.05075  [pdf, other

    cs.LG

    Bayesian Inverse Transition Learning for Offline Settings

    Authors: Leo Benac, Sonali Parbhoo, Finale Doshi-Velez

    Abstract: Offline Reinforcement learning is commonly used for sequential decision-making in domains such as healthcare and education, where the rewards are known and the transition dynamics $T$ must be estimated on the basis of batch data. A key challenge for all tasks is how to learn a reliable estimate of the transition dynamics $T$ that produce near-optimal policies that are safe enough so that they neve… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 8 pages, 1 plots, 2 tables

  5. arXiv:2307.07014  [pdf, other

    cs.LG cs.AI stat.ML

    Leveraging Factored Action Spaces for Off-Policy Evaluation

    Authors: Aaman Rebello, Shengpu Tang, Jenna Wiens, Sonali Parbhoo

    Abstract: Off-policy evaluation (OPE) aims to estimate the benefit of following a counterfactual sequence of actions, given data collected from executed sequences. However, existing OPE estimators often exhibit high bias and high variance in problems involving large, combinatorial action spaces. We investigate how to mitigate this issue using factored action spaces i.e. expressing each action as a combinati… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: Main paper: 8 pages, 7 figures. Appendix: 30 pages, 17 figures. Accepted at ICML 2023 Workshop on Counterfactuals in Minds and Machines, Honolulu, Hawaii, USA. Camera ready version

    MSC Class: 62D20 (Primary) 62M05; 60J10; 62D05; 62P10 (Secondary) ACM Class: I.2.6; I.2.8; G.3; J.3

  6. arXiv:2306.11208  [pdf, other

    cs.LG cs.AI stat.ML

    The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning

    Authors: Sarah Rathnam, Sonali Parbhoo, Weiwei Pan, Susan A. Murphy, Finale Doshi-Velez

    Abstract: Discount regularization, using a shorter planning horizon when calculating the optimal policy, is a popular choice to restrict planning to a less complex set of policies when estimating an MDP from sparse or noisy data (Jiang et al., 2015). It is commonly understood that discount regularization functions by de-emphasizing or ignoring delayed effects. In this paper, we reveal an alternate view of d… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  7. arXiv:2304.03365  [pdf, other

    cs.LG cs.AI

    Decision-Focused Model-based Reinforcement Learning for Reward Transfer

    Authors: Abhishek Sharma, Sonali Parbhoo, Omer Gottesman, Finale Doshi-Velez

    Abstract: Decision-focused (DF) model-based reinforcement learning has recently been introduced as a powerful algorithm that can focus on learning the MDP dynamics that are most relevant for obtaining high returns. While this approach increases the agent's performance by directly optimizing the reward, it does so by learning less accurate dynamics from a maximum likelihood perspective. We demonstrate that w… ▽ More

    Submitted 1 January, 2024; v1 submitted 6 April, 2023; originally announced April 2023.

  8. arXiv:2301.05664  [pdf, other

    cs.LG stat.ML

    Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning

    Authors: Taylor W. Killian, Sonali Parbhoo, Marzyeh Ghassemi

    Abstract: In safety-critical decision-making scenarios being able to identify worst-case outcomes, or dead-ends is crucial in order to develop safe and reliable policies in practice. These situations are typically rife with uncertainty due to unknown or stochastic characteristics of the environment as well as limited offline training data. As a result, the value of a decision at any time point should be bas… ▽ More

    Submitted 30 January, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

    Comments: To appear in TMLR (01/2023). The submission and reviews can be viewed at: https://openreview.net/forum?id=oKlEOT83gI

  9. arXiv:2207.06269  [pdf, other

    cs.LG

    Policy Optimization with Sparse Global Contrastive Explanations

    Authors: Jiayu Yao, Sonali Parbhoo, Weiwei Pan, Finale Doshi-Velez

    Abstract: We develop a Reinforcement Learning (RL) framework for improving an existing behavior policy via sparse, user-interpretable changes. Our goal is to make minimal changes while gaining as much benefit as possible. We define a minimal change as having a sparse, global contrastive explanation between the original and proposed policy. We improve the current policy with the constraint of kee** that gl… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: Accepted at IMLH Workshop, ICML 2022

  10. arXiv:2201.08262  [pdf, other

    cs.LG stat.ML

    Generalizing Off-Policy Evaluation From a Causal Perspective For Sequential Decision-Making

    Authors: Sonali Parbhoo, Shalmali Joshi, Finale Doshi-Velez

    Abstract: Assessing the effects of a policy based on observational data from a different policy is a common problem across several high-stake decision-making domains, and several off-policy evaluation (OPE) techniques have been proposed. However, these methods largely formulate OPE as a problem disassociated from the process used to generate the data (i.e. structural assumptions in the form of a causal grap… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  11. Learning Conditional Invariance through Cycle Consistency

    Authors: Maxim Samarin, Vitali Nesterov, Mario Wieser, Aleksander Wieczorek, Sonali Parbhoo, Volker Roth

    Abstract: Identifying meaningful and independent factors of variation in a dataset is a challenging learning task frequently addressed by means of deep latent variable models. This task can be viewed as learning symmetry transformations preserving the value of a chosen property along latent dimensions. However, existing approaches exhibit severe drawbacks in enforcing the invariance property in the latent s… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

    Comments: 16 pages, 3 figures, published at the DAGM German Conference on Pattern Recognition, Sep. 28 - Oct. 1, 2021

  12. arXiv:2110.13221  [pdf, other

    cs.LG cs.AI stat.ML

    On Learning Prediction-Focused Mixtures

    Authors: Abhishek Sharma, Catherine Zeng, Sanjana Narayanan, Sonali Parbhoo, Finale Doshi-Velez

    Abstract: Probabilistic models help us encode latent structures that both model the data and are ideally also useful for specific downstream tasks. Among these, mixture models and their time-series counterparts, hidden Markov models, identify discrete components in the data. In this work, we focus on a constrained capacity setting, where we want to learn a model with relatively few components (e.g. for inte… ▽ More

    Submitted 27 October, 2021; v1 submitted 25 October, 2021; originally announced October 2021.

  13. arXiv:2109.11043  [pdf, other

    cs.LG

    Learning Predictive and Interpretable Timeseries Summaries from ICU Data

    Authors: Nari Johnson, Sonali Parbhoo, Andrew Slavin Ross, Finale Doshi-Velez

    Abstract: Machine learning models that utilize patient data across time (rather than just the most recent measurements) have increased performance for many risk stratification tasks in the intensive care unit. However, many of these models and their learned representations are complex and therefore difficult for clinicians to interpret, creating challenges for validation. Our work proposes a new procedure t… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: 10 pages, 3 figures, AMIA 2021 Annual Symposium

  14. arXiv:2109.06312  [pdf, other

    cs.LG stat.ML

    Learning-to-defer for sequential medical decision-making under uncertainty

    Authors: Shalmali Joshi, Sonali Parbhoo, Finale Doshi-Velez

    Abstract: Learning-to-defer is a framework to automatically defer decision-making to a human expert when ML-based decisions are deemed unreliable. Existing learning-to-defer frameworks are not designed for sequential settings. That is, they defer at every instance independently, based on immediate predictions, while ignoring the potential long-term impact of these interventions. As a result, existing framew… ▽ More

    Submitted 5 December, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

  15. arXiv:2103.11175  [pdf, other

    cs.LG stat.ME stat.ML

    NCoRE: Neural Counterfactual Representation Learning for Combinations of Treatments

    Authors: Sonali Parbhoo, Stefan Bauer, Patrick Schwab

    Abstract: Estimating an individual's potential response to interventions from observational data is of high practical relevance for many domains, such as healthcare, public policy or economics. In this setting, it is often the case that combinations of interventions may be applied simultaneously, for example, multiple prescriptions in healthcare or different fiscal and monetary measures in economics. Howeve… ▽ More

    Submitted 20 March, 2021; originally announced March 2021.

  16. arXiv:2101.05360  [pdf, other

    cs.LG cs.AI

    Preferential Mixture-of-Experts: Interpretable Models that Rely on Human Expertise as much as Possible

    Authors: Melanie F. Pradier, Javier Zazo, Sonali Parbhoo, Roy H. Perlis, Maurizio Zazzi, Finale Doshi-Velez

    Abstract: We propose Preferential MoE, a novel human-ML mixture-of-experts model that augments human expertise in decision making with a data-based classifier only when necessary for predictive performance. Our model exhibits an interpretable gating function that provides information on when human rules should be followed or avoided. The gating function is maximized for using human-based rules, and classifi… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

    Comments: 10 pages, 5 figures, 4 tables, AMIA 2021 Virtual Informatics Summit

  17. arXiv:2008.13412  [pdf, other

    stat.AP cs.LG q-bio.QM

    Real-time Prediction of COVID-19 related Mortality using Electronic Health Records

    Authors: Patrick Schwab, Arash Mehrjou, Sonali Parbhoo, Leo Anthony Celi, Jürgen Hetzel, Markus Hofer, Bernhard Schölkopf, Stefan Bauer

    Abstract: Coronavirus Disease 2019 (COVID-19) is an emerging respiratory disease caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) with rapid human-to-human transmission and a high case fatality rate particularly in older patients. Due to the exponential growth of infections, many healthcare systems across the world are under pressure to care for increasing amounts of at-risk patien… ▽ More

    Submitted 31 August, 2020; originally announced August 2020.

  18. arXiv:2002.03478  [pdf, other

    cs.LG stat.ML

    Interpretable Off-Policy Evaluation in Reinforcement Learning by Highlighting Influential Transitions

    Authors: Omer Gottesman, Joseph Futoma, Yao Liu, Sonali Parbhoo, Leo Anthony Celi, Emma Brunskill, Finale Doshi-Velez

    Abstract: Off-policy evaluation in reinforcement learning offers the chance of using observational data to improve future outcomes in domains such as healthcare and education, but safe deployment in high stakes settings requires ways of assessing its validity. Traditional measures such as confidence intervals may be insufficient due to noise, limited data and confounding. In this paper we develop a method t… ▽ More

    Submitted 11 August, 2020; v1 submitted 9 February, 2020; originally announced February 2020.

    Comments: ICML final version

  19. arXiv:2002.02782  [pdf, other

    cs.LG stat.ML

    Inverse Learning of Symmetries

    Authors: Mario Wieser, Sonali Parbhoo, Aleksander Wieczorek, Volker Roth

    Abstract: Symmetry transformations induce invariances which are frequently described with deep latent variable models. In many complex domains, such as the chemical space, invariances can be observed, yet the corresponding symmetry transformation cannot be formulated analytically. We propose to learn the symmetry transformation with a model consisting of two latent subspaces, where the first subspace captur… ▽ More

    Submitted 22 October, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

    Comments: Accepted for publication at NeurIPS 2020

  20. arXiv:1908.05254  [pdf, other

    cs.LG stat.ML

    Optimizing for Interpretability in Deep Neural Networks with Tree Regularization

    Authors: Mike Wu, Sonali Parbhoo, Michael C. Hughes, Volker Roth, Finale Doshi-Velez

    Abstract: Deep models have advanced prediction in many domains, but their lack of interpretability remains a key barrier to the adoption in many real world applications. There exists a large body of work aiming to help humans understand these black box functions to varying levels of granularity -- for example, through distillation, gradients, or adversarial examples. These methods however, all tackle interp… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1908.04494, arXiv:1711.06178

  21. arXiv:1908.04494  [pdf, other

    cs.LG stat.ML

    Regional Tree Regularization for Interpretability in Black Box Models

    Authors: Mike Wu, Sonali Parbhoo, Michael Hughes, Ryan Kindle, Leo Celi, Maurizio Zazzi, Volker Roth, Finale Doshi-Velez

    Abstract: The lack of interpretability remains a barrier to the adoption of deep neural networks. Recently, tree regularization has been proposed to encourage deep neural networks to resemble compact, axis-aligned decision trees without significant compromises in accuracy. However, it may be unreasonable to expect that a single tree can predict well across all possible inputs. In this work, we propose regio… ▽ More

    Submitted 16 March, 2020; v1 submitted 13 August, 2019; originally announced August 2019.

    Comments: AAAI 2020 (Oral)

  22. arXiv:1811.10347  [pdf, other

    stat.ME cs.AI

    Estimating Causal Effects With Partial Covariates For Clinical Interpretability

    Authors: Sonali Parbhoo, Mario Wieser, Volker Roth

    Abstract: Estimating the causal effects of an intervention in the presence of confounding is a frequently occurring problem in applications such as medicine. The task is challenging since there may be multiple confounding factors, some of which may be missing, and inferences must be made from high-dimensional, noisy measurements. In this paper, we propose a decision-theoretic approach to estimate the causal… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

  23. arXiv:1811.07969  [pdf, other

    cs.CV

    Informed MCMC with Bayesian Neural Networks for Facial Image Analysis

    Authors: Adam Kortylewski, Mario Wieser, Andreas Morel-Forster, Aleksander Wieczorek, Sonali Parbhoo, Volker Roth, Thomas Vetter

    Abstract: Computer vision tasks are difficult because of the large variability in the data that is induced by changes in light, background, partial occlusion as well as the varying pose, texture, and shape of objects. Generative approaches to computer vision allow us to overcome this difficulty by explicitly modeling the physical image formation process. Using generative object models, the analysis of an ob… ▽ More

    Submitted 29 November, 2018; v1 submitted 19 November, 2018; originally announced November 2018.

    Comments: Accepted to the Bayesian Deep Learning Workshop at NeurIPS 2018

  24. arXiv:1807.02326  [pdf, other

    stat.ML cs.LG

    Cause-Effect Deep Information Bottleneck For Systematically Missing Covariates

    Authors: Sonali Parbhoo, Mario Wieser, Aleksander Wieczorek, Volker Roth

    Abstract: Estimating the causal effects of an intervention from high-dimensional observational data is difficult due to the presence of confounding. The task is often complicated by the fact that we may have a systematic missingness in our data at test time. Our approach uses the information bottleneck to perform a low-dimensional compression of covariates by explicitly considering the relevance of informat… ▽ More

    Submitted 28 February, 2020; v1 submitted 6 July, 2018; originally announced July 2018.

  25. arXiv:1711.06178  [pdf, other

    stat.ML cs.LG

    Beyond Sparsity: Tree Regularization of Deep Models for Interpretability

    Authors: Mike Wu, Michael C. Hughes, Sonali Parbhoo, Maurizio Zazzi, Volker Roth, Finale Doshi-Velez

    Abstract: The lack of interpretability remains a key barrier to the adoption of deep models in many applications. In this work, we explicitly regularize deep models so human users might step through the process behind their predictions in little time. Specifically, we train deep time-series models so their class-probability predictions have high accuracy while being closely modeled by decision trees with fe… ▽ More

    Submitted 16 November, 2017; originally announced November 2017.

    Comments: To appear in AAAI 2018. Contains 9-page main paper and appendix with supplementary material

  26. arXiv:1701.06171  [pdf, other

    cs.CV

    Greedy Structure Learning of Hierarchical Compositional Models

    Authors: Adam Kortylewski, Aleksander Wieczorek, Mario Wieser, Clemens Blumer, Sonali Parbhoo, Andreas Morel-Forster, Volker Roth, Thomas Vetter

    Abstract: In this work, we consider the problem of learning a hierarchical generative model of an object from a set of images which show examples of the object in the presence of variable background clutter. Existing approaches to this problem are limited by making strong a-priori assumptions about the object's geometric structure and require segmented training data for learning. In this paper, we propose a… ▽ More

    Submitted 14 April, 2019; v1 submitted 22 January, 2017; originally announced January 2017.

    Comments: CVPR 2019

  27. arXiv:1510.01485  [pdf, other

    stat.ML cs.LG

    Bayesian Markov Blanket Estimation

    Authors: Dinu Kaufmann, Sonali Parbhoo, Aleksander Wieczorek, Sebastian Keller, David Adametz, Volker Roth

    Abstract: This paper considers a Bayesian view for estimating a sub-network in a Markov random field. The sub-network corresponds to the Markov blanket of a set of query variables, where the set of potential neighbours here is big. We factorize the posterior such that the Markov blanket is conditionally independent of the network of the potential neighbours. By exploiting this blockwise decoupling, we deriv… ▽ More

    Submitted 6 October, 2015; originally announced October 2015.

    Comments: 16 pages, 5 figures