Skip to main content

Showing 51–100 of 288 results for author: van der Schaar, M

.
  1. arXiv:2305.09235  [pdf, other

    cs.LG

    Synthetic data, real errors: how (not) to publish and use synthetic data

    Authors: Boris van Breugel, Zhaozhi Qian, Mihaela van der Schaar

    Abstract: Generating synthetic data through generative models is gaining interest in the ML community and beyond, promising a future where datasets can be tailored to individual needs. Unfortunately, synthetic data is usually not perfect, resulting in potential errors in downstream tasks. In this work we explore how the generative process affects the downstream ML task. We show that the naive synthetic data… ▽ More

    Submitted 8 July, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: Proceedings of the 40th International Conference on Machine Learning (ICML 2023)

  2. arXiv:2304.06715  [pdf, other

    cs.LG cs.AI cs.CG

    Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance

    Authors: Jonathan Crabbé, Mihaela van der Schaar

    Abstract: Interpretability methods are valuable only if their explanations faithfully describe the explained model. In this work, we consider neural networks whose predictions are invariant under a specific symmetry group. This includes popular architectures, ranging from convolutional to graph neural networks. Any explanation that faithfully explains this type of model needs to be in agreement with this in… ▽ More

    Submitted 5 October, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: Presented at NeurIPS 2023

  3. arXiv:2304.03722  [pdf, other

    cs.LG

    Beyond Privacy: Navigating the Opportunities and Challenges of Synthetic Data

    Authors: Boris van Breugel, Mihaela van der Schaar

    Abstract: Generating synthetic data through generative models is gaining interest in the ML community and beyond. In the past, synthetic data was often regarded as a means to private data release, but a surge of recent papers explore how its potential reaches much further than this -- from creating more fair data to data augmentation, and from simulation to text generated by ChatGPT. In this perspective we… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  4. arXiv:2304.03674  [pdf, other

    cs.LG cs.AI cs.SE

    Machine Learning with Requirements: a Manifesto

    Authors: Eleonora Giunchiglia, Fergus Imrie, Mihaela van der Schaar, Thomas Lukasiewicz

    Abstract: In the recent years, machine learning has made great advancements that have been at the root of many breakthroughs in different application domains. However, it is still an open issue how make them applicable to high-stakes or safety-critical application domains, as they can often be brittle and unreliable. In this paper, we argue that requirements definition and satisfaction can go a long way to… ▽ More

    Submitted 2 February, 2024; v1 submitted 7 April, 2023; originally announced April 2023.

  5. arXiv:2303.05506  [pdf, other

    cs.LG cs.AI stat.ML

    TANGOS: Regularizing Tabular Neural Networks through Gradient Orthogonalization and Specialization

    Authors: Alan Jeffares, Tennison Liu, Jonathan Crabbé, Fergus Imrie, Mihaela van der Schaar

    Abstract: Despite their success with unstructured data, deep neural networks are not yet a panacea for structured tabular data. In the tabular domain, their efficiency crucially relies on various forms of regularization to prevent overfitting and provide strong generalization performance. Existing regularization techniques include broad modelling decisions such as choice of architecture, loss functions, and… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Published at International Conference on Learning Representations (ICLR) 2023

  6. arXiv:2303.02186  [pdf, ps, other

    cs.LG cs.AI

    Causal Deep Learning

    Authors: Jeroen Berrevoets, Krzysztof Kacprzyk, Zhaozhi Qian, Mihaela van der Schaar

    Abstract: Causality has the potential to truly transform the way we solve a large number of real-world problems. Yet, so far, its potential largely remains to be unlocked as causality often requires crucial assumptions which cannot be tested in practice. To address this challenge, we propose a new way of thinking about causality -- we call this causal deep learning. Our causal deep learning framework spans… ▽ More

    Submitted 14 February, 2024; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2212.00911

  7. arXiv:2303.01513  [pdf

    cs.LG cs.AI

    Safe AI for health and beyond -- Monitoring to transform a health service

    Authors: Mahed Abroshan, Michael Burkhart, Oscar Giles, Sam Greenbury, Zoe Kourtzi, Jack Roberts, Mihaela van der Schaar, Jannetta S Steyn, Alan Wilson, May Yong

    Abstract: Machine learning techniques are effective for building predictive models because they identify patterns in large datasets. Development of a model for complex real-life problems often stop at the point of publication, proof of concept or when made accessible through some mode of deployment. However, a model in the medical domain risks becoming obsolete as patient demographics, systems and clinical… ▽ More

    Submitted 6 June, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: 12 pages, 3 figures

    ACM Class: I.2.1

  8. arXiv:2302.12749  [pdf, other

    cs.LG

    SurvivalGAN: Generating Time-to-Event Data for Survival Analysis

    Authors: Alexander Norcliffe, Bogdan Cebere, Fergus Imrie, Pietro Lio, Mihaela van der Schaar

    Abstract: Synthetic data is becoming an increasingly promising technology, and successful applications can improve privacy, fairness, and data democratization. While there are many methods for generating synthetic tabular data, the task remains non-trivial and unexplored for specific scenarios. One such scenario is survival data. Here, the key difficulty is censoring: for some instances, we are not aware of… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  9. arXiv:2302.12718  [pdf, other

    stat.ME cs.LG stat.ML

    Understanding the Impact of Competing Events on Heterogeneous Treatment Effect Estimation from Time-to-Event Data

    Authors: Alicia Curth, Mihaela van der Schaar

    Abstract: We study the problem of inferring heterogeneous treatment effects (HTEs) from time-to-event data in the presence of competing events. Albeit its great practical relevance, this problem has received little attention compared to its counterparts studying HTE estimation without time-to-event data or competing events. We take an outcome modeling approach to estimating HTEs, and consider how and when e… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: To appear in the Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS) 2023, Valencia, Spain. PMLR: Volume 206

  10. arXiv:2302.12619  [pdf, other

    cs.LG q-bio.QM

    T-Phenotype: Discovering Phenotypes of Predictive Temporal Patterns in Disease Progression

    Authors: Yuchao Qin, Mihaela van der Schaar, Changhee Lee

    Abstract: Clustering time-series data in healthcare is crucial for clinical phenoty** to understand patients' disease progression patterns and to design treatment guidelines tailored to homogeneous patient subgroups. While rich temporal dynamics enable the discovery of potential clusters beyond static correlations, two major challenges remain outstanding: i) discovery of predictive patterns from many pote… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  11. arXiv:2302.12604  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Neural Laplace Control for Continuous-time Delayed Systems

    Authors: Samuel Holt, Alihan Hüyük, Zhaozhi Qian, Hao Sun, Mihaela van der Schaar

    Abstract: Many real-world offline reinforcement learning (RL) problems involve continuous-time environments with delays. Such environments are characterized by two distinctive features: firstly, the state x(t) is observed at irregular time intervals, and secondly, the current action a(t) only affects the future state x(t + g) with an unknown delay g > 0. A prime example of such an environment is satellite c… ▽ More

    Submitted 10 April, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS) 2023, Valencia, Spain. PMLR: Volume 206. Copyright 2023 by the author(s)

    ACM Class: I.2.6; I.2.5; E.1

  12. arXiv:2302.12580  [pdf, other

    cs.LG cs.CR

    Membership Inference Attacks against Synthetic Data through Overfitting Detection

    Authors: Boris van Breugel, Hao Sun, Zhaozhi Qian, Mihaela van der Schaar

    Abstract: Data is the foundation of most science. Unfortunately, sharing data can be obstructed by the risk of violating data privacy, impeding research in fields like healthcare. Synthetic data is a potential solution. It aims to generate data that has the same distribution as the original data, but that does not disclose information about individuals. Membership Inference Attacks (MIAs) are a common priva… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  13. arXiv:2302.12238  [pdf, other

    cs.LG stat.ML

    Improving Adaptive Conformal Prediction Using Self-Supervised Learning

    Authors: Nabeel Seedat, Alan Jeffares, Fergus Imrie, Mihaela van der Schaar

    Abstract: Conformal prediction is a powerful distribution-free tool for uncertainty quantification, establishing valid prediction intervals with finite-sample guarantees. To produce valid intervals which are also adaptive to the difficulty of each instance, a common approach is to compute normalized nonconformity scores on a separate calibration set. Self-supervised learning has been effectively utilized in… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: Accepted to the International Conference on Artificial Intelligence and Statistics (AISTATS 2023). *Seedat & Jeffares contributed equally

  14. arXiv:2302.02923  [pdf, other

    stat.ML cs.LG econ.EM

    In Search of Insights, Not Magic Bullets: Towards Demystification of the Model Selection Dilemma in Heterogeneous Treatment Effect Estimation

    Authors: Alicia Curth, Mihaela van der Schaar

    Abstract: Personalized treatment effect estimates are often of interest in high-stakes applications -- thus, before deploying a model estimating such effects in practice, one needs to be sure that the best candidate from the ever-growing machine learning toolbox for this task was chosen. Unfortunately, due to the absence of counterfactual information in practice, it is usually not possible to rely on standa… ▽ More

    Submitted 6 June, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: To appear in the Proceedings of the 40th International Conference on Machine Learning, Honolulu, Hawaii, USA. PMLR 202, 2023

  15. arXiv:2301.12260  [pdf, other

    cs.LG cs.AI

    TemporAI: Facilitating Machine Learning Innovation in Time Domain Tasks for Medicine

    Authors: Evgeny S. Saveliev, Mihaela van der Schaar

    Abstract: TemporAI is an open source Python software library for machine learning (ML) tasks involving data with a time component, focused on medicine and healthcare use cases. It supports data in time series, static, and eventmodalities and provides an interface for prediction, causal inference, and time-to-event analysis, as well as common preprocessing utilities and model interpretability methods. The li… ▽ More

    Submitted 28 January, 2023; originally announced January 2023.

    ACM Class: I.2.0

  16. arXiv:2301.11323  [pdf, other

    cs.LG

    Joint Training of Deep Ensembles Fails Due to Learner Collusion

    Authors: Alan Jeffares, Tennison Liu, Jonathan Crabbé, Mihaela van der Schaar

    Abstract: Ensembles of machine learning models have been well established as a powerful method of improving performance over a single model. Traditionally, ensembling algorithms train their base learners independently or sequentially with the goal of optimizing their joint performance. In the case of deep ensembles of neural networks, we are provided with the opportunity to directly optimize the true object… ▽ More

    Submitted 31 October, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: To appear in the Proceedings of the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  17. arXiv:2301.07573  [pdf, other

    cs.LG cs.AI

    Synthcity: facilitating innovative use cases of synthetic data in different data modalities

    Authors: Zhaozhi Qian, Bogdan-Constantin Cebere, Mihaela van der Schaar

    Abstract: Synthcity is an open-source software package for innovative use cases of synthetic data in ML fairness, privacy and augmentation across diverse tabular data modalities, including static data, regular and irregular time series, data with censoring, multi-source data, composite data, and more. Synthcity provides the practitioners with a single access point to cutting edge research and tools in synth… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

  18. arXiv:2212.00911  [pdf, other

    cs.LG

    Navigating causal deep learning

    Authors: Jeroen Berrevoets, Krzysztof Kacprzyk, Zhaozhi Qian, Mihaela van der Schaar

    Abstract: Causal deep learning (CDL) is a new and important research area in the larger field of machine learning. With CDL, researchers aim to structure and encode causal knowledge in the extremely flexible representation space of deep learning models. Doing so will lead to more informed, robust, and general predictions and inference -- which is important! However, CDL is still in its infancy. For example,… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  19. arXiv:2211.06138  [pdf, other

    cs.LG cs.CY stat.ML

    Practical Approaches for Fair Learning with Multitype and Multivariate Sensitive Attributes

    Authors: Tennison Liu, Alex J. Chan, Boris van Breugel, Mihaela van der Schaar

    Abstract: It is important to guarantee that machine learning algorithms deployed in the real world do not result in unfairness or unintended social consequences. Fair ML has largely focused on the protection of single attributes in the simpler setting where both attributes and target outcomes are binary. However, the practical application in many a real-world problem entails the simultaneous protection of m… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  20. arXiv:2211.05764  [pdf, other

    cs.LG cs.AI cs.CY cs.SE stat.ML

    DC-Check: A Data-Centric AI checklist to guide the development of reliable machine learning systems

    Authors: Nabeel Seedat, Fergus Imrie, Mihaela van der Schaar

    Abstract: While there have been a number of remarkable breakthroughs in machine learning (ML), much of the focus has been placed on model development. However, to truly realize the potential of machine learning in real-world settings, additional aspects must be considered across the ML pipeline. Data-centric AI is emerging as a unifying paradigm that could enable such reliable end-to-end pipelines. However,… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: Main paper: 11 pages, supplementary & case studies follow

    Journal ref: IEEE Transactions on Artificial Intelligence, 2023

  21. arXiv:2211.00631  [pdf, other

    cs.LG

    Composite Feature Selection using Deep Ensembles

    Authors: Fergus Imrie, Alexander Norcliffe, Pietro Lio, Mihaela van der Schaar

    Abstract: In many real world problems, features do not act alone but in combination with each other. For example, in genomics, diseases might not be caused by any single mutation but require the presence of multiple mutations. Prior work on feature selection either seeks to identify individual features or can only determine relevant groups from a predefined set. We investigate the problem of discovering gro… ▽ More

    Submitted 11 January, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: Accepted to NeurIPS 2022

  22. arXiv:2210.13043  [pdf, other

    cs.LG cs.AI

    Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular data

    Authors: Nabeel Seedat, Jonathan Crabbé, Ioana Bica, Mihaela van der Schaar

    Abstract: High model performance, on average, can hide that models may systematically underperform on subgroups of the data. We consider the tabular setting, which surfaces the unique issue of outcome heterogeneity - this is prevalent in areas such as healthcare, where patients with similar features can have different outcomes, thus making reliable predictions challenging. To tackle this, we propose Data-IQ… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Presented at NeurIPS 2022

  23. AutoPrognosis 2.0: Democratizing Diagnostic and Prognostic Modeling in Healthcare with Automated Machine Learning

    Authors: Fergus Imrie, Bogdan Cebere, Eoin F. McKinney, Mihaela van der Schaar

    Abstract: Diagnostic and prognostic models are increasingly important in medicine and inform many clinical decisions. Recently, machine learning approaches have shown improvement over conventional modeling techniques by better capturing complex interactions between patient covariates in a data-driven manner. However, the use of machine learning introduces a number of technical and practical challenges that… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Journal ref: PLOS Digital Health, 2023, 2(6): e0000276

  24. arXiv:2210.06183  [pdf, other

    cs.LG stat.ME

    Transfer Learning on Heterogeneous Feature Spaces for Treatment Effects Estimation

    Authors: Ioana Bica, Mihaela van der Schaar

    Abstract: Consider the problem of improving the estimation of conditional average treatment effects (CATE) for a target domain of interest by leveraging related information from a source domain with a different feature space. This heterogeneous transfer learning problem for CATE estimation is ubiquitous in areas such as healthcare where we may wish to evaluate the effectiveness of a treatment for a new pati… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

  25. arXiv:2210.05320  [pdf, other

    cs.LG

    Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning

    Authors: Alex J. Chan, Mihaela van der Schaar

    Abstract: Consider making a prediction over new test data without any opportunity to learn from a training set of labelled data - instead given access to a set of expert models and their predictions alongside some limited information about the dataset used to train them. In scenarios from finance to the medical sciences, and even consumer practice, stakeholders have developed models on private data they eit… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  26. arXiv:2209.11222  [pdf, other

    cs.LG cs.AI

    Concept Activation Regions: A Generalized Framework For Concept-Based Explanations

    Authors: Jonathan Crabbé, Mihaela van der Schaar

    Abstract: Concept-based explanations permit to understand the predictions of a deep neural network (DNN) through the lens of concepts specified by users. Existing methods assume that the examples illustrating a concept are mapped in a fixed direction of the DNN's latent space. When this holds true, the concept can be represented by a concept activation vector (CAV) pointing in that direction. In this work,… ▽ More

    Submitted 29 September, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: Presented at NeurIPS 2022

  27. arXiv:2208.05844  [pdf, other

    stat.ML cs.LG

    Adaptive Identification of Populations with Treatment Benefit in Clinical Trials: Machine Learning Challenges and Solutions

    Authors: Alicia Curth, Alihan Hüyük, Mihaela van der Schaar

    Abstract: We study the problem of adaptively identifying patient subpopulations that benefit from a given treatment during a confirmatory clinical trial. This type of adaptive clinical trial has been thoroughly studied in biostatistics, but has been allowed only limited adaptivity so far. Here, we aim to relax classical restrictions on such designs and investigate how to incorporate ideas from the recent ma… ▽ More

    Submitted 5 June, 2023; v1 submitted 11 August, 2022; originally announced August 2022.

    Comments: To appear in the Proceedings of the 40th International Conference on Machine Learning, Honolulu, Hawaii, USA. PMLR 202, 2023

  28. arXiv:2208.01373  [pdf, other

    cs.LG cs.AI stat.ML

    DAPDAG: Domain Adaptation via Perturbed DAG Reconstruction

    Authors: Yanke Li, Hatt Tobias, Ioana Bica, Mihaela van der Schaar

    Abstract: Leveraging labelled data from multiple domains to enable prediction in another domain without labels is a significant, yet challenging problem. To address this problem, we introduce the framework DAPDAG (\textbf{D}omain \textbf{A}daptation via \textbf{P}erturbed \textbf{DAG} Reconstruction) and propose to learn an auto-encoder that undertakes inference on population statistics given features and r… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

  29. arXiv:2207.05161  [pdf, other

    cs.LG cs.AI

    What is Flagged in Uncertainty Quantification? Latent Density Models for Uncertainty Categorization

    Authors: Hao Sun, Boris van Breugel, Jonathan Crabbe, Nabeel Seedat, Mihaela van der Schaar

    Abstract: Uncertainty Quantification (UQ) is essential for creating trustworthy machine learning models. Recent years have seen a steep rise in UQ methods that can flag suspicious examples, however, it is often unclear what exactly these methods identify. In this work, we propose a framework for categorizing uncertain examples flagged by UQ methods in classification tasks. We introduce the confusion density… ▽ More

    Submitted 27 October, 2023; v1 submitted 11 July, 2022; originally announced July 2022.

  30. arXiv:2206.10586  [pdf, other

    cs.LG

    D-CIPHER: Discovery of Closed-form Partial Differential Equations

    Authors: Krzysztof Kacprzyk, Zhaozhi Qian, Mihaela van der Schaar

    Abstract: Closed-form differential equations, including partial differential equations and higher-order ordinary differential equations, are one of the most important tools used by scientists to model and better understand natural phenomena. Discovering these equations directly from data is challenging because it requires modeling relationships between various derivatives that are not observed in the data (… ▽ More

    Submitted 29 November, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: To appear in the Proceedings of the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  31. arXiv:2206.08363  [pdf, other

    cs.LG cs.AI stat.ME

    Benchmarking Heterogeneous Treatment Effect Models through the Lens of Interpretability

    Authors: Jonathan Crabbé, Alicia Curth, Ioana Bica, Mihaela van der Schaar

    Abstract: Estimating personalized effects of treatments is a complex, yet pervasive problem. To tackle it, recent developments in the machine learning (ML) literature on heterogeneous treatment effect estimation gave rise to many sophisticated, but opaque, tools: due to their flexibility, modularity and ability to learn constrained representations, neural networks in particular have become central to this l… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  32. arXiv:2206.08311  [pdf, other

    cs.LG stat.ML

    Continuous-Time Modeling of Counterfactual Outcomes Using Neural Controlled Differential Equations

    Authors: Nabeel Seedat, Fergus Imrie, Alexis Bellot, Zhaozhi Qian, Mihaela van der Schaar

    Abstract: Estimating counterfactual outcomes over time has the potential to unlock personalized healthcare by assisting decision-makers to answer ''what-iF'' questions. Existing causal inference approaches typically consider regular, discrete-time intervals between observations and treatment decisions and hence are unable to naturally model irregularly sampled data, which is the common setting in practice.… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: Presented at the International Conference on Machine Learning (ICML) 2022

  33. arXiv:2206.07769  [pdf, other

    stat.ML cs.LG

    HyperImpute: Generalized Iterative Imputation with Automatic Model Selection

    Authors: Daniel Jarrett, Bogdan Cebere, Tennison Liu, Alicia Curth, Mihaela van der Schaar

    Abstract: Consider the problem of imputing missing values in a dataset. One the one hand, conventional approaches using iterative imputation benefit from the simplicity and customizability of learning conditional distributions directly, but suffer from the practical requirement for appropriate model specification of each and every variable. On the other hand, recent methods using deep generative modeling be… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Journal ref: In Proc. 39th International Conference on Machine Learning (ICML 2022)

  34. arXiv:2206.06354  [pdf, other

    cs.LG stat.ML

    Differentiable and Transportable Structure Learning

    Authors: Jeroen Berrevoets, Nabeel Seedat, Fergus Imrie, Mihaela van der Schaar

    Abstract: Directed acyclic graphs (DAGs) encode a lot of information about a particular distribution in their structure. However, compute required to infer these structures is typically super-exponential in the number of variables, as inference requires a sweep of a combinatorially large space of potential structures. That is, until recent advances made it possible to search this space using a differentiabl… ▽ More

    Submitted 12 June, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: Accepted at the International Conference on Machine Learning (ICML) 2023

  35. arXiv:2206.04843  [pdf, other

    cs.LG cs.AI stat.ML

    Neural Laplace: Learning diverse classes of differential equations in the Laplace domain

    Authors: Samuel Holt, Zhaozhi Qian, Mihaela van der Schaar

    Abstract: Neural Ordinary Differential Equations model dynamical systems with ODEs learned by neural networks. However, ODEs are fundamentally inadequate to model systems with long-range dependencies or discontinuities, which are common in engineering and biological systems. Broader classes of differential equations (DE) have been proposed as remedies, including delay differential equations and integro-diff… ▽ More

    Submitted 14 June, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Proceedings of the 39th International Conference on Machine Learning, Baltimore, Maryland, USA, PMLR 162, 2022. Copyright 2022 by the author(s)

    ACM Class: I.2.6; I.2.5; E.1

  36. arXiv:2203.08057  [pdf, other

    cs.LG

    POETREE: Interpretable Policy Learning with Adaptive Decision Trees

    Authors: Alizée Pace, Alex J. Chan, Mihaela van der Schaar

    Abstract: Building models of human decision-making from observed behaviour is critical to better understand, diagnose and support real-world policies such as clinical care. As established policy learning approaches remain focused on imitation performance, they fall short of explaining the demonstrated decision-making process. Policy Extraction through decision Trees (POETREE) is a novel framework for interp… ▽ More

    Submitted 30 September, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

  37. arXiv:2203.07338  [pdf, other

    cs.LG

    Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies

    Authors: Alex J. Chan, Alicia Curth, Mihaela van der Schaar

    Abstract: Human decision making is well known to be imperfect and the ability to analyse such processes individually is crucial when attempting to aid or improve a decision-maker's ability to perform a task, e.g. to alert them to potential biases or oversights on their part. To do so, it is necessary to develop interpretable representations of how agents make decisions and how this process changes over time… ▽ More

    Submitted 30 September, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

  38. arXiv:2203.01928  [pdf, other

    cs.LG cs.AI

    Label-Free Explainability for Unsupervised Models

    Authors: Jonathan Crabbé, Mihaela van der Schaar

    Abstract: Unsupervised black-box models are challenging to interpret. Indeed, most existing explainability methods require labels to select which component(s) of the black-box's output to interpret. In the absence of labels, black-box outputs often are representation vectors whose components do not correspond to any meaningful quantity. Hence, choosing which component(s) to interpret in a label-free unsuper… ▽ More

    Submitted 9 June, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Presented at ICML 2022

  39. arXiv:2202.12891  [pdf, other

    stat.ML cs.LG

    Combining Observational and Randomized Data for Estimating Heterogeneous Treatment Effects

    Authors: Tobias Hatt, Jeroen Berrevoets, Alicia Curth, Stefan Feuerriegel, Mihaela van der Schaar

    Abstract: Estimating heterogeneous treatment effects is an important problem across many domains. In order to accurately estimate such treatment effects, one typically relies on data from observational studies or randomized experiments. Currently, most existing works rely exclusively on observational data, which is often confounded and, hence, yields biased estimates. While observational data is confounded,… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  40. arXiv:2202.10153  [pdf, other

    cs.LG cs.AI stat.ML

    Inferring Lexicographically-Ordered Rewards from Preferences

    Authors: Alihan Hüyük, William R. Zame, Mihaela van der Schaar

    Abstract: Modeling the preferences of agents over a set of alternatives is a principal concern in many areas. The dominant approach has been to find a single reward/utility function with the property that alternatives yielding higher rewards are preferred over alternatives yielding lower rewards. However, in many settings, preferences are based on multiple, often competing, objectives; a single reward funct… ▽ More

    Submitted 7 June, 2022; v1 submitted 21 February, 2022; originally announced February 2022.

    Comments: In Proceedings of the 36th AAAI Conference on Artificial Intelligence

  41. arXiv:2202.08836  [pdf, other

    cs.LG cs.AI

    Data-SUITE: Data-centric identification of in-distribution incongruous examples

    Authors: Nabeel Seedat, Jonathan Crabbé, Mihaela van der Schaar

    Abstract: Systematic quantification of data quality is critical for consistent model performance. Prior works have focused on out-of-distribution data. Instead, we tackle an understudied yet equally important problem of characterizing incongruous regions of in-distribution (ID) data, which may arise from feature space heterogeneity. To this end, we propose a paradigm shift with Data-SUITE: a data-centric AI… ▽ More

    Submitted 13 June, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: Presented at the International Conference on Machine Learning (ICML) 2022

  42. arXiv:2202.02096  [pdf, ps, other

    stat.ML cs.LG

    To Impute or not to Impute? Missing Data in Treatment Effect Estimation

    Authors: Jeroen Berrevoets, Fergus Imrie, Trent Kyono, James Jordon, Mihaela van der Schaar

    Abstract: Missing data is a systemic problem in practical scenarios that causes noise and bias when estimating treatment effects. This makes treatment effect estimation from data with missingness a particularly tricky endeavour. A key reason for this is that standard assumptions on missingness are rendered insufficient due to the presence of an additional variable, treatment, besides the input (e.g. an indi… ▽ More

    Submitted 24 February, 2023; v1 submitted 4 February, 2022; originally announced February 2022.

  43. arXiv:2112.03811  [pdf, other

    cs.LG

    Disentangled Counterfactual Recurrent Networks for Treatment Effect Inference over Time

    Authors: Jeroen Berrevoets, Alicia Curth, Ioana Bica, Eoin McKinney, Mihaela van der Schaar

    Abstract: Choosing the best treatment-plan for each individual patient requires accurate forecasts of their outcome trajectories as a function of the treatment, over time. While large observational data sets constitute rich sources of information to learn from, they also contain biases as treatments are rarely assigned randomly in practice. To provide accurate and unbiased forecasts, we introduce the Disent… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

  44. arXiv:2111.03187  [pdf, other

    cs.LG

    MIRACLE: Causally-Aware Imputation via Learning Missing Data Mechanisms

    Authors: Trent Kyono, Yao Zhang, Alexis Bellot, Mihaela van der Schaar

    Abstract: Missing data is an important problem in machine learning practice. Starting from the premise that imputation methods should preserve the causal structure of the data, we develop a regularization scheme that encourages any baseline imputation method to be causally consistent with the underlying data generating mechanism. Our proposal is a causally-aware imputation algorithm (MIRACLE). MIRACLE itera… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

  45. arXiv:2110.15355  [pdf, other

    cs.LG cs.AI cs.HC

    Explaining Latent Representations with a Corpus of Examples

    Authors: Jonathan Crabbé, Zhaozhi Qian, Fergus Imrie, Mihaela van der Schaar

    Abstract: Modern machine learning models are complicated. Most of them rely on convoluted latent representations of their input to issue a prediction. To achieve greater transparency than a black-box that connects inputs to predictions, it is necessary to gain a deeper understanding of these latent representations. To that aim, we propose SimplEx: a user-centred method that provides example-based explanatio… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: Presented at the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021)

  46. arXiv:2110.14001  [pdf, other

    cs.LG stat.ML

    SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data

    Authors: Alicia Curth, Changhee Lee, Mihaela van der Schaar

    Abstract: We study the problem of inferring heterogeneous treatment effects from time-to-event data. While both the related problems of (i) estimating treatment effects for binary or continuous outcomes and (ii) predicting survival outcomes have been well studied in the recent machine learning literature, their combination -- albeit of high practical relevance -- has received considerably less attention. Wi… ▽ More

    Submitted 23 January, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  47. arXiv:2110.12884  [pdf, other

    cs.LG stat.ML

    DECAF: Generating Fair Synthetic Data Using Causally-Aware Generative Networks

    Authors: Boris van Breugel, Trent Kyono, Jeroen Berrevoets, Mihaela van der Schaar

    Abstract: Machine learning models have been criticized for reflecting unfair biases in the training data. Instead of solving for this by introducing fair learning algorithms directly, we focus on generating fair synthetic data, such that any downstream learner is fair. Generating fair synthetic data from unfair data - while remaining truthful to the underlying data-generating process (DGP) - is non-trivial.… ▽ More

    Submitted 4 November, 2021; v1 submitted 25 October, 2021; originally announced October 2021.

  48. Conservative Policy Construction Using Variational Autoencoders for Logged Data with Missing Values

    Authors: Mahed Abroshan, Kai Hou Yip, Cem Tekin, Mihaela van der Schaar

    Abstract: In high-stakes applications of data-driven decision making like healthcare, it is of paramount importance to learn a policy that maximizes the reward while avoiding potentially dangerous actions when there is uncertainty. There are two main challenges usually associated with this problem. Firstly, learning through online exploration is not possible due to the critical nature of such applications.… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

  49. arXiv:2108.03039  [pdf, other

    cs.LG stat.ML

    Identifiable Energy-based Representations: An Application to Estimating Heterogeneous Causal Effects

    Authors: Yao Zhang, Jeroen Berrevoets, Mihaela van der Schaar

    Abstract: Conditional average treatment effects (CATEs) allow us to understand the effect heterogeneity across a large population of individuals. However, typical CATE learners assume all confounding variables are measured in order for the CATE to be identifiable. This requirement can be satisfied by collecting many variables, at the expense of increased sample complexity for estimating CATEs. To combat thi… ▽ More

    Submitted 30 January, 2022; v1 submitted 6 August, 2021; originally announced August 2021.

    Comments: 20 pages, 2 figures, 9 tables

    Journal ref: Proceedings of the 25th International Conference on Artificial Intelligence and Statistics (AISTATS) 2022

  50. arXiv:2107.13346  [pdf, other

    cs.LG stat.ME

    Doing Great at Estimating CATE? On the Neglected Assumptions in Benchmark Comparisons of Treatment Effect Estimators

    Authors: Alicia Curth, Mihaela van der Schaar

    Abstract: The machine learning toolbox for estimation of heterogeneous treatment effects from observational data is expanding rapidly, yet many of its algorithms have been evaluated only on a very limited set of semi-synthetic benchmark datasets. In this paper, we show that even in arguably the simplest setting -- estimation under ignorability assumptions -- the results of such empirical evaluations can be… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: Workshop on the Neglected Assumptions in Causal Inference at the International Conference on Machine Learning (ICML), 2021