Skip to main content

Showing 101–150 of 288 results for author: van der Schaar, M

.
  1. arXiv:2107.06317  [pdf, other

    cs.LG stat.ML

    Inverse Contextual Bandits: Learning How Behavior Evolves over Time

    Authors: Alihan Hüyük, Daniel Jarrett, Mihaela van der Schaar

    Abstract: Understanding a decision-maker's priorities by observing their behavior is critical for transparency and accountability in decision processes, such as in healthcare. Though conventional approaches to policy learning almost invariably assume stationarity in behavior, this is hardly true in practice: Medical practice is constantly evolving as clinical professionals fine-tune their knowledge over tim… ▽ More

    Submitted 8 June, 2022; v1 submitted 13 July, 2021; originally announced July 2021.

    Comments: In Proceedings of the 39th International Conference on Machine Learning

  2. arXiv:2106.05303  [pdf, other

    cs.LG cs.AI

    Explaining Time Series Predictions with Dynamic Masks

    Authors: Jonathan Crabbé, Mihaela van der Schaar

    Abstract: How can we explain the predictions of a machine learning model? When the data is structured as a multivariate time series, this question induces additional difficulties such as the necessity for the explanation to embody the time dependency and the large number of inputs. To address these challenges, we propose dynamic masks (Dynamask). This method produces instance-wise importance scores for each… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: Presented at the Thirty-eighth International Conference on Machine Learning (ICML 2021)

  3. arXiv:2106.04240  [pdf, other

    cs.LG

    The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation

    Authors: Alex J. Chan, Ioana Bica, Alihan Huyuk, Daniel Jarrett, Mihaela van der Schaar

    Abstract: Understanding decision-making in clinical environments is of paramount importance if we are to bring the strengths of machine learning to ultimately improve patient outcomes. Several factors including the availability of public data, the intrinsically offline nature of the problem, and the complexity of human decision making, has meant that the mainstream development of algorithms is often geared… ▽ More

    Submitted 14 March, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

  4. arXiv:2106.03765  [pdf, other

    stat.ML cs.LG

    On Inductive Biases for Heterogeneous Treatment Effect Estimation

    Authors: Alicia Curth, Mihaela van der Schaar

    Abstract: We investigate how to exploit structural similarities of an individual's potential outcomes (POs) under different treatments to obtain better estimates of conditional average treatment effects in finite samples. Especially when it is unknown whether a treatment has an effect at all, it is natural to hypothesize that the POs are similar - yet, some existing strategies for treatment effect estimatio… ▽ More

    Submitted 25 October, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: To Appear in the Proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  5. arXiv:2106.02875  [pdf, other

    cs.LG stat.ML

    Integrating Expert ODEs into Neural ODEs: Pharmacology and Disease Progression

    Authors: Zhaozhi Qian, William R. Zame, Lucas M. Fleuren, Paul Elbers, Mihaela van der Schaar

    Abstract: Modeling a system's temporal behaviour in reaction to external stimuli is a fundamental problem in many areas. Pure Machine Learning (ML) approaches often fail in the small sample regime and cannot provide actionable insights beyond predictions. A promising modification has been to incorporate expert domain knowledge into ML models. The application we consider is predicting the progression of dise… ▽ More

    Submitted 17 June, 2021; v1 submitted 5 June, 2021; originally announced June 2021.

  6. arXiv:2105.02522  [pdf, other

    stat.ML cs.LG math.DS

    Neural graphical modelling in continuous-time: consistency guarantees and algorithms

    Authors: Alexis Bellot, Kim Branson, Mihaela van der Schaar

    Abstract: The discovery of structure from time series data is a key problem in fields of study working with complex systems. Most identifiability results and learning algorithms assume the underlying dynamics to be discrete in time. Comparatively few, in contrast, explicitly define dependencies in infinitesimal intervals of time, independently of the scale of observation and of the regularity of sampling. I… ▽ More

    Submitted 3 February, 2022; v1 submitted 6 May, 2021; originally announced May 2021.

  7. arXiv:2103.15106  [pdf, other

    stat.ML cs.LG

    Deconfounded Score Method: Scoring DAGs with Dense Unobserved Confounding

    Authors: Alexis Bellot, Mihaela van der Schaar

    Abstract: Unobserved confounding is one of the greatest challenges for causal discovery. The case in which unobserved variables have a widespread effect on many of the observed ones is particularly difficult because most pairs of variables are conditionally dependent given any other subset, rendering the causal effect unidentifiable. In this paper we show that beyond conditional independencies, under the pr… ▽ More

    Submitted 25 May, 2021; v1 submitted 28 March, 2021; originally announced March 2021.

  8. arXiv:2102.11500  [pdf, other

    cs.LG cs.AI

    Model-Attentive Ensemble Learning for Sequence Modeling

    Authors: Victor D. Bourgin, Ioana Bica, Mihaela van der Schaar

    Abstract: Medical time-series datasets have unique characteristics that make prediction tasks challenging. Most notably, patient trajectories often contain longitudinal variations in their input-output relationships, generally referred to as temporal conditional shift. Designing sequence models capable of adapting to such time-varying distributions remains a prevailing problem. To address this we present Mo… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

  9. arXiv:2102.08921  [pdf, other

    cs.LG stat.ML

    How Faithful is your Synthetic Data? Sample-level Metrics for Evaluating and Auditing Generative Models

    Authors: Ahmed M. Alaa, Boris van Breugel, Evgeny Saveliev, Mihaela van der Schaar

    Abstract: Devising domain- and model-agnostic evaluation metrics for generative models is an important and as yet unresolved problem. Most existing metrics, which were tailored solely to the image synthesis setup, exhibit a limited capacity for diagnosing the different modes of failure of generative models across broader application domains. In this paper, we introduce a 3-dimensional evaluation metric, (… ▽ More

    Submitted 13 July, 2022; v1 submitted 17 February, 2021; originally announced February 2021.

  10. arXiv:2102.06483  [pdf, other

    cs.LG

    Scalable Bayesian Inverse Reinforcement Learning

    Authors: Alex J. Chan, Mihaela van der Schaar

    Abstract: Bayesian inference over the reward presents an ideal solution to the ill-posed nature of the inverse reinforcement learning problem. Unfortunately current methods generally do not scale well beyond the small tabular setting due to the need for an inner-loop MDP solver, and even non-Bayesian methods that do themselves scale often require extensive interaction with the environment to perform well, b… ▽ More

    Submitted 11 March, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

  11. arXiv:2102.06271  [pdf, other

    cs.LG

    Selecting Treatment Effects Models for Domain Adaptation Using Causal Knowledge

    Authors: Trent Kyono, Ioana Bica, Zhaozhi Qian, Mihaela van der Schaar

    Abstract: Selecting causal inference models for estimating individualized treatment effects (ITE) from observational data presents a unique challenge since the counterfactual outcomes are never observed. The problem is challenged further in the unsupervised domain adaptation (UDA) setting where we only have access to labeled samples in the source domain, but desire selecting a model that achieves good perfo… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  12. arXiv:2102.03014  [pdf, other

    cs.LG

    A Variational Information Bottleneck Approach to Multi-Omics Data Integration

    Authors: Changhee Lee, Mihaela van der Schaar

    Abstract: Integration of data from multiple omics techniques is becoming increasingly important in biomedical research. Due to non-uniformity and technical limitations in omics platforms, such integrative analyses on multiple omics, which we refer to as views, involve learning from incomplete observations with various view-missing patterns. This is challenging because i) complex interactions within and acro… ▽ More

    Submitted 9 February, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: This paper is accepted to AISTATS 2021

  13. arXiv:2102.01577  [pdf, other

    stat.ML cs.LG

    Policy Analysis using Synthetic Controls in Continuous-Time

    Authors: Alexis Bellot, Mihaela van der Schaar

    Abstract: Counterfactual estimation using synthetic controls is one of the most successful recent methodological developments in causal inference. Despite its popularity, the current description only considers time series aligned across units and synthetic controls expressed as linear combinations of observed control units. We propose a continuous-time alternative that models the latent counterfactual path… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

  14. arXiv:2101.11769  [pdf, other

    stat.ML cs.LG

    Learning Matching Representations for Individualized Organ Transplantation Allocation

    Authors: Can Xu, Ahmed M. Alaa, Ioana Bica, Brent D. Ershoff, Maxime Cannesson, Mihaela van der Schaar

    Abstract: Organ transplantation is often the last resort for treating end-stage illness, but the probability of a successful transplantation depends greatly on compatibility between donors and recipients. Current medical practice relies on coarse rules for donor-recipient matching, but is short of domain knowledge regarding the complex factors underlying organ compatibility. In this paper, we formulate the… ▽ More

    Submitted 1 February, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: Accepted to AISTATS 2021

  15. arXiv:2101.10998  [pdf, ps, other

    cs.LG stat.AP stat.ML

    SDF-Bayes: Cautious Optimism in Safe Dose-Finding Clinical Trials with Drug Combinations and Heterogeneous Patient Groups

    Authors: Hyun-Suk Lee, Cong Shen, William Zame, Jang-Won Lee, Mihaela van der Schaar

    Abstract: Phase I clinical trials are designed to test the safety (non-toxicity) of drugs and find the maximum tolerated dose (MTD). This task becomes significantly more challenging when multiple-drug dose-combinations (DC) are involved, due to the inherent conflict between the exponentially increasing DC candidates and the limited patient budget. This paper proposes a novel Bayesian design, SDF-Bayes, for… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: Accepted to AISTATS 2021

  16. arXiv:2101.10943  [pdf, other

    stat.ML cs.LG

    Nonparametric Estimation of Heterogeneous Treatment Effects: From Theory to Learning Algorithms

    Authors: Alicia Curth, Mihaela van der Schaar

    Abstract: The need to evaluate treatment effectiveness is ubiquitous in most of empirical science, and interest in flexibly investigating effect heterogeneity is growing rapidly. To do so, a multitude of model-agnostic, nonparametric meta-learners have been proposed in recent years. Such learners decompose the treatment effect estimation problem into separate sub-problems, each solvable using standard super… ▽ More

    Submitted 25 February, 2021; v1 submitted 26 January, 2021; originally announced January 2021.

    Comments: To appear in the Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS) 2021

  17. arXiv:2101.10074  [pdf, other

    cs.CY cs.AI cs.LG

    Personalized Education in the AI Era: What to Expect Next?

    Authors: Setareh Maghsudi, Andrew Lan, Jie Xu, Mihaela van der Schaar

    Abstract: The objective of personalized learning is to design an effective knowledge acquisition track that matches the learner's strengths and bypasses her weaknesses to ultimately meet her desired goal. This concept emerged several years ago and is being adopted by a rapidly-growing number of educational institutions around the globe. In recent years, the boost of artificial intelligence (AI) and machine… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

  18. arXiv:2012.04580  [pdf, ps, other

    cs.LG cs.CY

    Synthetic Data: Opening the data floodgates to enable faster, more directed development of machine learning methods

    Authors: James Jordon, Alan Wilson, Mihaela van der Schaar

    Abstract: Many ground-breaking advancements in machine learning can be attributed to the availability of a large volume of rich data. Unfortunately, many large-scale datasets are highly sensitive, such as healthcare data, and are not widely available to the machine learning community. Generating synthetic data with privacy guarantees provides one such solution, allowing meaningful research to be carried out… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

  19. arXiv:2011.08596  [pdf, other

    cs.LG cs.AI

    Learning outside the Black-Box: The pursuit of interpretable models

    Authors: Jonathan Crabbé, Yao Zhang, William Zame, Mihaela van der Schaar

    Abstract: Machine Learning has proved its ability to produce accurate models but the deployment of these models outside the machine learning community has been hindered by the difficulties of interpreting these models. This paper proposes an algorithm that produces a continuous global interpretation of any given continuous black-box function. Our algorithm employs a variation of projection pursuit in which… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: presented in 34th Conference on Neural Information Processing Systems (NeurIPS 2020)

    Journal ref: Advances in Neural Information Processing Systems 33 (2020) 17838-17849

  20. arXiv:2009.13180  [pdf, other

    cs.LG stat.ML

    CASTLE: Regularization via Auxiliary Causal Graph Discovery

    Authors: Trent Kyono, Yao Zhang, Mihaela van der Schaar

    Abstract: Regularization improves generalization of supervised models to out-of-sample data. Prior works have shown that prediction in the causal direction (effect from cause) results in lower testing error than the anti-causal direction. However, existing regularization methods are agnostic of causality. We introduce Causal Structure Learning (CASTLE) regularization and propose to regularize a neural netwo… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

  21. arXiv:2008.06461  [pdf, other

    stat.ME stat.ML

    Estimating Structural Target Functions using Machine Learning and Influence Functions

    Authors: Alicia Curth, Ahmed M. Alaa, Mihaela van der Schaar

    Abstract: We aim to construct a class of learning algorithms that are of practical value to applied researchers in fields such as biostatistics, epidemiology and econometrics, where the need to learn from incompletely observed information is ubiquitous. We propose a new framework for statistical machine learning of target functions arising as identifiable functionals from statistical models, which we call `… ▽ More

    Submitted 8 February, 2021; v1 submitted 14 August, 2020; originally announced August 2020.

  22. arXiv:2007.13825  [pdf, other

    cs.LG stat.ML

    CPAS: the UK's National Machine Learning-based Hospital Capacity Planning System for COVID-19

    Authors: Zhaozhi Qian, Ahmed M. Alaa, Mihaela van der Schaar

    Abstract: The coronavirus disease 2019 (COVID-19) global pandemic poses the threat of overwhelming healthcare systems with unprecedented demands for intensive care resources. Managing these demands cannot be effectively conducted without a nationwide collective effort that relies on data to forecast hospital demands on the national, regional, hospital and individual levels. To this end, we developed the COV… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

  23. arXiv:2007.13531  [pdf, other

    cs.LG cs.AI stat.ML

    Learning "What-if" Explanations for Sequential Decision-Making

    Authors: Ioana Bica, Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar

    Abstract: Building interpretable parameterizations of real-world decision-making on the basis of demonstrated behavior -- i.e. trajectories of observations and actions made by an expert maximizing some unknown reward function -- is essential for introspecting and auditing policies in different institutions. In this paper, we propose learning explanations of expert decisions by modeling their reward function… ▽ More

    Submitted 30 March, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: In Proc. 9th International Conference on Learning Representations (ICLR 2021)

  24. arXiv:2007.13481  [pdf, other

    cs.LG stat.ML

    Discriminative Jackknife: Quantifying Uncertainty in Deep Learning via Higher-Order Influence Functions

    Authors: Ahmed M. Alaa, Mihaela van der Schaar

    Abstract: Deep learning models achieve high predictive accuracy across a broad spectrum of tasks, but rigorously quantifying their predictive uncertainty remains challenging. Usable estimates of predictive uncertainty should (1) cover the true prediction targets with high probability, and (2) discriminate between high- and low-confidence prediction instances. Existing methods for uncertainty quantification… ▽ More

    Submitted 29 June, 2020; originally announced July 2020.

  25. arXiv:2007.12087  [pdf, other

    cs.LG cs.CR cs.CY stat.ML

    Hide-and-Seek Privacy Challenge

    Authors: James Jordon, Daniel Jarrett, **sung Yoon, Tavian Barnes, Paul Elbers, Patrick Thoral, Ari Ercole, Cheng Zhang, Danielle Belgrave, Mihaela van der Schaar

    Abstract: The clinical time-series setting poses a unique combination of challenges to data modeling and sharing. Due to the high dimensionality of clinical time series, adequate de-identification to preserve privacy while retaining data utility is difficult to achieve using common de-identification techniques. An innovative approach to this problem is synthetic data generation. From a technical perspective… ▽ More

    Submitted 24 July, 2020; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: 19 pages, 5 figures. Part of the NeurIPS 2020 competition track

  26. arXiv:2007.10653  [pdf, other

    stat.ML cs.LG

    Accounting for Unobserved Confounding in Domain Generalization

    Authors: Alexis Bellot, Mihaela van der Schaar

    Abstract: This paper investigates the problem of learning robust, generalizable prediction models from a combination of multiple datasets and qualitative assumptions about the underlying data-generating model. Part of the challenge of learning robust models lies in the influence of unobserved confounders that void many of the invariances and principles of minimum error presently used for this problem. Our a… ▽ More

    Submitted 3 February, 2022; v1 submitted 21 July, 2020; originally announced July 2020.

  27. arXiv:2006.14988  [pdf, other

    stat.ML cs.LG

    Unlabelled Data Improves Bayesian Uncertainty Calibration under Covariate Shift

    Authors: Alex J. Chan, Ahmed M. Alaa, Zhaozhi Qian, Mihaela van der Schaar

    Abstract: Modern neural networks have proven to be powerful function approximators, providing state-of-the-art performance in a multitude of applications. They however fall short in their ability to quantify confidence in their predictions - this is crucial in high-stakes applications that involve critical decision-making. Bayesian neural networks (BNNs) aim at solving this problem by placing a prior distri… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

  28. arXiv:2006.14154  [pdf, other

    stat.ML cs.LG

    Strictly Batch Imitation Learning by Energy-based Distribution Matching

    Authors: Daniel Jarrett, Ioana Bica, Mihaela van der Schaar

    Abstract: Consider learning a policy purely on the basis of demonstrated behavior -- that is, with no access to reinforcement signals, no knowledge of transition dynamics, and no further interaction with the environment. This *strictly batch imitation learning* problem arises wherever live experimentation is costly, such as in healthcare. One solution is simply to retrofit existing algorithms for apprentice… ▽ More

    Submitted 14 January, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: In Proc. 34th International Conference on Neural Information Processing Systems (NeurIPS 2020)

  29. arXiv:2006.14141  [pdf, other

    stat.ML cs.LG

    Inverse Active Sensing: Modeling and Understanding Timely Decision-Making

    Authors: Daniel Jarrett, Mihaela van der Schaar

    Abstract: Evidence-based decision-making entails collecting (costly) observations about an underlying phenomenon of interest, and subsequently committing to an (informed) decision on the basis of accumulated evidence. In this setting, active sensing is the goal-oriented problem of efficiently selecting which acquisitions to make, and when and what decision to settle on. As its complement, inverse active sen… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Journal ref: In Proc. 37th International Conference on Machine Learning (ICML 2020)

  30. arXiv:2006.14099  [pdf, other

    cs.LG stat.ML

    AutoCP: Automated Pipelines for Accurate Prediction Intervals

    Authors: Yao Zhang, William Zame, Mihaela van der Schaar

    Abstract: Successful application of machine learning models to real-world prediction problems, e.g. financial forecasting and personalized medicine, has proved to be challenging, because such settings require limiting and quantifying the uncertainty in the model predictions, i.e. providing valid and accurate prediction intervals. Conformal Prediction is a distribution-free approach to construct valid predic… ▽ More

    Submitted 13 September, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

  31. arXiv:2006.13707  [pdf, other

    cs.LG stat.ML

    Frequentist Uncertainty in Recurrent Neural Networks via Blockwise Influence Functions

    Authors: Ahmed M. Alaa, Mihaela van der Schaar

    Abstract: Recurrent neural networks (RNNs) are instrumental in modelling sequential and time-series data. Yet, when using RNNs to inform decision-making, predictions by themselves are not sufficient; we also need estimates of predictive uncertainty. Existing approaches for uncertainty quantification in RNNs are based predominantly on Bayesian methods; these are computationally prohibitive, and require major… ▽ More

    Submitted 27 June, 2020; v1 submitted 20 June, 2020; originally announced June 2020.

  32. arXiv:2006.08600  [pdf, other

    physics.med-ph cs.LG stat.ML

    Temporal Phenoty** using Deep Predictive Clustering of Disease Progression

    Authors: Changhee Lee, Mihaela van der Schaar

    Abstract: Due to the wider availability of modern electronic health records, patient care data is often being stored in the form of time-series. Clustering such time-series data is crucial for patient phenoty**, anticipating patients' prognoses by identifying "similar" patients, and designing treatment guidelines that are tailored to homogeneous patient subgroups. In this paper, we develop a deep learning… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  33. arXiv:2006.07917  [pdf, ps, other

    stat.ML cs.LG

    Robust Recursive Partitioning for Heterogeneous Treatment Effects with Uncertainty Quantification

    Authors: Hyun-Suk Lee, Yao Zhang, William Zame, Cong Shen, Jang-Won Lee, Mihaela van der Schaar

    Abstract: Subgroup analysis of treatment effects plays an important role in applications from medicine to public policy to recommender systems. It allows physicians (for example) to identify groups of patients for whom a given drug or treatment is likely to be effective and groups of patients for which it is not. Most of the current methods of subgroup analysis begin with a particular algorithm for estimati… ▽ More

    Submitted 17 October, 2020; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: 19 pages, 7 figures, NeurIPS 2020

  34. arXiv:2006.05026  [pdf, other

    cs.LG stat.AP stat.ME stat.ML

    Learning for Dose Allocation in Adaptive Clinical Trials with Safety Constraints

    Authors: Cong Shen, Zhiyang Wang, Sofia S. Villar, Mihaela van der Schaar

    Abstract: Phase I dose-finding trials are increasingly challenging as the relationship between efficacy and toxicity of new compounds (or combination of them) becomes more complex. Despite this, most commonly used methods in practice focus on identifying a Maximum Tolerated Dose (MTD) by learning only from toxicity events. We present a novel adaptive clinical trial methodology, called Safe Efficacy Explorat… ▽ More

    Submitted 15 June, 2020; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: Accepted to the 37th International Conference on Machine Learning (ICML 2020)

  35. arXiv:2005.08837  [pdf, other

    stat.AP cs.LG physics.soc-ph

    When and How to Lift the Lockdown? Global COVID-19 Scenario Analysis and Policy Assessment using Compartmental Gaussian Processes

    Authors: Zhaozhi Qian, Ahmed M. Alaa, Mihaela van der Schaar

    Abstract: The coronavirus disease 2019 (COVID-19) global pandemic has led many countries to impose unprecedented lockdown measures in order to slow down the outbreak. Questions on whether governments have acted promptly enough, and whether lockdown measures can be lifted soon have since been central in public discourse. Data-driven models that predict COVID-19 fatalities under different lockdown policy scen… ▽ More

    Submitted 3 June, 2020; v1 submitted 13 May, 2020; originally announced May 2020.

  36. arXiv:2005.04154  [pdf, ps, other

    eess.SP cs.LG eess.SY stat.ML

    A Non-Stationary Bandit-Learning Approach to Energy-Efficient Femto-Caching with Rateless-Coded Transmission

    Authors: Setareh Maghsudi, Mihaela van der Schaar

    Abstract: The ever-increasing demand for media streaming together with limited backhaul capacity renders develo** efficient file-delivery methods imperative. One such method is femto-caching, which, despite its great potential, imposes several challenges such as efficient resource management. We study a resource allocation problem for joint caching and transmission in small cell networks, where the system… ▽ More

    Submitted 13 April, 2020; originally announced May 2020.

  37. arXiv:2004.14700  [pdf, other

    stat.ME

    A primer on coupled state-switching models for multiple interacting time series

    Authors: Jennifer Pohle, Roland Langrock, Mihaela van der Schaar, Ruth King, Frants Havmand Jensen

    Abstract: State-switching models such as hidden Markov models or Markov-switching regression models are routinely applied to analyse sequences of observations that are driven by underlying non-observable states. Coupled state-switching models extend these approaches to address the case of multiple observation sequences whose underlying state variables interact. In this paper, we provide an overview of the m… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

    Comments: 30 pages, 9 figures

  38. arXiv:2002.12326  [pdf, other

    cs.LG stat.ML

    Estimating the Effects of Continuous-valued Interventions using Generative Adversarial Networks

    Authors: Ioana Bica, James Jordon, Mihaela van der Schaar

    Abstract: While much attention has been given to the problem of estimating the effect of discrete interventions from observational data, relatively little work has been done in the setting of continuous-valued interventions, such as treatments associated with a dosage parameter. In this paper, we tackle this problem by building on a modification of the generative adversarial networks (GANs) framework. Our m… ▽ More

    Submitted 22 November, 2020; v1 submitted 27 February, 2020; originally announced February 2020.

    Journal ref: Advances in Neural Information Processing Systems (2020)

  39. arXiv:2002.04083  [pdf, other

    cs.LG stat.ML

    Estimating Counterfactual Treatment Outcomes over Time Through Adversarially Balanced Representations

    Authors: Ioana Bica, Ahmed M. Alaa, James Jordon, Mihaela van der Schaar

    Abstract: Identifying when to give treatments to patients and how to select among multiple treatments over time are important medical problems with a few existing solutions. In this paper, we introduce the Counterfactual Recurrent Network (CRN), a novel sequence-to-sequence model that leverages the increasingly available patient observational data to estimate treatment effects over time and answer such medi… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Journal ref: In Proc. 8th International Conference on Learning Representations (ICLR 2020)

  40. arXiv:2001.08345  [pdf, other

    stat.ML cs.LG

    Target-Embedding Autoencoders for Supervised Representation Learning

    Authors: Daniel Jarrett, Mihaela van der Schaar

    Abstract: Autoencoder-based learning has emerged as a staple for disciplining representations in unsupervised and semi-supervised settings. This paper analyzes a framework for improving generalization in a purely supervised setting, where the target space is high-dimensional. We motivate and formalize the general framework of target-embedding autoencoders (TEA) for supervised prediction, learning intermedia… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

    Journal ref: In Proc. 8th International Conference on Learning Representations (ICLR 2020)

  41. arXiv:2001.04754  [pdf, other

    stat.ML cs.LG

    Learning Overlap** Representations for the Estimation of Individualized Treatment Effects

    Authors: Yao Zhang, Alexis Bellot, Mihaela van der Schaar

    Abstract: The choice of making an intervention depends on its potential benefit or harm in comparison to alternatives. Estimating the likely outcome of alternatives from observational data is a challenging problem as all outcomes are never observed, and selection bias precludes the direct comparison of differently intervened groups. Despite their empirical success, we show that algorithms that learn domain-… ▽ More

    Submitted 17 February, 2020; v1 submitted 14 January, 2020; originally announced January 2020.

    Journal ref: Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS) 2020

  42. arXiv:2001.03898  [pdf, other

    cs.LG stat.ML

    Stepwise Model Selection for Sequence Prediction via Deep Kernel Learning

    Authors: Yao Zhang, Daniel Jarrett, Mihaela van der Schaar

    Abstract: An essential problem in automated machine learning (AutoML) is that of model selection. A unique challenge in the sequential setting is the fact that the optimal model itself may vary over time, depending on the distribution of features and labels available up to each point in time. In this paper, we propose a novel Bayesian optimization (BO) algorithm to tackle the challenge of model selection in… ▽ More

    Submitted 14 February, 2020; v1 submitted 12 January, 2020; originally announced January 2020.

    Journal ref: Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS) 2020

  43. arXiv:2001.02585  [pdf, other

    cs.LG stat.ML

    Learning Dynamic and Personalized Comorbidity Networks from Event Data using Deep Diffusion Processes

    Authors: Zhaozhi Qian, Ahmed M. Alaa, Alexis Bellot, Jem Rashbass, Mihaela van der Schaar

    Abstract: Comorbid diseases co-occur and progress via complex temporal patterns that vary among individuals. In electronic health records we can observe the different diseases a patient has, but can only infer the temporal relationship between each co-morbid condition. Learning such temporal patterns from event data is crucial for understanding disease pathology and predicting prognoses. To this end, we dev… ▽ More

    Submitted 19 January, 2020; v1 submitted 8 January, 2020; originally announced January 2020.

  44. arXiv:2001.02463  [pdf, other

    stat.ML cs.LG

    Contextual Constrained Learning for Dose-Finding Clinical Trials

    Authors: Hyun-Suk Lee, Cong Shen, James Jordon, Mihaela van der Schaar

    Abstract: Clinical trials in the medical domain are constrained by budgets. The number of patients that can be recruited is therefore limited. When a patient population is heterogeneous, this creates difficulties in learning subgroup specific responses to a particular drug and especially for a variety of dosages. In addition, patient recruitment can be difficult by the fact that clinical trials do not aim t… ▽ More

    Submitted 23 February, 2020; v1 submitted 8 January, 2020; originally announced January 2020.

    Comments: 18 pages, 5 figures, in Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS) 2020, Palermo, Italy

  45. arXiv:1912.09086  [pdf, other

    stat.ML cs.LG stat.AP

    A Bayesian Approach to Modelling Longitudinal Data in Electronic Health Records

    Authors: Alexis Bellot, Mihaela van der Schaar

    Abstract: Analyzing electronic health records (EHR) poses significant challenges because often few samples are available describing a patient's health and, when available, their information content is highly diverse. The problem we consider is how to integrate sparsely sampled longitudinal data, missing measurements informative of the underlying health status and fixed demographic information to produce est… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

    Comments: Presented as an abstract at the Machine Learning for Health Workshop, NeurIPS 2019

  46. arXiv:1911.12441  [pdf, other

    cs.LG cs.AI stat.ML

    Improving Model Robustness Using Causal Knowledge

    Authors: Trent Kyono, Mihaela van der Schaar

    Abstract: For decades, researchers in fields, such as the natural and social sciences, have been verifying causal relationships and investigating hypotheses that are now well-established or understood as truth. These causal mechanisms are properties of the natural world, and thus are invariant conditions regardless of the collection domain or environment. We show in this paper how prior knowledge in the for… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

    Comments: 14 pages, 12 figures

  47. arXiv:1907.04081  [pdf, other

    stat.ME stat.ML

    Kernel Hypothesis Testing with Set-valued Data

    Authors: Alexis Bellot, Mihaela van der Schaar

    Abstract: We present a general framework for hypothesis testing on distributions of sets of individual examples. Sets may represent many common data sources such as groups of observations in time series, collections of words in text or a batch of images of a given phenomenon. This observation pattern, however, differs from the common assumptions required for hypothesis testing: each set differs in size, may… ▽ More

    Submitted 2 February, 2021; v1 submitted 9 July, 2019; originally announced July 2019.

  48. arXiv:1907.04068  [pdf, other

    stat.ML cs.LG

    Conditional Independence Testing using Generative Adversarial Networks

    Authors: Alexis Bellot, Mihaela van der Schaar

    Abstract: We consider the hypothesis testing problem of detecting conditional dependence, with a focus on high-dimensional feature spaces. Our contribution is a new test statistic based on samples from a generative adversarial network designed to approximate directly a conditional distribution that encodes the null hypothesis, in a manner that maximizes power (the rate of true negatives). We show that such… ▽ More

    Submitted 18 December, 2019; v1 submitted 9 July, 2019; originally announced July 2019.

    Comments: Updated version published at NeurIPS 2019

  49. arXiv:1906.06796  [pdf, other

    cs.LG stat.ML

    ASAC: Active Sensing using Actor-Critic models

    Authors: **sung Yoon, James Jordon, Mihaela van der Schaar

    Abstract: Deciding what and when to observe is critical when making observations is costly. In a medical setting where observations can be made sequentially, making these observations (or not) should be an active choice. We refer to this as the active sensing problem. In this paper, we propose a novel deep learning framework, which we call ASAC (Active Sensing using Actor-Critic models) to address this prob… ▽ More

    Submitted 16 June, 2019; originally announced June 2019.

    Comments: Accepted in 2019 Machine Learning for Healthcare Conference

  50. arXiv:1905.12280  [pdf, other

    stat.ML cs.LG

    Lifelong Bayesian Optimization

    Authors: Yao Zhang, James Jordon, Ahmed M. Alaa, Mihaela van der Schaar

    Abstract: Automatic Machine Learning (Auto-ML) systems tackle the problem of automating the design of prediction models or pipelines for data science. In this paper, we present Lifelong Bayesian Optimization (LBO), an online, multitask Bayesian optimization (BO) algorithm designed to solve the problem of model selection for datasets arriving and evolving over time. To be suitable for "lifelong" Bayesian Opt… ▽ More

    Submitted 21 June, 2019; v1 submitted 29 May, 2019; originally announced May 2019.

    Comments: 17 pages, 8 figures