Search | arXiv e-print repository

The DeCAMFounder: Non-Linear Causal Discovery in the Presence of Hidden Variables

Authors: Raj Agrawal, Chandler Squires, Neha Prasad, Caroline Uhler

Abstract: Many real-world decision-making tasks require learning causal relationships between a set of variables. Traditional causal discovery methods, however, require that all variables are observed, which is often not feasible in practical scenarios. Without additional assumptions about the unobserved variables, it is not possible to recover any causal relationships from observational data. Fortunately,… ▽ More Many real-world decision-making tasks require learning causal relationships between a set of variables. Traditional causal discovery methods, however, require that all variables are observed, which is often not feasible in practical scenarios. Without additional assumptions about the unobserved variables, it is not possible to recover any causal relationships from observational data. Fortunately, in many applied settings, additional structure among the confounders can be expected. In particular, pervasive confounding is commonly encountered and has been utilized for consistent causal estimation in linear causal models. In this paper, we present a provably consistent method to estimate causal relationships in the non-linear, pervasive confounding setting. The core of our procedure relies on the ability to estimate the confounding variation through a simple spectral decomposition of the observed data matrix. We derive a DAG score function based on this insight, prove its consistency in recovering a correct ordering of the DAG, and empirically compare it to previous approaches. We demonstrate improved performance on both simulated and real datasets by explicitly accounting for both confounders and non-linear effects. △ Less

Submitted 25 June, 2023; v1 submitted 15 February, 2021; originally announced February 2021.

Comments: To appear in Journal of the Royal Statistical Society Series B

arXiv:2007.12098 [pdf, other]

Optimal Transport using GANs for Lineage Tracing

Authors: Neha Prasad, Karren Yang, Caroline Uhler

Abstract: In this paper, we present Super-OT, a novel approach to computational lineage tracing that combines a supervised learning framework with optimal transport based on Generative Adversarial Networks (GANs). Unlike previous approaches to lineage tracing, Super-OT has the flexibility to integrate paired data. We benchmark Super-OT based on single-cell RNA-seq data against Waddington-OT, a popular appro… ▽ More In this paper, we present Super-OT, a novel approach to computational lineage tracing that combines a supervised learning framework with optimal transport based on Generative Adversarial Networks (GANs). Unlike previous approaches to lineage tracing, Super-OT has the flexibility to integrate paired data. We benchmark Super-OT based on single-cell RNA-seq data against Waddington-OT, a popular approach for lineage tracing that also employs optimal transport. We show that Super-OT achieves gains over Waddington-OT in predicting the class outcome of cells during differentiation, since it allows the integration of additional information during training. △ Less

Submitted 5 January, 2022; v1 submitted 23 July, 2020; originally announced July 2020.

Comments: 4 pages excluding references, 2 figures, 3 tables. Accepted at ICML 2020 Workshop on Computational Biology for Spotlight Presentation. Code can be found here: https://github.com/uhlerlab/superot

arXiv:1905.13167 [pdf, other]

Defining Admissible Rewards for High Confidence Policy Evaluation

Authors: Niranjani Prasad, Barbara E Engelhardt, Finale Doshi-Velez

Abstract: A key impediment to reinforcement learning (RL) in real applications with limited, batch data is defining a reward function that reflects what we implicitly know about reasonable behaviour for a task and allows for robust off-policy evaluation. In this work, we develop a method to identify an admissible set of reward functions for policies that (a) do not diverge too far from past behaviour, and (… ▽ More A key impediment to reinforcement learning (RL) in real applications with limited, batch data is defining a reward function that reflects what we implicitly know about reasonable behaviour for a task and allows for robust off-policy evaluation. In this work, we develop a method to identify an admissible set of reward functions for policies that (a) do not diverge too far from past behaviour, and (b) can be evaluated with high confidence, given only a collection of past trajectories. Together, these ensure that we propose policies that we trust to be implemented in high-risk settings. We demonstrate our approach to reward design on synthetic domains as well as in a critical care context, for a reward that consolidates clinical objectives to learn a policy for weaning patients from mechanical ventilation. △ Less

Submitted 30 May, 2019; originally announced May 2019.

arXiv:1808.04679 [pdf, other]

An Optimal Policy for Patient Laboratory Tests in Intensive Care Units

Authors: Li-Fang Cheng, Niranjani Prasad, Barbara E Engelhardt

Abstract: Laboratory testing is an integral tool in the management of patient care in hospitals, particularly in intensive care units (ICUs). There exists an inherent trade-off in the selection and timing of lab tests between considerations of the expected utility in clinical decision-making of a given test at a specific time, and the associated cost or risk it poses to the patient. In this work, we introdu… ▽ More Laboratory testing is an integral tool in the management of patient care in hospitals, particularly in intensive care units (ICUs). There exists an inherent trade-off in the selection and timing of lab tests between considerations of the expected utility in clinical decision-making of a given test at a specific time, and the associated cost or risk it poses to the patient. In this work, we introduce a framework that learns policies for ordering lab tests which optimizes for this trade-off. Our approach uses batch off-policy reinforcement learning with a composite reward function based on clinical imperatives, applied to data that include examples of clinicians ordering labs for patients. To this end, we develop and extend principles of Pareto optimality to improve the selection of actions based on multiple reward function components while respecting typical procedural considerations and prioritization of clinical goals in the ICU. Our experiments show that we can estimate a policy that reduces the frequency of lab tests and optimizes timing to minimize information redundancy. We also find that the estimated policies typically suggest ordering lab tests well ahead of critical onsets--such as mechanical ventilation or dialysis--that depend on the lab results. We evaluate our approach by quantifying how these policies may initiate earlier onset of treatment. △ Less

Submitted 14 August, 2018; originally announced August 2018.

Comments: The first two authors contributed equally to this work. Preprint of an article submitted for consideration in Pacific Symposium on Biocomputing copyright 2018 [copyright World Scientific Publishing Company] [https://psb.stanford.edu/]

arXiv:1307.4048 [pdf, ps, other]

doi 10.1109/ASRU.2013.6707725

Modified SPLICE and its Extension to Non-Stereo Data for Noise Robust Speech Recognition

Authors: D. S. Pavan Kumar, N. Vishnu Prasad, Vikas Joshi, S. Umesh

Abstract: In this paper, a modification to the training process of the popular SPLICE algorithm has been proposed for noise robust speech recognition. The modification is based on feature correlations, and enables this stereo-based algorithm to improve the performance in all noise conditions, especially in unseen cases. Further, the modified framework is extended to work for non-stereo datasets where clean… ▽ More In this paper, a modification to the training process of the popular SPLICE algorithm has been proposed for noise robust speech recognition. The modification is based on feature correlations, and enables this stereo-based algorithm to improve the performance in all noise conditions, especially in unseen cases. Further, the modified framework is extended to work for non-stereo datasets where clean and noisy training utterances, but not stereo counterparts, are required. Finally, an MLLR-based computationally efficient run-time noise adaptation method in SPLICE framework has been proposed. The modified SPLICE shows 8.6% absolute improvement over SPLICE in Test C of Aurora-2 database, and 2.93% overall. Non-stereo method shows 10.37% and 6.93% absolute improvements over Aurora-2 and Aurora-4 baseline models respectively. Run-time adaptation shows 9.89% absolute improvement in modified framework as compared to SPLICE for Test C, and 4.96% overall w.r.t. standard MLLR adaptation on HMMs. △ Less

Submitted 15 July, 2013; originally announced July 2013.

Comments: Submitted to Automatic Speech Recognition and Understanding (ASRU) 2013 Workshop

Showing 1–5 of 5 results for author: Prasad, N