Search | arXiv e-print repository

Local Constraint-Based Causal Discovery under Selection Bias

Authors: Philip Versteeg, Cheng Zhang, Joris M. Mooij

Abstract: We consider the problem of discovering causal relations from independence constraints selection bias in addition to confounding is present. While the seminal FCI algorithm is sound and complete in this setup, no criterion for the causal interpretation of its output under selection bias is presently known. We focus instead on local patterns of independence relations, where we find no sound method f… ▽ More We consider the problem of discovering causal relations from independence constraints selection bias in addition to confounding is present. While the seminal FCI algorithm is sound and complete in this setup, no criterion for the causal interpretation of its output under selection bias is presently known. We focus instead on local patterns of independence relations, where we find no sound method for only three variable that can include background knowledge. Y-Structure patterns are shown to be sound in predicting causal relations from data under selection bias, where cycles may be present. We introduce a finite-sample scoring rule for Y-Structures that is shown to successfully predict causal relations in simulation experiments that include selection mechanisms. On real-world microarray data, we show that a Y-Structure variant performs well across different datasets, potentially circumventing spurious correlations due to selection bias. △ Less

Submitted 3 March, 2022; originally announced March 2022.

Comments: Accepted at the 1st Conference on Causal Learning and Reasoning

arXiv:1910.02505 [pdf, other]

doi 10.1109/BIBM47256.2019.8983232

Boosting Local Causal Discovery in High-Dimensional Expression Data

Authors: Philip Versteeg, Joris M. Mooij

Abstract: We study the performance of Local Causal Discovery (LCD), a simple and efficient constraint-based method for causal discovery, in predicting causal effects in large-scale gene expression data. We construct practical estimators specific to the high-dimensional regime. Inspired by the ICP algorithm, we use an optional preselection method and two different statistical tests. Empirically, the resultin… ▽ More We study the performance of Local Causal Discovery (LCD), a simple and efficient constraint-based method for causal discovery, in predicting causal effects in large-scale gene expression data. We construct practical estimators specific to the high-dimensional regime. Inspired by the ICP algorithm, we use an optional preselection method and two different statistical tests. Empirically, the resulting LCD estimator is seen to closely approach the accuracy of ICP, the state-of-the-art method, while it is algorithmically simpler and computationally more efficient. △ Less

Submitted 1 November, 2019; v1 submitted 6 October, 2019; originally announced October 2019.

Comments: Accepted at BIBM / CABB 2019

Journal ref: 2019 IEEE Intl. Conf. Bioinf. and Biomed. (BIBM 2019) pp. 2599-2604

arXiv:1707.06422 [pdf, other]

Domain Adaptation by Using Causal Inference to Predict Invariant Conditional Distributions

Authors: Sara Magliacane, Thijs van Ommen, Tom Claassen, Stephan Bongers, Philip Versteeg, Joris M. Mooij

Abstract: An important goal common to domain adaptation and causal inference is to make accurate predictions when the distributions for the source (or training) domain(s) and target (or test) domain(s) differ. In many cases, these different distributions can be modeled as different contexts of a single underlying system, in which each distribution corresponds to a different perturbation of the system, or in… ▽ More An important goal common to domain adaptation and causal inference is to make accurate predictions when the distributions for the source (or training) domain(s) and target (or test) domain(s) differ. In many cases, these different distributions can be modeled as different contexts of a single underlying system, in which each distribution corresponds to a different perturbation of the system, or in causal terms, an intervention. We focus on a class of such causal domain adaptation problems, where data for one or more source domains are given, and the task is to predict the distribution of a certain target variable from measurements of other variables in one or more target domains. We propose an approach for solving these problems that exploits causal inference and does not rely on prior knowledge of the causal graph, the type of interventions or the intervention targets. We demonstrate our approach by evaluating a possible implementation on simulated and real world data. △ Less

Submitted 29 October, 2018; v1 submitted 20 July, 2017; originally announced July 2017.

Comments: Camera-ready version, to be published in the proceedings of Neural Information Processing Systems 2018 (NIPS*2018)

Journal ref: Advances in Neural Information Processing Systems 31 (NeurIPS*2018), 10869-10879

Showing 1–3 of 3 results for author: Versteeg, P