Search | arXiv e-print repository

doi 10.1613/jair.1.15579

Simulating counterfactuals

Authors: Juha Karvanen, Santtu Tikka, Matti Vihola

Abstract: Counterfactual inference considers a hypothetical intervention in a parallel world that shares some evidence with the factual world. If the evidence specifies a conditional distribution on a manifold, counterfactuals may be analytically intractable. We present an algorithm for simulating values from a counterfactual distribution where conditions can be set on both discrete and continuous variables… ▽ More Counterfactual inference considers a hypothetical intervention in a parallel world that shares some evidence with the factual world. If the evidence specifies a conditional distribution on a manifold, counterfactuals may be analytically intractable. We present an algorithm for simulating values from a counterfactual distribution where conditions can be set on both discrete and continuous variables. We show that the proposed algorithm can be presented as a particle filter leading to asymptotically valid inference. The algorithm is applied to fairness analysis in credit-scoring. △ Less

Submitted 26 March, 2024; v1 submitted 27 June, 2023; originally announced June 2023.

Journal ref: Journal of Artificial Intelligence Research 80, 835-857, 2024

arXiv:2206.06699 [pdf, ps, other]

Generalizing experimental findings: identification beyond adjustments

Authors: Juha Karvanen

Abstract: We aim to generalize the results of a randomized controlled trial (RCT) to a target population with the help of some observational data. This is a problem of causal effect identification with multiple data sources. Challenges arise when the RCT is conducted in a context that differs from the target population. Earlier research has focused on cases where the estimates from the RCT can be adjusted b… ▽ More We aim to generalize the results of a randomized controlled trial (RCT) to a target population with the help of some observational data. This is a problem of causal effect identification with multiple data sources. Challenges arise when the RCT is conducted in a context that differs from the target population. Earlier research has focused on cases where the estimates from the RCT can be adjusted by observational data in order to remove the selection bias and other domain specific differences. We consider examples where the experimental findings cannot be generalized by an adjustment and show that the generalization may still be possible by other identification strategies that can be derived by applying do-calculus. The obtained identifying functionals for these examples contain trapdoor variables of a new type. The value of a trapdoor variable needs to be fixed in the estimation and the choice of the value may have a major effect on the bias and accuracy of estimates, which is also seen in simulations. The presented results expand the scope of settings where the generalization of experimental findings is doable △ Less

Submitted 14 June, 2022; originally announced June 2022.

MSC Class: 62D20; 62H12; 62H22

arXiv:2111.04513 [pdf, ps, other]

Clustering and Structural Robustness in Causal Diagrams

Authors: Santtu Tikka, Jouni Helske, Juha Karvanen

Abstract: Graphs are commonly used to represent and visualize causal relations. For a small number of variables, this approach provides a succinct and clear view of the scenario at hand. As the number of variables under study increases, the graphical approach may become impractical, and the clarity of the representation is lost. Clustering of variables is a natural way to reduce the size of the causal diagr… ▽ More Graphs are commonly used to represent and visualize causal relations. For a small number of variables, this approach provides a succinct and clear view of the scenario at hand. As the number of variables under study increases, the graphical approach may become impractical, and the clarity of the representation is lost. Clustering of variables is a natural way to reduce the size of the causal diagram, but it may erroneously change the essential properties of the causal relations if implemented arbitrarily. We define a specific type of cluster, called transit cluster, that is guaranteed to preserve the identifiability properties of causal effects under certain conditions. We provide a sound and complete algorithm for finding all transit clusters in a given graph and demonstrate how clustering can simplify the identification of causal effects. We also study the inverse problem, where one starts with a clustered graph and looks for extended graphs where the identifiability properties of causal effects remain unchanged. We show that this kind of structural robustness is closely related to transit clusters. △ Less

Submitted 15 August, 2023; v1 submitted 8 November, 2021; originally announced November 2021.

Comments: This is the version published in JMLR

Journal ref: Journal of Machine Learning Research, 24(195):1-32, 2023

arXiv:2009.09768 [pdf, other]

Identifying Causal Effects via Context-specific Independence Relations

Authors: Santtu Tikka, Antti Hyttinen, Juha Karvanen

Abstract: Causal effect identification considers whether an interventional probability distribution can be uniquely determined from a passively observed distribution in a given causal structure. If the generating system induces context-specific independence (CSI) relations, the existing identification procedures and criteria based on do-calculus are inherently incomplete. We show that deciding causal effect… ▽ More Causal effect identification considers whether an interventional probability distribution can be uniquely determined from a passively observed distribution in a given causal structure. If the generating system induces context-specific independence (CSI) relations, the existing identification procedures and criteria based on do-calculus are inherently incomplete. We show that deciding causal effect non-identifiability is NP-hard in the presence of CSIs. Motivated by this, we design a calculus and an automated search procedure for identifying causal effects in the presence of CSIs. The approach is provably sound and it includes standard do-calculus as a special case. With the approach we can obtain identifying formulas that were unobtainable previously, and demonstrate that a small number of CSI-relations may be sufficient to turn a previously non-identifiable instance to identifiable. △ Less

Submitted 21 September, 2020; originally announced September 2020.

Comments: Appeared at 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

Journal ref: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)

arXiv:1902.01073 [pdf, other]

doi 10.18637/jss.v099.i05

Causal Effect Identification from Multiple Incomplete Data Sources: A General Search-based Approach

Authors: Santtu Tikka, Antti Hyttinen, Juha Karvanen

Abstract: Causal effect identification considers whether an interventional probability distribution can be uniquely determined without parametric assumptions from measured source distributions and structural knowledge on the generating system. While complete graphical criteria and procedures exist for many identification problems, there are still challenging but important extensions that have not been consi… ▽ More Causal effect identification considers whether an interventional probability distribution can be uniquely determined without parametric assumptions from measured source distributions and structural knowledge on the generating system. While complete graphical criteria and procedures exist for many identification problems, there are still challenging but important extensions that have not been considered in the literature. To tackle these new settings, we present a search algorithm directly over the rules of do-calculus. Due to generality of do-calculus, the search is capable of taking more advanced data-generating mechanisms into account along with an arbitrary type of both observational and experimental source distributions. The search is enhanced via a heuristic and search space reduction techniques. The approach, called do-search, is provably sound, and it is complete with respect to identifiability problems that have been shown to be completely characterized by do-calculus. When extended with additional rules, the search is capable of handling missing data problems as well. With the versatile search, we are able to approach new problems such as combined transportability and selection bias, or multiple sources of selection bias. We perform a systematic analysis of bivariate missing data problems and study causal inference under case-control design. We also present the R package dosearch that provides an interface for a C++ implementation of the search. △ Less

Submitted 27 August, 2021; v1 submitted 4 February, 2019; originally announced February 2019.

Comments: This is the version published in the Journal of Statistical Software

Journal ref: Journal of Statistical Software, 99(5):1-40, 2021

arXiv:1806.07172 [pdf, ps, other]

doi 10.1016/j.ijar.2019.02.007

Surrogate Outcomes and Transportability

Authors: Santtu Tikka, Juha Karvanen

Abstract: Identification of causal effects is one of the most fundamental tasks of causal inference. We consider an identifiability problem where some experimental and observational data are available but neither data alone is sufficient for the identification of the causal effect of interest. Instead of the outcome of interest, surrogate outcomes are measured in the experiments. This problem is a generaliz… ▽ More Identification of causal effects is one of the most fundamental tasks of causal inference. We consider an identifiability problem where some experimental and observational data are available but neither data alone is sufficient for the identification of the causal effect of interest. Instead of the outcome of interest, surrogate outcomes are measured in the experiments. This problem is a generalization of identifiability using surrogate experiments and we label it as surrogate outcome identifiability. We show that the concept of transportability provides a sufficient criteria for determining surrogate outcome identifiability for a large class of queries. △ Less

Submitted 12 March, 2019; v1 submitted 19 June, 2018; originally announced June 2018.

Comments: This is the version published in the International Journal of Approximate Reasoning

Journal ref: International Journal of Approximate Reasoning, 2019; 108: 21-37

arXiv:1806.07085 [pdf, ps, other]

Enhancing Identification of Causal Effects by Pruning

Authors: Santtu Tikka, Juha Karvanen

Abstract: Causal models communicate our assumptions about causes and effects in real-world phe- nomena. Often the interest lies in the identification of the effect of an action which means deriving an expression from the observed probability distribution for the interventional distribution resulting from the action. In many cases an identifiability algorithm may return a complicated expression that contains… ▽ More Causal models communicate our assumptions about causes and effects in real-world phe- nomena. Often the interest lies in the identification of the effect of an action which means deriving an expression from the observed probability distribution for the interventional distribution resulting from the action. In many cases an identifiability algorithm may return a complicated expression that contains variables that are in fact unnecessary. In practice this can lead to additional computational burden and increased bias or inefficiency of estimates when dealing with measurement error or missing data. We present graphical criteria to detect variables which are redundant in identifying causal effects. We also provide an improved version of a well-known identifiability algorithm that implements these criteria. △ Less

Submitted 19 June, 2018; originally announced June 2018.

Comments: This is the version published in JMLR

Journal ref: Journal of Machine Learning Research (JMLR), 18(194):1-23, 2018

arXiv:1806.07082 [pdf, other]

Simplifying Probabilistic Expressions in Causal Inference

Authors: Santtu Tikka, Juha Karvanen

Abstract: Obtaining a non-parametric expression for an interventional distribution is one of the most fundamental tasks in causal inference. Such an expression can be obtained for an identifiable causal effect by an algorithm or by manual application of do-calculus. Often we are left with a complicated expression which can lead to biased or inefficient estimates when missing data or measurement errors are i… ▽ More Obtaining a non-parametric expression for an interventional distribution is one of the most fundamental tasks in causal inference. Such an expression can be obtained for an identifiable causal effect by an algorithm or by manual application of do-calculus. Often we are left with a complicated expression which can lead to biased or inefficient estimates when missing data or measurement errors are involved. We present an automatic simplification algorithm that seeks to eliminate symbolically unnecessary variables from these expressions by taking advantage of the structure of the underlying graphical model. Our method is applicable to all causal effect formulas and is readily available in the R package causaleffect. △ Less

Submitted 19 June, 2018; originally announced June 2018.

Comments: This is the version published in JMLR

Journal ref: Journal of Machine Learning Research (JMLR), 18(36):1-30, 2017

arXiv:1403.1124 [pdf, ps, other]

Estimating complex causal effects from incomplete observational data

Authors: Juha Karvanen

Abstract: Despite the major advances taken in causal modeling, causality is still an unfamiliar topic for many statisticians. In this paper, it is demonstrated from the beginning to the end how causal effects can be estimated from observational data assuming that the causal structure is known. To make the problem more challenging, the causal effects are highly nonlinear and the data are missing at random. T… ▽ More Despite the major advances taken in causal modeling, causality is still an unfamiliar topic for many statisticians. In this paper, it is demonstrated from the beginning to the end how causal effects can be estimated from observational data assuming that the causal structure is known. To make the problem more challenging, the causal effects are highly nonlinear and the data are missing at random. The tools used in the estimation include causal models with design, causal calculus, multiple imputation and generalized additive models. The main message is that a trained statistician can estimate causal effects by judiciously combining existing tools. △ Less

Submitted 2 July, 2014; v1 submitted 5 March, 2014; originally announced March 2014.

Showing 1–9 of 9 results for author: Karvanen, J