-
Risk ratio, odds ratio, risk difference... Which causal measure is easier to generalize?
Authors:
Bénédicte Colnet,
Julie Josse,
Gaël Varoquaux,
Erwan Scornet
Abstract:
There are many measures to report so-called treatment or causal effects: absolute difference, ratio, odds ratio, number needed to treat, and so on. The choice of a measure, eg absolute versus relative, is often debated because it leads to different impressions of the benefit or risk of a treatment. Besides, different causal measures may lead to various treatment effect heterogeneity: some input va…
▽ More
There are many measures to report so-called treatment or causal effects: absolute difference, ratio, odds ratio, number needed to treat, and so on. The choice of a measure, eg absolute versus relative, is often debated because it leads to different impressions of the benefit or risk of a treatment. Besides, different causal measures may lead to various treatment effect heterogeneity: some input variables may have an influence on some causal measures and no effect at all on others. In addition some measures - but not all - have appealing properties such as collapsibility, matching the intuition of a population summary. In this paper, we first review common causal measures and their pros and cons typically brought forward. Doing so, we clarify the notions of collapsibility and treatment effect heterogeneity, unifying existing definitions. Then, we show that for any causal measures there exists a generative model such that the conditional average treatment effect (CATE) captures the treatment effect. However, only the risk difference can disentangle the treatment effect from the baseline at both population and strata levels, regardless of the outcome type (continuous or binary). As our primary goal is the generalization of causal measures, we show that different sets of covariates are needed to generalize an effect to a target population depending on (i) the causal measure of interest, and (ii) the identification method chosen, that is generalizing either conditional outcome or local effects.
△ Less
Submitted 30 March, 2024; v1 submitted 28 March, 2023;
originally announced March 2023.
-
Reweighting the RCT for generalization: finite sample error and variable selection
Authors:
Bénédicte Colnet,
Julie Josse,
Gaël Varoquaux,
Erwan Scornet
Abstract:
Randomized Controlled Trials (RCTs) may suffer from limited scope. In particular, samples may be unrepresentative: some RCTs over- or under- sample individuals with certain characteristics compared to the target population, for which one wants conclusions on treatment effectiveness. Re-weighting trial individuals to match the target population can improve the treatment effect estimation. In this w…
▽ More
Randomized Controlled Trials (RCTs) may suffer from limited scope. In particular, samples may be unrepresentative: some RCTs over- or under- sample individuals with certain characteristics compared to the target population, for which one wants conclusions on treatment effectiveness. Re-weighting trial individuals to match the target population can improve the treatment effect estimation. In this work, we establish the exact expressions of the bias and variance of such reweighting procedures -- also called Inverse Propensity of Sampling Weighting (IPSW) -- in presence of categorical covariates for any sample size. Such results allow us to compare the theoretical performance of different versions of IPSW estimates. Besides, our results show how the performance (bias, variance, and quadratic risk) of IPSW estimates depends on the two sample sizes (RCT and target population). A by-product of our work is the proof of consistency of IPSW estimates. Results also reveal that IPSW performances are improved when the trial probability to be treated is estimated (rather than using its oracle counterpart). In addition, we study choice of variables: how including covariates that are not necessary for identifiability of the causal effect may impact the asymptotic variance. Including covariates that are shifted between the two samples but not treatment effect modifiers increases the variance while non-shifted but treatment effect modifiers do not. We illustrate all the takeaways in a didactic example, and on a semi-synthetic simulation inspired from critical care medicine.
△ Less
Submitted 13 March, 2024; v1 submitted 16 August, 2022;
originally announced August 2022.
-
Causal effect on a target population: a sensitivity analysis to handle missing covariates
Authors:
Bénédicte Colnet,
Julie Josse,
Erwan Scornet,
Gaël Varoquaux
Abstract:
Randomized Controlled Trials (RCTs) are often considered the gold standard for estimating causal effect, but they may lack external validity when the population eligible to the RCT is substantially different from the target population. Having at hand a sample of the target population of interest allows us to generalize the causal effect. Identifying the treatment effect in the target population re…
▽ More
Randomized Controlled Trials (RCTs) are often considered the gold standard for estimating causal effect, but they may lack external validity when the population eligible to the RCT is substantially different from the target population. Having at hand a sample of the target population of interest allows us to generalize the causal effect. Identifying the treatment effect in the target population requires covariates to capture all treatment effect modifiers that are shifted between the two sets. Standard estimators then use either weighting (IPSW), outcome modeling (G-formula), or combine the two in doubly robust approaches (AIPSW). However such covariates are often not available in both sets. In this paper, after proving L1-consistency of these three estimators, we compute the expected bias induced by a missing covariate, assuming a Gaussian distribution, a continuous outcome, and a semi-parametric model. Under this setting, we perform a sensitivity analysis for each missing covariate pattern and compute the sign of the expected bias. We also show that there is no gain in linearly imputing a partially-unobserved covariate. Finally we study the substitution of a missing covariate by a proxy. We illustrate all these results on simulations, as well as semi-synthetic benchmarks using data from the Tennessee Student/Teacher Achievement Ratio (STAR), and a real-world example from critical care medicine.
△ Less
Submitted 10 January, 2023; v1 submitted 13 May, 2021;
originally announced May 2021.
-
Capillary filtering of particles during dip coating
Authors:
Alban Sauret,
Adrien Gans,
Benedicte Colnet,
Guillaume Saingier,
Martin Z. Bazant,
Emilie Dressaire
Abstract:
An object withdrawn from a liquid bath is coated with a thin layer of liquid. Along with the liquid, impurities such as particles present in the bath can be transferred to the withdrawn substrate. Entrained particles locally modify the thickness of the film, hence altering the quality and properties of the coating. In this study, we show that it is possible to entrain the liquid alone and avoid co…
▽ More
An object withdrawn from a liquid bath is coated with a thin layer of liquid. Along with the liquid, impurities such as particles present in the bath can be transferred to the withdrawn substrate. Entrained particles locally modify the thickness of the film, hence altering the quality and properties of the coating. In this study, we show that it is possible to entrain the liquid alone and avoid contamination of the substrate, at sufficiently low withdrawal velocity in diluted suspensions. Using a model system consisting of a plate exiting a liquid bath, we observe that particles can remain trapped in the meniscus which exerts a resistive capillary force to the entrainment. We characterize different entrainment regimes as the withdrawal velocity increases: from a pure liquid film, to a liquid film containing clusters of particles, and eventually individual particles. This capillary filtration is an effective barrier against the contamination of substrates withdrawn from a polluted bath and finds application against biocontamination.
△ Less
Submitted 27 November, 2020;
originally announced November 2020.
-
Causal inference methods for combining randomized trials and observational studies: a review
Authors:
Bénédicte Colnet,
Imke Mayer,
Guanhua Chen,
Awa Dieng,
Ruohong Li,
Gaël Varoquaux,
Jean-Philippe Vert,
Julie Josse,
Shu Yang
Abstract:
With increasing data availability, causal effects can be evaluated across different data sets, both randomized controlled trials (RCTs) and observational studies. RCTs isolate the effect of the treatment from that of unwanted (confounding) co-occurring effects but they may suffer from unrepresentativeness, and thus lack external validity. On the other hand, large observational samples are often mo…
▽ More
With increasing data availability, causal effects can be evaluated across different data sets, both randomized controlled trials (RCTs) and observational studies. RCTs isolate the effect of the treatment from that of unwanted (confounding) co-occurring effects but they may suffer from unrepresentativeness, and thus lack external validity. On the other hand, large observational samples are often more representative of the target population but can conflate confounding effects with the treatment of interest. In this paper, we review the growing literature on methods for causal inference on combined RCTs and observational studies, striving for the best of both worlds. We first discuss identification and estimation methods that improve generalizability of RCTs using the representativeness of observational data. Classical estimators include weighting, difference between conditional outcome models, and doubly robust estimators. We then discuss methods that combine RCTs and observational data to either ensure uncounfoundedness of the observational analysis or to improve (conditional) average treatment effect estimation. We also connect and contrast works developed in both the potential outcomes literature and the structural causal model literature. Finally, we compare the main methods using a simulation study and real world data to analyze the effect of tranexamic acid on the mortality rate in major trauma patients. A review of available codes and new implementations is also provided.
△ Less
Submitted 10 January, 2023; v1 submitted 16 November, 2020;
originally announced November 2020.
-
Dip-coating of suspensions
Authors:
A. Gans,
E. Dressaire,
B. Colnet,
G. Saingier,
M. Z. Bazant,
A. Sauret
Abstract:
Withdrawing a plate from a suspension leads to the entrainment of a coating layer of fluid and particles on the solid surface. In this article, we study the Landau-Levich problem in the case of a suspension of non-Brownian particles at moderate volume fraction $10\% < φ< 41\%$. We observe different regimes depending on the withdrawal velocity $U$, the volume fraction of the suspension $φ$, and the…
▽ More
Withdrawing a plate from a suspension leads to the entrainment of a coating layer of fluid and particles on the solid surface. In this article, we study the Landau-Levich problem in the case of a suspension of non-Brownian particles at moderate volume fraction $10\% < φ< 41\%$. We observe different regimes depending on the withdrawal velocity $U$, the volume fraction of the suspension $φ$, and the diameter of the particles $2\,a$. Our results exhibit three coating regimes. (i) At small enough capillary number $Ca$, no particles are entrained, and only a liquid film coats the plate. (ii) At large capillary number, we observe that the thickness of the entrained film of suspension is captured by the Landau-Levich law using the effective viscosity of the suspension $η(φ)$. (iii) At intermediate capillary numbers, the situation becomes more complicated with a heterogeneous coating on the substrate. We rationalize our experimental findings by providing the domain of existence of these three regimes as a function of the fluid and particles properties.
△ Less
Submitted 22 March, 2019;
originally announced March 2019.