-
FedECA: A Federated External Control Arm Method for Causal Inference with Time-To-Event Data in Distributed Settings
Authors:
Jean Ogier du Terrail,
Quentin Klopfenstein,
Honghao Li,
Imke Mayer,
Nicolas Loiseau,
Mohammad Hallal,
Félix Balazard,
Mathieu Andreux
Abstract:
External control arms (ECA) can inform the early clinical development of experimental drugs and provide efficacy evidence for regulatory approval in non-randomized settings. However, the main challenge of implementing ECA lies in accessing real-world data or historical clinical trials. Indeed, data sharing is often not feasible due to privacy considerations related to data leaving the original col…
▽ More
External control arms (ECA) can inform the early clinical development of experimental drugs and provide efficacy evidence for regulatory approval in non-randomized settings. However, the main challenge of implementing ECA lies in accessing real-world data or historical clinical trials. Indeed, data sharing is often not feasible due to privacy considerations related to data leaving the original collection centers, along with pharmaceutical companies' competitive motives. In this paper, we leverage a privacy-enhancing technology called federated learning (FL) to remove some of the barriers to data sharing. We introduce a federated learning inverse probability of treatment weighted (IPTW) method for time-to-event outcomes called FedECA which eases the implementation of ECA by limiting patients' data exposure. We show with extensive experiments that FedECA outperforms its closest competitor, matching-adjusted indirect comparison (MAIC), in terms of statistical power and ability to balance the treatment and control groups. To encourage the use of such methods, we publicly release our code which relies on Substra, an open-source FL software with proven experience in privacy-sensitive contexts.
△ Less
Submitted 20 December, 2023; v1 submitted 28 November, 2023;
originally announced November 2023.
-
Generalizing treatment effects with incomplete covariates: identifying assumptions and multiple imputation algorithms
Authors:
Imke Mayer,
Julie Josse,
Traumabase Group
Abstract:
We focus on the problem of generalizing a causal effect estimated on a randomized controlled trial (RCT) to a target population described by a set of covariates from observational data. Available methods such as inverse propensity sampling weighting are not designed to handle missing values, which are however common in both data sources. In addition to coupling the assumptions for causal effect id…
▽ More
We focus on the problem of generalizing a causal effect estimated on a randomized controlled trial (RCT) to a target population described by a set of covariates from observational data. Available methods such as inverse propensity sampling weighting are not designed to handle missing values, which are however common in both data sources. In addition to coupling the assumptions for causal effect identifiability and for the missing values mechanism and to defining appropriate estimation strategies, one difficulty is to consider the specific structure of the data with two sources and treatment and outcome only available in the RCT. We propose three multiple imputation strategies to handle missing values when generalizing treatment effects, each handling the multi-source structure of the problem differently (separate imputation, joint imputation with fixed effect, joint imputation ignoring source information). As an alternative to multiple imputation, we also propose a direct estimation approach that treats incomplete covariates as semi-discrete variables. The multiple imputation strategies and the latter alternative rely on different sets of assumptions concerning the impact of missing values on identifiability. We discuss these assumptions and assess the methods through an extensive simulation study. This work is motivated by the analysis of a large registry of over 20,000 major trauma patients and an RCT studying the effect of tranexamic acid administration on mortality in major trauma patients admitted to ICU. The analysis illustrates how the missing values handling can impact the conclusion about the effect generalized from the RCT to the target population.
△ Less
Submitted 24 February, 2023; v1 submitted 26 April, 2021;
originally announced April 2021.
-
The influence of cations on the dipole moments of neighboring polar molecules
Authors:
Imre Bakó,
Dániel Csókás,
István Mayer,
Szilvia Pothoczki,
László Pusztai
Abstract:
It is shown that the dipole moment of polar (water, methanol, formamide, acetone and acetonitrile) molecules in the neighborhood of a cation is increased primarily by polarization from the bare electrostatic charge of the cation, although the effective value of the latter is somewhat reduced by "back donation" of electrons from neighbouring polar molecules. In other words, the classical picture ma…
▽ More
It is shown that the dipole moment of polar (water, methanol, formamide, acetone and acetonitrile) molecules in the neighborhood of a cation is increased primarily by polarization from the bare electrostatic charge of the cation, although the effective value of the latter is somewhat reduced by "back donation" of electrons from neighbouring polar molecules. In other words, the classical picture may be viewed as if a point charge slightly smaller than the nominal charge of the cation would be placed at the cation site. It was found that the geometrical arrangement of the polar molecules in the first solvation shell is such that their mutual polarization reduces the dipole moments of individual molecules, so that in some cases they become smaller than the dipole moment of the free protic or aprotic molecule. We conjecture that this behavior is essentially a manifestation of the Le Chatellier-Braun principle.
△ Less
Submitted 30 March, 2021;
originally announced March 2021.
-
Causal inference methods for combining randomized trials and observational studies: a review
Authors:
Bénédicte Colnet,
Imke Mayer,
Guanhua Chen,
Awa Dieng,
Ruohong Li,
Gaël Varoquaux,
Jean-Philippe Vert,
Julie Josse,
Shu Yang
Abstract:
With increasing data availability, causal effects can be evaluated across different data sets, both randomized controlled trials (RCTs) and observational studies. RCTs isolate the effect of the treatment from that of unwanted (confounding) co-occurring effects but they may suffer from unrepresentativeness, and thus lack external validity. On the other hand, large observational samples are often mo…
▽ More
With increasing data availability, causal effects can be evaluated across different data sets, both randomized controlled trials (RCTs) and observational studies. RCTs isolate the effect of the treatment from that of unwanted (confounding) co-occurring effects but they may suffer from unrepresentativeness, and thus lack external validity. On the other hand, large observational samples are often more representative of the target population but can conflate confounding effects with the treatment of interest. In this paper, we review the growing literature on methods for causal inference on combined RCTs and observational studies, striving for the best of both worlds. We first discuss identification and estimation methods that improve generalizability of RCTs using the representativeness of observational data. Classical estimators include weighting, difference between conditional outcome models, and doubly robust estimators. We then discuss methods that combine RCTs and observational data to either ensure uncounfoundedness of the observational analysis or to improve (conditional) average treatment effect estimation. We also connect and contrast works developed in both the potential outcomes literature and the structural causal model literature. Finally, we compare the main methods using a simulation study and real world data to analyze the effect of tranexamic acid on the mortality rate in major trauma patients. A review of available codes and new implementations is also provided.
△ Less
Submitted 10 January, 2023; v1 submitted 16 November, 2020;
originally announced November 2020.
-
Topological defects and SUSY RG flow
Authors:
Ilka Brunner,
Ingrid Mayer,
Cornelius Schmidt-Colinet
Abstract:
We study the effect of bulk perturbations of N=(2,2) superconformal minimal models on topological defects. In particular, symmetries and more general topological defects which survive the flow to the IR are identified. Our method is to consider the topological subsector and make use of the Landau-Ginzburg formulation to describe RG flows and topological defects in terms of matrix factorizations.
We study the effect of bulk perturbations of N=(2,2) superconformal minimal models on topological defects. In particular, symmetries and more general topological defects which survive the flow to the IR are identified. Our method is to consider the topological subsector and make use of the Landau-Ginzburg formulation to describe RG flows and topological defects in terms of matrix factorizations.
△ Less
Submitted 5 July, 2020;
originally announced July 2020.
-
MissDeepCausal: Causal Inference from Incomplete Data Using Deep Latent Variable Models
Authors:
Imke Mayer,
Julie Josse,
Félix Raimundo,
Jean-Philippe Vert
Abstract:
Inferring causal effects of a treatment, intervention or policy from observational data is central to many applications. However, state-of-the-art methods for causal inference seldom consider the possibility that covariates have missing values, which is ubiquitous in many real-world analyses. Missing data greatly complicate causal inference procedures as they require an adapted unconfoundedness hy…
▽ More
Inferring causal effects of a treatment, intervention or policy from observational data is central to many applications. However, state-of-the-art methods for causal inference seldom consider the possibility that covariates have missing values, which is ubiquitous in many real-world analyses. Missing data greatly complicate causal inference procedures as they require an adapted unconfoundedness hypothesis which can be difficult to justify in practice. We circumvent this issue by considering latent confounders whose distribution is learned through variational autoencoders adapted to missing values. They can be used either as a pre-processing step prior to causal inference but we also suggest to embed them in a multiple imputation strategy to take into account the variability due to missing values. Numerical experiments demonstrate the effectiveness of the proposed methodology especially for non-linear models compared to competitors.
△ Less
Submitted 25 February, 2020;
originally announced February 2020.
-
Doubly robust treatment effect estimation with missing attributes
Authors:
Imke Mayer,
Erik Sverdrup,
Tobias Gauss,
Jean-Denis Moyer,
Stefan Wager,
Julie Josse
Abstract:
Missing attributes are ubiquitous in causal inference, as they are in most applied statistical work. In this paper, we consider various sets of assumptions under which causal inference is possible despite missing attributes and discuss corresponding approaches to average treatment effect estimation, including generalized propensity score methods and multiple imputation. Across an extensive simulat…
▽ More
Missing attributes are ubiquitous in causal inference, as they are in most applied statistical work. In this paper, we consider various sets of assumptions under which causal inference is possible despite missing attributes and discuss corresponding approaches to average treatment effect estimation, including generalized propensity score methods and multiple imputation. Across an extensive simulation study, we show that no single method systematically out-performs others. We find, however, that doubly robust modifications of standard methods for average treatment effect estimation with missing data repeatedly perform better than their non-doubly robust baselines; for example, doubly robust generalized propensity score methods beat inverse-weighting with the generalized propensity score. This finding is reinforced in an analysis of an observations study on the effect on mortality of tranexamic acid administration among patients with traumatic brain injury in the context of critical care management. Here, doubly robust estimators recover confidence intervals that are consistent with evidence from randomized trials, whereas non-doubly robust estimators do not.
△ Less
Submitted 22 May, 2020; v1 submitted 23 October, 2019;
originally announced October 2019.
-
R-miss-tastic: a unified platform for missing values methods and workflows
Authors:
Imke Mayer,
Aude Sportisse,
Julie Josse,
Nicholas Tierney,
Nathalie Vialaneix
Abstract:
Missing values are unavoidable when working with data. Their occurrence is exacerbated as more data from different sources become available. However, most statistical models and visualization methods require complete data, and improper handling of missing data results in information loss or biased analyses. Since the seminal work of Rubin (1976), a burgeoning literature on missing values has arise…
▽ More
Missing values are unavoidable when working with data. Their occurrence is exacerbated as more data from different sources become available. However, most statistical models and visualization methods require complete data, and improper handling of missing data results in information loss or biased analyses. Since the seminal work of Rubin (1976), a burgeoning literature on missing values has arisen, with heterogeneous aims and motivations. This led to the development of various methods, formalizations, and tools. For practitioners, it remains nevertheless challenging to decide which method is most suited for their problem, partially due to a lack of systematic covering of this topic in statistics or data science curricula.
To help address this challenge, we have launched the "R-miss-tastic" platform, which aims to provide an overview of standard missing values problems, methods, and relevant implementations of methodologies. Beyond gathering and organizing a large majority of the material on missing data (bibliography, courses, tutorials, implementations), "R-miss-tastic" covers the development of standardized analysis workflows. Indeed, we have developed several pipelines in R and Python to allow for hands-on illustration of and recommendations on missing values handling in various statistical tasks such as matrix completion, estimation and prediction, while ensuring reproducibility of the analyses. Finally, the platform is dedicated to users who analyze incomplete data, researchers who want to compare their methods and search for an up-to-date bibliography, and also teachers who are looking for didactic materials (notebooks, video, slides).
△ Less
Submitted 17 June, 2024; v1 submitted 13 August, 2019;
originally announced August 2019.
-
Some remarks on the "needle radiation"
Authors:
Istvan Mayer
Abstract:
It is shown that the classical wave equation is lacking solutions corresponding to the concept of "needle-radiation", while the simplest augmented version of the wave equation -- essentially the Klein-Gordon equation -- obtained by adding a linear term, does have such a solution.
It is shown that the classical wave equation is lacking solutions corresponding to the concept of "needle-radiation", while the simplest augmented version of the wave equation -- essentially the Klein-Gordon equation -- obtained by adding a linear term, does have such a solution.
△ Less
Submitted 25 October, 2013; v1 submitted 10 September, 2013;
originally announced September 2013.
-
Is the Spreading of Quantum Mechanical Wave Packets Indeed Inevitable?
Authors:
István Mayer
Abstract:
It is demonstrated that -- contrary to the common belief -- it is possible to construct solutions of the non-relativistic Schrödinger equation of a free particle, that do not exhibit dispersion. However, it seems that no normalizable wave packets can be built up by their use, so the spreading of the wave packets is indeed inevitable.
It is demonstrated that -- contrary to the common belief -- it is possible to construct solutions of the non-relativistic Schrödinger equation of a free particle, that do not exhibit dispersion. However, it seems that no normalizable wave packets can be built up by their use, so the spreading of the wave packets is indeed inevitable.
△ Less
Submitted 9 October, 2012; v1 submitted 3 September, 2012;
originally announced September 2012.
-
A Connection between Special Theory of Relativity and Quantum Theory
Authors:
I. Mayer
Abstract:
The special theory of relativity does not predict the existence of photons (quanta of electromagnetic radiation). However, it is demonstrated here that it follows from the special theory of relativity that if photons do exist---and we know that they do---then their energy must be proportional to their frequency. This means that the Planck-Einstein formula E=hν follows just from some results of the…
▽ More
The special theory of relativity does not predict the existence of photons (quanta of electromagnetic radiation). However, it is demonstrated here that it follows from the special theory of relativity that if photons do exist---and we know that they do---then their energy must be proportional to their frequency. This means that the Planck-Einstein formula E=hν follows just from some results of the special theory of relativity and the assumption of the particle--wave duality.
△ Less
Submitted 30 August, 2012; v1 submitted 13 July, 2012;
originally announced July 2012.