Identification and estimation of mediational effects of longitudinal modified treatment policies

Brian Gilbert corresponding author: [email protected] Division of Biostatistics, Department of Population Health, New York University Grossman School of Medicine. Katherine L. Hoffman Department of Epidemiology, Mailman School of Public Health, Columbia University. Nicholas Williams Department of Epidemiology, Mailman School of Public Health, Columbia University. Kara E. Rudolph Department of Epidemiology, Mailman School of Public Health, Columbia University. Edward J. Schenck Department of Medicine, Division of Pulmonary and Critical Care Medicine, NewYork-Presbyterian Hospital/Weill Cornell Medical Center. Iván Díaz Division of Biostatistics, Department of Population Health, New York University Grossman School of Medicine.

(July 1, 2024)

Abstract

We demonstrate a comprehensive semiparametric approach to causal mediation analysis, addressing the complexities inherent in settings with longitudinal and continuous treatments, confounders, and mediators. Our methodology utilizes a nonparametric structural equation model and a cross-fitted sequential regression technique based on doubly robust pseudo-outcomes, yielding an efficient, asymptotically normal estimator without relying on restrictive parametric modeling assumptions. We are motivated by a recent scientific controversy regarding the effects of invasive mechanical ventilation (IMV) on the survival of COVID-19 patients, considering acute kidney injury (AKI) as a mediating factor. We highlight the possibility of “inconsistent mediation,” in which the direct and indirect effects of the exposure operate in opposite directions. We discuss the significance of mediation analysis for scientific understanding and its potential utility in treatment decisions.

Keywords— mediation, machine learning, longitudinal, treatment policies

1 Introduction

Causal mediation analysis, or the identification of the different pathways through which an exposure or treatment (we will use the terms “exposure” and “treatment” interchangeably) can operate, has been increasing in interest across the scientific literature (Nguyen et al., 2021). Although the concept of mediation has been active for at least a century (Wright, 1921), mediation analysis has only relatively recently been incorporated in formal causal models such as the framework of potential outcomes or structural causal models, which allow for nonparametric perspectives, or the study of mediation effects without reference to a particular (e.g., linear) statistical model (Imai et al., 2010).

Even so, the formulation of an estimand of interest in a mediation analysis is not always a simple task. The so-called “natural” mediation effects, where one considers an intervention that partially “blocks” the effect of an exposure by assigning mediator values to their counterfactual values under a fixed exposure level in order to isolate “direct” and “indirect” effects, have been well-studied, and conditions for their identification have been given (Robins and Greenland, 1992; Pearl, 2001). However, those identification conditions are quite strong and unlikely to be satisfied in many instances of real data-generating processes (Avin et al., 2005; Miles et al., 2015). To analyze mediation under more reasonable assumptions, alternative estimands based on stochastic draws from the mediator distribution have been proposed and labeled “interventional effects” (VanderWeele et al., 2014).

Mediation analysis in longitudinal settings is particularly challenging; some existing approaches have relied on parametric model specification (VanderWeele and Tchetgen, 2017; Tai et al., 2022), which leads to a lack of robustness. Zheng and van der Laan (2017) provided a robust machine-learning approach for the estimation of mediation effects with time-varying treatments, confounders, and mediators. This line of research was advanced by Díaz et al. (2023b) which allowed the (interventional) direct effect to include pathways through intermediate confounders. This formulation is desirable if the goal of mediation analysis is to separate effects according to whether they operate through the mediator or not, which is the point of view of the present article.

However, even these strategies that make use of flexible models for estimation still rely on the notion of a “static” treatment regimen, or one in which each value of treatment for all timepoints is explicitly specified in the estimand of interest, which is a significant limitation insofar as real-world decision-making rarely follows this form in the longitudinal setting. That is, even an externally-designed intervention is likely to be responsive, intentionally or otherwise, to variables that are encountered over the course of treatment. A few examples include the administration of antiretroviral drugs in HIV patients, which can be adapted to CD4 cell count measurements (Hernán et al., 2006), and the administration of antihypertensive drugs, which can be adapted to blood pressure measurements (Johnson et al., 2018).

Aside from their lack of applicability, studies of static intervention face various statistical challenges. First, identification of the causal estimand relies on the positivity assumption (i.e., that there is a positive probability of any treatment regime given any level of covariates), which is often implausible, especially for continuous treatments given at multiple timepoints. Likewise, for continuous treatments, the estimation of a dose-response curve (the function of exposure values that yields the population-average outcome conditional on receiving each value) involves complications in terms of convergence rates (standard $\sqrt{n}$ rates are not possible, see Kennedy, 2018) and of the summarization and interpretation of such a high-dimensional target parameter.

By contrast, so-called “dynamic” treatment regimes allow exposure values to change based on observed covariates (Robins et al., 2004). These can be generalized to “stochastic” interventions which allow for randomness in the exposure values, even conditional on all other variables. Furthermore, it may often be of interest to consider a treatment regime that assigns treatment values depending on what treatment value would have occurred in the absence of intervention, or the “natural” value of treatment (Young et al., 2014). Such a regime, which we call a “modified treatment policy,” was considered in Robins et al. (2004), with formalization and estimation strategies for single-timepoint studies to follow in Díaz and van der Laan (2012) and Haneuse and Rotnitzky (2013). Díaz et al. (2021) provide a framework for robust estimation of the effects of modified treatment policies for time-varying treatments.

We synthesize these various strands of the statistical literature into a new strategy for mediation analysis in longitudinal settings, following Díaz and Hejazi (2020) and Hejazi et al. (2023), who studied mediation analysis for single-timepoint stochastic interventions. Specifically, we consider the direct and indirect effects of longitudinal modified treatment policies under interventional mediator distributions, which have not been considered in previous literature. The methods we propose are motivated by an issue that was relevant in the early landscape of COVID-19 management: the use of invasive mechanical ventilation (IMV) in cases of acute respiratory distress syndrome (ARDS). ARDS is a form of acute hypoxic respiratory failure, which is one of the key time-varying features of a patient’s course with severe COVID (see, e.g., Wang et al. (2021)). In the beginning stages of the pandemic, IMV was employed early, or at a lower supplemental oxygen delivery threshold, to manage ARDS, but its usage has been associated with several iatrogenic risks such as ventilator-associated pneumonia (Wicky et al., 2021) and barotrauma (Shrestha et al., 2022). Of interest to the present study is acute kidney injury (AKI), a critical condition that complicates ICU stays and is associated with mortality (Vemuri et al., 2022). As more data emerged and practitioners became more familiar with the pace of disease progression, guidelines shifted towards delaying intubation due to high mortality rates among mechanically ventilated patients and potential secondary complications. To navigate these treatment dynamics, we aim to elucidate the direct and indirect effects of IMV on patient mortality with respect to the AKI pathway. This application offers a compelling case for the use of our methods in understanding the effects of time-varying treatments and the causal pathways involved. We emphasize the phenomenon of “inconsistent mediation” (a situation in which the direct and indirect effects are qualitatively in opposite directions). In the current setting, this would involve (on average) IMV causing death through AKI while (on average) preventing death through other mechanisms. This dichotomy may be relevant for research and clinical practice, if interventions could be in place to mitigate IMV-induced AKI. While this is an active area of research, potential interventions include the administration of diuretics to modulate intravascular volume status (Grams et al., 2011; Glassford and Bellomo, 2011) and pharmacological options as explored by Pickkers et al. (2022). Finally, we also investigate the relationship between baseline covariates and the magnitude of these direct and indirect effects; such relationships can suggest biological pathways as well as potentially inform future treatments.

The current paper is structured as follows. Section 2 introduces the causal framework and notation we will use throughout. Section 3 presents the theory and techniques we will use to learn mediational effects of longitudinal treatment policies. Section 4 gives our analysis of the intubation data, and Section 5 concludes.

2 Model setup

2.1 Notation

Our notation largely mirrors that of Díaz et al. (2021) and Díaz et al. (2023b). We consider data $X_{1},\ldots,X_{n}$ , an i.i.d. sample from a distribution $\mathsf{P}$ . The data are longitudinal, with $X=\allowbreak(L_{1},A_{1},Z_{1},M_{1},\allowbreak L_{2},\ldots,A_{\tau},Z_{% \tau},\allowbreak M_{\tau},L_{\tau+1})$ , where the variables $A$ are the exposure of interest, $L$ are covariates, $M$ are mediators, and $Z$ are intermediate confounders. An intermediate confounder is a variable affected by exposure and which affects the mediator and outcome; see VanderWeele et al. (2014). We let $Y=L_{\tau+1}$ denote the outcome of interest. As in our illustrative application, we will assume that the mediator takes values in a finite set; the supports of all other variables are unrestricted.

For any symbol $W$ , we let $\bar{W}_{t}=(W_{1},\ldots,W_{t})$ (past history) and $\underaccent{\bar}{W}_{t}=(W_{t},\ldots,W_{\tau})$ (future history). We let $\bar{W}=\bar{W}_{\tau}=\underaccent{\bar}{W}_{1}$ , the full history. We also define $H_{A,t}=(\bar{L}_{t},\bar{M}_{t-1},\bar{Z}_{t-1},\bar{A}_{t-1})$ ; this is the history of all variables prior to $A_{t}$ . Similarly, $H_{Z,t}$ is the history of all variables prior to $Z_{t}$ , and $H_{M,t}$ and $H_{L,t}$ are defined likewise. Finally, $\mathsf{g}_{A,t}(a_{t}\mid h_{A,t})$ is the probability density or probability mass function of treatment at time $t$ , given the variable history $H_{A,t}=h_{A,t}$ . Similarly, $\mathsf{g}_{M,t}(m_{t},h_{M,t})$ gives the conditional probability mass function of the time-varying mediator.

2.2 Structural equation model and definition of causal effects

We consider causal effects defined within a nonparametric structural equation model, which consists of exogenous and endogenous variables and unknown deterministic functions governing the causal relations between them, together with independence assumptions on the exogenous variables (Pearl, 2000). We allow for arbitrary causal relations among observed variables that respect their time-ordering; this can be visualized in the directed acyclic graph (DAG) given in Figure 1, reproduced from Díaz et al. (2023b).

Figure 1: Causal DAG for the model of time-varying mediation. For clarity, the symbol

\boldsymbol{\mapsto}

is used to indicate arrows from all previous nodes (to the left) to all following nodes (to the right).

To formalize the dependence structure, we posit functions $f$ such that for each $t$ , we have $A_{t}=f_{A,t}(H_{A,t},U_{A,t})$ , $Z_{t}=f_{Z,t}(H_{Z,t},U_{Z,t})$ , $M_{t}=f_{M,t}(H_{M,t},U_{M,t})$ , and $L_{t}=f_{L,t}(H_{L,t},U_{L,t})$ . The variables $U=(U_{A,t},U_{Z,t},U_{M,t},U_{L,t},U_{Y}:t\in\{1,\ldots,\tau\})$ are unobserved “exogenous” variables. In the general model, there are no restrictions on the distribution of $U$ , but in Section 3, they will be subject to assumptions to enable identification of the estimands of interest.

This setup allows us to speak of causal effects as the interventions on some functions $f$ while kee** the remaining structure of the model intact. For example, the simple intervention to set $A_{1}$ to the value $a_{1}$ replaces $f_{A,1}$ with the constant function $a_{1}$ . In this paper, we are interested in the effects of interventions on the treatment variables $\bar{A}$ , and we will analyze its decomposition through pathways that include mediating variables $\bar{M}$ and through pathways that do not. For this purpose, we will consider interventions on the functions $f_{A,t}$ and $f_{M,t}$ for $t=1,\ldots,\tau$ .

For interventions on the treatment variable, we will consider longitudinal modified treatment policies (LMTPs) (Díaz et al., 2021), which are hypothetical interventions defined in terms of a sequence of functions $\mathbbm{d}=(\mathbbm{d}_{1},\ldots,\mathbbm{d}_{\tau})$ , where $\mathbbm{d}_{t}$ is a function $\mathbbm{d}_{t}(a_{t},h_{A,t},\epsilon_{t})$ that depends on a treatment value $a_{t}$ , a history value $h_{A,t}$ , and possibly a randomizer $\epsilon_{t}$ . The intervention is defined by removing the equation $A_{t}=f_{A_{t}}(H_{t},U_{A,t})$ sequentially (from $t=1$ to $t=\tau$ ) from the structural model, and replacing it with evaluations of the functions $\mathbbm{d}_{t}$ as follows. At the first timepoint, the LMTP assigns the exposure as a new random variable $A_{1}^{\mathbbm{d}}=\mathbbm{d}_{t}(A_{1},H_{A,1},\epsilon_{1})$ . This generates a counterfactual treatment $A_{2}(\bar{A}_{1}^{\mathbbm{d}})=f_{A_{2}}(H_{A,2}(\bar{A}_{1}^{\mathbbm{d}}),% U_{A,1})$ that would have been observed if the intervention had been discontinued right before $t=2$ . Then, the intervention at $t=2$ assigns treatment as $A_{2}^{\mathbbm{d}}=\mathbbm{d}_{2}(A_{2}(\bar{A}_{1}^{\mathbbm{d}}),H_{A,2}(% \bar{A}_{1}^{\mathbbm{d}}),\epsilon_{2})$ . In general, an intervention that sets the exposure values up to time $t-1$ to $\bar{A}_{t-1}^{\mathbbm{d}}$ generates counterfactual variables $L_{t}(\bar{A}_{t-1}^{\mathbbm{d}})=f_{L_{t}}(A_{t-1}^{\mathbbm{d}},H_{t-1}(% \bar{A}_{t-2}^{\mathbbm{d}}),U_{L,t})$ , $M_{t}(\bar{A}_{t-1}^{\mathbbm{d}})$ , and $Z_{t}(\bar{A}_{t-1}^{\mathbbm{d}})$ . In addition, $A_{t}(\bar{A}_{t-1}^{\mathbbm{d}})=f_{A_{t}}(H_{t}(\bar{A}_{t-1}^{\mathbbm{d}}% ),U_{A,t})$ , $A_{t}(\bar{A}_{t-1}^{\mathbbm{d}})$ , termed the “natural value of treatment” (Richardson and Robins, 2013; Young et al., 2014), “represents the value of treatment that would have been observed at time $t$ under an intervention carried out up until time $t-1$ but discontinued thereafter” (Díaz et al., 2021). The intervention at time $t$ is given by $A_{t}^{\mathbbm{d}}=\mathbbm{d}_{t}(A_{t}(\bar{A}_{t-1}^{\mathbbm{d}}),H_{A,t}% (\bar{A}_{t-1}^{\mathbbm{d}}),\epsilon_{t})$ .

The intervention on $\bar{M}$ is defined in terms of a stochastic intervention. Specifically, let $M_{t}(\bar{A}^{\mathbbm{d}})=f_{M,t}(A_{t}^{\mathbbm{d}},H_{M,t}(\bar{A}_{t}^{% \mathbbm{d}}),U_{M,t})$ denote the mediator at time $t$ observed under the above LMTP, and let $\bar{J}(\bar{A}^{\mathbbm{d}})$ denote a random draw from the distribution of $\bar{M}(\bar{A}^{\mathbbm{d}})$ , potentially conditioning on baseline variables $L_{1}$ . We consider interventions where $\bar{M}$ is replaced by this random draw.

A causal “effect” is typically conceived as some difference in outcome distributions under different interventions. To simplify notation throughout, we will consider two user-specified, fixed interventions $\mathbbm{d}^{\star}=(\mathbbm{d}^{\star}_{1},\ldots,\mathbbm{d}^{\star}_{\tau})$ and $\mathbbm{d}^{\prime}=(\mathbbm{d}^{\prime}_{1},\ldots,\mathbbm{d}^{\prime}_{% \tau})$ , though the generalization of the theory to arbitrary $\mathbbm{d}$ is immediate. We define the randomized modified treatment policy effect as $\mathsf{E}[Y(\bar{A}^{\mathbbm{d}^{\prime}},\bar{J}(\bar{A}^{\mathbbm{d}^{% \prime}}))-Y(\bar{A}^{\mathbbm{d}^{\star}},\bar{J}(\bar{A}^{\mathbbm{d}^{\star% }}))]$ and decompose it into an interventional direct effect

DE=\mathsf{E}[Y(\bar{A}^{\mathbbm{d}^{\prime}},\bar{J}(\bar{A}^{\mathbbm{d}^{% \star}}))-Y(\bar{A}^{\mathbbm{d}^{\star}},\bar{J}(\bar{A}^{\mathbbm{d}^{\star}% }))]

(1)

and indirect effect

IE=\mathsf{E}[Y(\bar{A}^{\mathbbm{d}^{\prime}},\bar{J}(\bar{A}^{\mathbbm{d}^{% \prime}}))-Y(\bar{A}^{\mathbbm{d}^{\prime}},\bar{J}(\bar{A}^{\mathbbm{d}^{% \star}}))],

(2)

which sum to the “overall” or “total” effect comparing $\mathbbm{d}^{\prime}$ to $\mathbbm{d}^{\star}$ . Thus, the task at hand is to estimate expressions of the form $\mathsf{E}[Y(\bar{A}^{\mathbbm{d}^{\prime}},\bar{J}(\bar{A}^{\mathbbm{d}^{% \star}}))]$ , for general $\mathbbm{d}^{\star}$ and $\mathbbm{d}^{\prime}$ .

3 Identification and estimation of interventional mediation effects

3.1 Assumptions and sequential regression

In what follows, it will be necessary to reference variable histories under interventions. For example, we let $H_{L,t}^{\prime}=(\bar{L}_{t-1},\bar{M}_{t-1},\bar{Z}_{t-1},\bar{A}_{t-1}^{d^{% \prime}})$ . Intervened histories for other variables are defined analogously, as are counterfactual histories under the other intervention $\mathbbm{d}^{\star}$ , denoted with $H^{\star}$ . Similarly, to ease notation, we let $A_{t}^{\prime}=\mathbbm{d}_{t}^{\prime}(A_{t},H_{A,t},\epsilon_{t})$ , with $A_{t}^{\star}$ defined similarly.

To identify the parameter, $\mathsf{E}[Y(\bar{A}^{\mathbbm{d}^{\prime}},\bar{J}(\bar{A}^{\mathbbm{d}^{% \star}}))]$ , the following assumptions, which combine elements of the identification assumptions from Díaz et al. (2021) and Díaz et al. (2023b), are sufficient. Note that A1 involves future histories rather than single timepoints only.

A1Conditional exchangeability of treatment and mediator.

Assume:

(i)

$U_{A,t}\mbox{$\perp\!\!\!\perp$}(\underline{U}_{Z,t},\underline{U}_{M,t},% \underline{U}_{L,{t+1}},\underline{U}_{A,t+1})\mid H_{t}$ for all $t\in\{1,\ldots,\tau\}$ ;
(ii)

$U_{M,t}\mbox{$\perp\!\!\!\perp$}(\underline{U}_{L,{t+1}},\underline{U}_{A,t+1}% ,\underline{U}_{Z,{t+1}})\mid H_{t}$ for all $t\in\{1,\ldots,\tau\}$ .

A2Positivity of treatment and mediator assignment.

Assume:

(i)

If $(a_{t},h_{A,t})\in\mathop{\mathrm{supp}}\{A_{t},H_{A,t}\}$ then $(\mathbbm{d}^{\star}(a_{t},h_{A,t}),h_{A,t})\in\mathop{\mathrm{supp}}\{A_{t},H% _{A,t}\}$ for $t\in\{1,\ldots,\tau\}$ , and similarly for $\mathbbm{d}^{\prime}$ ;
(ii)

If $h_{M,t}\in\mathop{\mathrm{supp}}\{H_{M,t}\}$ then $(m_{t},h_{M,t})\in\mathop{\mathrm{supp}}\{M_{t},H_{M,t}\}$ for $t\in\{1,\ldots,\tau\}$ and $m_{t}\in\mathop{\mathrm{supp}}\{M_{t}\}$ .

If $A_{t}$ is not a discrete random variable, one further condition is required.

A3Piecewise-smooth invertibililty of treatment modification.

If $A_{t}$ is not a discrete random variable, the conditional support of $A_{t}$ given $H_{A,t}=h_{A,t}$ admits a partition into subintervals where the restrictions of $\mathbbm{d}^{\prime}(\cdot,h_{a,t})$ and $\mathbbm{d}^{\star}(\cdot,h_{a,t})$ to each subinterval have differentiable inverse functions.

Our identification strategy makes use of a representation of longitudinal causal estimands by means of “sequential regression” as proposed by Bang and Robins (2005) and further investigated by van der Laan and Gruber (2012); Luedtke et al. (2017), and Rotnitzky et al. (2017). This approach considers a recursive regression of outcomes (or pseudo-outcomes) onto previous variables, starting at $t=\tau$ and continuing backward to $t=0$ . As in Díaz et al. (2023b), we split the recursion over the two non-interventional variables $L$ and $Z$ as follows. We consider fixed, pre-specified interventions $\mathbbm{d}^{\prime}$ , $\mathbbm{d}^{\star}$ , and $\bar{m}$ . Starting with, $\mathsf{Q}_{Z,\tau+1}=Y$ , define

	$\displaystyle\mathsf{Q}_{L,t}(\bar{h}_{M,t},\underaccent{\bar}{m}_{t})$	$\displaystyle=\mathsf{E}[\mathsf{Q}_{Z,t+1}(A{{}^{\prime}}_{t+1},H_{A,t+1},% \underaccent{\bar}{m}_{t+1})\mid M_{t}=m_{t},H_{M,t}=h_{M,t}]$		(3)
	$\displaystyle\mathsf{Q}_{Z,t}(a_{t},\bar{h}_{A,t},\underaccent{\bar}{m}_{t})$	$\displaystyle=\mathsf{E}[\mathsf{Q}_{L,t}(\bar{H}_{M,t},\underaccent{\bar}{m}_% {t})\mid A_{t}=a_{t},H_{A,t}=h_{A,t}].$		(4)

Similarly, let $\mathsf{Q}_{M,\tau+1}=1$ and recursively define

\mathsf{Q}_{M,t}(a_{t},\bar{h}_{A,t},\underaccent{\bar}{m}_{t})=\mathsf{E}[% \mathds{1}\{M_{t}=m_{t}\}\mathsf{Q}_{M,t+1}(A_{t}^{\star},\bar{H}_{A,t+1},% \underaccent{\bar}{m}_{t+1})\mid A_{t}=a_{t},H_{A,t}=h_{A,t}].

When the meaning is clear, noting that the only non-random component of the functions $\mathsf{Q}$ are the user-specified $\mathbbm{d}^{\prime},\mathbbm{d}^{\star}$ , and $\bar{m}$ , we may use the abbreviations $\mathsf{Q}_{L,t}^{\prime}(\underaccent{\bar}{m}_{t})=\mathsf{Q}_{L,t}(\bar{H}_% {M,t},\underaccent{\bar}{m}_{t})$ , $\mathsf{Q}_{Z,t}^{\prime}(\underaccent{\bar}{m}_{t})=\mathsf{Q}_{Z,t}(A^{{}^{% \prime}}_{t},\bar{H}_{A,t},\underaccent{\bar}{m}_{t})$ , and $\mathsf{Q}_{M,t}^{\star}(\underaccent{\bar}{m}_{t})=\mathsf{Q}_{M,t}(A_{t}^{% \star},\bar{H}_{A,t},\underaccent{\bar}{m}_{t})$ . We give the following identification result, which is a generalization of Theorem 1 of Díaz et al. (2023b).

Theorem 1 (Identification).

Under Assumptions A1 and A2, the interventional parameter $\theta=\mathsf{E}[Y(\bar{A}^{\mathbbm{d}^{\prime}},\bar{J}(\bar{A}^{\mathbbm{d% }^{\star}}))]$ is identified as $\theta=\sum_{\bar{m}}\varphi(\bar{m})\lambda(\bar{m})$ , where $\varphi(\bar{m})=\mathsf{Q}_{L,0}^{\prime}(\bar{m})$ and $\lambda(\bar{m})=\mathsf{Q}_{M,0}^{\star}(\bar{m})$ .

Proof See Supplement Section S2. ∎

3.2 Inverse probability weighted identification

As is often the case in causal inference problems, it is also possible to identify effects of interest by using the treatment rather than the outcome mechanism. Let $\mathsf{g}_{t}^{\prime}(a_{t}\mid h_{A,t})$ and $\mathsf{g}_{t}^{\star}(a_{t}\mid h_{A,t})$ denote the density functions of $A^{\prime}_{t}=\mathbbm{d}^{\prime}(A_{t},H_{A,t},\epsilon_{t})$ and $A^{\star}_{t}=\mathbbm{d}^{\star}(A_{t},H_{A,t},\epsilon_{t})$ , respectively, conditional on $H_{A,t}=h_{A,t}$ . Then we define

\mathsf{G}_{A,t}^{\prime}(A_{t},H_{A,t})=\frac{\mathsf{g}_{t}^{\prime}(A_{t}% \mid H_{A,t})}{\mathsf{g}_{t}(A_{t}\mid H_{A,t})},\,\,\mathsf{G}_{A,t}^{\star}% (A_{t},H_{A,t})=\frac{\mathsf{g}_{t}^{\star}(A_{t}\mid H_{A,t})}{\mathsf{g}_{t% }(A_{t}\mid H_{A,t})},\,\,\,\mathsf{G}_{M,t}(H_{M,t},m_{t})=\frac{\mathds{1}\{% M_{t}=m_{t}\}}{\mathsf{g}_{M,t}(M_{t}\mid H_{M,t})},

as well as $\mathsf{K}_{l,u}^{\prime}=\prod_{r=l}^{u}\mathsf{G}_{A,r}^{\prime}$ , $\mathsf{K}_{l,u}^{\star}=\prod_{r=l}^{u}\mathsf{G}_{A,r}^{\star}$ , $\mathsf{H}_{l,u}=\prod_{r=l}^{u}\mathsf{G}_{M,r}$ . (For notational brevity, we have omitted the dependence on $A_{t},$ , $H_{A,t}$ , $H_{M,t}$ , and $m_{t}$ .) Then, the parameters from Theorem 1 can also be identified as $\varphi(\bar{m})=\mathsf{E}\{\mathsf{K}_{1,\tau}^{\prime}\mathsf{H}_{1,\tau}Y\}$ ; $\lambda(\bar{m})=\mathsf{E}\{\mathsf{K}_{1,\tau}^{\star}\mathds{1}\{\bar{M}=% \bar{m}\}\}$ ; see Supplement Section S3 for the derivation.

3.3 Estimation

The foregoing analysis allows us to construct estimators of the parameter $\theta$ given in Theorem 1 with desirable statistical properties such as asymptotic normality and efficiency. The theory of semiparametric inference allows for the analysis of estimators of parameters (e.g., causal effects) without restricting $\mathsf{P}$ to a parametric model. Standard references include van der Vaart (1998) and Bickel et al. (1997). In our case, we are particularly interested in nonparametric inference which places no restrictions on $\mathsf{P}$ other than the identification assumptions, particularly those related to positivity.

A given estimator $\hat{\theta}$ of $\theta$ is consistent and asymptotically linear if for some function $\mathsf{S}$ (which may depend on nuisance parameters) of the observations $O$ we have

\sqrt{n}(\hat{\theta}-\theta)=\frac{1}{\sqrt{n}}\sum_{i=1}^{n}\mathsf{S}(O_{i}% )+o_{P}(1)

By the central limit theorem, such an estimator converges to a normal random variable with variance $\operatorname{\mathsf{Var}}[\mathsf{S}(O)]$ at the rate of $\sqrt{n}$ . The function $\mathsf{S}$ is called the influence function of the estimator, and this convergence allows the construction of Wald-type confidence intervals. A central object in semiparametric theory is the efficient influence function, the variance of which provides an asymptotic lower bound on the variance of consistent regular and asymptotically linear estimators of a given parameter. Thus, if one can prove that a proposed estimator is asymptotically linear and has variance equal to the efficient influence function (this occurs when $\mathsf{S}$ is the efficient influence function), then that estimator is a strong candidate for use in data analyses.

A considerable body of literature has studied semiparametric estimation of total causal effects of static and dynamic interventions using sequential regression (e.g., Robins, 2000; van der Laan and Robins, 2003; Bang and Robins, 2005; van der Laan and Rubin, 2006; van der Laan and Rose, 2011; Luedtke et al., 2017; Rotnitzky et al., 2017; van der Laan and Rose, 2018). Typically, the approach involves the estimation of both the regression functions (in this case, the prediction of $Z,M,$ and $L$ conditional on their past) and the probability of treatment. These procedures have the property that the rate of convergence of the estimator for the treatment effect is equal to the product of the rates of convergences of the outcome and treatment estimators; this allows flexible machine learning models that converge at rates slower than “parametric” $\sqrt{n}$ rate, as long as their product converges at $\sqrt{n}$ rate (e.g., each nuisance parameter can be estimated at $n^{1/4}$ rate). In particular, if either model is consistent, then the treatment effect estimator is consistent.

As given below, the algebraic form of the efficient influence function for our parameter is identical to that of the parameter studied in Díaz et al. (2023b), except that the definitions of the IPW functions ( $\mathsf{G}^{\prime}$ and $\mathsf{G}^{\star}$ ) and the iterative regressions ( $\mathsf{Q}$ -functions) change to accommodate the LMTP generalization. The proofs and arguments for all the following claims mirror those of Díaz et al. (2023b) and are therefore omitted here. To give an expression for the efficient influence function for $\theta$ , we must introduce the following functions, which can be understood intuitively as doubly robust analogs of the $\mathsf{Q}$ - functions (see Rubin and van der Laan (2007)).

\mathsf{D}_{L,t}(\underaccent{\bar}{X}_{t},\underaccent{\bar}{m}_{t})=\sum_{s=% t}^{\tau}\mathsf{K}^{{}^{\prime}}_{t+1,s}\mathsf{H}_{t,s}\{\mathsf{Q}_{Z,s+1}-% \mathsf{Q}_{L,s}\}+\sum_{s=t+1}^{\tau}\mathsf{K}^{{}^{\prime}}_{t+1,s}\mathsf{% H}_{t,s-1}\{\mathsf{Q}_{L,s}-\mathsf{Q}_{Z,s}\}+\mathsf{Q}_{L,t}

(5)

\mathsf{D}_{Z,t}(\underaccent{\bar}{X}_{t},\underaccent{\bar}{m}_{t})=\sum_{s=% t}^{\tau}\mathsf{K}^{{}^{\prime}}_{t,s}\mathsf{H}_{t,s}\{\mathsf{Q}_{Z,s+1}-% \mathsf{Q}_{L,s}\}+\sum_{s=t}^{\tau}\mathsf{K}^{{}^{\prime}}_{t,s}\mathsf{H}_{% t,s-1}\{\mathsf{Q}_{L,s}-\mathsf{Q}_{Z,s}\}+\mathsf{Q}_{Z,t}

(6)

\mathsf{D}_{M,t}(\underaccent{\bar}{X}_{t},\underaccent{\bar}{m}_{t})=\sum_{s=% t}^{\tau}\mathsf{K}^{*}_{t,s}[\prod_{k=t}^{s-1}\mathds{1}(M_{k}=m_{k})][% \mathds{1}(M_{s}=m_{s})\mathsf{Q}_{M,s+1}-Q_{M,s}]+Q_{M,t}

(7)

Then, we have the following:

Theorem 2 (Efficient influence function for $\theta$ ).

The efficient influence function for $\theta$ in the nonparametric model is given by

\mathsf{S}(X,\eta)=\sum_{\bar{m}\in\bar{\mathcal{M}}}\left[\{\mathsf{D}_{Z,1}(% X,\bar{m};\eta)-\varphi(\bar{m})\}\lambda(\bar{m})+\{\mathsf{D}_{M,1}(X,\bar{m% };\eta)-\lambda(\bar{m})\}\varphi(\bar{m})\right].

Proof See the proof of Theorem 2 of Díaz et al. (2023b). ∎

The algorithm used to estimate $\theta$ is given as Algorithm 1 in Díaz et al. (2023b), where in our case the IPW weights are replaced with probability ratios as described in Section 3.2. The algorithm consists of the following steps. First, we set $\hat{\mathsf{D}}_{Z,\tau+1}=Y$ and $\hat{\mathsf{D}}_{M,\tau+1}=1$ . Then, beginning at $t=\tau$ , we calculate $\hat{\mathsf{Q}}_{L,t}$ as the nonparametric regression prediction of $\hat{\mathsf{D}}_{Z,t+1}$ , onto preceding variables. Then, we nonparametrically estimate the functions $\mathsf{g}$ and $\mathsf{G}$ (as in Section 3.2), and $\hat{\mathsf{D}}_{L,t}$ is calculated by plugging in all the nuisance functions estimated so far into its definition (in equation 5). Similarly, $\hat{\mathsf{Q}}_{Z,t}$ comes from a nonparametric regression of $\hat{\mathsf{D}}_{L,t}$ , and $\hat{\mathsf{Q}}_{M,t}$ comes from a nonparametric regression of $\mathbbm{I}(M_{t}=m_{t})\hat{\mathsf{D}}_{M,t+1}$ . Then $\hat{\mathsf{D}}_{Z,t}$ is estimated from equation 6 and $\hat{\mathsf{D}}_{M,t}$ is estimated from equation 7. This is repeated for $t=\tau-1,\tau-2,\dots,1$ . Finally, the mean of $\hat{\mathsf{D}}_{Z,1}$ is used as an estimator for $\phi$ , and the mean of $\hat{\mathsf{D}}_{M,1}$ is taken as an estimator of $\lambda$ , which are plugged into the identifying formula of Theorem 1 to estimate $\theta$ . (Note that the use of the $\mathsf{D}$ -functions in this way is motivated by Lemma 1 of Díaz et al. (2023b).) The estimated variance of the efficient influence function (Theorem 2) is used to estimate the standard error. The procedure makes use of cross-fitting in all regression functions to ensure that the asymptotic linearity result above holds without restrictions on the complexity of the form of the machine learning estimators, thereby yielding valid statistical inference under mild conditions (Zheng and van der Laan, 2011; Chernozhukov et al., 2018).

Our estimation algorithm has been implemented in the open-source R package lcmmtp (Williams and Díaz, 2023).

4 Motivating application

4.1 Background

Acute respiratory distress syndrome (ARDS) is a major cause of morbidity and mortality among COVID-19 patients. Treating ARDS often requires the use of respiratory support devices, ranging from nasal cannulae and face masks to more invasive methods such as mechanical ventilation via endotracheal tubes (“invasive mechanical ventilation” or IMV) (Hasan et al., 2020). Given the physiological links between the lungs and kidneys, it has been suggested that mechanical ventilation may cause acute kidney injury/failure (AKI); possible mechanisms include oxygen toxicity and capillary endothelial damage leading to inflammation, hypotension, and sepsis (Durdevic et al., 2020). AKI, which can render kidneys incapable of appropriately clearing critical toxins from the body or maintaining appropriate blood and interstitial volume, complicates about 30% of ICU admissions and increases the risk of in-hospital mortality and long-term morbidity and mortality (Kes and Jukić, 2010). While IMV may often be necessary to support respiration even if it can lead to AKI (Husain-Syed et al., 2016), the decision whether and when to intubate must be based on clinical factors such as oxygen saturation, dyspnea, respiratory rate, and chest radiography, weighing in the benefit of reducing ARDS-related mortality with the possibility of harm through AKI (Tobin, 2020; Thomson and Calligaro, 2021).

In the early weeks of the COVID-19 pandemic, recommendations from international health organizations advocated for early intubation of patients in an effort to safeguard healthcare workers from contracting infection and to reduce complications resulting from late intubations when the patient is unresponsive or apneic (Papoutsi et al., 2021). Over time, however, as clinicians became more familiar with the progression of the disease, guidance evolved toward postponing intubation, partially driven by reports showing high mortality rates for mechanically ventilated patients, potentially attributable to heightened risk of secondary infections, lung injury, and damage to other organs including the kidneys, due to ventilation (Tobin, 2006; Bavishi et al., 2021).

We illustrate our methods in this application with the goal of providing insight into effects operating through the pathway IMV $\to$ AKI $\to$ Death and effects operating directly through IMV $\to$ Death. We answer the following question, which has been cited by an expert panel on lung-kidney interactions in critically ill patients as an area that demands further research (Joannidis et al., 2020): What is the effect of invasive mechanical ventilation on death among COVID-19 patients, and how much of it operates through causing acute kidney injury?

We use a dataset consisting of approximately 3,300 patients who did not have a previous history of chronic kidney disease (CKD) and who were hospitalized with COVID-19 at the NewYork-Presbyterian Cornell, Queens, or Lower Manhattan hospitals between March 3^rd and May 15^th, 2020. The analytical dataset was created in a two-step approach. First, data pertaining to demographics, comorbidities, intubation, mortality, and discharge were gathered from electronic health records chart reviews and stored in a secure REDCap database (Goyal et al., 2020). To this data were added items from the Weill Cornell Critical carE Database for Advanced Research (WC-CEDAR), a data repository containing patient information collected over the course of care, including procedures, diagnoses, medications, and laboratory data (Schenck et al., 2021). Note that while all patients were hospitalized with COVID-19, they did not all necessarily suffer from ARDS specifically.

This data was previously described and studied in Díaz et al. (2023a) and Hoffman et al. (2024). The latter considered only effects on mortality without regard to AKI. The former treated AKI and other-cause mortality as competing risks; we take a different perspective by considering AKI as a mediator in the pathway from IMV to mortality. This orientation presents a more intricate view of the causal connections among these variables. Essentially, AKI and mortality are not only competing risks; they can jointly occur, with AKI potentially influencing mortality. By examining AKI’s mediating role, we aim to unravel the indirect ways in which IMV may contribute to mortality via AKI. Should this causal pathway exist, it could provide novel opportunities for medical interventions aimed at preventing or treating AKI, subsequently reducing the mortality rate. (As mentioned in Section 1, these options include the modulation of intravascular volume status (Grams et al., 2011; Glassford and Bellomo, 2011) and various pharmacological interventions (Pickkers et al., 2022).) Further, we analyze how various baseline variables are associated with the estimated direct and indirect effects and discuss how such considerations might inform treatment decisions regarding IMV.

We note that this data, inasmuch as it includes treatment decisions made in the face of considerable uncertainty and unstable guidelines regarding a novel disease, is especially valuable from the point of view of causal inference for observational data. This is because causal inference relies on the assumptions of positivity (that individuals at any covariate level have positive probability of being assigned to either treatment condition) and ignorability (that the treatment-outcome relationship is not confounded by unmeasured variables). If treatment decisions are made based on a generally agreed-upon function of clinical indicators, then one or the other of these conditions is likely to be violated; if all the clinical indicators are included in the data, then positivity is doubtful; if some of the clinical indicators are omitted, then ignorability is doubtful. By contrast, in the current context, there can be enough variation in treatment decision-making to preserve positivity even with many important measured covariates. Practical violations of positivity can be diagnosed from observed data by looking at whether there is overlap in the covariate distributions corresponding to different treatment options (Zhu et al., 2021). Lack of ignorability cannot generally be diagnosed by observed data alone, but “confounding by indication,” where non-adjustment variables influence both treatment indication and outcomes, has been noted as a common issue in observational studies of medical treatments (Kyriacou and Lewis, 2016). For example, observational studies that link selective serotonin reuptake inhibitors to suicidal side effects must account for the fact that patients prescribed treatment for depression are already at a higher risk for such outcomes (Lee and Chang, 2022).

4.2 Time-varying variables, mediator, and modified treatment policy

This study uses data at the daily level, and study time begins on the day of hospitalization. The treatment variable is categorized into three levels: no supplemental oxygen, supplemental oxygen not including IMV, and IMV. The main goal is to estimate the overall causal effect of IMV on mortality/survival rates and to decompose it into effects operating through AKI and effects operating independently of it. To measure the total effect of IMV on survival, we posit an intervention that would delay IMV by one day among patients who received IMV for more than one day and would prevent IMV for patients who received IMV for exactly one day. We refer to this intervention as a “one-day delay in intubation.” Formally, for $A_{t}\in\{0,1,2\}$ corresponding to the end of day $t$ , $0$ indicates no supplemental oxygen, $1$ indicates supplemental oxygen not including IMV, and $2$ indicates IMV. We consider the following intervention:

\mathbbm{d}_{t}(a_{t},h_{t})=\begin{cases}1&\text{ if }a_{t}=2\text{ and }a_{s% }\leq 1\text{ for all }s<t,\\ a_{t}&\text{ otherwise.}\end{cases}

(8)

The structure of the data is as follows. Baseline confounders include age, sex, race, ethnicity, body mass index (BMI), hospital location, home oxygen status, and comorbidities (e.g., hypertension, history of stroke, diabetes mellitus, coronary artery disease, active cancer, cirrhosis, asthma, chronic obstructive pulmonary disease, interstitial lung disease (ILD), HIV infection, and immunosuppression), and are included in $L_{1}$ . Time-dependent confounders $Z_{t}$ and $L_{t}$ include vital signs (e.g., highest and lowest respiratory rate, oxygen saturation, temperature, heart rate, and blood pressure), laboratory results (e.g., C-Reactive Protein, BUN creatinine ratio, activated partial thromboplastin time, creatinine, lymphocytes, neutrophils, bilirubin, platelets, D-dimer, glucose, arterial partial pressure of oxygen, and arterial partial pressure of carbon dioxide). In cases of missing baseline confounders, mean substitution with an additional variable to indicate missingness is used. For missing data at later timepoints, the last observation is carried forward. Patients are censored at their day of hospital discharge, as AKI and vital status were unknown after this point.

In constructing the analytical dataset, it is important that the variables preserve the time-ordering of the DAG in Figure 1. Therefore, we constructed the dataset using the following procedure. First, we categorize patients into three groups: those whose first event was an intubation, those whose first event was AKI, and those who had neither event. For patients in the first group, we anchor the creation of their record on the time at which the intubation event occurred, denoted $t_{I}$ . The variables $L_{t_{I-1}}$ and $Z_{t_{I-1}}$ are then recorded as those observed in approximately 24-hour windows before $t_{I}$ , where $L_{t_{I-1}}$ contains variables measured in the first half of this approximate 24-hour interval, and $Z_{t_{I-1}}$ contains variables measured in the second half of this approximate 24-hour window. The time interval is approximate because the difference in hours between hospital admission and intubation is not necessarily a multiple of 24. The mediator $M_{t_{I-1}}$ is then set as $0$ , and the exposure $A_{t_{I-1}}$ is recorded as the respiratory support in the previous 24-hour window. This process is iterated until we have data for all the timepoints prior to $t_{I}$ . For times greater than or equal to $t_{I}$ , we follow a similar approach, but in addition to capturing $Z_{t}$ , $L_{t}$ , and $A_{t}$ , we measure whether the patient was diagnosed with AKI at every time window. AKI was defined using creatinine values in accordance with the “Kidney Disease: Improving Global Outcomes” definition (KDIGO) definition (Khwaja, 2012). Either of the following criteria was required: (a) serum creatinine change of greater than or equal to 0.3 mg/dL within 48 h, or (b) serum creatinine greater than or equal to 1.5 times the baseline serum creatinine known or assumed to have occurred within the past 7 days.

If and when a patient is diagnosed with AKI (call this time $t_{M}$ ), we begin a new 24-hour interval, i.e. $L_{t_{M}}$ , to ensure that the temporality of variables measured after $t_{M}$ is maintained. A similar procedure was used for patients whose first event was the diagnosis of AKI. Data for patients in the third group, who had neither event, was simply divided into approximate 24-hour time intervals using the anchor times of hospital admission and censoring or death date. During this process, we ensured that measurement of $L_{t}$ precedes $Z_{t}$ within each time interval. A GitHub repository of analysis code is available at https://github.com/CI-NYC/lcmmtp-application.

4.3 Results

We used our proposed methods to estimate the total, direct, and indirect effects on mortality of invasive mechanical ventilation through acute kidney failure. All regressions were estimated using lcmmtp (Williams and Díaz, 2023) and Super Learner (Van der Laan et al., 2007), building a convex combination of three predictors: generalized linear models with $\ell_{1}$ penalty, multivariate adaptive regression splines, and extreme gradient boosting of regression trees. To speed up computations, we used three folds for cross-fitting and used four-day mortality as our outcome of interest. Some observations had large weights $\mathsf{K}_{t,s}$ and $\mathsf{H}_{t,s}$ , so we truncated the corresponding estimates at their 99% quantiles to avoid empirical violations of the positivity assumption. The results of the main analysis are presented in Table 1, where we contrast the mortality under an intervention to prevent censoring versus the mortality under an intervention to prevent censoring and to delay intubation by one day as specified above. Specifically, we estimate the total effect defined as $\mathsf{E}[Y(\bar{A}^{\mathbbm{d}},\bar{J}(\bar{A}^{\mathbbm{d}}))-Y(\bar{A},% \bar{J}(\bar{A}))]$ , where $\mathbbm{d}$ is defined in (8) and $Y$ is four-day survival rate under an intervention to prevent censoring. The indirect effect is thus defined as $\mathsf{E}[Y(\bar{A},\bar{J}(\bar{A}^{\mathbbm{d}}))-Y(\bar{A},\bar{J}(\bar{A}% ))]$ , and the direct effect is $\mathsf{E}[Y(\bar{A}^{\mathbbm{d}},\bar{J}(\bar{A}^{\mathbbm{d}}))-Y(\bar{A},% \bar{J}(\bar{A}^{\mathbbm{d}}))]$ .

	Effect	Standard Error
Total	0.059	0.038
Direct (i.e., not through AKI)	-0.024	0.039
Indirect (i.e., through AKI)	0.083	0.027

Table 1: Total, direct, and indirect effect of delaying invasive mechanical ventilation by one day on survival operating through acute kidney failure (presented as intervention minus baseline; positive values indicate that delaying intubation is beneficial).

Although there is high uncertainty in the results, the point estimates support the hypothesis that invasive mechanical ventilation reduces mortality through mechanisms other than acute kidney failure. However, the effect through acute kidney failure is larger than the effect through all other mechanisms, resulting in an overall harmful effect of invasive mechanical ventilation on mortality. The estimated value of the total effect indicates that delaying intubation by one day would reduce mortality by 5.9 percentage points, with a standard error of 3.8 percentage points.

Next, we aim to identify treatment effect modifiers that can be used to predict which patients are at a higher risk of a harmful effect through AKI, and which patients are likely to benefit due to all other mechanisms. Rigorously estimating interpretable conditional average treatment effects, or treatment effects given some covariate values, is notoriously difficult, especially when continuous covariates are considered (see for example Kennedy et al. (2017) for a review of the state-of-the-art in estimation of conditional effects). For simplicity and interpretability, we can approach this by following a strategy presented in Rudolph and Díaz (2022). Specifically, we regress the influence function (see Theorem 2) for the interventional direct effect of intubation onto baseline covariates. (It is easily shown that the conditional (on covariates) expectation of the (uncentered) influence function for the ATE in standard settings is equal to the conditional average treatment effect; for the extension of this principle to the LMTP case see equation 6 of Díaz et al. (2021). To derive the influence function of the direct and indirect effects, note that the influence function for the difference of two parameters is equal to the difference of the influence functions of those parameters.)

Thus, we estimated univariate effect modification for each baseline variable by linear regression of the efficient influence function of the direct and indirect effect shown in equations 1 and 2 on each variable at a time, where we use the slope of the regression as a measure of effect modification. We show only the ten variables with the largest absolute value for univariate effect modification in Table 2. (If we were predominantly interested in the application of a treatment rule based on mediation effects, as described in Section 4.4, then it would be more appropriate to regress the influence function onto multiple covariates simultaneously; here we focus on the simple regressions to provide a more straightforward summary.)

Variable	Slope
ILD	0.34
Cancer	0.32
COPD	0.25
Lymph. count	0.23
CVA	0.16
CAD	0.15
Current smoker	-0.13
Cirrhosis	-0.12
Creatinine	0.10
Home oxygen	0.10

(a) Effect modifiers for the effect of delaying IMV on survival through AKI.

Variable	Slope
ILD	-1.06
Creatinine	0.69
Asthma	0.69
CVA	0.65
Cirrhosis	0.38
Immunosuppressed	-0.26
Bilirubin	0.25
Former smoker	0.22
Home oxygen	-0.18
White	-0.17

(b) Effect modifiers for the effect of delaying IMV on survival independent of AKI.

Table 2: Most important effect modifiers for the direct and indirect effect of invasive mechanical ventilation on survival through acute kidney failure, as measured by the slope of a simple linear regression of the efficient influence function. Variables with negative slopes indicate populations with lower effects (calculated as intervention minus baseline; positive values indicate that delaying intubation is relatively less harmful/more beneficial).

First, we note that this analysis is exploratory; some subsets may have relatively small sample sizes and we have not calculated standard errors. However, interstitial lung disease (ILD) is the most important variable for effect modification for both the indirect effect through AKI and the direct effect through other mechanisms. The negative effect modification parameter for ILD in Table 2 implies that delaying IMV in patients with ILD has a worse effect on survival (i.e., is more harmful/less beneficial) due to mechanisms other than AKI than in patients without ILD, whereas the positive effect modification parameter in Table 2 means that delaying IMV in patients with ILD has a better effect on survival (i.e., is less harmful/more beneficial) due to AKI than in patients without ILD. The former effect can be explained by the fact that delaying intubation may be riskier in patients with ILD; the latter effect (smaller in size) might be due to the fact that patients with ILD are more likely to experience mortality due to lung failure rather than AKI.

Likewise, Table 2 shows that delaying IMV in patients with cancer, chronic obstructive pulmonary disease (COPD), high lymphocyte count, costovertebral angle tenderness (CVA), coronary artery disease (CAD), high creatinine values, and those who used home oxygen, has a better effect on survival (i.e., is less harmful/more beneficial) due to AKI than in patients without those conditions. Delaying IMV in patients who have cirrhosis or who are current smokers has a worse effect on survival (i.e., is more harmful/less beneficial) due to AKI than in patients without those conditions.

The results in Table 2 show that delaying intubation in patients who are immunosuppressed, use home oxygen, or are white, has a worse effect on survival (i.e., is more harmful/less beneficial) due to mechanisms other than AKI than in patients outside those categories. Lastly, delaying intubation in patients with high creatinine values, asthma, CVA, cirrhosis, high bilirubin, and who are former smokers, has a better effect on survival (i.e., is less harmful/more beneficial) due to mechanisms other than AKI than in patients without those conditions.

We emphasize that given the high standard errors in the effect estimates, this summary should be seen as an exploratory demonstration of our methodology; further research is needed to establish firm scientific conclusions about the effects of the timing of IMV.

4.4 Mediation and treatment decisions

Here, we present a conjectured approach for how mediation analysis might inform treatment decisions. For the purpose of exposition, consider a setting with a single timepoint where we label the observed data as $O=(W,A,M,Y)$ , where $A$ is intubation status, $M$ is the occurrence of AKI, $W$ are a set of pre-treatment confounders, and $Y$ is survival. In studies of medical interventions, it is often of interest to decide an optimal treatment rule; that is, a function $d(W)$ map** covariates to treatment values that maximize the probability of survival of a patient presenting with measured variables $W$ . However, in the case of critical care for a relatively new and evolving disease, doctors would likely prefer to incorporate contextual indicators, patient-specific factors, and clinical judgment rather than to strictly rely on a mathematical rule constructed from previous data. (See Gallifant et al. (2022) for a critical review of existing academic literature promoting the use of artificial intelligence for mechanical ventilation decisions and Mathur and Burns (2019) for an overview of professional challenges that ICU physicians might face in applying algorithmic systems to their practice.) On the other hand, a mediation analysis of the kind we are proposing may be useful in providing a partial treatment rule, insofar as it might identify a subset of patients who should not receive intubation (i.e., those whose respiratory failure might recover without IMV and for whom IMV would introduce unnecessary risks), while leaving undecided the treatment given to patients outside this subset.

Specifically, we note again that the overall effect of intubation on survival can be represented as a combination of the effects through the mediating variable (AKI), and “direct” effects that do not operate through the mediator. (Because we are estimating “interventional” rather than “natural” effects (VanderWeele et al., 2014; Miles et al., 2015), this “overall effect” is not quite equal to the ordinary average treatment effect of intubation on survival, but it can be thought of in a similar way as a heuristic approximation; as described in Section 1, this substitution is done for the purpose of identification.) In principle, both the indirect and direct effects could be beneficial (increasing probability of survival) or harmful (decreasing probability of survival). When these effects are in contrary directions, it may be said that the process exhibits “inconsistent mediation” (MacKinnon et al., 2000). In the current clinical context, we might discount the existence of beneficial indirect effects (i.e., that intubation causes survival by preventing AKI), even at the individual level. Then, for any individual predicted to experience a harmful direct effect of treatment, the total effect of intubation on survival can be expected to be at least as poor. This line of reasoning provides a conservative basis for excluding a subset of patients from treatment.

By contrast, one could attempt to estimate the total effect for all patients and exclude any patients with a negative value, but this shuts out the possibility of additional decision-making input from physicians and ignores the possibility of ameliorating the indirect effects (i.e., that additional interventions could be put in place to mitigate the probability of AKI for patients under intubation).

While we are not advocating that such a treatment rule be applied immediately to patients similar to the ones under study, we believe the information gleaned from this analysis could be useful in future research. In particular, whether beneficial indirect effects could be plausible for some individuals, and what interventions could be put in place to prevent AKI in intubated individuals, do not appear to be well-established in the literature, but can be informed from such analyses. Aside from the current application, this framework could be useful for any treatment scenario where such conditions (the ability to mitigate harmful indirect effects and/or the absence of beneficial indirect effects) are fulfilled.

5 Discussion

Causal mediation analysis is a complex problem, especially in settings with time-varying and/or continuous exposures, mediators, and confounders; we have provided a framework and technique to handle these intricacies. We have argued that mediation analysis can be helpful in clinical decision-making and in formulating preliminary models of causal mechanisms. In addition, we have highlighted the medical importance of inconsistent mediation and its potential role in informing critical care. For this purpose, estimating conditional effects can help to identify the relevant subgroups and to understand the biological factors driving the inconsistency. The application to COVID-19 hospital data, while leaving much room for uncertainty, lends some credence to concerns regarding risks of early intubation through acute kidney injury. Further research might investigate approximate methods to ameliorate the computational costs associated with the sequential regression, which might grow quickly with the number of timepoints.

Supplementary Materials for
Identification and estimation of mediational effects of longitudinal modified treatment policies

S1 Technical note on notation

In the proofs that follow, any variable with index $t\leq 0$ should be interpreted as null, and a distribution conditional on null variables are marginal. Any summation expression with no terms (e.g., $\sum_{t+1}^{t}W_{t}$ ) are equal to zero, and any product with no factors is equal to one.

S2 Identification (Theorem 1)

Proof By the law of total expectation, we can write

\theta=\mathsf{E}[Y(\bar{A}^{\mathbbm{d}^{\prime}},\bar{J}(\bar{A}^{\mathbbm{d% }^{\star}}))]

=\mathsf{E}[\mathsf{E}[Y(\bar{A}^{\mathbbm{d}^{\prime}},\bar{J}(\bar{A}^{% \mathbbm{d}^{\star}}))\mid J(\bar{A}^{\mathbbm{d}^{\star}})]]

=\sum_{\bar{m}}\mathsf{E}[Y(\bar{A}^{\mathbbm{d}^{\prime}},\bar{J}(\bar{A}^{% \mathbbm{d}^{\star}}))\mid\bar{J}(\bar{A}^{\mathbbm{d}^{\star}})=\bar{m}]*% \mathsf{P}(\bar{J}(\bar{A}^{\mathbbm{d}^{\star}})=\bar{m})

=\sum_{\bar{m}}\mathsf{E}[Y(\bar{A}^{\mathbbm{d}^{\prime}},\bar{m})\mid\bar{J}% (\bar{A}^{\mathbbm{d}^{\star}})=\bar{m}]*\mathsf{P}(\bar{J}(\bar{A}^{\mathbbm{% d}^{\star}})=\bar{m})

Since $\bar{J}$ is defined to be distributed like $\bar{M}$ and independent of all data,

=\sum_{\bar{m}}\mathsf{E}[Y(\bar{A}^{\mathbbm{d}^{\prime}},\bar{m})]*\mathsf{P% }(\bar{M}(\bar{A}^{\mathbbm{d}^{\star}})=\bar{m})

Given policies $\mathbbm{d}^{\prime}$ and $\mathbbm{d}^{\star}$ , we let $\varphi(\bar{m})=\mathsf{E}[Y(\bar{A}^{\mathbbm{d}^{\prime}},\bar{m})]$ and $\lambda(\bar{m})=\mathsf{P}(\bar{M}(\bar{A}^{\mathbbm{d}^{\star}})=\bar{m})$ and will show that $\varphi$ and $\lambda$ are equivalent to the expressions given in the theorem.

First, for $\varphi$ , we consider the relabeled dataset where for all $t\leq\tau$ , $(L_{t},Z_{t})=(C_{2t-1},C_{2t})$ and $(A_{t},M_{t})=(E_{2t-1},E_{2t})$ . In this case, the expression $Y(\bar{A}^{\mathbbm{d}^{\prime}},\bar{m})$ is the expected value of the counterfactual outcome of the longitudinal modified treatment policy setting the values of $E$ to the corresponding values $\bar{A}^{\mathbbm{d}^{\prime}}$ and $\bar{m}$ . Similarly to Díaz et al. (2023b), this expression is identified by the following strategy.

Let $\mu_{2\tau+1}=Y$ and for $t=2\tau,\dots,1$ recursively define

\mu_{t}(e_{t},h_{t})=\mathsf{E}[\mu_{t+1}(E{{}^{\prime}}_{t+1},H_{t+1})\mid E_% {t}=e_{t},H_{t}=h_{t}]

where $H_{t}$ denotes all variables prior to $E_{t}$ . Then by integration, $\mathsf{E}[Y(\mathsf{E}^{\mathbbm{d}^{\prime}})]=\mathsf{E}[\mu_{1}(E_{1}^{% \prime},C_{1})]$ .

Noting that the desired intervention on $\bar{m}$ is not stochastic (and thus the mediation component of the intervention $\mathbbm{d}^{\prime}$ can be passed into the $\mathsf{Q}$ -functions as the future histories $\underaccent{\bar}{m}_{t}$ ); thus one can see that substituting variables is sufficient to recognize, when $\mathbbm{d}^{\prime}$ is the intervention setting $E$ to the values of $\bar{A}^{\mathbbm{d}^{\prime}}$ and $\bar{m}$ , the relation $\mathsf{E}[\mu_{1}(E_{1}^{\prime},C_{1})]=\mathsf{Q}_{L,0}^{\prime}(\bar{m})$ , as in the Theorem.

For $\lambda(\bar{m})=\mathsf{P}(\bar{M}(\bar{A}^{\mathbbm{d}^{\star}})=\bar{m})$ , we again relabel the data, this time grou** $L,Z,M$ as $S_{t}=(L_{t},Z_{t-1},\bar{M}_{t-1})$ (and again suitably altering the meaning of $H_{t}$ to denote all variables before $A_{t}$ , or $H_{A,t}$ in the original labeling). Then $\bar{M}(\bar{A}^{\mathbbm{d}\star})$ is simply a projection of the outcome $S_{\tau+1}$ under the longitudinal modified treatment policy $\bar{A}^{\mathbbm{d}\star}$ . Denoting this projection as $S_{t}^{m}=\bar{M}_{t-1}$ , we can write $\mathsf{P}(\bar{M}(\bar{A}^{\mathbbm{d}\star})=\bar{m})=\mathsf{E}[\mathbbm{1}% (S_{\tau+1}^{m}=\bar{m})]$ and we can again apply a similar identification strategy.

As a shorthand, let $\mathbbm{1}_{\tau+1}^{\bar{m}}=\mathbbm{1}(S_{\tau+1}^{m}=\bar{m})$ . Let $\rho_{\tau+1}=\mathbbm{1}_{\tau+1}^{\bar{m}}$ and for $t=\tau,\dots,1$ recursively define

\rho_{t}(a_{t},h_{t})=\mathsf{E}[\rho_{t+1}(A^{\star}_{t+1},H_{t+1})\mid A_{t}% =a_{t},H_{t}=h_{t}]

Then by integration we have $\lambda(\bar{m})=\mathsf{E}[\rho_{1}(A_{1}^{\star},S_{1})]$ . We will show that this expression is equivalent to $\mathsf{Q}_{M,0}^{\star}(\bar{m})$ as in the Theorem.

First, by definition we have

\rho_{\tau}(a_{\tau},h_{\tau})=\mathsf{E}[\mathbbm{1}_{\tau+1}^{\bar{m}}\mid A% _{\tau}=a_{\tau},H_{t}=h_{\tau}]

The conditioning event fixes all mediator values but the final one, so we can write

=\mathbbm{1}(S_{\tau}^{m}=\bar{m}_{\tau-1})\mathsf{P}(M_{\tau}=m_{\tau}\mid A_% {\tau}=a_{\tau},H_{t}=h_{\tau})

=\mathbbm{1}(S_{\tau}^{m}=\bar{m}_{\tau-1})\mathsf{E}[\mathbbm{1}(M_{\tau}=m_{% \tau})\mid A_{\tau}=a_{\tau},H_{t}=h_{\tau}]

By definition,

=\mathbbm{1}(S_{\tau}^{m}=\bar{m}_{\tau-1})\mathsf{Q}_{M,\tau}^{*}(% \underaccent{\bar}{m}_{\tau})

Then for $t=\tau-1$ , we similarly have

\rho_{\tau-1}(a_{\tau-1},h_{\tau-1})=\mathsf{E}[\rho_{\tau}(A_{\tau}^{\star},H% _{\tau})\mid A_{\tau-1}=a_{\tau-1},H_{t}=h_{\tau-1}]

=\mathsf{E}[\mathbbm{1}(S_{\tau}^{m}=\bar{m}_{\tau-1})\mathsf{Q}_{M,\tau}^{*}(% \underaccent{\bar}{m}_{\tau})\mid A_{\tau-1}=a_{\tau-1},H_{t}=h_{\tau-1}]

=\mathbbm{1}(S_{\tau-1}^{m}=\bar{m}_{\tau-2})\mathsf{E}[\mathbbm{1}(M_{\tau-1}% =m_{\tau-1})\mathsf{Q}_{M,\tau}^{*}(\underaccent{\bar}{m}_{\tau})\mid A_{\tau-% 1}=a_{\tau-1},H_{t}=h_{\tau-1}]

=\mathbbm{1}(S_{\tau-1}^{m}=\bar{m}_{\tau-2})\mathsf{Q}_{M,\tau-1}^{\star}(% \underaccent{\bar}{m}_{\tau-1})

Repeated iterations yield the pattern

\rho_{t}(a_{t},h_{t})=\mathbbm{1}(S_{t}^{m}=\bar{m}_{t-1})\mathsf{Q}_{M,t}^{% \star}(\underaccent{\bar}{m}_{t})

Thus $\lambda(\bar{m})=\mathsf{E}[\rho_{1}(A_{1}^{\star},S_{1})]=\mathsf{E}[\mathsf{% Q}_{M,1}^{\star}(\bar{m})]=\mathsf{Q}_{M,0}^{\star}(\bar{m})$ as desired. ∎

S3 IPW identification

Díaz et al. (2021) gives, for a general longitudinal modified treatment policy $\bar{A}^{\mathbbm{d}}$ and outcome $Y$ (without explicit mediation or intermediate confounders):

\mathsf{E}[Y(\bar{A}^{\mathbbm{d}})]=\mathsf{E}\left[Y\prod_{t=1}^{\tau}\frac{% g_{t}^{\mathbbm{d}}(a_{t}|h_{A,t})}{g_{t}(a_{t}|h_{A,t})}\right]

Applying this result after relabeling the data exactly as in the preceding Section S2 (respectively for each estimand) immediately yields the given IPW expressions for $\varphi$ and $\lambda$ .

References

Avin et al. (2005) Chen Avin, Ilya Shpitser, and Judea Pearl. Identifiability of path-specific effects. In IJCAI International Joint Conference on Artificial Intelligence, pages 357–363, 2005.
Bang and Robins (2005) Heejung Bang and James M Robins. Doubly robust estimation in missing data and causal inference models. Biometrics, 61(4):962–973, 2005.
Bavishi et al. (2021) Avni A Bavishi, Ruben J Mylvaganam, Rishi Agarwal, Ryan J Avery, and Michael J Cuttica. Timing of intubation in coronavirus disease 2019: A study of ventilator mechanics, imaging, findings, and outcomes. Critical Care Explorations, 3(5), 2021.
Bickel et al. (1997) Peter J Bickel, Chris AJ Klaassen, YA’Acov Ritov, and Jon A Wellner. Efficient and Adaptive Estimation for Semiparametric Models. Springer-Verlag, 1997.
Chernozhukov et al. (2018) Victor Chernozhukov, Denis Chetverikov, Mert Demirer, Esther Duflo, Christian Hansen, Whitney Newey, and James Robins. Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 21(1):C1–C68, 2018.
Díaz and Hejazi (2020) Iván Díaz and Nima S Hejazi. Causal mediation analysis for stochastic interventions. Journal of the Royal Statistical Society Series B: Statistical Methodology, 82(3):661–683, 2020.
Díaz and van der Laan (2012) Iván Díaz and Mark J van der Laan. Population Intervention Causal Effects Based on Stochastic Interventions. Biometrics, 68(2):541–549, 2012.
Díaz et al. (2021) Iván Díaz, Nicholas Williams, Katherine L Hoffman, and Edward J Schenck. Nonparametric causal effects based on longitudinal modified treatment policies. Journal of the American Statistical Association, pages 1–16, 2021.
Díaz et al. (2023a) Iván Díaz, Katherine L Hoffman, and Nima S Hejazi. Causal survival analysis under competing risks using longitudinal modified treatment policies. Lifetime Data Analysis, pages 1–24, 2023a.
Díaz et al. (2023b) Iván Díaz, Nicholas Williams, and Kara E Rudolph. Efficient and flexible mediation analysis with time-varying mediators, treatments, and confounders. Journal of Causal Inference, 11(1):20220077, 2023b.
Durdevic et al. (2020) Momcilo Durdevic, Dragana Durdevic, Maria Bernal Riera, Abhishek Nimkar, Andreea Constanta Stan, Amrah Hasan, Ashutossh Naaraayan, and Stephen Jesmajian. Progressive renal failure in patients with covid-19 after initiating mechanical ventilation: A case series. Chest, 158(4):A2629, 2020.
Gallifant et al. (2022) Jack Gallifant, Joe Zhang, Maria del Pilar Arias Lopez, Tingting Zhu, Luigi Camporota, Leo A Celi, and Federico Formenti. Artificial intelligence for mechanical ventilation: systematic review of design, reporting standards, and bias. British Journal of Anaesthesia, 128(2):343–351, 2022.
Glassford and Bellomo (2011) Neil J Glassford and Rinaldo Bellomo. Fluid therapy in acute kidney injury: the FACTTs. Nature Reviews Nephrology, 7(6):305–306, 2011.
Goyal et al. (2020) Parag Goyal, Justin J Choi, Laura C Pinheiro, Edward J Schenck, Ruijun Chen, Assem Jabri, Michael J Satlin, Thomas R Campion Jr, Musarrat Nahid, Joanna B Ringel, et al. Clinical characteristics of Covid-19 in New York City. New England Journal of Medicine, 382(24):2372–2374, 2020.
Grams et al. (2011) Morgan E Grams, Michelle M Estrella, Josef Coresh, Roy G Brower, Kathleen D Liu, and National Heart, Lung, and Blood Institute Acute Respiratory Distress Syndrome Network and others. Fluid balance, diuretic use, and mortality in acute kidney injury. Clinical journal of the American Society of Nephrology: CJASN, 6(5):966, 2011.
Haneuse and Rotnitzky (2013) Sebastian Haneuse and Andrea Rotnitzky. Estimation of the effect of interventions that modify the received treatment. Statistics in Medicine, 2013.
Hasan et al. (2020) Syed Shahzad Hasan, Toby Capstick, Raees Ahmed, Chia Siang Kow, Faizan Mazhar, Hamid A Merchant, and Syed Tabish Razi Zaidi. Mortality in COVID-19 patients with acute respiratory distress syndrome and corticosteroids use: a systematic review and meta-analysis. Expert Review of Respiratory Medicine, 14(11):1149–1163, 2020.
Hejazi et al. (2023) Nima S Hejazi, Kara E Rudolph, Mark J Van Der Laan, and Iván Díaz. Nonparametric causal mediation analysis for stochastic interventional (in) direct effects. Biostatistics, 24(3):686–707, 2023.
Hernán et al. (2006) Miguel A Hernán, Emilie Lanoy, Dominique Costagliola, and James M Robins. Comparison of dynamic treatment regimes via inverse probability weighting. Basic & clinical pharmacology & toxicology, 98(3):237–242, 2006.
Hoffman et al. (2024) Katherine L. Hoffman, Diego Salazar-Barreto, Nicholas Williams, Kara E. Rudolph, and Ivan Diaz. Studying continuous, time-varying, and/or complex exposures using longitudinal modified treatment policies, 2024. URL https://arxiv.longhoe.net/abs/2304.09460.
Husain-Syed et al. (2016) Faeq Husain-Syed, Arthur S Slutsky, and Claudio Ronco. Lung–kidney cross-talk in the critically ill patient. American Journal of Respiratory and Critical Care Medicine, 194(4):402–414, 2016.
Imai et al. (2010) Kosuke Imai, Luke Keele, and Dustin Tingley. A general approach to causal mediation analysis. Psychological methods, 15(4):309, 2010.
Joannidis et al. (2020) Michael Joannidis, Lui G Forni, Sebastian J Klein, Patrick M Honore, Kianoush Kashani, Marlies Ostermann, John Prowle, Sean M Bagshaw, Vincenzo Cantaluppi, Michael Darmon, et al. Lung–kidney interactions in critically ill patients: consensus report of the acute disease quality initiative (ADQI) 21 Workgroup. Intensive Care Medicine, 46(4):654–672, 2020.
Johnson et al. (2018) Kipp W Johnson, Benjamin S Glicksberg, Rachel A Hodos, Khader Shameer, and Joel T Dudley. Causal inference on electronic health records to assess blood pressure treatment targets: an application of the parametric g formula. In Pacific Symposium on Biocomputing 2018: Proceedings of the Pacific Symposium, pages 180–191. World Scientific, 2018.
Kennedy (2018) Edward H Kennedy. Nonparametric causal effects based on incremental propensity score interventions. Journal of the American Statistical Association, 2018.
Kennedy et al. (2017) Edward H Kennedy, Zongming Ma, Matthew D McHugh, and Dylan S Small. Non-parametric methods for doubly robust estimation of continuous treatment effects. Journal of the Royal Statistical Society Series B: Statistical Methodology, 79(4):1229–1245, 2017.
Kes and Jukić (2010) Petar Kes and Nikolina Bašić Jukić. Acute kidney injury in the intensive care unit. Bosnian Journal of Basic Medical Sciences, 10(Suppl 1):S8, 2010.
Khwaja (2012) Arif Khwaja. KDIGO clinical practice guidelines for acute kidney injury. Nephron Clinical Practice, 120(4):c179–c184, 2012.
Kyriacou and Lewis (2016) Demetrios N Kyriacou and Roger J Lewis. Confounding by indication in clinical research. Jama, 316(17):1818–1819, 2016.
Lee and Chang (2022) Jimin Lee and Sung Man Chang. Confounding by indication in studies of selective serotonin reuptake inhibitors. Psychiatry Investigation, 19(11):873, 2022.
Luedtke et al. (2017) Alexander R Luedtke, Oleg Sofrygin, Mark J van der Laan, and Marco Carone. Sequential double robustness in right-censored longitudinal models. arXiv preprint arXiv:1705.02459, 2017.
MacKinnon et al. (2000) David P MacKinnon, Jennifer L Krull, and Chondra M Lockwood. Equivalence of the mediation, confounding and suppression effect. Prevention science, 1:173–181, 2000.
Mathur and Burns (2019) Piyush Mathur and Michael L Burns. Artificial intelligence in critical care. International anesthesiology clinics, 57(2):89–102, 2019.
Miles et al. (2015) Caleb H Miles, Phyllis Kanki, Seema Meloni, and Eric J Tchetgen Tchetgen. On partial identification of the pure direct effect. arXiv preprint arXiv:1509.01652, 2015.
Nguyen et al. (2021) Trang Quynh Nguyen, Ian Schmid, and Elizabeth A Stuart. Clarifying causal mediation analysis for the applied researcher: Defining effects based on what we want to learn. Psychological Methods, 26(2):255, 2021.
Papoutsi et al. (2021) Eleni Papoutsi, Vassilis G Giannakoulis, Eleni Xourgia, Christina Routsi, Anastasia Kotanidou, and Ilias I Siempos. Effect of timing of intubation on clinical outcomes of critically ill patients with COVID-19: a systematic review and meta-analysis of non-randomized cohort studies. Critical Care, 25:1–9, 2021.
Pearl (2000) Judea Pearl. Causality: Models, Reasoning, and Inference. Cambridge University Press, Cambridge, 2000.
Pearl (2001) Judea Pearl. Direct & indirect effects. In Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence, UAI ’01, pages 411–420, San Francisco, CA, USA, 2001. Morgan Kaufmann Publishers Inc. ISBN 1-55860-800-1. URL http://dl.acm.org/citation.cfm?id=647235.720084.
Pickkers et al. (2022) Peter Pickkers, Patrick T Murray, and Marlies Ostermann. New drugs for acute kidney injury. Intensive Care Medicine, 48(12):1796–1798, 2022.
Richardson and Robins (2013) Thomas S Richardson and James M Robins. Single world intervention graphs (SWIGs): A unification of the counterfactual and graphical approaches to causality. Center for the Statistics and the Social Sciences, University of Washington Series. Working Paper, 128(30):2013, 2013.
Robins and Greenland (1992) James M Robins and Sander Greenland. Identifiability and exchangeability for direct and indirect effects. Epidemiology, 3(0):143–155, 1992.
Robins et al. (2004) James M Robins, Miguel A Hernán, and Uwe Siebert. Effects of multiple interventions. Comparative quantification of health risks: global and regional burden of disease attributable to selected major risk factors, 1:2191–2230, 2004.
Robins (2000) J.M. Robins. Robust estimation in sequentially ignorable missing data and causal inference models. In Proceedings of the American Statistical Association, 2000.
Rotnitzky et al. (2017) Andrea Rotnitzky, James Robins, and Lucia Babino. On the multiply robust estimation of the mean of the g-functional. arXiv preprint arXiv:1705.08582, 2017.
Rubin and van der Laan (2007) Daniel Rubin and Mark J van der Laan. A doubly robust censoring unbiased transformation. The international journal of biostatistics, 3(1), 2007.
Rudolph and Díaz (2022) Kara E Rudolph and Iván Díaz. When the ends do not justify the means: Learning who is predicted to have harmful indirect effects. Journal of the Royal Statistical Society Series A: Statistics in Society, 185(Supplement_2):S573–S589, 2022.
Schenck et al. (2021) Edward J Schenck, Katherine L Hoffman, Marika Cusick, Joseph Kabariti, Evan T Sholle, and Thomas R Campion Jr. Critical carE Database for Advanced Research (CEDAR): An automated method to support intensive care units with electronic health record data. Journal of Biomedical Informatics, 118:103789, 2021.
Shrestha et al. (2022) Dhan Bahadur Shrestha, Yub Raj Sedhai, Pravash Budhathoki, Ayush Adhikari, Nisheem Pokharel, Richa Dhakal, Satyasuna Kafle, Wasey Ali Yadullahi Mir, Roshan Acharya, Markos G Kashiouris, et al. Pulmonary barotrauma in COVID-19: A systematic review and meta-analysis. Annals of Medicine and Surgery, 73:103221, 2022.
Tai et al. (2022) An-Shun Tai, Sheng-Hsuan Lin, Yu-Cheng Chu, Tsung Yu, Milo A Puhan, and Tyler VanderWeele. Causal mediation analysis with multiple time-varying mediators. Epidemiology, 34(1):8–19, 2022.
Thomson and Calligaro (2021) David A Thomson and Gregory L Calligaro. Timing of intubation in COVID-19: Not just location, location, location? Critical Care, 25(1):1–2, 2021.
Tobin (2006) Martin J. Tobin. Principles and practice of mechanical ventilation, 2nd edition. Shock, 26(4):426, Oct 2006. doi: 10.1097/01.shk.0000245023.16612.dd.
Tobin (2020) Martin J Tobin. Basing respiratory management of COVID-19 on physiological principles. American Journal of Respiratory and Critical Care Medicine, 201(11):1319–1320, 2020. doi: 10.1164/rccm.202004-1076ED. URL https://doi.org/10.1164/rccm.202004-1076ED.
van der Laan and Gruber (2012) Mark J van der Laan and Susan Gruber. Targeted minimum loss based estimation of causal effects of multiple time point interventions. The international journal of biostatistics, 8(1), 2012.
van der Laan and Robins (2003) Mark J van der Laan and James M Robins. Unified Methods for Censored Longitudinal Data and Causality. Springer, New York, 2003.
van der Laan and Rose (2011) Mark J van der Laan and Sherri Rose. Targeted Learning: Causal Inference for Observational and Experimental Data. Springer, New York, 2011.
van der Laan and Rose (2018) Mark J van der Laan and Sherri Rose. Targeted Learning in Data Science: Causal Inference for Complex longitudinal Studies. Springer, New York, 2018.
van der Laan and Rubin (2006) Mark J van der Laan and Daniel Rubin. Targeted maximum likelihood learning. The International Journal of Biostatistics, 2(1), 2006.
Van der Laan et al. (2007) Mark J Van der Laan, Eric C Polley, and Alan E Hubbard. Super learner. Statistical Applications in Genetics and Molecular Biology, 6(1), 2007.
van der Vaart (1998) A. W. van der Vaart. Asymptotic Statistics. Cambridge University Press, 1998.
VanderWeele and Tchetgen (2017) Tyler J VanderWeele and Eric J Tchetgen Tchetgen. Mediation analysis with time varying exposures and mediators. Journal of the Royal Statistical Society. Series B, Statistical Methodology, 79(3):917, 2017.
VanderWeele et al. (2014) Tyler J VanderWeele, Stijn Vansteelandt, and James M Robins. Effect decomposition in the presence of an exposure-induced mediator-outcome confounder. Epidemiology (Cambridge, Mass.), 25(2):300, 2014.
Vemuri et al. (2022) Sneha V Vemuri, Mark L Rolfsen, Alexandra V Sykes, Puja G Takiar, Austin J Leonard, Atul Malhotra, Roger G Spragg, Etienne Macedo, and Mark L Hepokoski. Association Between Acute Kidney Injury During Invasive Mechanical Ventilation and ICU Outcomes and Respiratory System Mechanics. Critical Care Explorations, 4(7), 2022.
Wang et al. (2021) **g Gennie Wang, Bian Liu, Bethany Percha, Stephanie Pan, Neha Goel, Kusum S Mathews, Cynthia Gao, Pranai Tandon, Max Tomlinson, Edwin Yoo, et al. Cardiovascular disease and severe hypoxemia are associated with higher rates of noninvasive respiratory support failure in coronavirus disease 2019 pneumonia. Critical Care Explorations, 3(3), 2021.
Wicky et al. (2021) Paul-Henri Wicky, Michael S Niedermann, and Jean-François Timsit. Ventilator-associated pneumonia in the era of COVID-19 pandemic: How common and what is the impact? Critical Care, 25(1):1–3, 2021.
Williams and Díaz (2023) Nicholas Williams and Iván Díaz. lcmmtp: Nonparametric Longitudinal Causal Mediation Using Modified Treatment Policies, 2023. URL https://github.com/nt-williams/lcmmtp. R package version 0.1.0.
Wright (1921) Sewall Wright. Correlation and causation. Journal of Agricultural Research, 20(7):557–585, 1921.
Young et al. (2014) Jessica G Young, Miguel A Hernán, and James M Robins. Identification, estimation and approximation of risk under interventions that depend on the natural value of treatment using observational data. Epidemiologic Methods, 3(1):1–19, 2014.
Zheng and van der Laan (2017) Wen**g Zheng and Mark van der Laan. Longitudinal mediation analysis with time-varying mediators and exposures, with application to survival outcomes. Journal of Causal Inference, 5(2), 2017.
Zheng and van der Laan (2011) Wen**g Zheng and Mark J van der Laan. Cross-validated targeted minimum-loss-based estimation. In Targeted Learning, pages 459–474. Springer, 2011.
Zhu et al. (2021) Yaqian Zhu, Rebecca A Hubbard, Jessica Chubak, Jason Roy, and Nandita Mitra. Core concepts in pharmacoepidemiology: Violations of the positivity assumption in the causal analysis of observational data: Consequences and statistical approaches. Pharmacoepidemiology and Drug Safety, 30(11):1471–1485, 2021.