Search | arXiv e-print repository

Planning for Gold: Sample Splitting for Valid Powerful Design of Observational Studies

Authors: William Bekerman, Abhinandan Dalal, Carlo del Ninno, Dylan S. Small

Abstract: Observational studies are valuable tools for inferring causal effects in the absence of controlled experiments. However, these studies may be biased due to the presence of some relevant, unmeasured set of covariates. The design of an observational study has a prominent effect on its sensitivity to hidden biases, and the best design may not be apparent without examining the data. One approach to fa… ▽ More Observational studies are valuable tools for inferring causal effects in the absence of controlled experiments. However, these studies may be biased due to the presence of some relevant, unmeasured set of covariates. The design of an observational study has a prominent effect on its sensitivity to hidden biases, and the best design may not be apparent without examining the data. One approach to facilitate a data-inspired design is to split the sample into a planning sample for choosing the design and an analysis sample for making inferences. We devise a powerful and flexible method for selecting outcomes in the planning sample when an unknown number of outcomes are affected by the treatment. We investigate the theoretical properties of our method and conduct extensive simulations that demonstrate pronounced benefits, especially at higher levels of allowance for unmeasured confounding. Finally, we demonstrate our method in an observational study of the multi-dimensional impacts of a devastating flood in Bangladesh. △ Less

Submitted 2 June, 2024; originally announced June 2024.

arXiv:2405.16046 [pdf, ps, other]

Sensitivity Analysis for Attributable Effects in Case$^2$ Studies

Authors: Kan Chen, Ting Ye, Dylan S. Small

Abstract: The case$^2$ study, also referred to as the case-case study design, is a valuable approach for conducting inference for treatment effects. Unlike traditional case-control studies, the case$^2$ design compares treatment in two types of cases with the same disease. A key quantity of interest is the attributable effect, which is the number of cases of disease among treated units which are caused by t… ▽ More The case$^2$ study, also referred to as the case-case study design, is a valuable approach for conducting inference for treatment effects. Unlike traditional case-control studies, the case$^2$ design compares treatment in two types of cases with the same disease. A key quantity of interest is the attributable effect, which is the number of cases of disease among treated units which are caused by the treatment. Two key assumptions that are usually made for making inferences about the attributable effect in case$^2$ studies are 1.) treatment does not cause the second type of case, and 2.) the treatment does not alter an individual's case type. However, these assumptions are not realistic in many real-data applications. In this article, we present a sensitivity analysis framework to scrutinize the impact of deviations from these assumptions on obtained results. We also include sensitivity analyses related to the assumption of unmeasured confounding, recognizing the potential bias introduced by unobserved covariates. The proposed methodology is exemplified through an investigation into whether having violent behavior in the last year of life increases suicide risk via 1993 National Mortality Followback Survey dataset. △ Less

Submitted 25 May, 2024; originally announced May 2024.

Comments: 25 pages, 2 Figures, 4 Tables

arXiv:2403.19807 [pdf, other]

Protocols for Observational Studies: Methods and Open Problems

Authors: Dylan S. Small

Abstract: For learning about the causal effect of a treatment, a randomized controlled trial (RCT) is considered the gold standard. However, randomizing treatment is sometimes unethical or infeasible, and instead an observational study may be conducted. While some aspects of a well designed RCT cannot be replicated in an observational study, one aspect that can is to have a protocol with prespecified hypoth… ▽ More For learning about the causal effect of a treatment, a randomized controlled trial (RCT) is considered the gold standard. However, randomizing treatment is sometimes unethical or infeasible, and instead an observational study may be conducted. While some aspects of a well designed RCT cannot be replicated in an observational study, one aspect that can is to have a protocol with prespecified hypotheses about prespecified outcomes and a prespecified analysis. We illustrate the value of protocols for observational studies in three applications -- the effect of playing high school football on later life mental functioning, the effect of police seizing a gun when arresting a domestic violence suspect on future domestic violence and the effect of mountaintop mining on health. We then discuss methodologies for observational study protocols. We discuss considerations for protocols that are similar between observational studies and RCTs, and considerations that are different. The considerations that are different include (i) whether the protocol should be specified before treatment assignment is known or after; (ii) how multiple outcomes should be incorporated into the planned analysis and (iii) how subgroups should be incorporated into the planned analysis. We conclude with discussion of a few open problems in the methodology of observational study protocols. △ Less

Submitted 28 March, 2024; originally announced March 2024.

arXiv:2303.06332 [pdf, ps, other]

A Differential Effect Approach to Partial Identification of Treatment Effects

Authors: Kan Chen, Bingkai Wang, Dylan S. Small

Abstract: We consider identification and inference for the average treatment effect and heterogeneous treatment effect conditional on observable covariates in the presence of unmeasured confounding. Since point identification of these treatment effects is not achievable without strong assumptions, we obtain bounds on these treatment effects by leveraging differential effects, a tool that allows for using a… ▽ More We consider identification and inference for the average treatment effect and heterogeneous treatment effect conditional on observable covariates in the presence of unmeasured confounding. Since point identification of these treatment effects is not achievable without strong assumptions, we obtain bounds on these treatment effects by leveraging differential effects, a tool that allows for using a second treatment to learn the effect of the first treatment. The differential effect is the effect of using one treatment in lieu of the other. We provide conditions under which differential treatment effects can be used to point identify or partially identify treatment effects. Under these conditions, we develop a flexible and easy-to-implement semi-parametric framework to estimate bounds and leverage a two-stage approach to conduct statistical inference on effects of interest. The proposed method is examined through a simulation study and a case study that investigates the effect of smoking on the blood level of cadmium using the National Health and Nutrition Examination Survey. △ Less

Submitted 25 September, 2023; v1 submitted 11 March, 2023; originally announced March 2023.

Comments: 51 pages, 5 figures, 11 tables

arXiv:2301.04412 [pdf, ps, other]

RobustIV and controlfunctionIV: Causal Inference for Linear and Nonlinear Models with Invalid Instrumental Variables

Authors: Taehyeon Koo, You** Lee, Dylan S. Small, Zijian Guo

Abstract: We present R software packages RobustIV and controlfunctionIV for causal inference with possibly invalid instrumental variables. RobustIV focuses on the linear outcome model. It implements the two-stage hard thresholding method to select valid instrumental variables from a set of candidate instrumental variables and make inferences for the causal effect in both low- and high-dimensional settings.… ▽ More We present R software packages RobustIV and controlfunctionIV for causal inference with possibly invalid instrumental variables. RobustIV focuses on the linear outcome model. It implements the two-stage hard thresholding method to select valid instrumental variables from a set of candidate instrumental variables and make inferences for the causal effect in both low- and high-dimensional settings. Furthermore, RobustIV implements the high-dimensional endogeneity test and the searching and sampling method, a uniformly valid inference method robust to errors in instrumental variable selection. controlfunctionIV considers the nonlinear outcome model and makes inferences about the causal effect based on the control function method. Our packages are demonstrated using two publicly available economic data sets together with applications to the Framingham Heart Study. △ Less

Submitted 20 June, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

arXiv:2210.07324 [pdf, ps, other]

doi 10.1080/01621459.2023.2289693

Model-robust and efficient covariate adjustment for cluster-randomized experiments

Authors: Bingkai Wang, Chan Park, Dylan S. Small, Fan Li

Abstract: Cluster-randomized experiments are increasingly used to evaluate interventions in routine practice conditions, and researchers often adopt model-based methods with covariate adjustment in the statistical analyses. However, the validity of model-based covariate adjustment is unclear when the working models are misspecified, leading to ambiguity of estimands and risk of bias. In this article, we fir… ▽ More Cluster-randomized experiments are increasingly used to evaluate interventions in routine practice conditions, and researchers often adopt model-based methods with covariate adjustment in the statistical analyses. However, the validity of model-based covariate adjustment is unclear when the working models are misspecified, leading to ambiguity of estimands and risk of bias. In this article, we first adapt two conventional model-based methods, generalized estimating equations and linear mixed models, with weighted g-computation to achieve robust inference for cluster-average and individual-average treatment effects. To further overcome the limitations of model-based covariate adjustment methods, we propose an efficient estimator for each estimand that allows for flexible covariate adjustment and additionally addresses cluster size variation dependent on treatment assignment and other cluster characteristics. Such cluster size variations often occur post-randomization and, if ignored, can lead to bias of model-based estimators. For our proposed efficient covariate-adjusted estimator, we prove that when the nuisance functions are consistently estimated by machine learning algorithms, the estimator is consistent, asymptotically normal, and efficient. When the nuisance functions are estimated via parametric working models, the estimator is triply-robust. Simulation studies and analyses of three real-world cluster-randomized experiments demonstrate that the proposed methods are superior to existing alternatives. △ Less

Submitted 18 July, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

arXiv:2210.05169 [pdf, other]

Protocol for an Observational Study on the Effects of Giving Births from Unintended Pregnancies on Later Life Physical and Mental Health

Authors: Samrat Roy, Marina Bogomolov, Ruth Heller, Amy M. Claridge, Tishra Beeson, Dylan S. Small

Abstract: There has been increasing interest in studying the effect of giving births to unintended pregnancies on later life physical and mental health. In this article, we provide the protocol for our planned observational study on the long-term mental and physical health consequences for mothers who bear children resulting from unintended pregnancies. We aim to use the data from the Wisconsin Longitudinal… ▽ More There has been increasing interest in studying the effect of giving births to unintended pregnancies on later life physical and mental health. In this article, we provide the protocol for our planned observational study on the long-term mental and physical health consequences for mothers who bear children resulting from unintended pregnancies. We aim to use the data from the Wisconsin Longitudinal Study (WLS) and examine the effect of births from unintended pregnancies on a broad range of outcomes, including mental depression, psychological well-being, physical health, alcohol usage, and economic well-being. To strengthen our causal findings, we plan to address our research questions on two subgroups, Catholics and non-Catholics, and discover the "replicable" outcomes for which the effect of unintended pregnancy is negative (or, positive) in both subgroups. Following the idea of non-random cross-screening, the data will be split according to whether the woman is Catholic or not, and then one part of the data will be used to select the hypotheses and design the corresponding tests for the second part of the data. In past use of cross-screening (automatic cross-screening) there was only one team of investigators that dealt with both parts of the data so that the investigators would need to decide on an analysis plan before looking at the data. In this protocol, we describe plans to carry out a novel flexible cross-screening in which there will be two teams of investigators with access only to one part of data and each team will use their part of the data to decide how to plan the analysis for the second team's data. In addition to the above replicability analysis, we also discuss the plan to test the global null hypothesis that is intended to identify the outcomes which are affected by unintended pregnancy for at least one of the two subgroups of Catholics and non-Catholics. △ Less

Submitted 30 April, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

Comments: 29 pages

MSC Class: 62P25; 62F03

arXiv:2209.10339 [pdf, other]

Structural mean models for instrumented difference-in-differences

Authors: Tat-Thang Vo, Ting Ye, Ashkan Ertefaie, Samrat Roy, James Flory, Sean Hennessy, Stijn Vansteelandt, Dylan S. Small

Abstract: In the standard difference-in-differences research design, the parallel trends assumption may be violated when the relationship between the exposure trend and the outcome trend is confounded by unmeasured confounders. Progress can be made if there is an exogenous variable that (i) does not directly influence the change in outcome means (i.e. the outcome trend) except through influencing the change… ▽ More In the standard difference-in-differences research design, the parallel trends assumption may be violated when the relationship between the exposure trend and the outcome trend is confounded by unmeasured confounders. Progress can be made if there is an exogenous variable that (i) does not directly influence the change in outcome means (i.e. the outcome trend) except through influencing the change in exposure means (i.e. the exposure trend), and (ii) is not related to the unmeasured exposure - outcome confounders on the trend scale. Such exogenous variable is called an instrument for difference-in-differences. For continuous outcomes that lend themselves to linear modelling, so-called instrumented difference-in-differences methods have been proposed. In this paper, we will suggest novel multiplicative structural mean models for instrumented difference-in-differences, which allow one to identify and estimate the average treatment effect on count and rare binary outcomes, in the whole population or among the treated, when a valid instrument for difference-in-differences is available. We discuss the identifiability of these models, then develop efficient semi-parametric estimation approaches that allow the use of flexible, data-adaptive or machine learning methods to estimate the nuisance parameters. We apply our proposal on health care data to investigate the risk of moderate to severe weight gain under sulfonylurea treatment compared to metformin treatment, among new users of antihyperglycemic drugs. △ Less

Submitted 21 September, 2022; originally announced September 2022.

arXiv:2209.00781 [pdf, other]

Using Case Description Information to Reduce Sensitivity to Bias for the Attributable Fraction Among the Exposed

Authors: Kan Chen, **g Cheng, M. Elizabeth Halloran, Dylan S. Small

Abstract: The attributable fraction among the exposed (\textbf{AF}$_e$), also known as the attributable risk or excess fraction among the exposed, is the proportion of disease cases among the exposed that could be avoided by eliminating the exposure. Understanding the \textbf{AF}$_e$ for different exposures helps guide public health interventions. The conventional approach to inference for the \textbf{AF}… ▽ More The attributable fraction among the exposed (\textbf{AF}$_e$), also known as the attributable risk or excess fraction among the exposed, is the proportion of disease cases among the exposed that could be avoided by eliminating the exposure. Understanding the \textbf{AF}$_e$ for different exposures helps guide public health interventions. The conventional approach to inference for the \textbf{AF}$_e$ assumes no unmeasured confounding and could be sensitive to hidden bias from unobserved covariates. In this paper, we propose a new approach to reduce sensitivity to hidden bias for conducting statistical inference on the \textbf{AF}$_e$ by leveraging case description information. Case description information is information that describes the case, e.g., the subtype of cancer. The exposure may have more of an effect on some types of cases than other types. We explore how leveraging case description information can reduce sensitivity to bias from unmeasured confounding through an asymptotic tool, design sensitivity, and simulation studies. We allow for the possibility that leveraging case definition information may introduce additional selection bias through an additional sensitivity parameter. The proposed methodology is illustrated by re-examining alcohol consumption and the risk of postmenopausal invasive breast cancer using case description information on the subtype of cancer (hormone-sensitive or insensitive) using data from the Women's Health Initiative (WHI) Observational Study (OS). △ Less

Submitted 14 March, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

Comments: 30 pages, 8 tables, 1 figure

arXiv:2202.03379 [pdf, other]

Randomization Inference for Cluster-Randomized Test-Negative Designs with Application to Dengue Studies: Unbiased estimation, Partial compliance, and Stepped-wedge design

Authors: Bingkai Wang, Suzanne M. Dufault, Dylan S. Small, Nicholas P. Jewell

Abstract: In 2019, the World Health Organization identified dengue as one of the top ten global health threats. For the control of dengue, the Applying Wolbachia to Eliminate Dengue (AWED) study group conducted a cluster-randomized trial in Yogyakarta, Indonesia, and used a novel design, called the cluster-randomized test-negative design (CR-TND). This design can yield valid statistical inference with data… ▽ More In 2019, the World Health Organization identified dengue as one of the top ten global health threats. For the control of dengue, the Applying Wolbachia to Eliminate Dengue (AWED) study group conducted a cluster-randomized trial in Yogyakarta, Indonesia, and used a novel design, called the cluster-randomized test-negative design (CR-TND). This design can yield valid statistical inference with data collected by a passive surveillance system and thus has the advantage of cost-efficiency compared to traditional cluster-randomized trials. We investigate the statistical assumptions and properties of CR-TND under a randomization inference framework, which is known to be robust and efficient for small-sample problems. We find that, when the differential healthcare-seeking behavior comparing intervention and control varies across clusters (in contrast to the setting of Dufault and Jewell, 2020 where the differential healthcare-seeking behavior is constant across clusters), current analysis methods for CR-TND can be biased and have inflated type I error. We propose the log-contrast estimator that can eliminate such bias and improve precision by adjusting for covariates. Furthermore, we extend our methods to handle partial intervention compliance and a stepped-wedge design, both of which appear frequently in cluster-randomized trials. Finally, we demonstrate our results by simulation studies and re-analysis of the AWED study. △ Less

Submitted 7 February, 2022; originally announced February 2022.

arXiv:2112.00832 [pdf, other]

On the mixed-model analysis of covariance in cluster-randomized trials

Authors: Bingkai Wang, Michael O. Harhay, Jiaqi Tong, Dylan S. Small, Tim P. Morris, Fan Li

Abstract: In the analyses of cluster-randomized trials, mixed-model analysis of covariance (ANCOVA) is a standard approach for covariate adjustment and handling within-cluster correlations. However, when the normality, linearity, or the random-intercept assumption is violated, the validity and efficiency of the mixed-model ANCOVA estimators for estimating the average treatment effect remain unclear. Under t… ▽ More In the analyses of cluster-randomized trials, mixed-model analysis of covariance (ANCOVA) is a standard approach for covariate adjustment and handling within-cluster correlations. However, when the normality, linearity, or the random-intercept assumption is violated, the validity and efficiency of the mixed-model ANCOVA estimators for estimating the average treatment effect remain unclear. Under the potential outcomes framework, we prove that the mixed-model ANCOVA estimators for the average treatment effect are consistent and asymptotically normal under arbitrary misspecification of its working model. If the probability of receiving treatment is 0.5 for each cluster, we further show that the model-based variance estimator under mixed-model ANCOVA1 (ANCOVA without treatment-covariate interactions) remains consistent, clarifying that the confidence interval given by standard software is asymptotically valid even under model misspecification. Beyond robustness, we discuss several insights on precision among classical methods for analyzing cluster-randomized trials, including the mixed-model ANCOVA, individual-level ANCOVA, and cluster-level ANCOVA estimators. These insights may inform the choice of methods in practice. Our analytical results and insights are illustrated via simulation studies and analyses of three cluster-randomized trials. △ Less

Submitted 8 October, 2023; v1 submitted 1 December, 2021; originally announced December 2021.

arXiv:2105.01124 [pdf, other]

Combining Broad and Narrow Case Definitions in Matched Case-Control Studies: Firearms in the Home and Suicide Risk

Authors: Ting Ye, Kan Chen, Dylan S. Small

Abstract: Does having firearms in the home increase suicide risk? To test this hypothesis, a matched case-control study can be performed, in which suicide case subjects are compared to living controls who are similar in observed covariates in terms of their retrospective exposure to firearms at home. In this application, cases can be defined using a broad case definition (suicide) or a narrow case definitio… ▽ More Does having firearms in the home increase suicide risk? To test this hypothesis, a matched case-control study can be performed, in which suicide case subjects are compared to living controls who are similar in observed covariates in terms of their retrospective exposure to firearms at home. In this application, cases can be defined using a broad case definition (suicide) or a narrow case definition (suicide occurred at home). The broad case definition offers a larger number of cases but the narrow case definition may offer a larger effect size. Moreover, restricting to the narrow case definition may introduce selection bias (i.e., bias due to selecting samples based on characteristics affected by the treatment) because exposure to firearms in the home may affect the location of suicide and thus the type of a case a subject is. We propose a new sensitivity analysis framework for combining broad and narrow case definitions in matched case-control studies, that considers the unmeasured confounding bias and selection bias simultaneously. We develop a valid randomization-based testing procedure using only the narrow case matched sets when the effect of the unmeasured confounder on receiving treatment and the effect of the treatment on case definition among the always-cases are controlled by sensitivity parameters. We then use the Bonferroni method to combine the testing procedures using the broad and narrow case definitions. With the proposed methods, we find robust evidence that having firearms at home increases suicide risk. △ Less

Submitted 26 July, 2023; v1 submitted 3 May, 2021; originally announced May 2021.

arXiv:2012.00860 [pdf, other]

Relationship between changing malaria burden and low birth weight in sub-Saharan Africa: a difference-in-differences study via a pair-of-pairs approach

Authors: Siyu Heng, Wendy P. O'Meara, Ryan A. Simmons, Dylan S. Small

Abstract: Although interventional studies demonstrate that preventing malaria during pregnancy can reduce the low birth weight (i.e., child's birth weight $<$ 2,500 grams) rate, it remains unknown whether natural changes in parasite transmission and malaria burden can improve birth outcomes. We conduct an observational study of the effect of changing malaria burden on low birth weight using data from 18,112… ▽ More Although interventional studies demonstrate that preventing malaria during pregnancy can reduce the low birth weight (i.e., child's birth weight $<$ 2,500 grams) rate, it remains unknown whether natural changes in parasite transmission and malaria burden can improve birth outcomes. We conduct an observational study of the effect of changing malaria burden on low birth weight using data from 18,112 births in 19 countries in sub-Saharan African countries during the years 2000--2015. A malaria prevalence decline from a high rate (Plasmodium falciparum parasite rate in children aged 2-up-to-10 (i.e., $Pf\text{PR}_{2-10}$) $>$ 0.4) to a low rate ($Pf\text{PR}_{2-10}$ $<$ 0.2) is estimated to reduce the rate of low birth weight by 1.48 percentage points (95% confidence interval: 3.70 percentage points reduction, 0.74 percentage points increase), which is a 17% reduction in the low birth weight rate compared to the average (8.6%) in our study population with observed birth weight records (1.48/8.6 $\approx$ 17%). When focusing on first pregnancies, a decline in malaria prevalence from high to low is estimated to have a greater impact on the low birth weight rate than for all births: 3.73 percentage points (95% confidence interval: 9.11 percentage points reduction, 1.64 percentage points increase). Although the confidence intervals cannot rule out the possibility of no effect at the 95% confidence level, the concurrence between our primary analysis, secondary analyses, and sensitivity analyses, and the magnitude of the effect size, contribute to the weight of the evidence suggesting that declining malaria burden has an important effect on birth weight at the population level. The novel statistical methodology developed in this article, a pair-of-pairs approach to a difference-in-differences study, could be useful for many settings in which the units observed are different at different times. △ Less

Submitted 12 July, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

arXiv:2011.06917 [pdf, other]

Social Distancing and COVID-19: Randomization Inference for a Structured Dose-Response Relationship

Authors: Bo Zhang, Siyu Heng, Ting Ye, Dylan S. Small

Abstract: Social distancing is widely acknowledged as an effective public health policy combating the novel coronavirus. But extreme social distancing has costs and it is not clear how much social distancing is needed to achieve public health effects. In this article, we develop a design-based framework to make inference about the dose-response relationship between social distancing and COVID-19 related dea… ▽ More Social distancing is widely acknowledged as an effective public health policy combating the novel coronavirus. But extreme social distancing has costs and it is not clear how much social distancing is needed to achieve public health effects. In this article, we develop a design-based framework to make inference about the dose-response relationship between social distancing and COVID-19 related death toll and case numbers. We first discuss how to embed observational data with a time-independent, continuous treatment dose into an approximate randomized experiment, and develop a randomization-based procedure that tests if a structured dose-response relationship fits the data. We then generalize the design and testing procedure to accommodate a time-dependent, treatment dose trajectory, and generalize a dose-response relationship to a longitudinal setting. Finally, we apply the proposed design and testing procedures to investigate the effect of social distancing during the phased reopening in the United States on public health outcomes using data compiled from sources including Unacast, the United States Census Bureau, and the County Health Rankings and Roadmaps Program. We rejected a primary analysis null hypothesis that stated the social distancing from April 27, 2020, to June 28, 2020, had no effect on the COVID-19-related death toll from June 29, 2020, to August 2, 2020 (p-value < 0.001), and found that it took more reduction in mobility to prevent exponential growth in case numbers for non-rural counties compared to rural counties. △ Less

Submitted 9 August, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

arXiv:2011.03593 [pdf, other]

Instrumented Difference-in-Differences

Authors: Ting Ye, Ashkan Ertefaie, James Flory, Sean Hennessy, Dylan S. Small

Abstract: Unmeasured confounding is a key threat to reliable causal inference based on observational studies. Motivated from two powerful natural experiment devices, the instrumental variables and difference-in-differences, we propose a new method called instrumented difference-in-differences that explicitly leverages exogenous randomness in an exposure trend to estimate the average and conditional average… ▽ More Unmeasured confounding is a key threat to reliable causal inference based on observational studies. Motivated from two powerful natural experiment devices, the instrumental variables and difference-in-differences, we propose a new method called instrumented difference-in-differences that explicitly leverages exogenous randomness in an exposure trend to estimate the average and conditional average treatment effect in the presence of unmeasured confounding. We develop the identification assumptions using the potential outcomes framework. We propose a Wald estimator and a class of multiply robust and efficient semiparametric estimators, with provable consistency and asymptotic normality. In addition, we extend the instrumented difference-in-differences to a two-sample design to facilitate investigations of delayed treatment effect and provide a measure of weak identification. We demonstrate our results in simulated and real datasets. △ Less

Submitted 7 November, 2021; v1 submitted 6 November, 2020; originally announced November 2020.

arXiv:2009.06935 [pdf, other]

Constructing a More Closely Matched Control Group in a Difference-in-Differences Analysis: Its Effect on History Interacting with Group Bias

Authors: Pallavi Basu, Dylan S. Small

Abstract: Difference-in-differences analysis with a control group that differs considerably from a treated group is vulnerable to bias from historical events that have different effects on the groups. Constructing a more closely matched control group by matching a subset of the overall control group to the treated group may result in less bias. We study this phenomenon in simulation studies. We study the ef… ▽ More Difference-in-differences analysis with a control group that differs considerably from a treated group is vulnerable to bias from historical events that have different effects on the groups. Constructing a more closely matched control group by matching a subset of the overall control group to the treated group may result in less bias. We study this phenomenon in simulation studies. We study the effect of mountaintop removal mining (MRM) on mortality using a difference-in-differences analysis that makes use of the increase in MRM following the 1990 Clean Air Act Amendments. For a difference-in-differences analysis of the effect of MRM on mortality, we constructed a more closely matched control group and found a 95\% confidence interval that contains substantial adverse effects along with no effect and small beneficial effects. △ Less

Submitted 15 September, 2020; originally announced September 2020.

Comments: Accepted to Observational Studies

arXiv:2009.04832 [pdf, other]

A note on post-treatment selection in studying racial discrimination in policing

Authors: Qingyuan Zhao, Luke J Keele, Dylan S Small, Marshall M Joffe

Abstract: We discuss some causal estimands used to study racial discrimination in policing. A central challenge is that not all police-civilian encounters are recorded in administrative datasets and available to researchers. One possible solution is to consider the average causal effect of race conditional on the civilian already being detained by the police. We find that such an estimand can be quite diffe… ▽ More We discuss some causal estimands used to study racial discrimination in policing. A central challenge is that not all police-civilian encounters are recorded in administrative datasets and available to researchers. One possible solution is to consider the average causal effect of race conditional on the civilian already being detained by the police. We find that such an estimand can be quite different from the more familiar ones in causal inference and needs to be interpreted with caution. We propose using an estimand new for this context -- the causal risk ratio, which has more transparent interpretation and requires weaker identification assumptions. We demonstrate this through a reanalysis of the NYPD Stop-and-Frisk dataset. Our reanalysis shows that the naive estimator that ignores the post-treatment selection in administrative records may severely underestimate the disparity in police violence between minorities and whites in these and similar data. △ Less

Submitted 14 June, 2021; v1 submitted 10 September, 2020; originally announced September 2020.

Comments: Accepted for publication in the American Political Science Review on 14th June, 2021

MSC Class: 62D20 (Primary) 62P25 (Secondary)

arXiv:2006.02423 [pdf, other]

A Negative Correlation Strategy for Bracketing in Difference-in-Differences

Authors: Ting Ye, Luke Keele, Raiden Hasegawa, Dylan S. Small

Abstract: The method of difference-in-differences (DID) is widely used to study the causal effect of policy interventions in observational studies. DID employs a before and after comparison of the treated and control units to remove bias due to time-invariant unmeasured confounders under the parallel trends assumption. Estimates from DID, however, will be biased if the outcomes for the treated and control u… ▽ More The method of difference-in-differences (DID) is widely used to study the causal effect of policy interventions in observational studies. DID employs a before and after comparison of the treated and control units to remove bias due to time-invariant unmeasured confounders under the parallel trends assumption. Estimates from DID, however, will be biased if the outcomes for the treated and control units evolve differently in the absence of treatment, namely if the parallel trends assumption is violated. We propose a general identification strategy that leverages two groups of control units whose outcomes relative to the treated units exhibit a negative correlation, and achieves partial identification of the average treatment effect for the treated. The identified set is of a union bounds form that involves the minimum and maximum operators, which makes the canonical bootstrap generally inconsistent and naive methods overly conservative. By utilizing the directional inconsistency of the bootstrap distribution, we develop a novel bootstrap method to construct uniformly valid confidence intervals for the identified set and parameter of interest when the identified set is of a union bounds form, and we establish the method's theoretical properties. We develop a simple falsification test and sensitivity analysis. We apply the proposed strategy for bracketing to study whether minimum wage laws affect employment levels. △ Less

Submitted 13 June, 2022; v1 submitted 3 June, 2020; originally announced June 2020.

arXiv:2006.01393 [pdf, other]

Two Robust Tools for Inference about Causal Effects with Invalid Instruments

Authors: Hyunseung Kang, You** Lee, T. Tony Cai, Dylan S. Small

Abstract: Instrumental variables have been widely used to estimate the causal effect of a treatment on an outcome. Existing confidence intervals for causal effects based on instrumental variables assume that all of the putative instrumental variables are valid; a valid instrumental variable is a variable that affects the outcome only by affecting the treatment and is not related to unmeasured confounders. H… ▽ More Instrumental variables have been widely used to estimate the causal effect of a treatment on an outcome. Existing confidence intervals for causal effects based on instrumental variables assume that all of the putative instrumental variables are valid; a valid instrumental variable is a variable that affects the outcome only by affecting the treatment and is not related to unmeasured confounders. However, in practice, some of the putative instrumental variables are likely to be invalid. This paper presents two tools to conduct valid inference and tests in the presence of invalid instruments. First, we propose a simple and general approach to construct confidence intervals based on taking unions of well-known confidence intervals. Second, we propose a novel test for the null causal effect based on a collider bias. Our two proposals, especially when fused together, outperform traditional instrumental variable confidence intervals when invalid instruments are present, and can also be used as a sensitivity analysis when there is concern that instrumental variables assumptions are violated. The new approach is applied to a Mendelian randomization study on the causal effect of low-density lipoprotein on the incidence of cardiovascular diseases. △ Less

Submitted 2 June, 2020; originally announced June 2020.

arXiv:2005.01873 [pdf]

Protocol for a Study of the Effect of Surface Mining in Central Appalachia on Adverse Birth Outcomes

Authors: Dylan S. Small, Dan Firth, Luke Keele, Matthew Huber, Molly Passarella, Scott Lorch, Heather Burris

Abstract: Surface mining has become a major method of coal mining in Central Appalachia alongside the traditional underground mining. Concerns have been raised about the health effects of this surface mining, particularly mountaintop removal mining where coal is mined upon steep mountaintops by removing the mountaintop through clearcutting forests and explosives. We have designed a matched observational stu… ▽ More Surface mining has become a major method of coal mining in Central Appalachia alongside the traditional underground mining. Concerns have been raised about the health effects of this surface mining, particularly mountaintop removal mining where coal is mined upon steep mountaintops by removing the mountaintop through clearcutting forests and explosives. We have designed a matched observational study to assess the effects of surface mining in Central Appalachia on adverse birth outcomes. This protocol describes for the study the background and motivation, the sample selection and the analysis plan. △ Less

Submitted 4 May, 2020; originally announced May 2020.

arXiv:2004.00766 [pdf, ps, other]

Sharpening the Rosenbaum Sensitivity Bounds to Address Concerns About Interactions Between Observed and Unobserved Covariates

Authors: Siyu Heng, Dylan S. Small

Abstract: In observational studies, it is typically unrealistic to assume that treatments are randomly assigned, even conditional on adjusting for all observed covariates. Therefore, a sensitivity analysis is often needed to examine how hidden biases due to unobserved covariates would affect inferences on treatment effects. In matched observational studies where each treated unit is matched to one or multip… ▽ More In observational studies, it is typically unrealistic to assume that treatments are randomly assigned, even conditional on adjusting for all observed covariates. Therefore, a sensitivity analysis is often needed to examine how hidden biases due to unobserved covariates would affect inferences on treatment effects. In matched observational studies where each treated unit is matched to one or multiple untreated controls for observed covariates, the Rosenbaum bounds sensitivity analysis is one of the most popular sensitivity analysis models. In this paper, we show that in the presence of interactions between observed and unobserved covariates, directly applying the Rosenbaum bounds will almost inevitably exaggerate the report of sensitivity of causal conclusions to hidden bias. We give sharper odds ratio bounds to fix this deficiency. We illustrate our new method through studying the effect of anger/hostility tendency on the risk of having heart problems. △ Less

Submitted 14 May, 2021; v1 submitted 1 April, 2020; originally announced April 2020.

Comments: 32 pages, 2 tables

arXiv:2002.10436 [pdf, other]

doi 10.1080/01621459.2020.1736083

Selecting and ranking individualized treatment rules with unmeasured confounding

Authors: Bo Zhang, Jordan Weiss, Dylan S Small, Qingyuan Zhao

Abstract: It is common to compare individualized treatment rules based on the value function, which is the expected potential outcome under the treatment rule. Although the value function is not point-identified when there is unmeasured confounding, it still defines a partial order among the treatment rules under Rosenbaum's sensitivity analysis model. We first consider how to compare two treatment rules wi… ▽ More It is common to compare individualized treatment rules based on the value function, which is the expected potential outcome under the treatment rule. Although the value function is not point-identified when there is unmeasured confounding, it still defines a partial order among the treatment rules under Rosenbaum's sensitivity analysis model. We first consider how to compare two treatment rules with unmeasured confounding in the single-decision setting and then use this pairwise test to rank multiple treatment rules. We consider how to, among many treatment rules, select the best rules and select the rules that are better than a control rule. The proposed methods are illustrated using two real examples, one about the benefit of malaria prevention programs to different age groups and another about the effect of late retirement on senior health in different gender and occupation groups. △ Less

Submitted 24 February, 2020; originally announced February 2020.

Comments: 33 pages, accepted manuscript (by Journal of the American Statistical Association)

arXiv:2002.08457 [pdf, other]

ivmodel: An R Package for Inference and Sensitivity Analysis of Instrumental Variables Models with One Endogenous Variable

Authors: Hyunseung Kang, Yang Jiang, Qingyuan Zhao, Dylan S. Small

Abstract: We present a comprehensive R software ivmodel for analyzing instrumental variables with one endogenous variable. The package implements a general class of estimators called k- class estimators and two confidence intervals that are fully robust to weak instruments. The package also provides power formulas for various test statistics in instrumental variables. Finally, the package contains methods f… ▽ More We present a comprehensive R software ivmodel for analyzing instrumental variables with one endogenous variable. The package implements a general class of estimators called k- class estimators and two confidence intervals that are fully robust to weak instruments. The package also provides power formulas for various test statistics in instrumental variables. Finally, the package contains methods for sensitivity analysis to examine the sensitivity of the inference to instrumental variables assumptions. We demonstrate the software on the data set from Card (1995), looking at the causal effect of levels of education on log earnings where the instrument is proximity to a four-year college. △ Less

Submitted 7 July, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

Comments: 24 pages, 2 figures, 3 tables

arXiv:1911.09171 [pdf, other]

Re-Evaluating Strengthened-IV Designs: Asymptotic Efficiency, Bias Formula, and the Validity and Power of Sensitivity Analyses

Authors: Siyu Heng, Bo Zhang, Xu Han, Scott A. Lorch, Dylan S. Small

Abstract: Instrumental variables (IVs) are extensively used to estimate treatment effects when the treatment and outcome are confounded by unmeasured confounders; however, weak IVs are often encountered in empirical studies and may cause problems. Many studies have considered building a stronger IV from the original, possibly weak, IV in the design stage of a matched study at the cost of not using some of t… ▽ More Instrumental variables (IVs) are extensively used to estimate treatment effects when the treatment and outcome are confounded by unmeasured confounders; however, weak IVs are often encountered in empirical studies and may cause problems. Many studies have considered building a stronger IV from the original, possibly weak, IV in the design stage of a matched study at the cost of not using some of the samples in the analysis. It is widely accepted that strengthening an IV tends to render nonparametric tests more powerful and will increase the power of sensitivity analyses in large samples. In this article, we re-evaluate this conventional wisdom to bring new insights into this topic. We consider matched observational studies from three perspectives. First, we evaluate the trade-off between IV strength and sample size on nonparametric tests assuming the IV is valid and exhibit conditions under which strengthening an IV increases power and conversely conditions under which it decreases power. Second, we derive a necessary condition for a valid sensitivity analysis model with continuous doses. We show that the $Γ$ sensitivity analysis model, which has been previously used to come to the conclusion that strengthening an IV increases the power of sensitivity analyses in large samples, does not apply to the continuous IV setting and thus this previously reached conclusion may be invalid. Third, we quantify the bias of the Wald estimator with a possibly invalid IV under an oracle and leverage it to develop a valid sensitivity analysis framework; under this framework, we show that strengthening an IV may amplify or mitigate the bias of the estimator, and may or may not increase the power of sensitivity analyses. We also discuss how to better adjust for the observed covariates when building an IV in matched studies. △ Less

Submitted 15 October, 2021; v1 submitted 20 November, 2019; originally announced November 2019.

Comments: 86 pages, 4 figures, 6 tables

arXiv:1909.04706 [pdf, other]

Regression to the Mean's Impact on the Synthetic Control Method: Bias and Sensitivity Analysis

Authors: Nicholas Illenberger, Dylan S. Small, Pamela A. Shaw

Abstract: To make informed policy recommendations from observational data, we must be able to discern true treatment effects from random noise and effects due to confounding. Difference-in-Difference techniques which match treated units to control units based on pre-treatment outcomes, such as the synthetic control approach have been presented as principled methods to account for confounding. However, we sh… ▽ More To make informed policy recommendations from observational data, we must be able to discern true treatment effects from random noise and effects due to confounding. Difference-in-Difference techniques which match treated units to control units based on pre-treatment outcomes, such as the synthetic control approach have been presented as principled methods to account for confounding. However, we show that use of synthetic controls or other matching procedures can introduce regression to the mean (RTM) bias into estimates of the average treatment effect on the treated. Through simulations, we show RTM bias can lead to inflated type I error rates as well as decreased power in typical policy evaluation settings. Further, we provide a novel correction for RTM bias which can reduce bias and attain appropriate type I error rates. This correction can be used to perform a sensitivity analysis which determines how results may be affected by RTM. We use our proposed correction and sensitivity analysis to reanalyze data concerning the effects of California's Proposition 99, a large-scale tobacco control program, on statewide smoking rates. △ Less

Submitted 10 September, 2019; originally announced September 2019.

Comments: 15 pages, 4 figures

MSC Class: 62K99

arXiv:1908.09425 [pdf, other]

Estimating Malaria Vaccine Efficacy in the Absence of a Gold Standard Case Definition: Mendelian Factorial Design

Authors: Raiden B. Hasegawa, Dylan S. Small

Abstract: Accurate estimates of malaria vaccine efficacy require a reliable definition of a malaria case. However, the symptoms of clinical malaria are unspecific, overlap** with other childhood illnesses. Additionally, children in endemic areas tolerate varying levels of parasitemia without symptoms. Together, this makes finding a gold-standard case definition challenging. We present a method to identify… ▽ More Accurate estimates of malaria vaccine efficacy require a reliable definition of a malaria case. However, the symptoms of clinical malaria are unspecific, overlap** with other childhood illnesses. Additionally, children in endemic areas tolerate varying levels of parasitemia without symptoms. Together, this makes finding a gold-standard case definition challenging. We present a method to identify and estimate malaria vaccine efficacy that does not require an observable gold-standard case definition. Instead, we leverage genetic traits that are protective against malaria but not against other illnesses, e.g., the sickle cell trait, to identify vaccine efficacy in a randomized trial. Inspired by Mendelian randomization, we introduce Mendelian factorial design, a method that augments a randomized trial with genetic variation to produce a natural factorial experiment, which identifies vaccine efficacy under realistic assumptions. A robust, covariance adjusted estimation procedure is developed for estimating vaccine efficacy on the risk ratio and incidence ratio scales. Simulations suggest that our estimator has good performance whereas standard methods are systematically biased. We demonstrate that a combined estimator using both our proposed estimator and the standard approach yields significant improvements when the Mendelian factor is only weakly protective. △ Less

Submitted 25 August, 2019; originally announced August 2019.

arXiv:1907.06770 [pdf, other]

Increasing Power for Observational Studies of Aberrant Response: An Adaptive Approach

Authors: Siyu Heng, Hyunseung Kang, Dylan S. Small, Colin B. Fogarty

Abstract: In many observational studies, the interest is in the effect of treatment on bad, aberrant outcomes rather than the average outcome. For such settings, the traditional approach is to define a dichotomous outcome indicating aberration from a continuous score and use the Mantel-Haenszel test with matched data. For example, studies of determinants of poor child growth use the World Health Organizatio… ▽ More In many observational studies, the interest is in the effect of treatment on bad, aberrant outcomes rather than the average outcome. For such settings, the traditional approach is to define a dichotomous outcome indicating aberration from a continuous score and use the Mantel-Haenszel test with matched data. For example, studies of determinants of poor child growth use the World Health Organization's definition of child stunting being height-for-age z-score $\leq -2$. The traditional approach may lose power because it discards potentially useful information about the severity of aberration. We develop an adaptive approach that makes use of this information and asymptotically dominates the traditional approach. We develop our approach in two parts. First, we develop an aberrant rank approach in matched observational studies and prove a novel design sensitivity formula enabling its asymptotic comparison with the Mantel-Haenszel test under various settings. Second, we develop a new, general adaptive approach, the two-stage programming method, and use it to adaptively combine the aberrant rank test and the Mantel-Haenszel test. We apply our approach to a study of the effect of teenage pregnancy on stunting. △ Less

Submitted 14 October, 2020; v1 submitted 15 July, 2019; originally announced July 2019.

Comments: 83 pages, 1 figure, 8 tables

arXiv:1905.10808 [pdf, other]

A Test for Differential Ascertainment in Case-Control Studies with Application to Child Maltreatment

Authors: Matteo Sordello, Dylan S. Small

Abstract: We propose a method to test for the presence of differential ascertainment in case-control studies, when data are collected by multiple sources. We show that, when differential ascertainment is present, the use of only the observed cases leads to severe bias in the computation of the odds ratio. We can alleviate the effect of such bias using the estimates that our method of testing for differentia… ▽ More We propose a method to test for the presence of differential ascertainment in case-control studies, when data are collected by multiple sources. We show that, when differential ascertainment is present, the use of only the observed cases leads to severe bias in the computation of the odds ratio. We can alleviate the effect of such bias using the estimates that our method of testing for differential ascertainment naturally provides. We apply it to a dataset obtained from the National Violent Death Reporting System, with the goal of checking for the presence of differential ascertainment by race in the count of deaths caused by child maltreatment. △ Less

Submitted 4 July, 2020; v1 submitted 26 May, 2019; originally announced May 2019.

Comments: 25 pages, 5 figures, 8 tables

MSC Class: 62F03; 62P25

arXiv:1904.12321 [pdf, other]

Nonparametric maximum likelihood estimation under a likelihood ratio order

Authors: Ted Westling, Kevin J. Downes, Dylan S. Small

Abstract: Comparison of two univariate distributions based on independent samples from them is a fundamental problem in statistics, with applications in a wide variety of scientific disciplines. In many situations, we might hypothesize that the two distributions are stochastically ordered, meaning intuitively that samples from one distribution tend to be larger than those from the other. One type of stochas… ▽ More Comparison of two univariate distributions based on independent samples from them is a fundamental problem in statistics, with applications in a wide variety of scientific disciplines. In many situations, we might hypothesize that the two distributions are stochastically ordered, meaning intuitively that samples from one distribution tend to be larger than those from the other. One type of stochastic order that arises in economics, biomedicine, and elsewhere is the likelihood ratio order, also known as the density ratio order, in which the ratio of the density functions of the two distributions is monotone non-decreasing. In this article, we derive and study the nonparametric maximum likelihood estimator of the individual distributions and the ratio of their densities under the likelihood ratio order. Our work applies to discrete distributions, continuous distributions, and mixed continuous-discrete distributions. We demonstrate convergence in distribution of the estimator in certain cases, and we illustrate our results using numerical experiments and an analysis of a biomarker for predicting bacterial infection in children with systemic inflammatory response syndrome. △ Less

Submitted 7 July, 2021; v1 submitted 28 April, 2019; originally announced April 2019.

Comments: Revised paper

arXiv:1904.11430 [pdf, other]

Bracketing in the Comparative Interrupted Time-Series Design to Address Concerns about History Interacting with Group: Evaluating Missouri Handgun Purchaser Law

Authors: Raiden B. Hasegawa, Dylan S. Small, Daniel W Webster

Abstract: In the comparative interrupted time series design (also called the method of difference-in-differences), the change in outcome in a group exposed to treatment in the periods before and after the exposure is compared to the change in outcome in a control group not exposed to treatment in either period. The standard difference-in-difference estimator for a comparative interrupted time series design… ▽ More In the comparative interrupted time series design (also called the method of difference-in-differences), the change in outcome in a group exposed to treatment in the periods before and after the exposure is compared to the change in outcome in a control group not exposed to treatment in either period. The standard difference-in-difference estimator for a comparative interrupted time series design will be biased for estimating the causal effect of the treatment if there is an interaction between history in the after period and the groups; for example, there is a historical event besides the start of the treatment in the after period that benefits the treated group more than the control group. We present a bracketing method for bounding the effect of an interaction between history and the groups that arises from a time-invariant unmeasured confounder having a different effect in the after period than the before period. The method is applied to a study of the effect of the repeal of Missouri's permit-to-purchase handgun law on its firearm homicide rate. We estimate that the effect of the permit-to-purchase repeal on Missouri's firearm homicide rate is bracketed between 0.9 and 1.3 homicides per 100,000 people, corresponding to a percentage increase of 17% to 27% (95% confidence interval: [0.6,1.7] or [11%,35%]). A placebo study provides additional support for the hypothesis that the repeal has a causal effect of increasing the rate of state-wide firearm homicides. △ Less

Submitted 25 April, 2019; originally announced April 2019.

Journal ref: Epidemiology, Volume 30, Issue 3, p.371-379, 2019

arXiv:1902.10106 [pdf, ps, other]

Protocol for an Observational Study of the Association of High School Football Participation on Health in Late Adulthood

Authors: Timothy G. Gaulton, Sameer K. Deshpande, Dylan S. Small, Mark D. Neuman

Abstract: American football is the most popular high school sport and is among the leading cause of injury among adolescents. While there has been considerable recent attention on the link between football and cognitive decline, there is also evidence of higher than expected rates of pain, obesity, and lower quality of life among former professional players, either as a result of repetitive head injury or t… ▽ More American football is the most popular high school sport and is among the leading cause of injury among adolescents. While there has been considerable recent attention on the link between football and cognitive decline, there is also evidence of higher than expected rates of pain, obesity, and lower quality of life among former professional players, either as a result of repetitive head injury or through different mechanisms. Previously hidden downstream effects of playing football may have far-reaching public health implications for participants in youth and high school football programs. Our proposed study is a retrospective observational study that compares 1,153 high school males who played varsity football with 2,751 male students who did not. 1,951 of the control subjects did not play any sport and the remaining 800 controls played a non-contact sport. Our primary outcome is self-rated health measured at age 65. To control for potential confounders, we adjust for pre-exposure covariates with matching and model-based covariance adjustment. We will conduct an ordered testing procedure designed to use the full pool of 2,751 controls while also controlling for possible unmeasured differences between students who played sports and those who did not. We will quantitatively assess the sensitivity of the results to potential unmeasured confounding. The study will also assess secondary outcomes of pain, difficulty with activities of daily living, and obesity, as these are both important to individual well-being and have public health relevance. △ Less

Submitted 26 February, 2019; originally announced February 2019.

arXiv:1901.01869 [pdf, other]

Patterns of Effects and Sensitivity Analysis for Differences-in-Differences

Authors: Luke J. Keele, Dylan S. Small, Jesse Y. Hsu, Colin B. Fogarty

Abstract: Applied analysts often use the differences-in-differences (DID) method to estimate the causal effect of policy interventions with observational data. The method is widely used, as the required before and after comparison of a treated and control group is commonly encountered in practice. DID removes bias from unobserved time-invariant confounders. While DID removes bias from time-invariant confoun… ▽ More Applied analysts often use the differences-in-differences (DID) method to estimate the causal effect of policy interventions with observational data. The method is widely used, as the required before and after comparison of a treated and control group is commonly encountered in practice. DID removes bias from unobserved time-invariant confounders. While DID removes bias from time-invariant confounders, bias from time-varying confounders may be present. Hence, like any observational comparison, DID studies remain susceptible to bias from hidden confounders. Here, we develop a method of sensitivity analysis that allows investigators to quantify the amount of bias necessary to change a study's conclusions. Our method operates within a matched design that removes bias from observed baseline covariates. We develop methods for both binary and continuous outcomes. We then apply our methods to two different empirical examples from the social sciences. In the first application, we study the effect of changes to disability payments in Germany. In the second, we re-examine whether election day registration increased turnout in Wisconsin. △ Less

Submitted 1 February, 2019; v1 submitted 7 January, 2019; originally announced January 2019.

arXiv:1808.03934 [pdf, other]

Protocol for an observational study on the effects of playing football in adolescence on mental health in early adulthood

Authors: Sameer K. Deshpande, Raiden B. Hasegawa, Jordan Weiss, Dylan S. Small

Abstract: More than 1 million students play high school American football annually, but many health professionals have recently questioned its safety or called for its ban. These concerns have been partially driven by reports of chronic traumatic encephalopathy (CTE), increased risks of neurodegenerative disease, and associations between concussion history and later-life cognitive impairment and depression… ▽ More More than 1 million students play high school American football annually, but many health professionals have recently questioned its safety or called for its ban. These concerns have been partially driven by reports of chronic traumatic encephalopathy (CTE), increased risks of neurodegenerative disease, and associations between concussion history and later-life cognitive impairment and depression among retired professional football players. A recent observational study of a cohort of men who graduated from a Wisconsin high school in 1957 found no statistically significant harmful effects of playing high school football on a range of cognitive, psychological, and socio-economic outcomes measured at ages 35, 54, 65, and 72. Unfortunately, these findings may not generalize to younger populations, thanks to changes and improvements in football helmet technology and training techniques. In particular, these changes may have led to increased perceptions of safety but ultimately more dangerous styles of play, characterized by the frequent sub-concussive impacts thought to be associated with later-life neurological decline. In this work, we replicate the methodology of that earlier matched observational study using data from the National Longitudinal Study of Adolescent to Adult Health (Add Health). These include adolescent and family co-morbidities, academic experience, self-reported levels of general health and physical activity, and the score on the Add Health Picture Vocabulary Test. Our primary outcome is the CES-D score measured in 2008 when subjects were aged 24 -- 34 and settling into early adulthood. We also examine several secondary outcomes related to physical and psychological health, including suicidality. Our results can provide insight into the natural history of potential football-related decline and dysfunction. △ Less

Submitted 9 November, 2018; v1 submitted 12 August, 2018; originally announced August 2018.

Comments: Updated tables summarizing the matches constructed

arXiv:1807.10558 [pdf]

Protocol for an Observational Study on the Effects of Early-Life Participation in Contact Sports on Later-Life Cognition in a Sample of Monozygotic and Dizygotic Swedish Twins Reared Together and Twins Reared Apart

Authors: Jordan Weiss, Amanda R. Rabinowitz, Sameer K. Deshpande, Raiden B. Hasegawa, Dylan S. Small

Abstract: A large body of work links traumatic brain injury (TBI) in adulthood to the onset of Alzheimer's disease (AD). AD is the chief cause of dementia, leading to reduced cognitive capacity and autonomy and increased mortality risk. More recently, researchers have sought to investigate whether TBI experienced in early-life may influence trajectories of cognitive dysfunction in adulthood. It has been spe… ▽ More A large body of work links traumatic brain injury (TBI) in adulthood to the onset of Alzheimer's disease (AD). AD is the chief cause of dementia, leading to reduced cognitive capacity and autonomy and increased mortality risk. More recently, researchers have sought to investigate whether TBI experienced in early-life may influence trajectories of cognitive dysfunction in adulthood. It has been speculated that early-life participation in collision sports may lead to poor cognitive and mental health outcomes. However, to date, the few studies to investigate this relationship have produced mixed results. We propose to extend this literature by conducting a prospective study on the effects of early-life participation in collision sports on later-life cognitive health using the Swedish Adoption/Twin Study on Aging (SATSA). The SATSA is unique in its sampling of monozygotic and dizygotic twins reared together (respectively MZT, DZT) and twins reared apart (respectively MZA, DZA). The proposed analysis is a prospective study of 660 individuals comprised of 270 twin pairs and 120 singletons. Seventy-eight (11.8% individuals reported participation in collision sports. Our primary outcome will be an indicator of cognitive impairment determined by scores on the Mini-Mental State Examination (MMSE). We will also consider several secondary cognitive outcomes including verbal and spatial ability, memory, and processing speed. Our sample will be restricted to individuals with at least one MMSE score out of seven repeated assessments spaced approximately three years apart. We will adjust for age, sex, and education in each of our models. △ Less

Submitted 16 April, 2020; v1 submitted 27 July, 2018; originally announced July 2018.

Comments: Updated methodology and tables

arXiv:1804.07371 [pdf, other]

Powerful genome-wide design and robust statistical inference in two-sample summary-data Mendelian randomization

Authors: Qingyuan Zhao, Yang Chen, **gshu Wang, Dylan S. Small

Abstract: Two-sample summary-data Mendelian randomization (MR) has become a popular research design to estimate the causal effect of risk exposures. With the sample size of GWAS continuing to increase, it is now possible to utilize genetic instruments that are only weakly associated with the exposure. To maximize the statistical power of MR, we propose a genome-wide design where more than a thousand genetic… ▽ More Two-sample summary-data Mendelian randomization (MR) has become a popular research design to estimate the causal effect of risk exposures. With the sample size of GWAS continuing to increase, it is now possible to utilize genetic instruments that are only weakly associated with the exposure. To maximize the statistical power of MR, we propose a genome-wide design where more than a thousand genetic instruments are used. For the statistical analysis, we use an empirical partially Bayes approach where instruments are weighted according to their strength, thus weak instruments bring less variation to the estimator. The estimator is highly efficient with many weak genetic instruments and is robust to balanced and/or sparse pleiotropy. We apply our method to estimate the causal effect of body mass index (BMI) and major blood lipids on cardiovascular disease outcomes and obtain substantially shorter confidence intervals. Some new and statistically significant findings are: the estimated causal odds ratio of BMI on ischemic stroke is 1.19 (95% CI: 1.07--1.32, p-value < 0.001); the estimated causal odds ratio of high-density lipoprotein cholesterol (HDL-C) on coronary artery disease (CAD) is 0.78 (95% CI 0.73--0.84, p-value < 0.001). However, the estimated effect of HDL-C becomes substantially smaller and statistically non-significant when we only use the strong instruments. By employing a genome-wide design and robust statistical methods, the statistical power of MR studies can be greatly improved. Our empirical results suggest that, even though the relationship between HDL-C and CAD appears to be highly heterogeneous, it may be too soon to completely dismiss the HDL hypothesis. △ Less

Submitted 16 November, 2018; v1 submitted 19 April, 2018; originally announced April 2018.

Comments: 46 pages

MSC Class: 46N60

arXiv:1802.06711 [pdf, ps, other]

Sensitivity analyses for average treatment effects when outcome is censored by death in instrumental variable models

Authors: Kwonsang Lee, Scott A. Lorch, Dylan S. Small

Abstract: Two problems that arise in making causal inferences for non-mortality outcomes such as bronchopulmonary dysplasia (BPD) are unmeasured confounding and censoring by death, i.e., the outcome is only observed when subjects survive. In randomized experiments with noncompliance, instrumental variable methods can be used to control for the unmeasured confounding without censoring by death. But when ther… ▽ More Two problems that arise in making causal inferences for non-mortality outcomes such as bronchopulmonary dysplasia (BPD) are unmeasured confounding and censoring by death, i.e., the outcome is only observed when subjects survive. In randomized experiments with noncompliance, instrumental variable methods can be used to control for the unmeasured confounding without censoring by death. But when there is censoring by death, the average causal treatment effect cannot be identified under usual assumptions, but can be studied for a specific subpopulation by using sensitivity analysis with additional assumptions. However, in observational studies, evaluation of the local average treatment effect (LATE) in censoring by death problems with unmeasured confounding is not well studied. We develop a novel sensitivity analysis method based on instrumental variable models for studying the LATE. Specifically, we present the identification results under an additional assumption, and propose a three-step procedure for the LATE estimation. Also, we propose an improved two-step procedure by simultaneously estimating the instrument propensity score (i.e., the probability of instrument given covariates) and the parameters induced by the assumption. We have shown with simulation studies that the two-step procedure can be more robust and efficient than the three-step procedure. Finally, we apply our sensitivity analysis methods to a study of the effect of delivery at high-level neonatal intensive care units on the risk of BPD. △ Less

Submitted 19 February, 2018; originally announced February 2018.

arXiv:1802.06710 [pdf, ps, other]

Discovering Effect Modification and Randomization Inference in Air Pollution Studies

Authors: Kwonsang Lee, Dylan S. Small, Francesca Dominici

Abstract: Studies have shown that exposure to air pollution, even at low levels, significantly increases mortality. As regulatory actions are becoming prohibitively expensive, robust evidence to guide the development of targeted interventions to reduce air pollution exposure is needed. In this paper, we introduce a novel statistical method that splits the data into two subsamples: (a) Using the first subsam… ▽ More Studies have shown that exposure to air pollution, even at low levels, significantly increases mortality. As regulatory actions are becoming prohibitively expensive, robust evidence to guide the development of targeted interventions to reduce air pollution exposure is needed. In this paper, we introduce a novel statistical method that splits the data into two subsamples: (a) Using the first subsample, we consider a data-driven search for $\textit{de novo}$ discovery of subgroups that could have exposure effects that differ from the population mean; and then (b) using the second subsample, we quantify evidence of effect modification among the subgroups with nonparametric randomization-based tests. We also develop a sensitivity analysis method to assess the robustness of the conclusions to unmeasured confounding bias. Via simulation studies and theoretical arguments, we demonstrate that since we discover the subgroups in the first subsample, hypothesis testing on the second subsample can focus on theses subgroups only, thus substantially increasing the statistical power of the test. We apply our method to the data of 1,612,414 Medicare beneficiaries in New England region in the United States for the period 2000 to 2006. We find that seniors aged between 81-85 with low income and seniors aged above 85 have statistically significant higher causal effects of exposure to PM$_{2.5}$ on 5-year mortality rate compared to the population mean. △ Less

Submitted 19 February, 2018; originally announced February 2018.

arXiv:1801.09652 [pdf, other]

Statistical inference in two-sample summary-data Mendelian randomization using robust adjusted profile score

Authors: Qingyuan Zhao, **gshu Wang, Gibran Hemani, Jack Bowden, Dylan S. Small

Abstract: Mendelian randomization (MR) is a method of exploiting genetic variation to unbiasedly estimate a causal effect in presence of unmeasured confounding. MR is being widely used in epidemiology and other related areas of population science. In this paper, we study statistical inference in the increasingly popular two-sample summary-data MR design. We show a linear model for the observed associations… ▽ More Mendelian randomization (MR) is a method of exploiting genetic variation to unbiasedly estimate a causal effect in presence of unmeasured confounding. MR is being widely used in epidemiology and other related areas of population science. In this paper, we study statistical inference in the increasingly popular two-sample summary-data MR design. We show a linear model for the observed associations approximately holds in a wide variety of settings when all the genetic variants satisfy the exclusion restriction assumption, or in genetic terms, when there is no pleiotropy. In this scenario, we derive a maximum profile likelihood estimator with provable consistency and asymptotic normality. However, through analyzing real datasets, we find strong evidence of both systematic and idiosyncratic pleiotropy in MR, echoing the omnigenic model of complex traits that is recently proposed in genetics. We model the systematic pleiotropy by a random effects model, where no genetic variant satisfies the exclusion restriction condition exactly. In this case we propose a consistent and asymptotically normal estimator by adjusting the profile score. We then tackle the idiosyncratic pleiotropy by robustifying the adjusted profile score. We demonstrate the robustness and efficiency of the proposed methods using several simulated and real datasets. △ Less

Submitted 1 January, 2019; v1 submitted 29 January, 2018; originally announced January 2018.

Comments: 59 pages, 5 figures, 6 tables

MSC Class: 65J05; 46N60; 62F35

arXiv:1711.11286 [pdf, other]

Sensitivity analysis for inverse probability weighting estimators via the percentile bootstrap

Authors: Qingyuan Zhao, Dylan S. Small, Bhaswar B. Bhattacharya

Abstract: To identify the estimand in missing data problems and observational studies, it is common to base the statistical estimation on the "missing at random" and "no unmeasured confounder" assumptions. However, these assumptions are unverifiable using empirical data and pose serious threats to the validity of the qualitative conclusions of the statistical inference. A sensitivity analysis asks how the c… ▽ More To identify the estimand in missing data problems and observational studies, it is common to base the statistical estimation on the "missing at random" and "no unmeasured confounder" assumptions. However, these assumptions are unverifiable using empirical data and pose serious threats to the validity of the qualitative conclusions of the statistical inference. A sensitivity analysis asks how the conclusions may change if the unverifiable assumptions are violated to a certain degree. In this paper we consider a marginal sensitivity model which is a natural extension of Rosenbaum's sensitivity model that is widely used for matched observational studies. We aim to construct confidence intervals based on inverse probability weighting estimators, such that asymptotically the intervals have at least nominal coverage of the estimand whenever the data generating distribution is in the collection of marginal sensitivity models. We use a percentile bootstrap and a generalized minimax/maximin inequality to transform this intractable problem to a linear fractional programming problem, which can be solved very efficiently. We illustrate our method using a real dataset to estimate the causal effect of fish consumption on blood mercury level. △ Less

Submitted 8 October, 2018; v1 submitted 30 November, 2017; originally announced November 2017.

Comments: 32 pages, 1 figure

arXiv:1707.09549 [pdf, ps, other]

doi 10.1111/biom.12688

Sensitivity Analysis for matched pair analysis of binary data: From worst case to average case analysis

Authors: Raiden B. Hasegawa, Dylan S. Small

Abstract: In matched observational studies where treatment assignment is not randomized, sensitivity analysis helps investigators determine how sensitive their estimated treatment effect is to some unmeasured con- founder. The standard approach calibrates the sensitivity analysis according to the worst case bias in a pair. This approach will result in a conservative sensitivity analysis if the worst case bi… ▽ More In matched observational studies where treatment assignment is not randomized, sensitivity analysis helps investigators determine how sensitive their estimated treatment effect is to some unmeasured con- founder. The standard approach calibrates the sensitivity analysis according to the worst case bias in a pair. This approach will result in a conservative sensitivity analysis if the worst case bias does not hold in every pair. In this paper, we show that for binary data, the standard approach can be calibrated in terms of the average bias in a pair rather than worst case bias. When the worst case bias and average bias differ, the average bias interpretation results in a less conservative sensitivity analysis and more power. In many studies, the average case calibration may also carry a more natural interpretation than the worst case calibration and may also allow researchers to incorporate additional data to establish an empirical basis with which to calibrate a sensitivity analysis. We illustrate this with a study of the effects of cellphone use on the incidence of automobile accidents. Finally, we extend the average case calibration to the sensitivity analysis of confidence intervals for attributable effects. △ Less

Submitted 16 May, 2018; v1 submitted 29 July, 2017; originally announced July 2017.

Comments: minor corrections/clarifications made

Journal ref: Biometrics, Volume 73, Issue 4, p.1424-1432, 2017

arXiv:1705.08020 [pdf, other]

Selective inference for effect modification via the lasso

Authors: Qingyuan Zhao, Dylan S. Small, Ashkan Ertefaie

Abstract: Effect modification occurs when the effect of the treatment on an outcome varies according to the level of other covariates and often has important implications in decision making. When there are tens or hundreds of covariates, it becomes necessary to use the observed data to select a simpler model for effect modification and then make valid statistical inference. We propose a two stage procedure… ▽ More Effect modification occurs when the effect of the treatment on an outcome varies according to the level of other covariates and often has important implications in decision making. When there are tens or hundreds of covariates, it becomes necessary to use the observed data to select a simpler model for effect modification and then make valid statistical inference. We propose a two stage procedure to solve this problem. First, we use Robinson's transformation to decouple the nuisance parameters from the treatment effect of interest and use machine learning algorithms to estimate the nuisance parameters. Next, after plugging in the estimates of the nuisance parameters, we use the Lasso to choose a low-complexity model for effect modification. Compared to a full model consisting of all the covariates, the selected model is much more interpretable. Compared to the univariate subgroup analyses, the selected model greatly reduces the number of false discoveries. We show that the conditional selective inference for the selected model is asymptotically valid given the rate assumptions in classical semiparametric regression. Extensive simulation studies are conducted to verify the asymptotic results and an epidemiological application is used to demonstrate the method. △ Less

Submitted 19 November, 2021; v1 submitted 22 May, 2017; originally announced May 2017.

Comments: Accepted manuscript. To appear in the Journal of the Royal Statistical Society: Series B (Statistical Methodology)

arXiv:1705.03918 [pdf, other]

Causal Inference with Two Versions of Treatment

Authors: Raiden B. Hasegawa, Sameer K. Deshpande, Dylan S. Small, Paul R. Rosenbaum

Abstract: Causal effects are commonly defined as comparisons of the potential outcomes under treatment and control, but this definition is threatened by the possibility that the treatment or control condition is not well-defined, existing instead in more than one version. A simple, widely applicable analysis is proposed to address the possibility that the treatment or control condition exists in two version… ▽ More Causal effects are commonly defined as comparisons of the potential outcomes under treatment and control, but this definition is threatened by the possibility that the treatment or control condition is not well-defined, existing instead in more than one version. A simple, widely applicable analysis is proposed to address the possibility that the treatment or control condition exists in two versions with two different treatment effects. This analysis loses no power in the main comparison of treatment and control, provides additional information about version effects, and controls the family-wise error rate in several comparisons. The method is motivated and illustrated using an on-going study of the possibility that repeated head trauma in high school football causes an increase in risk of early on-set dementia. △ Less

Submitted 24 April, 2019; v1 submitted 10 May, 2017; originally announced May 2017.

arXiv:1705.00506 [pdf, ps, other]

Paradoxes in instrumental variable studies with missing data and one-sided noncompliance

Authors: Edward H. Kennedy, Dylan S. Small

Abstract: It is common in instrumental variable studies for instrument values to be missing, for example when the instrument is a genetic test in Mendelian randomization studies. In this paper we discuss two apparent paradoxes that arise in so-called single consent designs where there is one-sided noncompliance, i.e., where unencouraged units cannot access treatment. The first paradox is that, even under a… ▽ More It is common in instrumental variable studies for instrument values to be missing, for example when the instrument is a genetic test in Mendelian randomization studies. In this paper we discuss two apparent paradoxes that arise in so-called single consent designs where there is one-sided noncompliance, i.e., where unencouraged units cannot access treatment. The first paradox is that, even under a missing completely at random assumption, a complete-case analysis is biased when knowledge of one-sided noncompliance is taken into account; this is not the case when such information is disregarded. This occurs because incorporating information about one-sided noncompliance induces a dependence between the missingness and treatment. The second paradox is that, although incorporating such information does not lead to efficiency gains without missing data, the story is different when instrument values are missing: there, incorporating such information changes the efficiency bound, allowing possible efficiency gains. This is because some of the missing values can be filled in, based on the fact that anyone who received treatment must have been encouraged by the instrument (since the unencouraged cannot access treatment). △ Less

Submitted 17 April, 2018; v1 submitted 1 May, 2017; originally announced May 2017.

arXiv:1703.09787 [pdf, other]

Multiple testing when many $p$-values are uniformly conservative, with application to testing qualitative interaction in educational interventions

Authors: Qingyuan Zhao, Dylan S. Small, Weijie Su

Abstract: In the evaluation of treatment effects, it is of major policy interest to know if the treatment is beneficial for some and harmful for others, a phenomenon known as qualitative interaction. We formulate this question as a multiple testing problem with many conservative null $p$-values, in which the classical multiple testing methods may lose power substantially. We propose a simple technique---con… ▽ More In the evaluation of treatment effects, it is of major policy interest to know if the treatment is beneficial for some and harmful for others, a phenomenon known as qualitative interaction. We formulate this question as a multiple testing problem with many conservative null $p$-values, in which the classical multiple testing methods may lose power substantially. We propose a simple technique---conditioning---to improve the power. A crucial assumption we need is uniform conservativeness, meaning for any conservative $p$-value $p$, the conditional distribution $(p/τ)\,|\,p \le τ$ is stochastically larger than the uniform distribution on $(0,1)$ for any $τ$. We show this property holds for one-sided tests in a one-dimensional exponential family (e.g.\ testing for qualitative interaction) as well as testing $|μ|\leη$ using a statistic $X \sim \mathrm{N}(μ,1)$ (e.g.\ testing for practical importance with threshold $η$). We propose an adaptive method to select the threshold $τ$. Our theoretical and simulation results suggest the proposed tests gain significant power when many $p$-values are uniformly conservative and lose little power when no $p$-value is uniformly conservative. We apply our method to two educational intervention datasets. △ Less

Submitted 26 August, 2017; v1 submitted 28 March, 2017; originally announced March 2017.

Comments: 31 pages, 2 figure, 6 tables

arXiv:1703.02078 [pdf, ps, other]

Cross-screening in observational studies that test many hypotheses

Authors: Qingyuan Zhao, Dylan S. Small, Paul R. Rosenbaum

Abstract: We discuss observational studies that test many causal hypotheses, either hypotheses about many outcomes or many treatments. To be credible an observational study that tests many causal hypotheses must demonstrate that its conclusions are neither artifacts of multiple testing nor of small biases from nonrandom treatment assignment. In a sense that needs to be defined carefully, hidden within a sen… ▽ More We discuss observational studies that test many causal hypotheses, either hypotheses about many outcomes or many treatments. To be credible an observational study that tests many causal hypotheses must demonstrate that its conclusions are neither artifacts of multiple testing nor of small biases from nonrandom treatment assignment. In a sense that needs to be defined carefully, hidden within a sensitivity analysis for nonrandom assignment is an enormous correction for multiple testing: in the absence of bias, it is extremely improbable that multiple testing alone would create an association insensitive to moderate biases. We propose a new strategy called "cross-screening", different from but motivated by recent work of Bogomolov and Heller on replicability. Cross-screening splits the data in half at random, uses the first half to plan a study carried out on the second half, then uses the second half to plan a study carried out on the first half, and reports the more favorable conclusions of the two studies correcting using the Bonferroni inequality for having done two studies. If the two studies happen to concur, then they achieve Bogomolov-Heller replicability; however, importantly, replicability is not required for strong control of the family-wise error rate, and either study alone suffices for firm conclusions. In randomized studies with a few hypotheses, cross-split screening is not an attractive method when compared with conventional methods of multiplicity control, but it can become attractive when hundreds or thousands of hypotheses are subjected to sensitivity analyses in an observational study. We illustrate the technique by comparing 46 biomarkers in individuals who consume large quantities of fish versus little or no fish. △ Less

Submitted 6 March, 2017; originally announced March 2017.

Comments: 33 pages, 2 figures, 5 tables

arXiv:1702.00525 [pdf, ps, other]

A powerful approach to the study of moderate effect modification in observational studies

Authors: Kwonsang Lee, Dylan S. Small, Paul R. Rosenbaum

Abstract: Effect modification means the magnitude or stability of a treatment effect varies as a function of an observed covariate. Generally, larger and more stable treatment effects are insensitive to larger biases from unmeasured covariates, so a causal conclusion may be considerably firmer if this pattern is noted if it occurs. We propose a new strategy, called the submax-method, that combines explorato… ▽ More Effect modification means the magnitude or stability of a treatment effect varies as a function of an observed covariate. Generally, larger and more stable treatment effects are insensitive to larger biases from unmeasured covariates, so a causal conclusion may be considerably firmer if this pattern is noted if it occurs. We propose a new strategy, called the submax-method, that combines exploratory and confirmatory efforts to determine whether there is stronger evidence of causality - that is, greater insensitivity to unmeasured confounding - in some subgroups of individuals. It uses the joint distribution of test statistics that split the data in various ways based on certain observed covariates. For $L$ binary covariates, the method splits the population $L$ times into two subpopulations, perhaps first men and women, perhaps then smokers and nonsmokers, computing a test statistic from each subpopulation, and appends the test statistic for the whole population, making $2L+1$ test statistics in total. Although $L$ binary covariates define $2^{L}$ interaction groups, only $2L+1$ tests are performed, and at least $L+1$ of these tests use at least half of the data. The submax-method achieves the highest design sensitivity and the highest Bahadur efficiency of its component tests. Moreover, the form of the test is sufficiently tractable that its large sample power may be studied analytically. The simulation suggests that the submax method exhibits superior performance, in comparison with an approach using CART, when there is effect modification of moderate size. Using data from the NHANES I Epidemiologic Follow-Up Survey, an observational study of the effects of physical activity on survival is used to illustrate the method. The method is implemented in the $\texttt{R}$ package $\texttt{submax}$ which contains the NHANES example. △ Less

Submitted 9 March, 2018; v1 submitted 1 February, 2017; originally announced February 2017.

arXiv:1609.03686 [pdf, other]

New multivariate tests for assessing covariate balance in matched observational studies

Authors: Hao Chen, Dylan S. Small

Abstract: We propose new tests for assessing whether covariates in a treatment group and matched control group are balanced in observational studies. The tests exhibit high power under a wide range of multivariate alternatives, some of which existing tests have little power for. The asymptotic permutation null distributions of the proposed tests are studied and the p-values calculated through the asymptotic… ▽ More We propose new tests for assessing whether covariates in a treatment group and matched control group are balanced in observational studies. The tests exhibit high power under a wide range of multivariate alternatives, some of which existing tests have little power for. The asymptotic permutation null distributions of the proposed tests are studied and the p-values calculated through the asymptotic results work well in finite samples, facilitating the application of the test to large data sets. The tests are illustrated in a study of the effect of smoking on blood lead levels. The proposed tests are implemented in an R package BalanceCheck. △ Less

Submitted 27 February, 2019; v1 submitted 13 September, 2016; originally announced September 2016.

arXiv:1607.02566 [pdf, other]

Robust causal inference with continuous instruments using the local instrumental variable curve

Authors: Edward H. Kennedy, Scott A. Lorch, Dylan S. Small

Abstract: Instrumental variables are commonly used to estimate effects of a treatment afflicted by unmeasured confounding, and in practice instruments are often continuous (e.g., measures of distance, or treatment preference). However, available methods for continuous instruments have important limitations: they either require restrictive parametric assumptions for identification, or else rely on modeling b… ▽ More Instrumental variables are commonly used to estimate effects of a treatment afflicted by unmeasured confounding, and in practice instruments are often continuous (e.g., measures of distance, or treatment preference). However, available methods for continuous instruments have important limitations: they either require restrictive parametric assumptions for identification, or else rely on modeling both the outcome and treatment process well (and require modeling effect modification by all adjustment covariates). In this work we develop the first semiparametric doubly robust estimators of the local instrumental variable effect curve, i.e., the effect among those who would take treatment for instrument values above some threshold and not below. In addition to being robust to misspecification of either the instrument or treatment/outcome processes, our approach also incorporates information about the instrument mechanism and allows for flexible data-adaptive estimation of effect modification. We discuss asymptotic properties under weak conditions, and use the methods to study infant mortality effects of neonatal intensive care units with high versus low technical capacity, using travel time as an instrument. △ Less

Submitted 4 July, 2018; v1 submitted 9 July, 2016; originally announced July 2016.

arXiv:1607.01756 [pdf, other]

Protocol for an Observational Study on the Effects of Playing High School Football on Later Life Cognitive Functioning and Mental Health

Authors: Sameer K. Deshpande, Raiden B. Hasegawa, Amanda R. Rabinowitz, John Whyte, Carol L. Roan, Andrew Tabatabaei, Michael Baiocchi, Jason H. Karlawish, Christina L. Master, Dylan S. Small

Abstract: A potential causal relationship between head injuries sustained by NFL players and later-life neurological decline may have broad implications for participants in youth and high school football programs. However, brain trauma risk at the professional level may be different than that at the youth and high school levels and the long-term effects of participation at these levels is as-yet unclear. To… ▽ More A potential causal relationship between head injuries sustained by NFL players and later-life neurological decline may have broad implications for participants in youth and high school football programs. However, brain trauma risk at the professional level may be different than that at the youth and high school levels and the long-term effects of participation at these levels is as-yet unclear. To investigate the effect of playing high school football on later life depression and cognitive functioning, we propose a retrospective observational study using data from the Wisconsin Longitudinal Study (WLS) of graduates from Wisconsin high schools in 1957. We compare 1,153 high school males who played varsity football to 2,751 male students who did not. 1,951 of the control subjects did not play any sport and the remaining 800 controls played a non-contact sport. We focus on two primary outcomes measured at age 65: a composite cognitive outcome measuring verbal fluency and memory and the modified CES-D depression score. To control for potential confounders we adjust for pre-exposure covariates such as IQ with matching and model-based covariate adjustment. We will conduct an ordered testing procedure that uses all 2,751 controls while controlling for possible unmeasured differences between students who played sports and those who did not. We will quantitatively assess the sensitivity of the results to potential unmeasured confounding. The study will also consider several secondary outcomes of clinical interest such as aggression and heavy drinking. The rich set of pre-exposure variables, relatively unbiased sampling, and longitudinal nature of the WLS dataset make the proposed analysis unique among related studies that rely primarily on convenience samples of football players with reported neurological symptoms. △ Less

Submitted 6 July, 2016; originally announced July 2016.

Comments: Prior to performing the proposed analysis, we will register this pre-analysis plan on clincialtrials.gov

arXiv:1605.07663 [pdf, ps, other]

Estimating the Malaria Attributable Fever Fraction Accounting for Parasites Being Killed by Fever and Measurement Error

Authors: Kwonsang Lee, Dylan S. Small

Abstract: Malaria is a parasitic disease that is a major health problem in many tropical regions. The most characteristic symptom of malaria is fever. The fraction of fevers that are attributable to malaria, the malaria attributable fever fraction (MAFF), is an important public health measure for assessing the effect of malaria control programs and other purposes. Estimating the MAFF is not straightforward… ▽ More Malaria is a parasitic disease that is a major health problem in many tropical regions. The most characteristic symptom of malaria is fever. The fraction of fevers that are attributable to malaria, the malaria attributable fever fraction (MAFF), is an important public health measure for assessing the effect of malaria control programs and other purposes. Estimating the MAFF is not straightforward because there is no gold standard diagnosis of a malaria attributable fever; an individual can have malaria parasites in her blood and a fever, but the individual may have developed partial immunity that allows her to tolerate the parasites and the fever is being caused by another infection. We define the MAFF using the potential outcome framework for causal inference and show what assumptions underlie current estimation methods. Current estimation methods rely on an assumption that the parasite density is correctly measured. However, this assumption does not generally hold because (i) fever kills some parasites and (ii) the measurement of parasite density has measurement error. In the presence of these problems, we show current estimation methods do not perform well. We propose a novel maximum likelihood estimation method based on exponential family g-modeling. Under the assumption that the measurement error mechanism and the magnitude of the fever killing effect are known, we show that our proposed method provides approximately unbiased estimates of the MAFF in simulation studies. A sensitivity analysis can be used to assess the impact of different magnitudes of fever killing and different measurement error mechanisms. We apply our proposed method to estimate the MAFF in Kilombero, Tanzania. △ Less

Submitted 24 May, 2016; originally announced May 2016.

Comments: 39 pages, 5 figures

Showing 1–50 of 73 results for author: Small, D S