-
Doubly robust estimation and inference for a log-concave counterfactual density
Authors:
Daeyoung Ham,
Ted Westling,
Charles R. Doss
Abstract:
We consider the problem of causal inference based on observational data (or the related missing data problem) with a binary or discrete treatment variable. In that context we study counterfactual density estimation, which provides more nuanced information than counterfactual mean estimation (i.e., the average treatment effect). We impose the shape-constraint of log-concavity (a unimodality constra…
▽ More
We consider the problem of causal inference based on observational data (or the related missing data problem) with a binary or discrete treatment variable. In that context we study counterfactual density estimation, which provides more nuanced information than counterfactual mean estimation (i.e., the average treatment effect). We impose the shape-constraint of log-concavity (a unimodality constraint) on the counterfactual densities, and then develop doubly robust estimators of the log-concave counterfactual density (based on an augmented inverse-probability weighted pseudo-outcome), and show the consistency in various global metrics of that estimator. Based on that estimator we also develop asymptotically valid pointwise confidence intervals for the counterfactual density.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
A nonparametric doubly robust test for a continuous treatment effect
Authors:
Charles R. Doss,
Guangwei Weng,
Lan Wang,
Ira Moscovice,
Tongtan Chantarat
Abstract:
The vast majority of literature on evaluating the significance of a treatment effect based on observational data has been confined to discrete treatments. These methods are not applicable to drawing inference for a continuous treatment, which arises in many important applications. To adjust for confounders when evaluating a continuous treatment, existing inference methods often rely on discretizin…
▽ More
The vast majority of literature on evaluating the significance of a treatment effect based on observational data has been confined to discrete treatments. These methods are not applicable to drawing inference for a continuous treatment, which arises in many important applications. To adjust for confounders when evaluating a continuous treatment, existing inference methods often rely on discretizing the treatment or using (possibly misspecified) parametric models for the effect curve. Recently, Kennedy et al. (2017) proposed nonparametric doubly robust estimation for a continuous treatment effect in observational studies. However, inference for the continuous treatment effect is a harder problem. To the best of our knowledge, a completely nonparametric doubly robust approach for inference in this setting is not yet available. We develop such a nonparametric doubly robust procedure in this paper for making inference on the continuous treatment effect curve. Using empirical process techniques for local U- and V-processes, we establish the test statistic's asymptotic distribution. Furthermore, we propose a wild bootstrap procedure for implementing the test in practice. We illustrate the new method via simulations and a study of a constructed dataset relating the effect of nurse staffing hours on hospital performance. We implement our doubly robust dose response test in the R package DRDRtest on CRAN.
△ Less
Submitted 22 May, 2023; v1 submitted 7 February, 2022;
originally announced February 2022.
-
Unlinked monotone regression
Authors:
Fadoua Balabdaoui,
Charles R. Doss,
Cécile Durot
Abstract:
We consider so-called univariate unlinked (sometimes ``decoupled,'' or ``shuffled'') regression when the unknown regression curve is monotone. In standard monotone regression, one observes a pair $(X,Y)$ where a response $Y$ is linked to a covariate $X$ through the model $Y= m_0(X) + ε$, with $m_0$ the (unknown) monotone regression function and $ε$ the unobserved error (assumed to be independent o…
▽ More
We consider so-called univariate unlinked (sometimes ``decoupled,'' or ``shuffled'') regression when the unknown regression curve is monotone. In standard monotone regression, one observes a pair $(X,Y)$ where a response $Y$ is linked to a covariate $X$ through the model $Y= m_0(X) + ε$, with $m_0$ the (unknown) monotone regression function and $ε$ the unobserved error (assumed to be independent of $X$). In the unlinked regression setting one gets only to observe a vector of realizations from both the response $Y$ and from the covariate $X$ where now $Y \stackrel{d}{=} m_0(X) + ε$. There is no (observed) pairing of $X$ and $Y$. Despite this, it is actually still possible to derive a consistent non-parametric estimator of $m_0$ under the assumption of monotonicity of $m_0$ and knowledge of the distribution of the noise $ε$. In this paper, we establish an upper bound on the rate of convergence of such an estimator under minimal assumption on the distribution of the covariate $X$. We discuss extensions to the case in which the distribution of the noise is unknown. We develop a second order algorithm for its computation, and we demonstrate its use on synthetic data. Finally, we apply our method (in a fully data driven way, without knowledge of the error distribution) on longitudinal data from the US Consumer Expenditure Survey.
△ Less
Submitted 28 July, 2021; v1 submitted 1 July, 2020;
originally announced July 2020.
-
An explicit mean-covariance parameterization for multivariate response linear regression
Authors:
Aaron J. Molstad,
Guangwei Weng,
Charles R. Doss,
Adam J. Rothman
Abstract:
We develop a new method to fit the multivariate response linear regression model that exploits a parametric link between the regression coefficient matrix and the error covariance matrix. Specifically, we assume that the correlations between entries in the multivariate error random vector are proportional to the cosines of the angles between their corresponding regression coefficient matrix column…
▽ More
We develop a new method to fit the multivariate response linear regression model that exploits a parametric link between the regression coefficient matrix and the error covariance matrix. Specifically, we assume that the correlations between entries in the multivariate error random vector are proportional to the cosines of the angles between their corresponding regression coefficient matrix columns, so as the angle between two regression coefficient matrix columns decreases, the correlation between the corresponding errors increases. We highlight two models under which this parameterization arises: the latent variable reduced-rank regression model and the errors-in-variables regression model. We propose a novel non-convex weighted residual sum of squares criterion which exploits this parameterization and admits a new class of penalized estimators. The optimization is solved with an accelerated proximal gradient descent algorithm. Our method is used to study the association between microRNA expression and cancer drug activity measured on the NCI-60 cell lines. An R package implementing our method, MCMVR, is available at github.com/ajmolstad/MCMVR.
△ Less
Submitted 8 December, 2021; v1 submitted 30 August, 2018;
originally announced August 2018.
-
Bandwidth selection for kernel density estimators of multivariate level sets and highest density regions
Authors:
Charles R. Doss,
Guangwei Weng
Abstract:
We consider bandwidth matrix selection for kernel density estimators (KDEs) of density level sets in $\mathbb{R}^d$, $d \ge 2$. We also consider estimation of highest density regions, which differs from estimating level sets in that one specifies the probability content of the set rather than specifying the level directly. This complicates the problem. Bandwidth selection for KDEs is well studied,…
▽ More
We consider bandwidth matrix selection for kernel density estimators (KDEs) of density level sets in $\mathbb{R}^d$, $d \ge 2$. We also consider estimation of highest density regions, which differs from estimating level sets in that one specifies the probability content of the set rather than specifying the level directly. This complicates the problem. Bandwidth selection for KDEs is well studied, but the goal of most methods is to minimize a global loss function for the density or its derivatives. The loss we consider here is instead the measure of the symmetric difference of the true set and estimated set. We derive an asymptotic approximation to the corresponding risk. The approximation depends on unknown quantities which can be estimated, and the approximation can then be minimized to yield a choice of bandwidth, which we show in simulations performs well. We provide an R package lsbs for implementing our procedure.
△ Less
Submitted 24 October, 2018; v1 submitted 2 June, 2018;
originally announced June 2018.
-
Concave regression: value-constrained estimation and likelihood ratio-based inference
Authors:
Charles R. Doss
Abstract:
We propose a likelihood ratio statistic for forming hypothesis tests and confidence intervals for a nonparametrically estimated univariate regression function, based on the shape restriction of concavity (alternatively, convexity). Dealing with the likelihood ratio statistic requires studying an estimator satisfying a null hypothesis, that is, studying a concave least-squares estimator satisfying…
▽ More
We propose a likelihood ratio statistic for forming hypothesis tests and confidence intervals for a nonparametrically estimated univariate regression function, based on the shape restriction of concavity (alternatively, convexity). Dealing with the likelihood ratio statistic requires studying an estimator satisfying a null hypothesis, that is, studying a concave least-squares estimator satisfying a further equality constraint. We study this null hypothesis least-squares estimator (NLSE) here, and use it to study our likelihood ratio statistic. The NLSE is the solution to a convex program, and we find a set of inequality and equality constraints that characterize the solution. We also study a corresponding limiting version of the convex program based on observing a Brownian motion with drift. The solution to the limit problem is a stochastic process. We study the optimality conditions for the solution to the limit problem and find that they match those we derived for the solution to the finite sample problem. This allows us to show the limit stochastic process yields the limit distribution of the (finite sample) NLSE. We conjecture that the likelihood ratio statistic is asymptotically pivotal, meaning that it has a limit distribution with no nuisance parameters to be estimated, which makes it a very effective tool for this difficult inference problem. We provide a partial proof of this conjecture, and we also provide simulation evidence strongly supporting this conjecture.
△ Less
Submitted 10 September, 2018; v1 submitted 24 May, 2018;
originally announced May 2018.
-
Inference for the mode of a log-concave density
Authors:
Charles R. Doss,
Jon A. Wellner
Abstract:
We study a likelihood ratio test for the location of the mode of a log-concave density. Our test is based on comparison of the log-likelihoods corresponding to the unconstrained maximum likelihood estimator of a log-concave density and the constrained maximum likelihood estimator where the constraint is that the mode of the density is fixed, say at $m$. The constrained estimation problem is studie…
▽ More
We study a likelihood ratio test for the location of the mode of a log-concave density. Our test is based on comparison of the log-likelihoods corresponding to the unconstrained maximum likelihood estimator of a log-concave density and the constrained maximum likelihood estimator where the constraint is that the mode of the density is fixed, say at $m$. The constrained estimation problem is studied in detail in Doss and Wellner [2018]. Here the results of that paper are used to show that, under the null hypothesis (and strict curvature of $-\log f$ at the mode), the likelihood ratio statistic is asymptotically pivotal: that is, it converges in distribution to a limiting distribution which is free of nuisance parameters, thus playing the role of the $χ_1^2$ distribution in classical parametric statistical problems. By inverting this family of tests we obtain new (likelihood ratio based) confidence intervals for the mode of a log-concave density $f$. These new intervals do not depend on any smoothing parameters. We study the new confidence intervals via Monte Carlo methods and illustrate them with two real data sets. The new intervals seem to have several advantages over existing procedures. Software implementing the test and confidence intervals is available in the R package \verb+logcondens.mode+.
△ Less
Submitted 4 June, 2018; v1 submitted 30 November, 2016;
originally announced November 2016.
-
Univariate log-concave density estimation with symmetry or modal constraints
Authors:
Charles R. Doss,
Jon A. Wellner
Abstract:
We study nonparametric maximum likelihood estimation of a log-concave density function $f_0$ which is known to satisfy further constraints, where either (a) the mode $m$ of $f_0$ is known, or (b) $f_0$ is known to be symmetric about a fixed point $m$. We develop asymptotic theory for both constrained log-concave maximum likelihood estimators (MLE's), including consistency, global rates of converge…
▽ More
We study nonparametric maximum likelihood estimation of a log-concave density function $f_0$ which is known to satisfy further constraints, where either (a) the mode $m$ of $f_0$ is known, or (b) $f_0$ is known to be symmetric about a fixed point $m$. We develop asymptotic theory for both constrained log-concave maximum likelihood estimators (MLE's), including consistency, global rates of convergence, and local limit distribution theory. In both cases, we find the MLE's pointwise limit distribution at $m$ (either the known mode or the known center of symmetry) and at a point $x_0 \ne m$. Software to compute the constrained estimators is available in the R package \verb+logcondens.mode+.
The symmetry-constrained MLE is particularly useful in contexts of location estimation. The mode-constrained MLE is useful for mode-regression. The mode-constrained MLE can also be used to form a likelihood ratio test for the location of the mode of $f_0$. These problems are studied in separate papers. In particular, in a separate paper we show that, under a curvature assumption, the likelihood ratio statistic for the location of the mode can be used for hypothesis tests or confidence intervals that do not depend on either tuning parameters or nuisance parameters.
△ Less
Submitted 13 May, 2019; v1 submitted 30 November, 2016;
originally announced November 2016.
-
Bracketing numbers of convex and $m$-monotone functions on polytopes
Authors:
Charles R. Doss
Abstract:
We study bracketing covering numbers for spaces of bounded convex functions in the $L_p$ norms. Bracketing numbers are crucial quantities for understanding asymptotic behavior for many statistical nonparametric estimators. Bracketing number upper bounds in the supremum distance are known for bounded classes that also have a fixed Lipschitz constraint. However, in most settings of interest, the cla…
▽ More
We study bracketing covering numbers for spaces of bounded convex functions in the $L_p$ norms. Bracketing numbers are crucial quantities for understanding asymptotic behavior for many statistical nonparametric estimators. Bracketing number upper bounds in the supremum distance are known for bounded classes that also have a fixed Lipschitz constraint. However, in most settings of interest, the classes that arise do not include Lipschitz constraints, and so standard techniques based on known bracketing numbers cannot be used. In this paper, we find upper bounds for bracketing numbers of classes of convex functions without Lipschitz constraints on arbitrary polytopes. Our results are of particular interest in many multidimensional estimation problems based on convexity shape constraints.
Additionally, we show other applications of our proof methods; in particular we define a new class of multivariate functions, the so-called $m$-monotone functions. Such functions have been considered mathematically and statistically in the univariate case but never in the multivariate case. We show how our proof for convex bracketing upper bounds also applies to the $m$-monotone case.
△ Less
Submitted 14 April, 2020; v1 submitted 29 May, 2015;
originally announced June 2015.
-
Inference for a Two-Component Mixture of Symmetric Distributions under Log-Concavity
Authors:
Fadoua Balabdaoui,
Charles R. Doss
Abstract:
In this article, we revisit the problem of estimating the unknown zero-symmetric distribution in a two-component location mixture model, considered in previous works, now under the assumption that the zero-symmetric distribution has a log-concave density. When consistent estimators for the shift locations and mixing probability are used, we show that the nonparametric log-concave Maximum Likelihoo…
▽ More
In this article, we revisit the problem of estimating the unknown zero-symmetric distribution in a two-component location mixture model, considered in previous works, now under the assumption that the zero-symmetric distribution has a log-concave density. When consistent estimators for the shift locations and mixing probability are used, we show that the nonparametric log-concave Maximum Likelihood estimator (MLE) of both the mixed density and that of the unknown zero-symmetric component are consistent in the Hellinger distance. In case the estimators for the shift locations and mixing probability are $\sqrt n$-consistent, we establish that these MLE's converge to the truth at the rate $n^{-2/5}$ in the $L_1$ distance. To estimate the shift locations and mixing probability, we use the estimators proposed by \cite{hunteretal2007}. The unknown zero-symmetric density is efficiently computed using the \proglang{R} package \pkg{logcondens.mode}.
△ Less
Submitted 5 May, 2016; v1 submitted 17 November, 2014;
originally announced November 2014.
-
Global Rates of Convergence of the MLEs of Log-concave and s-concave Densities
Authors:
Charles R. Doss,
Jon A. Wellner
Abstract:
We establish global rates of convergence for the Maximum Likelihood Estimators (MLEs) of log-concave and $s$-concave densities on $\mathbb{R}$. The main finding is that the rate of convergence of the MLE in the Hellinger metric is no worse than $n^{-2/5}$ when $-1 < s < \infty$ where $s=0$ corresponds to the log-concave case. We also show that the MLE does not exist for the classes of $s$-concave…
▽ More
We establish global rates of convergence for the Maximum Likelihood Estimators (MLEs) of log-concave and $s$-concave densities on $\mathbb{R}$. The main finding is that the rate of convergence of the MLE in the Hellinger metric is no worse than $n^{-2/5}$ when $-1 < s < \infty$ where $s=0$ corresponds to the log-concave case. We also show that the MLE does not exist for the classes of $s$-concave densities with $s < - 1$.
△ Less
Submitted 15 September, 2015; v1 submitted 6 June, 2013;
originally announced June 2013.
-
Fitting birth-death processes to panel data with applications to bacterial DNA fingerprinting
Authors:
Charles R. Doss,
Marc A. Suchard,
Ian Holmes,
Midori Kato-Maeda,
Vladimir N. Minin
Abstract:
Continuous-time linear birth-death-immigration (BDI) processes are frequently used in ecology and epidemiology to model stochastic dynamics of the population of interest. In clinical settings, multiple birth-death processes can describe disease trajectories of individual patients, allowing for estimation of the effects of individual covariates on the birth and death rates of the process. Such esti…
▽ More
Continuous-time linear birth-death-immigration (BDI) processes are frequently used in ecology and epidemiology to model stochastic dynamics of the population of interest. In clinical settings, multiple birth-death processes can describe disease trajectories of individual patients, allowing for estimation of the effects of individual covariates on the birth and death rates of the process. Such estimation is usually accomplished by analyzing patient data collected at unevenly spaced time points, referred to as panel data in the biostatistics literature. Fitting linear BDI processes to panel data is a nontrivial optimization problem because birth and death rates can be functions of many parameters related to the covariates of interest. We propose a novel expectation--maximization (EM) algorithm for fitting linear BDI models with covariates to panel data. We derive a closed-form expression for the joint generating function of some of the BDI process statistics and use this generating function to reduce the E-step of the EM algorithm, as well as calculation of the Fisher information, to one-dimensional integration. This analytical technique yields a computationally efficient and robust optimization algorithm that we implemented in an open-source R package. We apply our method to DNA fingerprinting of Mycobacterium tuberculosis, the causative agent of tuberculosis, to study intrapatient time evolution of IS6110 copy number, a genetic marker frequently used during estimation of epidemiological clusters of Mycobacterium tuberculosis infections. Our analysis reveals previously undocumented differences in IS6110 birth-death rates among three major lineages of Mycobacterium tuberculosis, which has important implications for epidemiologists that use IS6110 for DNA fingerprinting of Mycobacterium tuberculosis.
△ Less
Submitted 10 January, 2014; v1 submitted 5 September, 2010;
originally announced September 2010.