Skip to main content

Showing 1–14 of 14 results for author: Carroll, R J

Searching in archive math. Search in all archives.
.
  1. arXiv:2201.10208  [pdf, other

    stat.ME math.ST stat.ML

    Semi-Supervised Quantile Estimation: Robust and Efficient Inference in High Dimensional Settings

    Authors: Abhishek Chakrabortty, Guorong Dai, Raymond J. Carroll

    Abstract: We consider quantile estimation in a semi-supervised setting, characterized by two available data sets: (i) a small or moderate sized labeled data set containing observations for a response and a set of possibly high dimensional covariates, and (ii) a much larger unlabeled data set where only the covariates are observed. We propose a family of semi-supervised estimators for the response quantile(s… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: 31 pages, 6 tables. arXiv admin note: text overlap with arXiv:2201.00468

  2. arXiv:2107.07257  [pdf, other

    stat.ME math.ST

    Nonparametric, tuning-free estimation of S-shaped functions

    Authors: Oliver Y. Feng, Yining Chen, Qiyang Han, Raymond J. Carroll, Richard J. Samworth

    Abstract: We consider the nonparametric estimation of an S-shaped regression function. The least squares estimator provides a very natural, tuning-free approach, but results in a non-convex optimisation problem, since the inflection point is unknown. We show that the estimator may nevertheless be regarded as a projection onto a finite union of convex cones, which allows us to propose a mixed primal-dual bas… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

    Comments: 79 pages, 10 figures

  3. arXiv:2103.12846  [pdf, ps, other

    math.ST

    On the global identifiability of logistic regression models with misclassified outcomes

    Authors: Rui Duan, Yang Ning, Jiasheng Shi, Raymond J Carroll, Tianxi Cai, Yong Chen

    Abstract: In the last decade, the secondary use of large data from health systems, such as electronic health records, has demonstrated great promise in advancing biomedical discoveries and improving clinical decision making. However, there is an increasing concern about biases in association studies caused by misclassification in the binary outcomes derived from electronic health records. We revisit the cla… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

  4. arXiv:1910.06235  [pdf, other

    math.ST

    Gaussian Processes with Errors in Variables: Theory and Computation

    Authors: Shuang Zhou, Debdeep Pati, Tianying Wang, Yun Yang, Raymond J. Carroll

    Abstract: Covariate measurement error in nonparametric regression is a common problem in nutritional epidemiology and geostatistics, and other fields. Over the last two decades, this problem has received substantial attention in the frequentist literature. Bayesian approaches for handling measurement error have only been explored recently and are surprisingly successful, although the lack of a proper theore… ▽ More

    Submitted 26 January, 2023; v1 submitted 14 October, 2019; originally announced October 2019.

  5. arXiv:1804.00793  [pdf, other

    math.ST

    A spline-assisted semiparametric approach to non-parametric measurement error models

    Authors: Fei Jiang, Yanyuan Ma, Raymond J. Carroll

    Abstract: It is well known that the minimax rates of convergence of nonparametric density and regression function estimation of a random variable measured with error is much slower than the rate in the error free case. Surprisingly, we show that if one is willing to impose a relatively mild assumption in requiring that the error-prone variable has a compact support, then the results can be greatly improved.… ▽ More

    Submitted 19 August, 2019; v1 submitted 2 April, 2018; originally announced April 2018.

    Comments: 30 pages

  6. arXiv:1610.00667  [pdf, ps, other

    stat.ML math.ST

    Data Integration with High Dimensionality

    Authors: Xin Gao, Raymond J. Carroll

    Abstract: We consider a problem of data integration. Consider determining which genes affect a disease. The genes, which we call predictor objects, can be measured in different experiments on the same individual. We address the question of finding which genes are predictors of disease by any of the experiments. Our formulation is more general. In a given data set, there are a fixed number of responses for e… ▽ More

    Submitted 3 October, 2016; originally announced October 2016.

  7. Estimation and inference in generalized additive coefficient models for nonlinear interactions with high-dimensional covariates

    Authors: Shujie Ma, Raymond J. Carroll, Hua Liang, Shizhong Xu

    Abstract: In the low-dimensional case, the generalized additive coefficient model (GACM) proposed by Xue and Yang [Statist. Sinica 16 (2006) 1423-1446] has been demonstrated to be a powerful tool for studying nonlinear interaction effects of variables. In this paper, we propose estimation and inference procedures for the GACM when the dimension of the variables is high. Specifically, we propose a groupwise… ▽ More

    Submitted 14 October, 2015; originally announced October 2015.

    Comments: Published at http://dx.doi.org/10.1214/15-AOS1344 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1344

    Journal ref: Annals of Statistics 2015, Vol. 43, No. 5, 2102-2131

  8. Unexpected properties of bandwidth choice when smoothing discrete data for constructing a functional data classifier

    Authors: Raymond J. Carroll, Aurore Delaigle, Peter Hall

    Abstract: The data functions that are studied in the course of functional data analysis are assembled from discrete data, and the level of smoothing that is used is generally that which is appropriate for accurate approximation of the conceptually smooth functions that were not actually observed. Existing literature shows that this approach is effective, and even optimal, when using functional data methods… ▽ More

    Submitted 18 December, 2013; originally announced December 2013.

    Comments: Published in at http://dx.doi.org/10.1214/13-AOS1158 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS1158

    Journal ref: Annals of Statistics 2013, Vol. 41, No. 6, 2739-2767

  9. arXiv:1308.5427  [pdf, ps, other

    math.ST

    Adaptive Posterior Convergence Rates in Bayesian Density Deconvolution with Supersmooth Errors

    Authors: Abhra Sarkar, Debdeep Pati, Bani K. Mallick, Raymond J. Carroll

    Abstract: Bayesian density deconvolution using nonparametric prior distributions is a useful alternative to the frequentist kernel based deconvolution estimators due to its potentially wide range of applicability, straightforward uncertainty quantification and generalizability to more sophisticated models. This article is the first substantive effort to theoretically quantify the behavior of the posterior i… ▽ More

    Submitted 9 September, 2013; v1 submitted 25 August, 2013; originally announced August 2013.

  10. Estimation and variable selection for generalized additive partial linear models

    Authors: Li Wang, Xiang Liu, Hua Liang, Raymond J. Carroll

    Abstract: We study generalized additive partial linear models, proposing the use of polynomial spline smoothing for estimation of nonparametric functions, and deriving quasi-likelihood based estimators for the linear parameters. We establish asymptotic normality for the estimators of the parametric components. The procedure avoids solving large systems of equations as in kernel-based procedures and thus res… ▽ More

    Submitted 12 December, 2011; originally announced December 2011.

    Comments: Published in at http://dx.doi.org/10.1214/11-AOS885 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS885

    Journal ref: Annals of Statistics 2011, Vol. 39, No. 4, 1827-1851

  11. A Bayesian Approach to Detection of Small Low Emission Sources

    Authors: Xiaolei Xun, Bani Mallick, Raymond J. Carroll, Peter Kuchment

    Abstract: The article addresses the problem of detecting presence and location of a small low emission source inside of an object, when the background noise dominates. This problem arises, for instance, in some homeland security applications. The goal is to reach the signal-to-noise ratio (SNR) levels on the order of $10^{-3}$. A Bayesian approach to this problem is implemented in 2D. The method allows infe… ▽ More

    Submitted 14 July, 2011; originally announced July 2011.

    MSC Class: 65C60; 82Dxx

    Journal ref: Inverse Problems 27 (2011), 115009 (11pp)

  12. Estimation of population-level summaries in general semiparametric repeated measures regression models

    Authors: Arnab Maity, Tatiyana V. Apanasovich, Raymond J. Carroll

    Abstract: This paper considers a wide family of semiparametric repeated measures regression models, in which the main interest is on estimating population-level quantities such as mean, variance, probabilities etc. Examples of our framework include generalized linear models for clustered/longitudinal data, among many others. We derive plug-in kernel-based estimators of the population level quantities and… ▽ More

    Submitted 15 May, 2008; originally announced May 2008.

    Comments: Published in at http://dx.doi.org/10.1214/193940307000000095 the IMS Collections (http://www.imstat.org/publications/imscollections.htm) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-COLL1-IMSCOLL110 MSC Class: 62G08; 62J02 (Primary) 62J12 (Secondary)

    Journal ref: IMS Collections 2008, Vol. 1, 123-137

  13. Nonparametric estimation of correlation functions in longitudinal and spatial data, with application to colon carcinogenesis experiments

    Authors: Yehua Li, Naisyin Wang, Meeyoung Hong, Nancy D. Turner, Joanne R. Lupton, Raymond J. Carroll

    Abstract: In longitudinal and spatial studies, observations often demonstrate strong correlations that are stationary in time or distance lags, and the times or locations of these data being sampled may not be homogeneous. We propose a nonparametric estimator of the correlation function in such data, using kernel methods. We develop a pointwise asymptotic normal distribution for the proposed estimator, wh… ▽ More

    Submitted 19 October, 2007; originally announced October 2007.

    Comments: Published in at http://dx.doi.org/10.1214/009053607000000082 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS0240 MSC Class: 62M10; 91B72; 62G08 (Primary)

    Journal ref: Annals of Statistics 2007, Vol. 35, No. 4, 1608-1643

  14. Discussion: Conditional growth charts

    Authors: Raymond J. Carroll, David Ruppert

    Abstract: Discussion of Conditional growth charts [math.ST/0702634]

    Submitted 22 February, 2007; originally announced February 2007.

    Comments: Published at http://dx.doi.org/10.1214/009053606000000641 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS0102B

    Journal ref: Annals of Statistics 2006, Vol. 34, No. 5, 2098-2104