Skip to main content

Showing 1–15 of 15 results for author: Chan, K C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.08873  [pdf, ps, other

    stat.ME

    Balancing Method for Non-monotone Missing Data

    Authors: Jianing Dong, Raymond K. W. Wong, Kwun Chuen Gary Chan

    Abstract: Covariate balancing methods have been widely applied to single or monotone missing patterns and have certain advantages over likelihood-based methods and inverse probability weighting approaches based on standard logistic regression. In this paper, we consider non-monotone missing data under the complete-case missing variable condition (CCMV), which is a case of missing not at random (MNAR). Using… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  2. arXiv:2311.04871  [pdf, other

    stat.ME

    Integration of Summary Information from External Studies for Semiparametric Models

    Authors: Jianxuan Zang, K. C. G. Chan, Fei Gao

    Abstract: With the development of biomedical science, researchers have increasing access to an abundance of studies focusing on similar research questions. There is a growing interest in the integration of summary information from those studies to enhance the efficiency of estimation in their own internal studies. In this work, we present a comprehensive framework on integration of summary information from… ▽ More

    Submitted 9 November, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

  3. arXiv:2309.08039  [pdf, other

    stat.ME math.ST

    Flexible Functional Treatment Effect Estimation

    Authors: Jiayi Wang, Raymond K. W. Wong, Xiaoke Zhang, Kwun Chuen Gary Chan

    Abstract: We study treatment effect estimation with functional treatments where the average potential outcome functional is a function of functions, in contrast to continuous treatment effect estimation where the target is a function of real numbers. By considering a flexible scalar-on-function marginal structural model, a weight-modified kernel ridge regression (WMKRR) is adopted for estimation. The weight… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  4. arXiv:2303.11388  [pdf, other

    stat.ME

    An Effective Multivariate Normality Test via Hessians of Empirical Cumulant Generating Functions

    Authors: Kwun Chuen Gary Chan, Hok Kan Ling, Chuan-Fa Tang, Sheung Chi Phillip Yam

    Abstract: In this article, we propose a new class of consistent tests for $p$-variate normality. These tests are based on the characterization of the standard multivariate normal distribution, that the Hessian of the corresponding cumulant generating function is identical to the $p\times p$ identity matrix and the idea of decomposing the information from the joint distribution into the dependence copula and… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

  5. arXiv:2110.06077  [pdf, other

    stat.ME stat.AP

    Data Harmonization Via Regularized Nonparametric Mixing Distribution Estimation

    Authors: Steven Wilkins-Reeves, Yen-Chi Chen, Kwun Chuen Gary Chan

    Abstract: Data harmonization is the process by which an equivalence is developed between two variables measuring a common trait. Our problem is motivated by dementia research in which multiple tests are used in practice to measure the same underlying cognitive ability such as language or memory. We connect this statistical problem to mixing distribution estimation. We introduce and study a non-parametric la… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: 46 pages, 15 figures

    MSC Class: 62G05

  6. arXiv:2106.05850  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Matrix Completion with Model-free Weighting

    Authors: Jiayi Wang, Raymond K. W. Wong, Xiaojun Mao, Kwun Chuen Gary Chan

    Abstract: In this paper, we propose a novel method for matrix completion under general non-uniform missing structures. By controlling an upper bound of a novel balancing error, we construct weights that can actively adjust for the non-uniformity in the empirical risk without explicitly modeling the observation probabilities, and can be computed efficiently via convex optimization. The recovered matrix based… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

    Comments: Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021

  7. arXiv:2103.03437  [pdf, other

    stat.ME

    Estimation of Partially Conditional Average Treatment Effect by Hybrid Kernel-covariate Balancing

    Authors: Jiayi Wang, Raymond K. W. Wong, Shu Yang, Kwun Chuen Gary Chan

    Abstract: We study nonparametric estimation for the partially conditional average treatment effect, defined as the treatment effect function over an interested subset of confounders. We propose a hybrid kernel weighting estimator where the weights aim to control the balancing error of any function of the confounders from a reproducing kernel Hilbert space after kernel smoothing over the subset of interested… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

    Comments: 19 pages, 2 figures

  8. arXiv:2010.00061  [pdf, other

    stat.ME

    Defining and Estimating Subgroup Mediation Effects with Semi-Competing Risks Data

    Authors: Fei Gao, Fan Xia, Kwun Chuen Gary Chan

    Abstract: In many medical studies, an ultimate failure event such as death is likely to be affected by the occurrence and timing of other intermediate clinical events. Both event times are subject to censoring by loss-to-follow-up but the nonterminal event may further be censored by the occurrence of the primary outcome, but not vice versa. To study the effect of an intervention on both events, the intermed… ▽ More

    Submitted 15 January, 2021; v1 submitted 30 September, 2020; originally announced October 2020.

  9. arXiv:2006.11408  [pdf, other

    cs.CV cs.LG stat.ML

    Quasi-conformal Geometry based Local Deformation Analysis of Lateral Cephalogram for Childhood OSA Classification

    Authors: Hei-Long Chan, Hoi-Man Yuen, Chun-Ting Au, Kate Ching-Ching Chan, Albert Martin Li, Lok-Ming Lui

    Abstract: Craniofacial profile is one of the anatomical causes of obstructive sleep apnea(OSA). By medical research, cephalometry provides information on patients' skeletal structures and soft tissues. In this work, a novel approach to cephalometric analysis using quasi-conformal geometry based local deformation information was proposed for OSA classification. Our study was a retrospective analysis based on… ▽ More

    Submitted 31 May, 2020; originally announced June 2020.

  10. arXiv:1807.00931  [pdf, other

    stat.ME

    Controlling the False Discovery Rate for Binary Feature Selection via Knockoff

    Authors: Yuxiang Xie, Kwun Chuen Gary Chan

    Abstract: Variable selection has been widely used in data analysis for the past decades, and it becomes increasingly important in the Big Data era as there are usually hundreds of variables available in a dataset. To enhance interpretability of a model, identifying potentially relevant features is often a step before fitting all the features into a regression model. A good variable selection method should e… ▽ More

    Submitted 13 August, 2020; v1 submitted 2 July, 2018; originally announced July 2018.

    MSC Class: 62

  11. arXiv:1601.03501  [pdf, ps, other

    stat.ME

    Efficient nonparametric estimation of causal mediation effects

    Authors: K. C. G. Chan, K. Imai, S. C. P. Yam, Z. Zhang

    Abstract: An essential goal of program evaluation and scientific research is the investigation of causal mechanisms. Over the past several decades, causal mediation analysis has been used in medical and social sciences to decompose the treatment effect into the natural direct and indirect effects. However, all of the existing mediation analysis methods rely on parametric modeling assumptions in one way or a… ▽ More

    Submitted 14 January, 2016; originally announced January 2016.

    Comments: Nonparametric Estimation, Natural direct effects, Natural indirect effects, Treatment effects, Semiparametric efficiency

    MSC Class: 62G05

  12. Oracle, Multiple Robust and Multipurpose Calibration in a Missing Response Problem

    Authors: Kwun Chuen Gary Chan, Sheung Chi Phillip Yam

    Abstract: In the presence of a missing response, reweighting the complete case subsample by the inverse of nonmissing probability is both intuitive and easy to implement. When the population totals of some auxiliary variables are known and when the inclusion probabilities are known by design, survey statisticians have developed calibration methods for improving efficiencies of the inverse probability weight… ▽ More

    Submitted 15 October, 2014; originally announced October 2014.

    Comments: Published in at http://dx.doi.org/10.1214/13-STS461 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS461

    Journal ref: Statistical Science 2014, Vol. 29, No. 3, 380-396

  13. arXiv:1403.7812  [pdf, ps, other

    stat.ME

    Marginalizable conditional model for clustered ordinal data

    Authors: Rui Zhang, Kwun Chuen Gary Chan

    Abstract: We introduce a flexible parametric mixed effects model for correlated binary data, with parameters that can be directly interpreted as marginal odds ratios. This leads to a robust estimation equation with an optimal weighting matrix being the inverse of a genuine model-based covariance matrix. Flexible correlation structures can be imposed by correlated random effects, and correlation parameters c… ▽ More

    Submitted 30 March, 2014; originally announced March 2014.

    Comments: 23 pages

  14. arXiv:1403.6744  [pdf, ps, other

    stat.ME

    A marginalizable frailty model for correlated right-censored data

    Authors: Rui Zhang Kwun Chuen Gary Chan

    Abstract: We introduce a flexible individual frailty model for clustered right-censored data, in which covariate effects can be marginally interpreted as log failure odds ratios. Flexible correlation structures can be imposed by introducing multivariate exponential distributed frailties, constructed from a set of multivariate Gaussian random variables. Finite and infinite dimensional parameters are consiste… ▽ More

    Submitted 26 March, 2014; originally announced March 2014.

    Comments: 46 pages, 3 tables

  15. Backward estimation of stochastic processes with failure events as time origins

    Authors: Kwun Chuen Gary Chan, Mei-Cheng Wang

    Abstract: Stochastic processes often exhibit sudden systematic changes in pattern a short time before certain failure events. Examples include increase in medical costs before death and decrease in CD4 counts before AIDS diagnosis. To study such terminal behavior of stochastic processes, a natural and direct way is to align the processes using failure events as time origins. This paper studies backward stoc… ▽ More

    Submitted 16 November, 2010; originally announced November 2010.

    Comments: Published in at http://dx.doi.org/10.1214/09-AOAS319 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS319

    Journal ref: Annals of Applied Statistics 2010, Vol. 4, No. 3, 1602-1620