Skip to main content

Showing 1–50 of 145 results for author: Zhu, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.17591  [pdf, other

    stat.ME

    Individualized Dynamic Mediation Analysis Using Latent Factor Models

    Authors: Yijiao Zhang, Yubai Yuan, Yuexia Zhang, Zhongyi Zhu, Annie Qu

    Abstract: Mediation analysis plays a crucial role in causal inference as it can investigate the pathways through which treatment influences outcome. Most existing mediation analysis assumes that mediation effects are static and homogeneous within populations. However, mediation effects usually change over time and exhibit significant heterogeneity in many real-world applications. Additionally, the presence… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 25 pages, 3 figures, 3 tables

  2. arXiv:2405.15325  [pdf, other

    cs.LG stat.ML

    On the Identification of Temporally Causal Representation with Instantaneous Dependence

    Authors: Zijian Li, Yifan Shen, Kaitao Zheng, Ruichu Cai, Xiangchen Song, Mingming Gong, Zhengmao Zhu, Guangyi Chen, Kun Zhang

    Abstract: Temporally causal representation learning aims to identify the latent causal process from time series observations, but most methods require the assumption that the latent causal processes do not have instantaneous relations. Although some recent methods achieve identifiability in the instantaneous causality case, they require either interventions on the latent variables or grou** of the observa… ▽ More

    Submitted 7 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:2405.03042  [pdf, other

    stat.ME stat.AP stat.CO

    Functional Post-Clustering Selective Inference with Applications to EHR Data Analysis

    Authors: Zihan Zhu, Xin Gai, Anru R. Zhang

    Abstract: In electronic health records (EHR) analysis, clustering patients according to patterns in their data is crucial for uncovering new subtypes of diseases. Existing medical literature often relies on classical hypothesis testing methods to test for differences in means between these clusters. Due to selection bias induced by clustering algorithms, the implementation of these classical methods on post… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  4. arXiv:2405.02539  [pdf, ps, other

    stat.ME

    Distributed Iterative Hard Thresholding for Variable Selection in Tobit Models

    Authors: Changxin Yang, Zhongyi Zhu, Heng Lian

    Abstract: While extensive research has been conducted on high-dimensional data and on regression with left-censored responses, simultaneously addressing these complexities remains challenging, with only a few proposed methods available. In this paper, we utilize the Iterative Hard Thresholding (IHT) algorithm on the Tobit model in such a setting. Theoretical analysis demonstrates that our estimator converge… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  5. arXiv:2405.02372  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Triadic-OCD: Asynchronous Online Change Detection with Provable Robustness, Optimality, and Convergence

    Authors: Yancheng Huang, Kai Yang, Zelin Zhu, Leian Chen

    Abstract: The primary goal of online change detection (OCD) is to promptly identify changes in the data stream. OCD problem find a wide variety of applications in diverse areas, e.g., security detection in smart grids and intrusion detection in communication networks. Prior research usually assumes precise knowledge of the system parameters. Nevertheless, this presumption often proves unattainable in practi… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: Accepted at ICML2024

  6. arXiv:2404.11579  [pdf, other

    stat.ME

    Spatial Heterogeneous Additive Partial Linear Model: A Joint Approach of Bivariate Spline and Forest Lasso

    Authors: Xin Zhang, Shan Yu, Zhengyuan Zhu, Xin Wang

    Abstract: Identifying spatial heterogeneous patterns has attracted a surge of research interest in recent years, due to its important applications in various scientific and engineering fields. In practice the spatially heterogeneous components are often mixed with components which are spatially smooth, making the task of identifying the heterogeneous regions more challenging. In this paper, we develop an ef… ▽ More

    Submitted 3 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  7. arXiv:2404.04317  [pdf, other

    stat.ML cs.LG q-bio.QM

    DeepLINK-T: deep learning inference for time series data using knockoffs and LSTM

    Authors: Wenxuan Zuo, Zifan Zhu, Yuxuan Du, Yi-Chun Yeh, Jed A. Fuhrman, **chi Lv, Yingying Fan, Fengzhu Sun

    Abstract: High-dimensional longitudinal time series data is prevalent across various real-world applications. Many such applications can be modeled as regression problems with high-dimensional time series covariates. Deep learning has been a popular and powerful tool for fitting these regression models. Yet, the development of interpretable and reproducible deep-learning models is challenging and remains un… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  8. arXiv:2404.03764  [pdf, other

    cs.LG stat.ME stat.ML

    CONCERT: Covariate-Elaborated Robust Local Information Transfer with Conditional Spike-and-Slab Prior

    Authors: Ruqian Zhang, Yijiao Zhang, Annie Qu, Zhongyi Zhu, Juan Shen

    Abstract: The popularity of transfer learning stems from the fact that it can borrow information from useful auxiliary datasets. Existing statistical transfer learning methods usually adopt a global similarity measure between the source data and the target data, which may lead to inefficiency when only local information is shared. In this paper, we propose a novel Bayesian transfer learning method named "CO… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 31 pages, 22 figures

  9. arXiv:2401.15811  [pdf, other

    stat.ME cs.IR

    Seller-Side Experiments under Interference Induced by Feedback Loops in Two-Sided Platforms

    Authors: Zhihua Zhu, Zheng Cai, Liang Zheng, Nian Si

    Abstract: Two-sided platforms are central to modern commerce and content sharing and often utilize A/B testing for develo** new features. While user-side experiments are common, seller-side experiments become crucial for specific interventions and metrics. This paper investigates the effects of interference caused by feedback loops on seller-side experiments in two-sided platforms, with a particular focus… ▽ More

    Submitted 9 February, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  10. arXiv:2401.02592  [pdf, other

    stat.ML cs.LG eess.SP math.OC

    Guaranteed Nonconvex Factorization Approach for Tensor Train Recovery

    Authors: Zhen Qin, Michael B. Wakin, Zhihui Zhu

    Abstract: In this paper, we provide the first convergence guarantee for the factorization approach. Specifically, to avoid the scaling ambiguity and to facilitate theoretical analysis, we optimize over the so-called left-orthogonal TT format which enforces orthonormality among most of the factors. To ensure the orthonormal structure, we utilize the Riemannian gradient descent (RGD) for optimizing those fact… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  11. arXiv:2312.14534  [pdf, other

    stat.ME stat.AP

    Global Rank Sum Test: An Efficient Rank-Based Nonparametric Test for Large Scale Online Experiment

    Authors: Zheng Cai, Bo Hu, Zhihua Zhu

    Abstract: Online experiments are widely used for improving online services. While doing online experiments, The student t-test is the most widely used hypothesis testing technique. In practice, however, the normality assumption on which the t-test depends on may fail, which resulting in untrustworthy results. In this paper, we first discuss the question of when the t-test fails, and thus introduce the rank-… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 9 pages, 3 figures

  12. arXiv:2310.20579  [pdf, other

    stat.ML cs.CR cs.LG

    Initialization Matters: Privacy-Utility Analysis of Overparameterized Neural Networks

    Authors: Jiayuan Ye, Zhenyu Zhu, Fanghui Liu, Reza Shokri, Volkan Cevher

    Abstract: We analytically investigate how over-parameterization of models in randomized machine learning algorithms impacts the information leakage about their training data. Specifically, we prove a privacy bound for the KL divergence between model distributions on worst-case neighboring datasets, and explore its dependence on the initialization, width, and depth of fully connected neural networks. We find… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  13. arXiv:2310.18123  [pdf, ps, other

    cs.LG stat.ML

    Sample Complexity Bounds for Score-Matching: Causal Discovery and Generative Modeling

    Authors: Zhenyu Zhu, Francesco Locatello, Volkan Cevher

    Abstract: This paper provides statistical sample complexity bounds for score-matching and its applications in causal discovery. We demonstrate that accurate estimation of the score function is achievable by training a standard deep ReLU neural network using stochastic gradient descent. We establish bounds on the error rate of recovering causal relationships using the score-matching-based causal discovery me… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted in NeurIPS 2023

  14. arXiv:2310.09426  [pdf, other

    cs.LG stat.ML

    Offline Reinforcement Learning for Optimizing Production Bidding Policies

    Authors: Dmytro Korenkevych, Frank Cheng, Artsiom Balakir, Alex Nikulkov, Lingnan Gao, Zhihao Cen, Zuobing Xu, Zheqing Zhu

    Abstract: The online advertising market, with its thousands of auctions run per second, presents a daunting challenge for advertisers who wish to optimize their spend under a budget constraint. Thus, advertising platforms typically provide automated agents to their customers, which act on their behalf to bid for impression opportunities in real time at scale. Because these proxy agents are owned by the plat… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  15. arXiv:2309.02530  [pdf, other

    cs.LG stat.ML

    Diffusion on the Probability Simplex

    Authors: Griffin Floto, Thorsteinn Jonsson, Mihai Nica, Scott Sanner, Eric Zhengyu Zhu

    Abstract: Diffusion models learn to reverse the progressive noising of a data distribution to create a generative model. However, the desired continuous nature of the noising process can be at odds with discrete data. To deal with this tension between continuous and discrete objects, we propose a method of performing diffusion on the probability simplex. Using the probability simplex naturally creates an in… ▽ More

    Submitted 11 September, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

  16. arXiv:2307.07918  [pdf, ps, other

    stat.ME

    A Data Fusion Method for Quantile Treatment Effects

    Authors: Yijiao Zhang, Zhongyi Zhu

    Abstract: With the increasing availability of datasets, develo** data fusion methods to leverage the strengths of different datasets to draw causal effects is of great practical importance to many scientific fields. In this paper, we consider estimating the quantile treatment effects using small validation data with fully-observed confounders and large auxiliary data with unmeasured confounders. We propos… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: 42 pages, 1 figure

  17. arXiv:2302.01075  [pdf, other

    stat.ML cs.LG

    MonoFlow: Rethinking Divergence GANs via the Perspective of Wasserstein Gradient Flows

    Authors: Mingxuan Yi, Zhanxing Zhu, Song Liu

    Abstract: The conventional understanding of adversarial training in generative adversarial networks (GANs) is that the discriminator is trained to estimate a divergence, and the generator learns to minimize this divergence. We argue that despite the fact that many variants of GANs were developed following this paradigm, the current theoretical understanding of GANs and their practical algorithms are inconsi… ▽ More

    Submitted 8 August, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  18. arXiv:2301.11697  [pdf, other

    stat.ML cs.LG

    Big portfolio selection by graph-based conditional moments method

    Authors: Zhoufan Zhu, Ningning Zhang, Ke Zhu

    Abstract: How to do big portfolio selection is very important but challenging for both researchers and practitioners. In this paper, we propose a new graph-based conditional moments (GRACE) method to do portfolio selection based on thousands of stocks or more. The GRACE method first learns the conditional quantiles and mean of stock returns via a factor-augmented temporal graph convolutional network, which… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: 35 pages

    MSC Class: 62M10; 68T07; 91G10

  19. arXiv:2301.06259  [pdf, other

    math.ST stat.ML

    Understanding Best Subset Selection: A Tale of Two C(omplex)ities

    Authors: Saptarshi Roy, Ambuj Tewari, Ziwei Zhu

    Abstract: For decades, best subset selection (BSS) has eluded statisticians mainly due to its computational bottleneck. However, until recently, modern computational breakthroughs have rekindled theoretical interest in BSS and have led to new findings. Recently, \cite{guo2020best} showed that the model selection performance of BSS is governed by a margin quantity that is robust to the design dependence, unl… ▽ More

    Submitted 17 July, 2023; v1 submitted 15 January, 2023; originally announced January 2023.

    Comments: 46 pages, 2 Figures

  20. arXiv:2301.01091  [pdf, other

    econ.EM stat.ME

    Fitting mixed logit random regret minimization models using maximum simulated likelihood

    Authors: Ziyue Zhu, Álvaro A. Gutiérrez-Vargas, Martina Vandebroek

    Abstract: This article describes the mixrandregret command, which extends the randregret command introduced in Gutiérrez-Vargas et al. (2021, The Stata Journal 21: 626-658) incorporating random coefficients for Random Regret Minimization models. The newly developed command mixrandregret allows the inclusion of random coefficients in the regret function of the classical RRM model introduced in Chorus (2010,… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

  21. arXiv:2212.12206  [pdf, other

    cs.LG cs.AI cs.CV eess.IV stat.ML

    Principled and Efficient Transfer Learning of Deep Models via Neural Collapse

    Authors: Xiao Li, Sheng Liu, **xin Zhou, Xinyu Lu, Carlos Fernandez-Granda, Zhihui Zhu, Qing Qu

    Abstract: As model size continues to grow and access to labeled training data remains limited, transfer learning has become a popular approach in many scientific and engineering fields. This study explores the phenomenon of neural collapse (NC) in transfer learning for classification problems, which is characterized by the last-layer features and classifiers of deep networks having zero within-class variabi… ▽ More

    Submitted 26 February, 2023; v1 submitted 23 December, 2022; originally announced December 2022.

    Comments: First two authors contributed equally, 29 pages, 14 figures, and 7 tables

  22. arXiv:2212.00428  [pdf, other

    stat.ME

    Transfer Learning for High-dimensional Quantile Regression via Convolution Smoothing

    Authors: Yijiao Zhang, Zhongyi Zhu

    Abstract: This paper studies the high-dimensional quantile regression problem under the transfer learning framework, where possibly related source datasets are available to make improvements on the estimation or prediction based solely on the target data. In the oracle case with known transferable sources, a smoothed two-step transfer learning algorithm based on convolution smoothing is proposed and the L1/… ▽ More

    Submitted 1 May, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: 27 pages, 6 figures

  23. arXiv:2210.02192  [pdf, other

    cs.LG cs.AI cs.IT math.OC stat.ML

    Are All Losses Created Equal: A Neural Collapse Perspective

    Authors: **xin Zhou, Chong You, Xiao Li, Kangning Liu, Sheng Liu, Qing Qu, Zhihui Zhu

    Abstract: While cross entropy (CE) is the most commonly used loss to train deep neural networks for classification tasks, many alternative losses have been developed to obtain better empirical performance. Among them, which one is the best to use is still a mystery, because there seem to be multiple factors affecting the answer, such as properties of the dataset, the choice of network architecture, and so o… ▽ More

    Submitted 8 October, 2022; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 32 page, 10 figures, NeurIPS 2022

  24. arXiv:2209.10675  [pdf, other

    math.OC cs.LG eess.IV stat.ML

    A Validation Approach to Over-parameterized Matrix and Image Recovery

    Authors: Lijun Ding, Zhen Qin, Liwei Jiang, **xin Zhou, Zhihui Zhu

    Abstract: In this paper, we study the problem of recovering a low-rank matrix from a number of noisy random linear measurements. We consider the setting where the rank of the ground-truth matrix is unknown a prior and use an overspecified factored representation of the matrix variable, where the global optimal solutions overfit and do not correspond to the underlying ground-truth. We then solve the associat… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: 29 pages and 9 figures

  25. arXiv:2209.09211  [pdf, other

    cs.LG cs.CV cs.IT eess.SP stat.ML

    Neural Collapse with Normalized Features: A Geometric Analysis over the Riemannian Manifold

    Authors: Can Yaras, Peng Wang, Zhihui Zhu, Laura Balzano, Qing Qu

    Abstract: When training overparameterized deep networks for classification tasks, it has been widely observed that the learned features exhibit a so-called "neural collapse" phenomenon. More specifically, for the output features of the penultimate layer, for each class the within-class features converge to their means, and the means of different classes exhibit a certain tight frame structure, which is also… ▽ More

    Submitted 7 March, 2023; v1 submitted 19 September, 2022; originally announced September 2022.

    Comments: The first two authors contributed to this work equally; 38 pages, 13 figures. Accepted at NeurIPS'22

  26. arXiv:2207.09098  [pdf, other

    stat.ME math.ST

    ReBoot: Distributed statistical learning via refitting bootstrap samples

    Authors: Yumeng Wang, Ziwei Zhu, Xuming He

    Abstract: In this paper, we propose a one-shot distributed learning algorithm via refitting bootstrap samples, which we refer to as ReBoot. ReBoot refits a new model to mini-batches of bootstrap samples that are continuously drawn from each of the locally fitted models. It requires only one round of communication of model parameters without much memory. Theoretically, we analyze the statistical error rate o… ▽ More

    Submitted 7 May, 2024; v1 submitted 19 July, 2022; originally announced July 2022.

  27. arXiv:2206.06463  [pdf, other

    physics.med-ph stat.AP

    A statistical reconstruction algorithm for positronium lifetime imaging using time-of-flight positron emission tomography

    Authors: Hsin-Hsiung Huang, Zheyuan Zhu, Slun Booppasiri, Zhuo Chen, Shuo Pang, Chien-Min Kao

    Abstract: Positron emission tomography (PET) is an important modality for diagnosing diseases such as cancer and Alzheimer's disease, capable of revealing the uptake of radiolabeled molecules that target specific pathological markers of the diseases. Recently, positronium lifetime imaging (PLI) that adds to traditional PET the ability to explore properties of the tissue microenvironment beyond tracer uptake… ▽ More

    Submitted 9 May, 2024; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: Submitted to IEEE-TPRMS

  28. arXiv:2206.01474  [pdf, other

    cs.LG stat.ML

    Offline Reinforcement Learning with Causal Structured World Models

    Authors: Zheng-Mao Zhu, Xiong-Hui Chen, Hong-Long Tian, Kun Zhang, Yang Yu

    Abstract: Model-based methods have recently shown promising for offline reinforcement learning (RL), aiming to learn good policies from historical data without interacting with the environment. Previous model-based offline RL methods learn fully connected nets as world-models that map the states and actions to the next-step states. However, it is sensible that a world-model should adhere to the underlying c… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  29. Modeling Ride-Sourcing Matching and Pickup Processes based on Additive Gaussian Process Models

    Authors: Zheng Zhu, Meng Xu, Yining Di, Xiqun Chen, **gru Yu

    Abstract: Matching and pickup processes are core features of ride-sourcing services. Previous studies have adopted abundant analytical models to depict the two processes and obtain operational insights; while the goodness of fit between models and data was dismissed. To simultaneously consider the fitness between models and data and analytically tractable formations, we propose a data-driven approach based… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: 30 pages, 8 figures, 4 tables. Submitted and under review in Transportmetrica B: Transport Dynamics

  30. arXiv:2203.04883  [pdf, other

    stat.ME

    A-Optimal Split Questionnaire Designs for Multivariate Continuous Variables

    Authors: Dae-Gyu Jang, Zhengyuan Zhu, Cindy Yu

    Abstract: A split questionnaire design (SQD), an alternative to full questionnaires, can reduce the response burden and improve survey quality. One can design a split questionnaire to reduce the information loss from missing data induced by the split questionnaire. This study develops a methodology for finding optimal SQD (OSQD) for multivariate continuous variables, applying a probabilistic design and opti… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

  31. arXiv:2203.01238  [pdf, other

    cs.LG cs.AI cs.IT math.OC stat.ML

    On the Optimization Landscape of Neural Collapse under MSE Loss: Global Optimality with Unconstrained Features

    Authors: **xin Zhou, Xiao Li, Tianyu Ding, Chong You, Qing Qu, Zhihui Zhu

    Abstract: When training deep neural networks for classification tasks, an intriguing empirical phenomenon has been widely observed in the last-layer classifiers and features, where (i) the class means and the last-layer classifiers all collapse to the vertices of a Simplex Equiangular Tight Frame (ETF) up to scaling, and (ii) cross-example within-class variability of last-layer activations collapses to zero… ▽ More

    Submitted 12 March, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  32. arXiv:2202.14026  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Robust Training under Label Noise by Over-parameterization

    Authors: Sheng Liu, Zhihui Zhu, Qing Qu, Chong You

    Abstract: Recently, over-parameterized deep networks, with increasingly more network parameters than training samples, have dominated the performances of modern machine learning. However, when the training data is corrupted, it has been well-known that over-parameterized networks tend to overfit and do not generalize. In this work, we propose a principled approach for robust training of over-parameterized d… ▽ More

    Submitted 2 August, 2022; v1 submitted 28 February, 2022; originally announced February 2022.

    Comments: 25 pages, 4 figures and 6 tables. Code is available at https://github.com/shengliu66/SOP

  33. arXiv:2201.01036  [pdf, other

    stat.ML cs.LG math.ST

    Supervised Homogeneity Fusion: a Combinatorial Approach

    Authors: Wen Wang, Shihao Wu, Ziwei Zhu, Ling Zhou, Peter X. -K. Song

    Abstract: Fusing regression coefficients into homogenous groups can unveil those coefficients that share a common value within each group. Such groupwise homogeneity reduces the intrinsic dimension of the parameter space and unleashes sharper statistical accuracy. We propose and investigate a new combinatorial grou** approach called $L_0$-Fusion that is amenable to mixed integer optimization (MIO). On the… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

  34. arXiv:2110.12088  [pdf, other

    cs.LG stat.ML

    Learning with Noisy Labels Revisited: A Study Using Real-World Human Annotations

    Authors: Jiaheng Wei, Zhaowei Zhu, Hao Cheng, Tongliang Liu, Gang Niu, Yang Liu

    Abstract: Existing research on learning with noisy labels mainly focuses on synthetic label noise. Synthetic noise, though has clean structures which greatly enabled statistical analyses, often fails to model real-world noise patterns. The recent literature has observed several efforts to offer real-world noisy datasets, yet the existing efforts suffer from two caveats: (1) The lack of ground-truth verifica… ▽ More

    Submitted 27 March, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: Published as a conference paper at ICLR 2022

  35. arXiv:2110.06282  [pdf, other

    cs.LG stat.ML

    The Rich Get Richer: Disparate Impact of Semi-Supervised Learning

    Authors: Zhaowei Zhu, Tianyi Luo, Yang Liu

    Abstract: Semi-supervised learning (SSL) has demonstrated its potential to improve the model accuracy for a variety of learning tasks when the high-quality supervised data is severely limited. Although it is often established that the average accuracy for the entire population of data is improved, it is unclear how SSL fares with different sub-populations. Understanding the above question has substantial fa… ▽ More

    Submitted 31 August, 2023; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: Published as a conference paper at ICLR 2022. Revised constants Theorems 1,2, and Lemma 3 (consider the union bound). Add acknowledgments to Nautilus

  36. Empirical likelihood inference for longitudinal data with covariate measurement errors: An application to the LEAN study

    Authors: Yuexia Zhang, Guoyou Qin, Zhongyi Zhu, Jiajia Zhang

    Abstract: Measurement errors usually arise during the longitudinal data collection process. Ignoring the effects of measurement errors will lead to invalid estimates. The Lifestyle Education for Activity and Nutrition (LEAN) study was designed to assess the effectiveness of intervention for enhancing weight loss over nine months. The covariates systolic blood pressure (SBP) and diastolic blood pressure (DBP… ▽ More

    Submitted 2 July, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

  37. arXiv:2110.01189  [pdf, other

    math.ST stat.ME

    Volatility prediction comparison via robust volatility proxies: An empirical deviation perspective

    Authors: Weichen Wang, Ran An, Ziwei Zhu

    Abstract: Volatility forecasting is crucial to risk management and portfolio construction. One particular challenge of assessing volatility forecasts is how to construct a robust proxy for the unknown true volatility. In this work, we show that the empirical loss comparison between two volatility predictors hinges on the deviation of the volatility proxy from the true volatility. We then establish non-asymp… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: 48 pages

    MSC Class: 62F35

  38. arXiv:2109.11154  [pdf, other

    math.OC cs.LG stat.ML

    Rank Overspecified Robust Matrix Recovery: Subgradient Method and Exact Recovery

    Authors: Lijun Ding, Liwei Jiang, Yudong Chen, Qing Qu, Zhihui Zhu

    Abstract: We study the robust recovery of a low-rank matrix from sparsely and grossly corrupted Gaussian measurements, with no prior knowledge on the intrinsic rank. We consider the robust matrix factorization approach. We employ a robust $\ell_1$ loss function and deal with the challenge of the unknown rank by using an overspecified factored representation of the matrix variable. We then solve the associat… ▽ More

    Submitted 26 October, 2021; v1 submitted 23 September, 2021; originally announced September 2021.

    Comments: 75 pages, 3 figures

  39. arXiv:2108.07940  [pdf, ps, other

    stat.ME

    Weak signal identification and inference in penalized likelihood models for categorical responses

    Authors: Yuexia Zhang, Peibei Shi, Zhongyi Zhu, Linbo Wang, Annie Qu

    Abstract: Penalized likelihood models are widely used to simultaneously select variables and estimate model parameters. However, the existence of weak signals can lead to inaccurate variable selection, biased parameter estimation, and invalid inference. Thus, identifying weak signals accurately and making valid inferences are crucial in penalized likelihood models. We develop a unified approach to identify… ▽ More

    Submitted 11 December, 2022; v1 submitted 17 August, 2021; originally announced August 2021.

    MSC Class: 62F99 ACM Class: G.3

  40. arXiv:2107.06939  [pdf, other

    math.ST stat.ME

    On sure early selection of the best subset

    Authors: Ziwei Zhu, Shihao Wu

    Abstract: The early solution path, which tracks the first few variables that enter the model of a selection procedure, is of profound importance to scientific discoveries. In practice, it is often statistically hopeless to identify all the important features with no false discovery, let alone the intimidating expense of experiments to test their significance. Such realistic limitation calls for statistical… ▽ More

    Submitted 17 November, 2022; v1 submitted 14 July, 2021; originally announced July 2021.

  41. arXiv:2105.02375  [pdf, other

    cs.LG cs.AI cs.IT math.OC stat.ML

    A Geometric Analysis of Neural Collapse with Unconstrained Features

    Authors: Zhihui Zhu, Tianyu Ding, **xin Zhou, Xiao Li, Chong You, Jeremias Sulam, Qing Qu

    Abstract: We provide the first global optimization landscape analysis of $Neural\;Collapse$ -- an intriguing empirical phenomenon that arises in the last-layer classifiers and features of neural networks during the terminal phase of training. As recently reported by Papyan et al., this phenomenon implies that ($i$) the class means and the last-layer classifiers all collapse to the vertices of a Simplex Equi… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: 42 pages, 8 figures, 1 table; the first two authors contributed to this work equally

  42. arXiv:2102.05291  [pdf, other

    cs.LG cs.AI stat.ML

    Clusterability as an Alternative to Anchor Points When Learning with Noisy Labels

    Authors: Zhaowei Zhu, Yiwen Song, Yang Liu

    Abstract: The label noise transition matrix, characterizing the probabilities of a training instance being wrongly annotated, is crucial to designing popular solutions to learning with noisy labels. Existing works heavily rely on finding "anchor points" or their approximates, defined as instances belonging to a particular class almost surely. Nonetheless, finding anchor points remains a non-trivial task, an… ▽ More

    Submitted 13 July, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

    Comments: ICML 2021

  43. arXiv:2101.09418  [pdf, other

    stat.AP

    A Geospatial Functional Model For OCO-2 Data with Application on Imputation and Land Fraction Estimation

    Authors: Xinyue Chang, Zhengyuan Zhu, Xiongtao Dai, Jonathan Hobbs

    Abstract: Data from NASA's Orbiting Carbon Observatory-2 (OCO-2) satellite is essential to many carbon management strategies. A retrieval algorithm is used to estimate CO2 concentration using the radiance data measured by OCO-2. However, due to factors such as cloud cover and cosmic rays, the spatial coverage of the retrieval algorithm is limited in some areas of critical importance for carbon cycle science… ▽ More

    Submitted 23 January, 2021; originally announced January 2021.

  44. arXiv:2012.10030  [pdf, other

    stat.ME

    Regularized Estimation in High-Dimensional Vector Auto-Regressive Models using Spatio-Temporal Information

    Authors: Zhenzhong Wang, Abolfazl Safikhani, Zhengyuan Zhu, David S. Matteson

    Abstract: A Vector Auto-Regressive (VAR) model is commonly used to model multivariate time series, and there are many penalized methods to handle high dimensionality. However in terms of spatio-temporal data, most methods do not take the spatial and temporal structure of the data into consideration, which may lead to unreliable network detection and inaccurate forecasts. This paper proposes a data-driven we… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

  45. arXiv:2012.10009  [pdf, other

    stat.ME

    Nonparametric Estimation of Repeated Densities with Heterogeneous Sample Sizes

    Authors: Jiaming Qiu, Xiongtao Dai, Zhengyuan Zhu

    Abstract: We consider the estimation of densities in multiple subpopulations, where the available sample size in each subpopulation greatly varies. This problem occurs in epidemiology, for example, where different diseases may share similar pathogenic mechanism but differ in their prevalence. Without specifying a parametric form, our proposed method pools information from the population and estimate the den… ▽ More

    Submitted 13 September, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: Add additional consistency results; rearrange some figures and tables for better presentation (results not changed); correct typos

  46. arXiv:2010.10090  [pdf, other

    cs.LG cs.AI stat.ML

    Knowledge Distillation in Wide Neural Networks: Risk Bound, Data Efficiency and Imperfect Teacher

    Authors: Guangda Ji, Zhanxing Zhu

    Abstract: Knowledge distillation is a strategy of training a student network with guide of the soft output from a teacher network. It has been a successful method of model compression and knowledge transfer. However, currently knowledge distillation lacks a convincing theoretical understanding. On the other hand, recent finding on neural tangent kernel enables us to approximate a wide neural network with a… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  47. arXiv:2010.10079  [pdf, other

    stat.ML cs.AI cs.LG stat.AP

    Neural Approximate Sufficient Statistics for Implicit Models

    Authors: Yanzhi Chen, Dinghuai Zhang, Michael Gutmann, Aaron Courville, Zhanxing Zhu

    Abstract: We consider the fundamental problem of how to automatically construct summary statistics for implicit generative models where the evaluation of the likelihood function is intractable, but sampling data from the model is possible. The idea is to frame the task of constructing sufficient statistics as learning mutual information maximizing representations of the data with the help of deep neural net… ▽ More

    Submitted 30 March, 2021; v1 submitted 20 October, 2020; originally announced October 2020.

    Comments: ICLR2021 spotlight

  48. arXiv:2010.02347  [pdf, other

    cs.LG stat.ML

    Learning with Instance-Dependent Label Noise: A Sample Sieve Approach

    Authors: Hao Cheng, Zhaowei Zhu, Xingyu Li, Yifei Gong, Xing Sun, Yang Liu

    Abstract: Human-annotated labels are often prone to noise, and the presence of such noise will degrade the performance of the resulting deep neural network (DNN) models. Much of the literature (with several recent exceptions) of learning with noisy labels focuses on the case when the label noise is independent of features. Practically, annotations errors tend to be instance-dependent and often depend on the… ▽ More

    Submitted 22 March, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: ICLR 2021

  49. arXiv:2010.01748  [pdf, other

    cs.LG cs.AI stat.ML

    Policy Learning Using Weak Supervision

    Authors: **gkang Wang, Hongyi Guo, Zhaowei Zhu, Yang Liu

    Abstract: Most existing policy learning solutions require the learning agents to receive high-quality supervision signals such as well-designed rewards in reinforcement learning (RL) or high-quality expert demonstrations in behavioral cloning (BC). These quality supervisions are usually infeasible or prohibitively expensive to obtain in practice. We aim for a unified framework that leverages the available c… ▽ More

    Submitted 2 November, 2021; v1 submitted 4 October, 2020; originally announced October 2020.

    Comments: NeurIPS 2021

  50. arXiv:2009.07888  [pdf, other

    cs.LG cs.AI stat.ML

    Transfer Learning in Deep Reinforcement Learning: A Survey

    Authors: Zhuangdi Zhu, Kaixiang Lin, Anil K. Jain, Jiayu Zhou

    Abstract: Reinforcement learning is a learning paradigm for solving sequential decision-making problems. Recent years have witnessed remarkable progress in reinforcement learning upon the fast development of deep neural networks. Along with the promising prospects of reinforcement learning in numerous domains such as robotics and game-playing, transfer learning has arisen to tackle various challenges faced… ▽ More

    Submitted 4 July, 2023; v1 submitted 16 September, 2020; originally announced September 2020.