Skip to main content

Showing 1–50 of 136 results for author: Zhou, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.06415  [pdf, other

    stat.ML cs.LG

    Generalization analysis with deep ReLU networks for metric and similarity learning

    Authors: Junyu Zhou, Puyu Wang, Ding-Xuan Zhou

    Abstract: While considerable theoretical progress has been devoted to the study of metric and similarity learning, the generalization mystery is still missing. In this paper, we study the generalization performance of metric and similarity learning by leveraging the specific structure of the true metric (the target function). Specifically, by deriving the explicit form of the true metric for metric and simi… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 15 pages, 1 figure

  2. arXiv:2403.16459  [pdf, other

    cs.LG math.ST stat.ML

    On the rates of convergence for learning with convolutional neural networks

    Authors: Yunfei Yang, Han Feng, Ding-Xuan Zhou

    Abstract: We study approximation and learning capacities of convolutional neural networks (CNNs) with one-side zero-padding and multiple channels. Our first result proves a new approximation bound for CNNs with certain constraint on the weights. Our second result gives new analysis on the covering number of feed-forward neural networks with CNNs as special cases. The analysis carefully takes into account th… ▽ More

    Submitted 8 April, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  3. arXiv:2403.14926  [pdf, other

    stat.ML cs.LG

    Contrastive Learning on Multimodal Analysis of Electronic Health Records

    Authors: Tianxi Cai, Feiqing Huang, Ryumei Nakada, Linjun Zhang, Doudou Zhou

    Abstract: Electronic health record (EHR) systems contain a wealth of multimodal clinical data including structured data like clinical codes and unstructured data such as clinical notes. However, many existing EHR-focused studies has traditionally either concentrated on an individual modality or merged different modalities in a rather rudimentary fashion. This approach often results in the perception of stru… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 34 pages

  4. arXiv:2403.12284  [pdf, other

    math.ST q-bio.QM stat.AP stat.ME

    The Wreaths of KHAN: Uniform Graph Feature Selection with False Discovery Rate Control

    Authors: Jiajun Liang, Yue Liu, Doudou Zhou, Sinian Zhang, Junwei Lu

    Abstract: Graphical models find numerous applications in biology, chemistry, sociology, neuroscience, etc. While substantial progress has been made in graph estimation, it remains largely unexplored how to select significant graph signals with uncertainty assessment, especially those graph features related to topological structures including cycles (i.e., wreaths), cliques, hubs, etc. These features play a… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  5. arXiv:2403.11960  [pdf, other

    cs.LG stat.ML

    CASPER: Causality-Aware Spatiotemporal Graph Neural Networks for Spatiotemporal Time Series Imputation

    Authors: Baoyu **g, Dawei Zhou, Kan Ren, Carl Yang

    Abstract: Spatiotemporal time series is the foundation of understanding human activities and their impacts, which is usually collected via monitoring sensors placed at different locations. The collected data usually contains missing values due to various failures, which have significant impact on data analysis. To impute the missing values, a lot of methods have been introduced. When recovering a specific d… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Preprint. Work in progress

  6. arXiv:2402.12875  [pdf, other

    cs.LG cs.CC stat.ML

    Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

    Authors: Zhiyuan Li, Hong Liu, Denny Zhou, Tengyu Ma

    Abstract: Instructing the model to generate a sequence of intermediate steps, a.k.a., a chain of thought (CoT), is a highly effective method to improve the accuracy of large language models (LLMs) on arithmetics and symbolic reasoning tasks. However, the mechanism behind CoT remains unclear. This work provides a theoretical understanding of the power of CoT for decoder-only transformers through the lens of… ▽ More

    Submitted 23 May, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: 38 pages, 10 figures. Accepted by ICLR 2024

  7. arXiv:2402.08998  [pdf, other

    cs.LG stat.ML

    Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path

    Authors: Qiwei Di, Jiafan He, Dongruo Zhou, Quanquan Gu

    Abstract: We study the Stochastic Shortest Path (SSP) problem with a linear mixture transition kernel, where an agent repeatedly interacts with a stochastic environment and seeks to reach certain goal state while minimizing the cumulative cost. Existing works often assume a strictly positive lower bound of the cost function or an upper bound of the expected length for the optimal policy. In this paper, we p… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 28 pages, 1 figure, In ICML 2023

  8. arXiv:2401.02890  [pdf, other

    stat.ML cs.LG

    Nonlinear functional regression by functional deep neural network with kernel embedding

    Authors: Zhongjie Shi, Jun Fan, Linhao Song, Ding-Xuan Zhou, Johan A. K. Suykens

    Abstract: With the rapid development of deep learning in various fields of science and technology, such as speech recognition, image classification, and natural language processing, recently it is also widely applied in the functional data analysis (FDA) with some empirical success. However, due to the infinite dimensional input, we need a powerful dimension reduction method for functional learning tasks, e… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  9. arXiv:2312.15611  [pdf, other

    stat.ME stat.ML

    Inference of Dependency Knowledge Graph for Electronic Health Records

    Authors: Zhiwei Xu, Ziming Gan, Doudou Zhou, Shuting Shen, Junwei Lu, Tianxi Cai

    Abstract: The effective analysis of high-dimensional Electronic Health Record (EHR) data, with substantial potential for healthcare research, presents notable methodological challenges. Employing predictive modeling guided by a knowledge graph (KG), which enables efficient feature selection, can enhance both statistical efficiency and interpretability. While various methods have emerged for constructing KGs… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  10. arXiv:2311.14222  [pdf, other

    cs.LG math.OC stat.ML

    Risk Bounds of Accelerated SGD for Overparameterized Linear Regression

    Authors: Xuheng Li, Yihe Deng, **gfeng Wu, Dongruo Zhou, Quanquan Gu

    Abstract: Accelerated stochastic gradient descent (ASGD) is a workhorse in deep learning and often achieves better generalization performance than SGD. However, existing optimization theory can only explain the faster convergence of ASGD, but cannot explain its better generalization. In this paper, we study the generalization of ASGD for overparameterized linear regression, which is possibly the simplest se… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: 85 pages, 5 figures

  11. arXiv:2311.11563  [pdf

    stat.ME stat.AP

    Time-varying effect in the competing risks based on restricted mean time lost

    Authors: Zhiyin Yu, Zhao** Li, Chengfeng Zhang, Yawen Hou, Derun Zhou, Zheng Chen

    Abstract: Patients with breast cancer tend to die from other diseases, so for studies that focus on breast cancer, a competing risks model is more appropriate. Considering subdistribution hazard ratio, which is used often, limited to model assumptions and clinical interpretation, we aimed to quantify the effects of prognostic factors by an absolute indicator, the difference in restricted mean time lost (RMT… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  12. arXiv:2309.04236  [pdf, other

    cs.LG stat.ML

    Adaptive Distributed Kernel Ridge Regression: A Feasible Distributed Learning Scheme for Data Silos

    Authors: Di Wang, Xiaotong Liu, Shao-Bo Lin, Ding-Xuan Zhou

    Abstract: Data silos, mainly caused by privacy and interoperability, significantly constrain collaborations among different organizations with similar data for the same purpose. Distributed learning based on divide-and-conquer provides a promising way to settle the data silos, but it suffers from several challenges, including autonomy, privacy guarantees, and the necessity of collaborations. This paper focu… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 46pages, 13figures

  13. arXiv:2308.09605  [pdf, other

    math.NA cs.LG math.ST stat.ML

    Solving PDEs on Spheres with Physics-Informed Convolutional Neural Networks

    Authors: Guanhang Lei, Zhen Lei, Lei Shi, Chenyu Zeng, Ding-Xuan Zhou

    Abstract: Physics-informed neural networks (PINNs) have been demonstrated to be efficient in solving partial differential equations (PDEs) from a variety of experimental perspectives. Some recent studies have also proposed PINN algorithms for PDEs on surfaces, including spheres. However, theoretical understanding of the numerical performance of PINNs, especially PINNs on surfaces or manifolds, is still lack… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

  14. arXiv:2308.08562  [pdf, other

    stat.AP q-bio.CB

    Bayesian Inference of Phenotypic Plasticity of Cancer Cells Based on Dynamic Model for Temporal Cell Proportion Data

    Authors: Shuli Chen, Yuman Wang, Da Zhou, Jie Hu

    Abstract: Mounting evidence underscores the prevalent hierarchical organization of cancer tissues. At the foundation of this hierarchy reside cancer stem cells, a subset of cells endowed with the pivotal role of engendering the entire cancer tissue through cell differentiation. In recent times, substantial attention has been directed towards the phenomenon of cancer cell plasticity, where the dynamic interc… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  15. arXiv:2308.04158  [pdf, other

    stat.ME

    A Dual Cox Model Theory And Its Applications In Oncology

    Authors: Powei Chen, Siying Hu, Dr. Hao** Zhou

    Abstract: Given the prominence of targeted therapy and immunotherapy in cancer treatment, it becomes imperative to consider heterogeneity in patients' responses to treatments, which contributes greatly to the widely used proportional hazard assumption invalidated as in several clinical trials. To address the challenge, we develop a Dual Cox model theory including a Dual Cox model and a fitting algorithm.… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  16. arXiv:2307.16792  [pdf, ps, other

    stat.ML cs.LG

    Classification with Deep Neural Networks and Logistic Loss

    Authors: Zihan Zhang, Lei Shi, Ding-Xuan Zhou

    Abstract: Deep neural networks (DNNs) trained with the logistic loss (i.e., the cross entropy loss) have made impressive advancements in various binary classification tasks. However, generalization analysis for binary classification with DNNs and logistic loss remains scarce. The unboundedness of the target function for the logistic loss is the main obstacle to deriving satisfactory generalization bounds. I… ▽ More

    Submitted 21 April, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

  17. arXiv:2307.16502  [pdf, ps, other

    math.CO stat.ME

    Percolated stochastic block model via EM algorithm and belief propagation with non-backtracking spectra

    Authors: Marianna Bolla, Daniel Zhou

    Abstract: Whereas Laplacian and modularity based spectral clustering is apt to dense graphs, recent results show that for sparse ones, the non-backtracking spectrum is the best candidate to find assortative clusters of nodes. Here belief propagation in the sparse stochastic block model is derived with arbitrary given model parameters that results in a non-linear system of equations; with linear approximatio… ▽ More

    Submitted 26 December, 2023; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: 30 pages, 16 figures

    MSC Class: 05C50; 05C80; 62H30

  18. arXiv:2307.12461  [pdf, ps, other

    cs.LG stat.ML

    Rates of Approximation by ReLU Shallow Neural Networks

    Authors: Tong Mao, Ding-Xuan Zhou

    Abstract: Neural networks activated by the rectified linear unit (ReLU) play a central role in the recent development of deep learning. The topic of approximating functions from Hölder spaces by these networks is crucial for understanding the efficiency of the induced learning algorithms. Although the topic has been well investigated in the setting of deep neural networks with many layers of hidden neurons,… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  19. arXiv:2307.03487  [pdf, ps, other

    stat.ML cs.LG

    Learning Theory of Distribution Regression with Neural Networks

    Authors: Zhongjie Shi, Zhan Yu, Ding-Xuan Zhou

    Abstract: In this paper, we aim at establishing an approximation theory and a learning theory of distribution regression via a fully connected neural network (FNN). In contrast to the classical regression methods, the input variables of distribution regression are probability measures. Then we often need to perform a second-stage sampling process to approximate the actual information of the distribution. On… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  20. arXiv:2306.08321  [pdf, other

    stat.ML cs.LG math.ST

    Nonparametric regression using over-parameterized shallow ReLU neural networks

    Authors: Yunfei Yang, Ding-Xuan Zhou

    Abstract: It is shown that over-parameterized neural networks can achieve minimax optimal rates of convergence (up to logarithmic factors) for learning functions from certain smooth function classes, if the weights are suitably constrained or regularized. Specifically, we consider the nonparametric regression of estimating an unknown $d$-variate function by using shallow ReLU neural networks. It is assumed… ▽ More

    Submitted 15 May, 2024; v1 submitted 14 June, 2023; originally announced June 2023.

    Journal ref: Journal of Machine Learning Research, 25(165):1-35, 2024

  21. arXiv:2305.19640  [pdf, other

    stat.ML cs.LG

    Fine-grained analysis of non-parametric estimation for pairwise learning

    Authors: Junyu Zhou, Shuo Huang, Han Feng, Puyu Wang, Ding-Xuan Zhou

    Abstract: In this paper, we are concerned with the generalization performance of non-parametric estimation for pairwise learning. Most of the existing work requires the hypothesis space to be convex or a VC-class, and the loss to be convex. However, these restrictive assumptions limit the applicability of the results in studying many popular methods, especially kernel methods and neural networks. We signifi… ▽ More

    Submitted 21 June, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: 30 pages, 1 figure

  22. arXiv:2305.17126  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Large Language Models as Tool Makers

    Authors: Tianle Cai, Xuezhi Wang, Tengyu Ma, Xinyun Chen, Denny Zhou

    Abstract: Recent research has highlighted the potential of large language models (LLMs) to improve their problem-solving capabilities with the aid of suitable external tools. In our work, we further advance this concept by introducing a closed-loop framework, referred to as LLMs A s Tool Makers (LATM), where LLMs create their own reusable tools for problem-solving. Our approach consists of two phases: 1) to… ▽ More

    Submitted 10 March, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Code available at https://github.com/ctlllll/LLM-ToolMaker

  23. arXiv:2305.16891  [pdf, other

    cs.LG stat.ML

    Generalization Guarantees of Gradient Descent for Multi-Layer Neural Networks

    Authors: Puyu Wang, Yunwen Lei, Di Wang, Yiming Ying, Ding-Xuan Zhou

    Abstract: Recently, significant progress has been made in understanding the generalization of neural networks (NNs) trained by gradient descent (GD) using the algorithmic stability approach. However, most of the existing research has focused on one-hidden-layer NNs and has not addressed the impact of different network scaling parameters. In this paper, we greatly extend the previous work \cite{lei2022stabil… ▽ More

    Submitted 29 September, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 38 pages, 2 figures

  24. arXiv:2305.11965  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization

    Authors: Zi-Hao Qiu, Quanqi Hu, Zhuoning Yuan, Denny Zhou, Lijun Zhang, Tianbao Yang

    Abstract: In this paper, we aim to optimize a contrastive loss with individualized temperatures in a principled and systematic manner for self-supervised learning. The common practice of using a global temperature parameter $τ$ ignores the fact that ``not all semantics are created equal", meaning that different anchor data may have different numbers of samples with similar semantics, especially when data ex… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: 33 pages, 11 figures, accepted by ICML2023

  25. arXiv:2305.07408  [pdf, ps, other

    stat.ML cs.LG

    Distributed Gradient Descent for Functional Learning

    Authors: Zhan Yu, Jun Fan, Ding-Xuan Zhou

    Abstract: In recent years, different types of distributed learning schemes have received increasing attention for their strong advantages in handling large-scale data information. In the information era, to face the big data challenges which stem from functional data analysis very recently, we propose a novel distributed gradient descent functional learning (DGDFL) algorithm to tackle functional data across… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: 35 pages

  26. arXiv:2304.04443  [pdf, other

    stat.ML cs.LG

    Approximation of Nonlinear Functionals Using Deep ReLU Networks

    Authors: Linhao Song, Jun Fan, Di-Rong Chen, Ding-Xuan Zhou

    Abstract: In recent years, functional neural networks have been proposed and studied in order to approximate nonlinear continuous functionals defined on $L^p([-1, 1]^s)$ for integers $s\ge1$ and $1\le p<\infty$. However, their theoretical properties are largely unknown beyond universality of approximation or the existing analysis does not apply to the rectified linear unit (ReLU) activation function. To fil… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  27. Optimal rates of approximation by shallow ReLU$^k$ neural networks and applications to nonparametric regression

    Authors: Yunfei Yang, Ding-Xuan Zhou

    Abstract: We study the approximation capacity of some variation spaces corresponding to shallow ReLU$^k$ neural networks. It is shown that sufficiently smooth functions are contained in these spaces with finite variation norms. For functions with less smoothness, the approximation rates in terms of the variation norm are established. Using these results, we are able to prove the optimal approximation rates… ▽ More

    Submitted 8 January, 2024; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: Version 3 improves some approximation bounds by using recent results from arXiv:2307.15285

  28. arXiv:2302.10371  [pdf, other

    cs.LG math.OC stat.ML

    Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency

    Authors: Heyang Zhao, Jiafan He, Dongruo Zhou, Tong Zhang, Quanquan Gu

    Abstract: Recently, several studies (Zhou et al., 2021a; Zhang et al., 2021b; Kim et al., 2021; Zhou and Gu, 2022) have provided variance-dependent regret bounds for linear contextual bandits, which interpolates the regret for the worst-case regime and the deterministic reward regime. However, these algorithms are either computationally intractable or unable to handle unknown variance of the noise. In this… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: 43 pages, 2 tables

  29. arXiv:2212.06132  [pdf, ps, other

    cs.LG math.OC stat.ML

    Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes

    Authors: Jiafan He, Heyang Zhao, Dongruo Zhou, Quanquan Gu

    Abstract: We study reinforcement learning (RL) with linear function approximation. For episodic time-inhomogeneous linear Markov decision processes (linear MDPs) whose transition probability can be parameterized as a linear function of a given feature map**, we propose the first computationally efficient algorithm that achieves the nearly minimax optimal regret $\tilde O(d\sqrt{H^3K})$, where $d$ is the d… ▽ More

    Submitted 3 November, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: 33 pages, 1 table. In ICML 2023

  30. arXiv:2210.10643  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Accurate Subgraph Similarity Computation via Neural Graph Pruning

    Authors: Linfeng Liu, Xu Han, Dawei Zhou, Li-** Liu

    Abstract: Subgraph similarity search, one of the core problems in graph search, concerns whether a target graph approximately contains a query graph. The problem is recently touched by neural methods. However, current neural methods do not consider pruning the target graph, though pruning is critically important in traditional calculations of subgraph similarities. One obstacle to applying pruning in neural… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Journal ref: Transactions on Machine Learning Research (TMLR) October 2022

  31. arXiv:2209.13762  [pdf, other

    stat.ML cs.LG

    Consensus Knowledge Graph Learning via Multi-view Sparse Low Rank Block Model

    Authors: Tianxi Cai, Dong Xia, Luwan Zhang, Doudou Zhou

    Abstract: Network analysis has been a powerful tool to unveil relationships and interactions among a large number of objects. Yet its effectiveness in accurately identifying important node-node interactions is challenged by the rapidly growing network size, with data being collected at an unprecedented granularity and scale. Common wisdom to overcome such high dimensionality is collapsing nodes into smaller… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

  32. arXiv:2209.08005  [pdf, ps, other

    stat.ML cs.LG

    Stability and Generalization for Markov Chain Stochastic Gradient Methods

    Authors: Puyu Wang, Yunwen Lei, Yiming Ying, Ding-Xuan Zhou

    Abstract: Recently there is a large amount of work devoted to the study of Markov chain stochastic gradient methods (MC-SGMs) which mainly focus on their convergence analysis for solving minimization problems. In this paper, we provide a comprehensive generalization analysis of MC-SGMs for both minimization and minimax problems through the lens of algorithmic stability in the framework of statistical learni… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

  33. arXiv:2209.04188  [pdf, ps, other

    stat.ML cs.CR cs.LG

    Differentially Private Stochastic Gradient Descent with Low-Noise

    Authors: Puyu Wang, Yunwen Lei, Yiming Ying, Ding-Xuan Zhou

    Abstract: Modern machine learning algorithms aim to extract fine-grained information from data to provide accurate predictions, which often conflicts with the goal of privacy protection. This paper addresses the practical and theoretical importance of develo** privacy-preserving machine learning algorithms that ensure good performance while preserving privacy. In this paper, we focus on the privacy and ut… ▽ More

    Submitted 14 July, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

  34. arXiv:2208.06972  [pdf, other

    stat.AP econ.GN

    Is the NFL's franchise tag fair to players?

    Authors: Darwin Zhou

    Abstract: There has been a consistent criticism over the past decade of the NFL franchise tag's monetary limitations due to its biased institutions in favor of the team rather than the player. But the question whether the NFL's franchise tag is fair or unfair to players has never been systematically studied. In this paper, I investigate the effects of NFL players' contract extensions when on a franchise tag… ▽ More

    Submitted 15 August, 2022; v1 submitted 14 August, 2022; originally announced August 2022.

  35. arXiv:2208.05363  [pdf, ps, other

    cs.LG cs.AI cs.GT math.OC stat.ML

    Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium

    Authors: Chris Junchi Li, Dongruo Zhou, Quanquan Gu, Michael I. Jordan

    Abstract: We consider learning Nash equilibria in two-player zero-sum Markov Games with nonlinear function approximation, where the action-value function is approximated by a function in a Reproducing Kernel Hilbert Space (RKHS). The key challenge is how to do exploration in the high-dimensional function space. We propose a novel online learning algorithm to find a Nash equilibrium by minimizing the duality… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: 42 pages

  36. arXiv:2208.05134  [pdf, other

    stat.ME

    Doubly Robust Augmented Model Accuracy Transfer Inference with High Dimensional Features

    Authors: Doudou Zhou, Molei Liu, Mengyan Li, Tianxi Cai

    Abstract: Due to label scarcity and covariate shift happening frequently in real-world studies, transfer learning has become an essential technique to train models generalizable to some target populations using existing labeled source data. Most existing transfer learning research has been focused on model estimation, while there is a paucity of literature on transfer inference for model accuracy despite it… ▽ More

    Submitted 8 November, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

  37. arXiv:2206.05581  [pdf, other

    stat.ML cs.LG stat.ME

    Federated Offline Reinforcement Learning

    Authors: Doudou Zhou, Yufeng Zhang, Aaron Sonabend-W, Zhaoran Wang, Junwei Lu, Tianxi Cai

    Abstract: Evidence-based or data-driven dynamic treatment regimes are essential for personalized medicine, which can benefit from offline reinforcement learning (RL). Although massive healthcare data are available across medical institutions, they are prohibited from sharing due to privacy constraints. Besides, heterogeneity exists in different sites. As a result, federated offline RL algorithms are necessa… ▽ More

    Submitted 27 January, 2024; v1 submitted 11 June, 2022; originally announced June 2022.

  38. arXiv:2206.03038  [pdf, other

    stat.ME

    Asymptotic Distribution-free Change-point Detection for Modern Data Based on a New Ranking Scheme

    Authors: Doudou Zhou, Hao Chen

    Abstract: Change-point detection (CPD) involves identifying distributional changes in a sequence of independent observations. Among nonparametric methods, rank-based methods are attractive due to their robustness and effectiveness and have been extensively studied for univariate data. However, they are not well explored for high-dimensional or non-Euclidean data. This paper proposes a new method, Rank INduc… ▽ More

    Submitted 27 June, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

  39. arXiv:2205.11507  [pdf, other

    cs.LG math.OC stat.ML

    Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs

    Authors: Dongruo Zhou, Quanquan Gu

    Abstract: Recent studies have shown that episodic reinforcement learning (RL) is not more difficult than contextual bandits, even with a long planning horizon and unknown state transitions. However, these results are limited to either tabular Markov decision processes (MDPs) or computationally inefficient algorithms for linear mixture MDPs. In this paper, we propose the first computationally efficient horiz… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: 33 pages, 1 table

  40. arXiv:2205.06811  [pdf, other

    cs.LG stat.ML

    Nearly Optimal Algorithms for Linear Contextual Bandits with Adversarial Corruptions

    Authors: Jiafan He, Dongruo Zhou, Tong Zhang, Quanquan Gu

    Abstract: We study the linear contextual bandit problem in the presence of adversarial corruption, where the reward at each round is corrupted by an adversary, and the corruption level (i.e., the sum of corruption magnitudes over the horizon) is $C\geq 0$. The best-known algorithms in this setting are limited in that they either are computationally inefficient or require a strong assumption on the corruptio… ▽ More

    Submitted 9 July, 2022; v1 submitted 13 May, 2022; originally announced May 2022.

    Comments: 25 pages, 1 table. This version simplifies the proof of the regret upper bound in Version 1, and provides a stronger result for the lower bound

  41. arXiv:2202.13603  [pdf, other

    cs.LG math.OC stat.ML

    Optimal Online Generalized Linear Regression with Stochastic Noise and Its Application to Heteroscedastic Bandits

    Authors: Heyang Zhao, Dongruo Zhou, Jiafan He, Quanquan Gu

    Abstract: We study the problem of online generalized linear regression in the stochastic setting, where the label is generated from a generalized linear model with possibly unbounded additive noise. We provide a sharp analysis of the classical follow-the-regularized-leader (FTRL) algorithm to cope with the label noise. More specifically, for $σ$-sub-Gaussian label noise, our analysis provides a regret upper… ▽ More

    Submitted 27 March, 2023; v1 submitted 28 February, 2022; originally announced February 2022.

    Comments: 27 pages, 3 figures. In this updated version, we have changed the paper title, added new theoretical results on the FTRL algorithm and mainly focused on stochastic online regression. Refer to arXiv:2202.13603v1 for the previous version, which contains more results on heteroscedastic nonlinear bandits

  42. arXiv:2202.12387  [pdf, other

    cs.LG cs.CV math.OC stat.ML

    Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance

    Authors: Zhuoning Yuan, Yuexin Wu, Zi-Hao Qiu, Xianzhi Du, Lijun Zhang, Denny Zhou, Tianbao Yang

    Abstract: In this paper, we study contrastive learning from an optimization perspective, aiming to analyze and address a fundamental issue of existing contrastive learning methods that either rely on a large batch size or a large dictionary of feature vectors. We consider a global objective for contrastive learning, which contrasts each positive pair with all negative pairs for an anchor point. From the opt… ▽ More

    Submitted 20 September, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: Accepted by ICML2022

  43. arXiv:2112.12948  [pdf, other

    stat.ME math.ST

    A new ranking scheme for modern data and its application to two-sample hypothesis testing

    Authors: Doudou Zhou, Hao Chen

    Abstract: Rank-based approaches are among the most popular nonparametric methods for univariate data in tackling statistical problems such as hypothesis testing due to their robustness and effectiveness. However, they are unsatisfactory for more complex data. In the era of big data, high-dimensional and non-Euclidean data, such as networks and images, are ubiquitous and pose challenges for statistical analy… ▽ More

    Submitted 1 July, 2023; v1 submitted 24 December, 2021; originally announced December 2021.

  44. arXiv:2110.13144  [pdf, other

    math.OC cs.LG stat.ML

    Faster Perturbed Stochastic Gradient Methods for Finding Local Minima

    Authors: Zixiang Chen, Dongruo Zhou, Quanquan Gu

    Abstract: Esca** from saddle points and finding local minimum is a central problem in nonconvex optimization. Perturbed gradient methods are perhaps the simplest approach for this problem. However, to find $(ε, \sqrtε)$-approximate local minima, the existing best stochastic gradient complexity for this type of algorithms is $\tilde O(ε^{-3.5})$, which is not optimal. In this paper, we propose LENA (Last s… ▽ More

    Submitted 20 April, 2022; v1 submitted 25 October, 2021; originally announced October 2021.

    Comments: 29 pages, 1 figure, 1 table. In ALT 2022

  45. arXiv:2110.12615  [pdf, other

    cs.LG stat.ML

    Linear Contextual Bandits with Adversarial Corruptions

    Authors: Heyang Zhao, Dongruo Zhou, Quanquan Gu

    Abstract: We study the linear contextual bandit problem in the presence of adversarial corruption, where the interaction between the player and a possibly infinite decision set is contaminated by an adversary that can corrupt the reward up to a corruption level $C$ measured by the sum of the largest alteration on rewards in each round. We present a variance-aware algorithm that is adaptive to the level of a… ▽ More

    Submitted 24 October, 2021; originally announced October 2021.

    Comments: 27 pages, 1 figure

  46. arXiv:2110.09704  [pdf, other

    stat.ME eess.SY

    Hybrid variable monitoring: An unsupervised process monitoring framework with binary and continuous variables

    Authors: Min Wang, Donghua Zhou, Maoyin Chen

    Abstract: Traditional process monitoring methods, such as PCA, PLS, ICA, MD et al., are strongly dependent on continuous variables because most of them inevitably involve Euclidean or Mahalanobis distance. With industrial processes becoming more and more complex and integrated, binary variables also appear in monitoring variables besides continuous variables, which makes process monitoring more challenging.… ▽ More

    Submitted 10 March, 2022; v1 submitted 18 October, 2021; originally announced October 2021.

    Comments: This paper has been submitted to Automatica for potential publication

  47. arXiv:2110.06394  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation

    Authors: Weitong Zhang, Dongruo Zhou, Quanquan Gu

    Abstract: We study the model-based reward-free reinforcement learning with linear function approximation for episodic Markov decision processes (MDPs). In this setting, the agent works in two phases. In the exploration phase, the agent interacts with the environment and collects samples without the reward. In the planning phase, the agent is given a specific reward function and uses samples collected from t… ▽ More

    Submitted 31 December, 2021; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: 29 pages, 1 figure, 1 table. In NeurIPS 2021

  48. arXiv:2106.12498  [pdf, other

    cs.LG cs.IT stat.ML

    Universal Consistency of Deep Convolutional Neural Networks

    Authors: Shao-Bo Lin, Kaidong Wang, Yao Wang, Ding-Xuan Zhou

    Abstract: Compared with avid research activities of deep convolutional neural networks (DCNNs) in practice, the study of theoretical behaviors of DCNNs lags heavily behind. In particular, the universal consistency of DCNNs remains open. In this paper, we prove that implementing empirical risk minimization on DCNNs with expansive convolution (with zero-padding) is strongly universally consistent. Motivated b… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: 9pages, 4 figures

  49. arXiv:2106.12034  [pdf, other

    stat.ML cs.LG

    Pure Exploration in Kernel and Neural Bandits

    Authors: Yinglun Zhu, Dongruo Zhou, Ruoxi Jiang, Quanquan Gu, Rebecca Willett, Robert Nowak

    Abstract: We study pure exploration in bandits, where the dimension of the feature representation can be much larger than the number of arms. To overcome the curse of dimensionality, we propose to adaptively embed the feature representation of each arm into a lower-dimensional space and carefully deal with the induced model misspecification. Our approach is conceptually very different from existing works th… ▽ More

    Submitted 17 March, 2022; v1 submitted 22 June, 2021; originally announced June 2021.

  50. arXiv:2106.11960  [pdf, other

    cs.LG math.OC stat.ML

    Variance-Aware Off-Policy Evaluation with Linear Function Approximation

    Authors: Yifei Min, Tianhao Wang, Dongruo Zhou, Quanquan Gu

    Abstract: We study the off-policy evaluation (OPE) problem in reinforcement learning with linear function approximation, which aims to estimate the value function of a target policy based on the offline data collected by a behavior policy. We propose to incorporate the variance information of the value function to improve the sample efficiency of OPE. More specifically, for time-inhomogeneous episodic linea… ▽ More

    Submitted 3 January, 2022; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: 59 pages, 4 figures. In NeurIPS 2021