Skip to main content

Showing 1–50 of 86 results for author: Zhao, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.17079  [pdf, other

    stat.ML cs.LG

    Learning with User-Level Local Differential Privacy

    Authors: Puning Zhao, Li Shen, Rongfei Fan, Qingming Li, Huiwen Wu, Jiafei Wu, Zhe Liu

    Abstract: User-level privacy is important in distributed systems. Previous research primarily focuses on the central model, while the local models have received much less attention. Under the central model, user-level DP is strictly stronger than the item-level one. However, under the local model, the relationship between user-level and item-level LDP becomes more complex, thus the analysis is crucially dif… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  2. arXiv:2404.06984  [pdf, other

    stat.ME

    Adaptive Strategy of Testing Alphas in High Dimensional Linear Factor Pricing Models

    Authors: Chenxi Zhao, ** Zhao, Long Feng, Zhaojun Wang

    Abstract: In recent years, there has been considerable research on testing alphas in high-dimensional linear factor pricing models. In our study, we introduce a novel max-type test procedure that performs well under sparse alternatives. Furthermore, we demonstrate that this new max-type test procedure is asymptotically independent from the sum-type test procedure proposed by Pesaran and Yamagata (2017). Bui… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  3. arXiv:2403.04568  [pdf, other

    cs.LG stat.ML

    Improved Algorithm for Adversarial Linear Mixture MDPs with Bandit Feedback and Unknown Transition

    Authors: Long-Fei Li, Peng Zhao, Zhi-Hua Zhou

    Abstract: We study reinforcement learning with linear function approximation, unknown transition, and adversarial losses in the bandit feedback setting. Specifically, we focus on linear mixture MDPs whose transition kernel is a linear mixture model. We propose a new algorithm that attains an $\widetilde{O}(d\sqrt{HS^3K} + \sqrt{HSAK})$ regret with high probability, where $d$ is the dimension of feature mapp… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: AISTATS 2024

  4. arXiv:2310.09545  [pdf, other

    stat.ME econ.EM math.ST stat.ML

    A Semiparametric Instrumented Difference-in-Differences Approach to Policy Learning

    Authors: Pan Zhao, Yifan Cui

    Abstract: Recently, there has been a surge in methodological development for the difference-in-differences (DiD) approach to evaluate causal effects. Standard methods in the literature rely on the parallel trends assumption to identify the average treatment effect on the treated. However, the parallel trends assumption may be violated in the presence of unmeasured confounding, and the average treatment effe… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  5. arXiv:2310.06969  [pdf, other

    stat.ME cs.LG stat.ML

    Positivity-free Policy Learning with Observational Data

    Authors: Pan Zhao, Antoine Chambaz, Julie Josse, Shu Yang

    Abstract: Policy learning utilizing observational data is pivotal across various domains, with the objective of learning the optimal treatment assignment policy while adhering to specific constraints such as fairness, budget, and simplicity. This study introduces a novel positivity-free (stochastic) policy learning framework designed to address the challenges posed by the impracticality of the positivity as… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  6. arXiv:2309.08911  [pdf, other

    cs.LG stat.ML

    Efficient Methods for Non-stationary Online Learning

    Authors: Peng Zhao, Yan-Feng Xie, Lijun Zhang, Zhi-Hua Zhou

    Abstract: Non-stationary online learning has drawn much attention in recent years. In particular, dynamic regret and adaptive regret are proposed as two principled performance measures for online convex optimization in non-stationary environments. To optimize them, a two-layer online ensemble is usually deployed due to the inherent uncertainty of the non-stationarity, in which a group of base-learners are m… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

    Comments: preliminary conference version appeared at NeurIPS 2022; this extended version improves the paper presentation, further investigates the interval dynamic regret, and adds two applications (online non-stochastic control and online PCA)

  7. arXiv:2309.01691  [pdf, other

    stat.ME

    Frequentist Model Averaging for Global Fréchet Regression

    Authors: Xingyu Yan, Xinyu Zhang, Peng Zhao

    Abstract: To consider model uncertainty in global Fréchet regression and improve density response prediction, we propose a frequentist model averaging method. The weights are chosen by minimizing a cross-validation criterion based on Wasserstein distance. In the cases where all candidate models are misspecified, we prove that the corresponding model averaging estimator has asymptotic optimality, achieving t… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  8. arXiv:2308.01490  [pdf, ps, other

    cs.LG stat.ML

    Minimax Optimal Q Learning with Nearest Neighbors

    Authors: Puning Zhao, Lifeng Lai

    Abstract: Analyzing the Markov decision process (MDP) with continuous state spaces is generally challenging. A recent interesting work \cite{shah2018q} solves MDP with bounded continuous state space by a nearest neighbor $Q$ learning approach, which has a sample complexity of $\tilde{O}(\frac{1}{ε^{d+3}(1-γ)^{d+7}})$ for $ε$-accurate $Q$ function estimation with discount factor $γ$. In this paper, we propos… ▽ More

    Submitted 17 June, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

  9. arXiv:2307.08360  [pdf, other

    cs.LG math.OC stat.ML

    Universal Online Learning with Gradient Variations: A Multi-layer Online Ensemble Approach

    Authors: Yu-Hu Yan, Peng Zhao, Zhi-Hua Zhou

    Abstract: In this paper, we propose an online convex optimization approach with two different levels of adaptivity. On a higher level, our approach is agnostic to the unknown types and curvatures of the online functions, while at a lower level, it can exploit the unknown niceness of the environments and attain problem-dependent guarantees. Specifically, we obtain $\mathcal{O}(\log V_T)$,… ▽ More

    Submitted 15 April, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  10. arXiv:2303.08979  [pdf, other

    stat.ME stat.CO

    An Approximate Bayesian Approach to Covariate-dependent Graphical Modeling

    Authors: Sutanoy Dasgupta, Peng Zhao, Jacob Helwig, Prasenjit Ghosh, Debdeep Pati, Bani K. Mallick

    Abstract: Gaussian graphical models typically assume a homogeneous structure across all subjects, which is often restrictive in applications. In this article, we propose a weighted pseudo-likelihood approach for graphical modeling which allows different subjects to have different graphical structures depending on extraneous covariates. The pseudo-likelihood approach replaces the joint distribution by a prod… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  11. arXiv:2303.02691  [pdf, other

    cs.LG stat.ML

    Revisiting Weighted Strategy for Non-stationary Parametric Bandits

    Authors: **g Wang, Peng Zhao, Zhi-Hua Zhou

    Abstract: Non-stationary parametric bandits have attracted much attention recently. There are three principled ways to deal with non-stationarity, including sliding-window, weighted, and restart strategies. As many non-stationary environments exhibit gradual drifting patterns, the weighted strategy is commonly adopted in real-world applications. However, previous theoretical studies show that its analysis i… ▽ More

    Submitted 7 June, 2023; v1 submitted 5 March, 2023; originally announced March 2023.

    Comments: AISTATS 2023

  12. arXiv:2302.08076  [pdf, ps, other

    stat.ME

    Augmented two-step estimating equations with nuisance functionals and complex survey data

    Authors: Puying Zhao, Changbao Wu

    Abstract: Statistical inference in the presence of nuisance functionals with complex survey data is an important topic in social and economic studies. The Gini index, Lorenz curves and quantile shares are among the commonly encountered examples. The nuisance functionals are usually handled by a plug-in nonparametric estimator and the main inferential procedure can be carried out through a two-step generaliz… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: 43 pages

  13. arXiv:2302.04552  [pdf, ps, other

    cs.LG stat.ML

    Optimistic Online Mirror Descent for Bridging Stochastic and Adversarial Online Convex Optimization

    Authors: Sijia Chen, Yu-Jie Zhang, Wei-Wei Tu, Peng Zhao, Lijun Zhang

    Abstract: Stochastically Extended Adversarial (SEA) model is introduced by Sachs et al. [2022] as an interpolation between stochastic and adversarial online convex optimization. Under the smoothness condition, they demonstrate that the expected regret of optimistic follow-the-regularized-leader (FTRL) depends on the cumulative stochastic variance $σ_{1:T}^2$ and the cumulative adversarial variation… ▽ More

    Submitted 16 March, 2024; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: v3 substantially improves the presentation and has a few improvements, including the regret bound for strongly convex functions; v2 is an extended version that enriches the content with improved regret bounds for strongly convex functions, discussions on the optimism design for dynamic regret minimization, and extensions to non-smooth scenarios; v1 is the ICML 2023 conference version

  14. arXiv:2302.02552  [pdf, other

    cs.LG stat.ML

    Adapting to Continuous Covariate Shift via Online Density Ratio Estimation

    Authors: Yu-Jie Zhang, Zhen-Yu Zhang, Peng Zhao, Masashi Sugiyama

    Abstract: Dealing with distribution shifts is one of the central challenges for modern machine learning. One fundamental situation is the covariate shift, where the input distributions of data change from training to testing stages while the input-conditional output distribution remains unchanged. In this paper, we initiate the study of a more challenging scenario -- continuous covariate shift -- in which t… ▽ More

    Submitted 27 May, 2024; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: NeurIPS 2023

  15. arXiv:2301.05491  [pdf, other

    stat.ME stat.ML

    Efficient and robust transfer learning of optimal individualized treatment regimes with right-censored survival data

    Authors: Pan Zhao, Julie Josse, Shu Yang

    Abstract: An individualized treatment regime (ITR) is a decision rule that assigns treatments based on patients' characteristics. The value function of an ITR is the expected outcome in a counterfactual world had this ITR been implemented. Recently, there has been increasing interest in combining heterogeneous data sources, such as leveraging the complementary features of randomized controlled trial (RCT) d… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

  16. arXiv:2210.11050  [pdf, other

    cs.LG stat.ML

    Vertical Federated Linear Contextual Bandits

    Authors: Zeyu Cao, Zhipeng Liang, Shu Zhang, Hangyu Li, Ouyang Wen, Yu Rong, Peilin Zhao, Bingzhe Wu

    Abstract: In this paper, we investigate a novel problem of building contextual bandits in the vertical federated setting, i.e., contextual information is vertically distributed over different departments. This problem remains largely unexplored in the research community. To this end, we carefully design a customized encryption scheme named orthogonal matrix-based mask mechanism(O3M) for encrypting local con… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

  17. arXiv:2210.00091  [pdf, other

    stat.ME stat.ML

    Factorized Fusion Shrinkage for Dynamic Relational Data

    Authors: Peng Zhao, Anirban Bhattacharya, Debdeep Pati, Bani K. Mallick

    Abstract: Modern data science applications often involve complex relational data with dynamic structures. An abrupt change in such dynamic relational data is typically observed in systems that undergo regime changes due to interventions. In such a case, we consider a factorized fusion shrinkage model in which all decomposed factors are dynamically shrunk towards group-wise fusion structures, where the shrin… ▽ More

    Submitted 18 April, 2023; v1 submitted 30 September, 2022; originally announced October 2022.

  18. arXiv:2209.15117  [pdf, other

    stat.ML math.ST stat.CO

    Structured Optimal Variational Inference for Dynamic Latent Space Models

    Authors: Peng Zhao, Anirban Bhattacharya, Debdeep Pati, Bani K. Mallick

    Abstract: We consider a latent space model for dynamic networks, where our objective is to estimate the pairwise inner products of the latent positions. To balance posterior inference and computational scalability, we present a structured mean-field variational inference framework, where the time-dependent properties of the dynamic networks are exploited to facilitate computation and inference. Additionally… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  19. arXiv:2208.12483  [pdf, other

    cs.LG stat.ML

    Dynamic Regret of Online Markov Decision Processes

    Authors: Peng Zhao, Long-Fei Li, Zhi-Hua Zhou

    Abstract: We investigate online Markov Decision Processes (MDPs) with adversarially changing loss functions and known transitions. We choose dynamic regret as the performance measure, defined as the performance difference between the learner and any sequence of feasible changing policies. The measure is strictly stronger than the standard static regret that benchmarks the learner's performance with a fixed… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

  20. arXiv:2208.08693  [pdf, other

    stat.ME econ.EM

    Matrix Quantile Factor Model

    Authors: Xin-Bing Kong, Yong-Xin Liu, Long Yu, Peng Zhao

    Abstract: This paper introduces a matrix quantile factor model for matrix-valued data with a low-rank structure. We estimate the row and column factor spaces via minimizing the empirical check loss function over all panels. We show the estimates converge at rate $1/\min\{\sqrt{p_1p_2}, \sqrt{p_2T},$ $\sqrt{p_1T}\}$ in average Frobenius norm, where $p_1$, $p_2$ and $T$ are the row dimensionality, column dime… ▽ More

    Submitted 26 May, 2023; v1 submitted 18 August, 2022; originally announced August 2022.

  21. arXiv:2207.02121  [pdf, other

    cs.LG stat.ML

    Adapting to Online Label Shift with Provable Guarantees

    Authors: Yong Bai, Yu-Jie Zhang, Peng Zhao, Masashi Sugiyama, Zhi-Hua Zhou

    Abstract: The standard supervised learning paradigm works effectively when training data shares the same distribution as the upcoming testing samples. However, this stationary assumption is often violated in real-world applications, especially when testing data appear in an online fashion. In this paper, we formulate and investigate the problem of \emph{online label shift} (OLaS): the learner trains an init… ▽ More

    Submitted 14 January, 2023; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: NeurIPS 2022; the first two authors contributed equally

  22. arXiv:2206.07766  [pdf, other

    cs.LG stat.ML

    Pareto Invariant Risk Minimization: Towards Mitigating the Optimization Dilemma in Out-of-Distribution Generalization

    Authors: Yongqiang Chen, Kaiwen Zhou, Yatao Bian, Binghui Xie, Bingzhe Wu, Yonggang Zhang, Kaili Ma, Han Yang, Peilin Zhao, Bo Han, James Cheng

    Abstract: Recently, there has been a growing surge of interest in enabling machine learning systems to generalize well to Out-of-Distribution (OOD) data. Most efforts are devoted to advancing optimization objectives that regularize models to capture the underlying invariance; however, there often are compromises in the optimization process of these OOD objectives: i) Many OOD objectives have to be relaxed a… ▽ More

    Submitted 2 March, 2023; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: ICLR 2023, 50 pages, 58 figures

  23. arXiv:2204.07742  [pdf, other

    cs.LG cs.DC stat.ML

    DRFLM: Distributionally Robust Federated Learning with Inter-client Noise via Local Mixup

    Authors: Bingzhe Wu, Zhipeng Liang, Yuxuan Han, Yatao Bian, Peilin Zhao, Junzhou Huang

    Abstract: Recently, federated learning has emerged as a promising approach for training a global model using data from multiple organizations without leaking their raw data. Nevertheless, directly applying federated learning to real-world tasks faces two challenges: (1) heterogeneity in the data among different organizations; and (2) data noises inside individual organizations. In this paper, we propose a… ▽ More

    Submitted 16 April, 2022; originally announced April 2022.

  24. arXiv:2202.06188  [pdf, other

    math.ST math.PR stat.ME

    Testing the number of common factors by bootstrapped sample covariance matrix in high-dimensional factor models

    Authors: Long Yu, Peng Zhao, Wang Zhou

    Abstract: This paper studies the impact of bootstrap procedure on the eigenvalue distributions of the sample covariance matrix under a high-dimensional factor structure. We provide asymptotic distributions for the top eigenvalues of bootstrapped sample covariance matrix under mild conditions. After bootstrap, the spiked eigenvalues which are driven by common factors will converge weakly to Gaussian limits a… ▽ More

    Submitted 20 November, 2023; v1 submitted 12 February, 2022; originally announced February 2022.

    Comments: 102 pages, 9 figures, 6 tables

    MSC Class: 62H25; 60B20

  25. arXiv:2202.06151  [pdf, ps, other

    cs.LG stat.ML

    Corralling a Larger Band of Bandits: A Case Study on Switching Regret for Linear Bandits

    Authors: Haipeng Luo, Mengxiao Zhang, Peng Zhao, Zhi-Hua Zhou

    Abstract: We consider the problem of combining and learning over a set of adversarial bandit algorithms with the goal of adaptively tracking the best one on the fly. The CORRAL algorithm of Agarwal et al. (2017) and its variants (Foster et al., 2020a) achieve this goal with a regret overhead of order $\widetilde{O}(\sqrt{MT})$ where $M$ is the number of base algorithms and $T$ is the time horizon. The polyn… ▽ More

    Submitted 12 February, 2022; originally announced February 2022.

  26. arXiv:2202.06150  [pdf, ps, other

    cs.LG stat.ML

    Adaptive Bandit Convex Optimization with Heterogeneous Curvature

    Authors: Haipeng Luo, Mengxiao Zhang, Peng Zhao

    Abstract: We consider the problem of adversarial bandit convex optimization, that is, online learning over a sequence of arbitrary convex loss functions with only one function evaluation for each of them. While all previous works assume known and homogeneous curvature on these loss functions, we study a heterogeneous setting where each function has its own curvature that is only revealed after the learner m… ▽ More

    Submitted 12 February, 2022; originally announced February 2022.

  27. arXiv:2201.12736  [pdf, other

    cs.LG cs.GT stat.ML

    No-Regret Learning in Time-Varying Zero-Sum Games

    Authors: Mengxiao Zhang, Peng Zhao, Haipeng Luo, Zhi-Hua Zhou

    Abstract: Learning from repeated play in a fixed two-player zero-sum game is a classic problem in game theory and online learning. We consider a variant of this problem where the game payoff matrix changes over time, possibly in an adversarial manner. We first present three performance measures to guide the algorithmic design for this problem: 1) the well-studied individual regret, 2) an extension of dualit… ▽ More

    Submitted 30 January, 2022; originally announced January 2022.

  28. arXiv:2111.07337  [pdf, other

    cs.LG stat.ML

    $p$-Laplacian Based Graph Neural Networks

    Authors: Guoji Fu, Peilin Zhao, Yatao Bian

    Abstract: Graph neural networks (GNNs) have demonstrated superior performance for semi-supervised node classification on graphs, as a result of their ability to exploit node features and topological information simultaneously. However, most GNNs implicitly assume that the labels of nodes and their neighbors in a graph are the same or consistent, which does not hold in heterophilic graphs, where the labels o… ▽ More

    Submitted 23 June, 2022; v1 submitted 14 November, 2021; originally announced November 2021.

    Comments: ICML'2022

  29. arXiv:2108.02668  [pdf, other

    stat.AP stat.ME

    Covariance Estimation and its Application in Large-Scale Online Controlled Experiments

    Authors: Tao Xiong, Yihan Bao, Penglei Zhao, Yong Wang

    Abstract: During the last few decades, online controlled experiments (also known as A/B tests) have been adopted as a golden standard for measuring business improvements in industry. In our company, there are more than a billion users participating in thousands of experiments simultaneously, and with statistical inference and estimations conducted to thousands of online metrics in those experiments routinel… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

  30. arXiv:2105.14244  [pdf, other

    cs.LG cs.SI stat.ML

    Learning Graphon Autoencoders for Generative Graph Modeling

    Authors: Hongteng Xu, Peilin Zhao, Junzhou Huang, Dixin Luo

    Abstract: Graphon is a nonparametric model that generates graphs with arbitrary sizes and can be induced from graphs easily. Based on this model, we propose a novel algorithmic framework called \textit{graphon autoencoder} to build an interpretable and scalable graph generative model. This framework treats observed graphs as induced graphons in functional space and derives their latent representations by an… ▽ More

    Submitted 29 May, 2021; originally announced May 2021.

  31. arXiv:2103.16082  [pdf, other

    cs.LG stat.ML

    Optimal Stochastic Nonconvex Optimization with Bandit Feedback

    Authors: Puning Zhao, Lifeng Lai

    Abstract: In this paper, we analyze the continuous armed bandit problems for nonconvex cost functions under certain smoothness and sublevel set assumptions. We first derive an upper bound on the expected cumulative regret of a simple bin splitting method. We then propose an adaptive bin splitting method, which can significantly improve the performance. Furthermore, a minimax lower bound is derived, which sh… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

  32. arXiv:2103.08450  [pdf, other

    stat.AP stat.ML

    Modeling Multivariate Cyber Risks: Deep Learning Dating Extreme Value Theory

    Authors: Mingyue Zhang Wu, **zhu Luo, Xing Fang, Maochao Xu, Peng Zhao

    Abstract: Modeling cyber risks has been an important but challenging task in the domain of cyber security. It is mainly because of the high dimensionality and heavy tails of risk patterns. Those obstacles have hindered the development of statistical modeling of the multivariate cyber risks. In this work, we propose a novel approach for modeling the multivariate cyber risks which relies on the deep learning… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: 25 pages

  33. arXiv:2102.03758  [pdf, other

    cs.LG stat.ML

    Non-stationary Online Learning with Memory and Non-stochastic Control

    Authors: Peng Zhao, Yu-Hu Yan, Yu-Xiang Wang, Zhi-Hua Zhou

    Abstract: We study the problem of Online Convex Optimization (OCO) with memory, which allows loss functions to depend on past decisions and thus captures temporal effects of learning problems. In this paper, we introduce dynamic policy regret as the performance measure to design algorithms robust to non-stationary environments, which competes algorithms' decisions with a sequence of changing comparators. We… ▽ More

    Submitted 14 August, 2023; v1 submitted 7 February, 2021; originally announced February 2021.

    Journal ref: Journal of Machine Learning Research, 2023

  34. arXiv:2010.00438  [pdf, other

    stat.ML cs.LG

    Analysis of KNN Density Estimation

    Authors: Puning Zhao, Lifeng Lai

    Abstract: We analyze the $\ell_1$ and $\ell_\infty$ convergence rates of k nearest neighbor density estimation method. Our analysis includes two different cases depending on whether the support set is bounded or not. In the first case, the probability density function has a bounded support and is bounded away from zero. We show that kNN density estimation is minimax optimal under both $\ell_1$ and… ▽ More

    Submitted 29 September, 2020; originally announced October 2020.

  35. arXiv:2009.13714  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Learning to Generate Image Source-Agnostic Universal Adversarial Perturbations

    Authors: Pu Zhao, Parikshit Ram, Songtao Lu, Yuguang Yao, Djallel Bouneffouf, Xue Lin, Sijia Liu

    Abstract: Adversarial perturbations are critical for certifying the robustness of deep learning models. A universal adversarial perturbation (UAP) can simultaneously attack multiple images, and thus offers a more unified threat model, obviating an image-wise attack algorithm. However, the existing UAP generator is underdeveloped when images are drawn from different image sources (e.g., with different image… ▽ More

    Submitted 17 August, 2022; v1 submitted 28 September, 2020; originally announced September 2020.

  36. arXiv:2008.10320  [pdf, other

    cs.CV cs.AI stat.ML

    A Single Frame and Multi-Frame Joint Network for 360-degree Panorama Video Super-Resolution

    Authors: Hongying Liu, Zhubo Ruan, Chaowei Fang, Peng Zhao, Fanhua Shang, Yuanyuan Liu, Lijun Wang

    Abstract: Spherical videos, also known as \ang{360} (panorama) videos, can be viewed with various virtual reality devices such as computers and head-mounted displays. They attract large amount of interest since awesome immersion can be experienced when watching spherical videos. However, capturing, storing and transmitting high-resolution spherical videos are extremely expensive. In this paper, we propose a… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

    Comments: 10 pages, 5 figures, submitted to an international peer-review journal

  37. arXiv:2007.13518  [pdf, other

    cs.LG stat.ML

    FedML: A Research Library and Benchmark for Federated Machine Learning

    Authors: Chaoyang He, Songze Li, **hyun So, Xiao Zeng, Mi Zhang, Hongyi Wang, Xiaoyang Wang, Praneeth Vepakomma, Abhishek Singh, Hang Qiu, Xinghua Zhu, Jianzong Wang, Li Shen, Peilin Zhao, Yan Kang, Yang Liu, Ramesh Raskar, Qiang Yang, Murali Annavaram, Salman Avestimehr

    Abstract: Federated learning (FL) is a rapidly growing research field in machine learning. However, existing FL libraries cannot adequately support diverse algorithmic development; inconsistent dataset and model usage make fair algorithm comparison challenging. In this work, we introduce FedML, an open research library and benchmark to facilitate FL algorithm development and fair performance comparison. Fed… ▽ More

    Submitted 8 November, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: This is FedML white paper V3. Homepage: https://fedml.ai; GitHub: https://github.com/FedML-AI/FedML; In V3, More advanced algorithms and IoT device training are supported, please check here: https://github.com/FedML-AI/FedML/blob/master/fedml_iot/

  38. arXiv:2007.11280  [pdf, other

    cs.LG stat.ML

    Storage Fit Learning with Feature Evolvable Streams

    Authors: Bo-Jian Hou, Yu-Hu Yan, Peng Zhao, Zhi-Hua Zhou

    Abstract: Feature evolvable learning has been widely studied in recent years where old features will vanish and new features will emerge when learning with streams. Conventional methods usually assume that a label will be revealed after prediction at each time step. However, in practice, this assumption may not hold whereas no label will be given at most time steps. A good solution is to leverage the techni… ▽ More

    Submitted 23 February, 2021; v1 submitted 22 July, 2020; originally announced July 2020.

  39. arXiv:2007.05970  [pdf, other

    cs.LG stat.ML

    Inverse Graph Identification: Can We Identify Node Labels Given Graph Labels?

    Authors: Tian Bian, Xi Xiao, Tingyang Xu, Yu Rong, Wenbing Huang, Peilin Zhao, Junzhou Huang

    Abstract: Graph Identification (GI) has long been researched in graph learning and is essential in certain applications (e.g. social community detection). Specifically, GI requires to predict the label/score of a target graph given its collection of node features and edge connections. While this task is common, more complex cases arise in practice---we are supposed to do the inverse thing by, for example, g… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

  40. arXiv:2007.03479  [pdf, ps, other

    cs.LG stat.ML

    Dynamic Regret of Convex and Smooth Functions

    Authors: Peng Zhao, Yu-Jie Zhang, Lijun Zhang, Zhi-Hua Zhou

    Abstract: We investigate online convex optimization in non-stationary environments and choose the dynamic regret as the performance measure, defined as the difference between cumulative loss incurred by the online algorithm and that of any feasible comparator sequence. Let $T$ be the time horizon and $P_T$ be the path-length that essentially reflects the non-stationarity of environments, the state-of-the-ar… ▽ More

    Submitted 28 November, 2020; v1 submitted 7 July, 2020; originally announced July 2020.

  41. arXiv:2007.02192  [pdf, other

    math.ST stat.AP stat.CO stat.ME stat.ML

    Tail-adaptive Bayesian shrinkage

    Authors: Se Yoon Lee, Peng Zhao, Debdeep Pati, Bani K. Mallick

    Abstract: Robust Bayesian methods for high-dimensional regression problems under diverse sparse regimes are studied. Traditional shrinkage priors are primarily designed to detect a handful of signals from tens of thousands of predictors in the so-called ultra-sparsity domain. However, they may not perform desirably when the degree of sparsity is moderate. In this paper, we propose a robust sparse estimation… ▽ More

    Submitted 19 February, 2024; v1 submitted 4 July, 2020; originally announced July 2020.

  42. arXiv:2006.05933  [pdf, other

    cs.LG cs.IR stat.ML

    AMEIR: Automatic Behavior Modeling, Interaction Exploration and MLP Investigation in the Recommender System

    Authors: Pengyu Zhao, Kecheng Xiao, Yuanxing Zhang, Kaigui Bian, Wei Yan

    Abstract: Recently, deep learning models have been widely spread in the industrial recommender systems and boosted the recommendation quality. Though having achieved remarkable success, the design of task-aware recommender systems usually requires manual feature engineering and architecture engineering from domain experts. To relieve those human efforts, we explore the potential of neural architecture searc… ▽ More

    Submitted 14 June, 2022; v1 submitted 10 June, 2020; originally announced June 2020.

  43. arXiv:2006.05876  [pdf, ps, other

    cs.LG stat.ML

    Improved Analysis for Dynamic Regret of Strongly Convex and Smooth Functions

    Authors: Peng Zhao, Lijun Zhang

    Abstract: In this paper, we present an improved analysis for dynamic regret of strongly convex and smooth functions. Specifically, we investigate the Online Multiple Gradient Descent (OMGD) algorithm proposed by Zhang et al. (2017). The original analysis shows that the dynamic regret of OMGD is at most $\mathcal{O}(\min\{\mathcal{P}_T,\mathcal{S}_T\})$, where $\mathcal{P}_T$ and $\mathcal{S}_T$ are path-len… ▽ More

    Submitted 14 April, 2021; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: To appear at L4DC 2021

  44. arXiv:2005.12172  [pdf, ps, other

    stat.ME

    Empirical Likelihood Inference With Public-Use Survey Data

    Authors: Puying Zhao, J. N. K. Rao, Changbao Wu

    Abstract: Public-use survey data are an important source of information for researchers in social science and health studies to build statistical models and make inferences on the target finite population. This paper presents two general inferential tools through the pseudo empirical likelihood and the sample empirical likelihood methods. Theoretical results on point estimation and linear or nonlinear hypot… ▽ More

    Submitted 25 May, 2020; originally announced May 2020.

    Comments: 50 pages, including 11 pages of tables

    MSC Class: 62D05; 62G05; 62G10

  45. arXiv:2005.00060  [pdf, other

    cs.LG cs.CV stat.ML

    Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness

    Authors: Pu Zhao, Pin-Yu Chen, Payel Das, Karthikeyan Natesan Ramamurthy, Xue Lin

    Abstract: Mode connectivity provides novel geometric insights on analyzing loss landscapes and enables building high-accuracy pathways between well-trained neural networks. In this work, we propose to employ mode connectivity in loss landscapes to study the adversarial robustness of deep neural networks, and provide novel methods for improving this robustness. Our experiments cover various types of adversar… ▽ More

    Submitted 2 July, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

    Comments: accepted by ICLR 2020

  46. arXiv:2003.08793  [pdf

    cs.CV cs.LG stat.ML

    Deep Active Learning for Remote Sensing Object Detection

    Authors: Zhenshen Qu, **gda Du, Yong Cao, Qiuyu Guan, Pengbo Zhao

    Abstract: Recently, CNN object detectors have achieved high accuracy on remote sensing images but require huge labor and time costs on annotation. In this paper, we propose a new uncertainty-based active learning which can select images with more information for annotation and detector can still reach high performance with a fraction of the training images. Our method not only analyzes objects' classificati… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

    Comments: 6 pages, 3 figures

  47. arXiv:2003.05610  [pdf, other

    cs.LG cs.AI stat.ML

    Privacy Preserving Point-of-interest Recommendation Using Decentralized Matrix Factorization

    Authors: Chaochao Chen, Ziqi Liu, Peilin Zhao, Jun Zhou, Xiaolong Li

    Abstract: Points of interest (POI) recommendation has been drawn much attention recently due to the increasing popularity of location-based networks, e.g., Foursquare and Yelp. Among the existing approaches to POI recommendation, Matrix Factorization (MF) based techniques have proven to be effective. However, existing MF approaches suffer from two major problems: (1) Expensive computations and storages due… ▽ More

    Submitted 12 March, 2020; originally announced March 2020.

    Comments: Accepted by AAAI'18

  48. arXiv:2003.03051  [pdf, other

    cs.LG stat.ML

    Cost-Sensitive Portfolio Selection via Deep Reinforcement Learning

    Authors: Yifan Zhang, Peilin Zhao, Qingyao Wu, Bin Li, Junzhou Huang, Mingkui Tan

    Abstract: Portfolio Selection is an important real-world financial task and has attracted extensive attention in artificial intelligence communities. This task, however, has two main difficulties: (i) the non-stationary price series and complex asset correlations make the learning of feature representation very hard; (ii) the practicality principle in financial markets requires controlling both transaction… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

    Journal ref: IEEE Transactions on Knowledge and Data Engineering (TKDE), 2020

  49. arXiv:2002.11599  [pdf, other

    cs.IT stat.ML

    Minimax Optimal Estimation of KL Divergence for Continuous Distributions

    Authors: Puning Zhao, Lifeng Lai

    Abstract: Estimating Kullback-Leibler divergence from identical and independently distributed samples is an important problem in various domains. One simple and effective estimator is based on the k nearest neighbor distances between these samples. In this paper, we analyze the convergence rates of the bias and variance of this estimator. Furthermore, we derive a lower bound of the minimax mean square error… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

  50. arXiv:2002.07891  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Towards Query-Efficient Black-Box Adversary with Zeroth-Order Natural Gradient Descent

    Authors: Pu Zhao, Pin-Yu Chen, Siyue Wang, Xue Lin

    Abstract: Despite the great achievements of the modern deep neural networks (DNNs), the vulnerability/robustness of state-of-the-art DNNs raises security concerns in many application domains requiring high reliability. Various adversarial attacks are proposed to sabotage the learning performance of DNN models. Among those, the black-box adversarial attack methods have received special attentions owing to th… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

    Comments: accepted by AAAI 2020