Skip to main content

Showing 1–21 of 21 results for author: Liang, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.19557  [pdf, other

    stat.ML cs.LG

    Neural Dynamic Data Valuation

    Authors: Zhangyong Liang, Huanhuan Gao, Ji Zhang

    Abstract: Data constitute the foundational component of the data economy and its marketplaces. Efficient and fair data valuation has emerged as a topic of significant interest.\ Many approaches based on marginal contribution have shown promising results in various downstream tasks. However, they are well known to be computationally expensive as they require training a large number of utility functions, whic… ▽ More

    Submitted 12 June, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: 43 pages, 19 figures

  2. arXiv:2404.17561  [pdf, other

    stat.ME stat.ML

    Structured Conformal Inference for Matrix Completion with Applications to Group Recommender Systems

    Authors: Ziyi Liang, Tianmin Xie, Xin Tong, Matteo Sesia

    Abstract: We develop a conformal inference method to construct joint confidence regions for structured groups of missing entries within a sparsely observed matrix. This method is useful to provide reliable uncertainty estimation for group-level collaborative filtering; for example, it can be applied to help suggest a movie for a group of friends to watch together. Unlike standard conformal techniques, which… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  3. arXiv:2312.09393  [pdf

    stat.AP

    Bi-scale Car-following Model Calibration for Corridor Based on Trajectory

    Authors: Keke Long, Haotian Shi, Zhiwei Chen, Zhaohui Liang, Xiaopeng Li, Felipe de Souza

    Abstract: The precise estimation of macroscopic traffic parameters, such as travel time and fuel consumption, is essential for the optimization of traffic management systems. Despite its importance, the comprehensive acquisition of vehicle trajectory data for the calculation of these macroscopic measures presents a challenge. To bridge this gap, this study aims to calibrate car-following models capable of p… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  4. arXiv:2301.11721  [pdf, other

    stat.ML cs.AI cs.LG

    Single-Trajectory Distributionally Robust Reinforcement Learning

    Authors: Zhipeng Liang, Xiaoteng Ma, Jose Blanchet, Jiheng Zhang, Zhengyuan Zhou

    Abstract: As a framework for sequential decision-making, Reinforcement Learning (RL) has been regarded as an essential component leading to Artificial General Intelligence (AGI). However, RL is often criticized for having the same training environment as the test one, which also hinders its application in the real world. To mitigate this problem, Distributionally Robust RL (DRRL) is proposed to improve the… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: First two authors contribute equally

  5. arXiv:2301.11556  [pdf, other

    stat.ML cs.LG math.ST

    Conformal inference is (almost) free for neural networks trained with early stop**

    Authors: Ziyi Liang, Yanfei Zhou, Matteo Sesia

    Abstract: Early stop** based on hold-out data is a popular regularization technique designed to mitigate overfitting and increase the predictive accuracy of neural networks. Models trained with early stop** often provide relatively accurate predictions, but they generally still lack precise statistical guarantees unless they are further calibrated using independent hold-out data. This paper addresses th… ▽ More

    Submitted 26 June, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: Updates: extension to quantile regression, some further details about methodology, more numerical experiments

  6. arXiv:2210.11050  [pdf, other

    cs.LG stat.ML

    Vertical Federated Linear Contextual Bandits

    Authors: Zeyu Cao, Zhipeng Liang, Shu Zhang, Hangyu Li, Ouyang Wen, Yu Rong, Peilin Zhao, Bingzhe Wu

    Abstract: In this paper, we investigate a novel problem of building contextual bandits in the vertical federated setting, i.e., contextual information is vertically distributed over different departments. This problem remains largely unexplored in the research community. To this end, we carefully design a customized encryption scheme named orthogonal matrix-based mask mechanism(O3M) for encrypting local con… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

  7. arXiv:2209.06620  [pdf, other

    cs.LG cs.AI stat.ML

    Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation

    Authors: Xiaoteng Ma, Zhipeng Liang, Jose Blanchet, Mingwen Liu, Li Xia, Jiheng Zhang, Qianchuan Zhao, Zhengyuan Zhou

    Abstract: Among the reasons hindering reinforcement learning (RL) applications to real-world problems, two factors are critical: limited data and the mismatch between the testing environment (real environment in which the policy is deployed) and the training environment (e.g., a simulator). This paper attempts to address these issues simultaneously with distributionally robust offline RL, where we learn a d… ▽ More

    Submitted 27 January, 2023; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: First two authors contribute equally

  8. arXiv:2208.11111  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Integrative conformal p-values for powerful out-of-distribution testing with labeled outliers

    Authors: Ziyi Liang, Matteo Sesia, Wenguang Sun

    Abstract: This paper develops novel conformal methods to test whether a new observation was sampled from the same distribution as a reference set. Blending inductive and transductive conformal inference in an innovative way, the described methods can re-weight standard conformal p-values based on dependent side information from known out-of-distribution data in a principled way, and can automatically take a… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

  9. arXiv:2206.08111  [pdf, other

    cs.LG cs.CR math.OC stat.ML

    On Private Online Convex Optimization: Optimal Algorithms in $\ell_p$-Geometry and High Dimensional Contextual Bandits

    Authors: Yuxuan Han, Zhicong Liang, Zhipeng Liang, Yang Wang, Yuan Yao, Jiheng Zhang

    Abstract: Differentially private (DP) stochastic convex optimization (SCO) is ubiquitous in trustworthy machine learning algorithm design. This paper studies the DP-SCO problem with streaming data sampled from a distribution and arrives sequentially. We also consider the continual release model where parameters related to private information are updated and released upon each new data, often known as the on… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: This is the extended version of the paper appeared in the 39th International Conference on Machine Learning (ICML 2022): Optimal Private Streaming SCO in $\ell_p$-geometry with Applications in High Dimensional Online Decision Making

  10. arXiv:2204.07742  [pdf, other

    cs.LG cs.DC stat.ML

    DRFLM: Distributionally Robust Federated Learning with Inter-client Noise via Local Mixup

    Authors: Bingzhe Wu, Zhipeng Liang, Yuxuan Han, Yatao Bian, Peilin Zhao, Junzhou Huang

    Abstract: Recently, federated learning has emerged as a promising approach for training a global model using data from multiple organizations without leaking their raw data. Nevertheless, directly applying federated learning to real-world tasks faces two challenges: (1) heterogeneity in the data among different organizations; and (2) data noises inside individual organizations. In this paper, we propose a… ▽ More

    Submitted 16 April, 2022; originally announced April 2022.

  11. arXiv:2203.11461  [pdf, other

    stat.ME stat.ML

    Locally Adaptive Algorithms for Multiple Testing with Network Structure, with Application to Genome-Wide Association Studies

    Authors: Ziyi Liang, T. Tony Cai, Wenguang Sun, Yin Xia

    Abstract: Linkage analysis has provided valuable insights to the GWAS studies, particularly in revealing that SNPs in linkage disequilibrium (LD) can jointly influence disease phenotypes. However, the potential of LD network data has often been overlooked or underutilized in the literature. In this paper, we propose a locally adaptive structure learning algorithm (LASLA) that provides a principled and gener… ▽ More

    Submitted 16 August, 2023; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: 33 pages, 7 figures

  12. arXiv:2106.03365  [pdf, ps, other

    stat.ML cs.LG

    Generalized Linear Bandits with Local Differential Privacy

    Authors: Yuxuan Han, Zhipeng Liang, Yang Wang, Jiheng Zhang

    Abstract: Contextual bandit algorithms are useful in personalized online decision-making. However, many applications such as personalized medicine and online advertising require the utilization of individual-specific information for effective learning, while user's data should remain private from the server due to privacy concerns. This motivates the introduction of local differential privacy (LDP), a strin… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  13. arXiv:2012.05577   

    stat.ME

    Context-dependent Ranking and Selection under a Bayesian Framework

    Authors: Haidong Li, Henry Lam, Zhe Liang, Yijie Peng

    Abstract: We consider a context-dependent ranking and selection problem. The best design is not universal but depends on the contexts. Under a Bayesian framework, we develop a dynamic sampling scheme for context-dependent optimization (DSCO) to efficiently learn and select the best designs in all contexts. The proposed sampling scheme is proved to be consistent. Numerical experiments show that the proposed… ▽ More

    Submitted 18 December, 2020; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: The article was published without the co-Author's notice, and it is withdrawn due to his objection

  14. arXiv:2009.03510  [pdf, other

    cs.LG cs.CR stat.ML

    FedCM: A Real-time Contribution Measurement Method for Participants in Federated Learning

    Authors: Boyi Liu, Bingjie Yan, Yize Zhou, Zhixuan Liang, Cheng-Zhong Xu

    Abstract: Federated Learning (FL) creates an ecosystem for multiple agents to collaborate on building models with data privacy consideration. The method for contribution measurement of each agent in the FL system is critical for fair credits allocation but few are proposed. In this paper, we develop a real-time contribution measurement method FedCM that is simple but powerful. The method defines the impact… ▽ More

    Submitted 11 February, 2021; v1 submitted 8 September, 2020; originally announced September 2020.

  15. arXiv:2007.03254  [pdf, ps, other

    cs.LG stat.ML

    Auto-CASH: Autonomous Classification Algorithm Selection with Deep Q-Network

    Authors: Tianyu Mu, Hongzhi Wang, Chunnan Wang, Zheng Liang

    Abstract: The great amount of datasets generated by various data sources have posed the challenge to machine learning algorithm selection and hyperparameter configuration. For a specific machine learning task, it usually takes domain experts plenty of time to select an appropriate algorithm and configure its hyperparameters. If the problem of algorithm selection and hyperparameter optimization can be solved… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

  16. The Geometry of Nonlinear Embeddings in Kernel Discriminant Analysis

    Authors: Jiae Kim, Yoonkyung Lee, Zhiyu Liang

    Abstract: Fisher's linear discriminant analysis is a classical method for classification, yet it is limited to capturing linear features only. Kernel discriminant analysis as an extension is known to successfully alleviate the limitation through a nonlinear feature map**. We study the geometry of nonlinear embeddings in discriminant analysis with polynomial kernels and Gaussian kernel by identifying the p… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

  17. arXiv:2005.00218  [pdf, other

    cs.LG stat.ML

    Differentially Private Federated Learning with Laplacian Smoothing

    Authors: Zhicong Liang, Bao Wang, Quanquan Gu, Stanley Osher, Yuan Yao

    Abstract: Federated learning aims to protect data privacy by collaboratively learning a model without sharing private data among users. However, an adversary may still be able to infer the private training data by attacking the released model. Differential privacy provides a statistical protection against such attacks at the price of significantly degrading the accuracy or utility of the trained models. In… ▽ More

    Submitted 10 September, 2021; v1 submitted 1 May, 2020; originally announced May 2020.

  18. Real-time Data-driven Quality Assessment for Continuous Manufacturing of Carbon Nanotube Buckypaper

    Authors: Xinran Shi, Xiaowei Yue, Zhiyong Liang, Jianjun Shi

    Abstract: Carbon nanotube (CNT) thin sheet, or buckypaper, has shown great potential as a multifunctional platform material due to its desirable properties, including its lightweight nature, high mechanical properties, and good conductivity. However, their mass adoption and applications by industry have run into significant bottlenecks because of large variability and uncertainty in quality during fabricati… ▽ More

    Submitted 19 April, 2020; originally announced April 2020.

  19. arXiv:1808.09940  [pdf, other

    q-fin.PM cs.LG stat.ML

    Adversarial Deep Reinforcement Learning in Portfolio Management

    Authors: Zhipeng Liang, Hao Chen, Junhao Zhu, Kangkang Jiang, Yanran Li

    Abstract: In this paper, we implement three state-of-art continuous reinforcement learning algorithms, Deep Deterministic Policy Gradient (DDPG), Proximal Policy Optimization (PPO) and Policy Gradient (PG)in portfolio management. All of them are widely-used in game playing and robot control. What's more, PPO has appealing theoretical propeties which is hopefully potential in portfolio management. We present… ▽ More

    Submitted 17 November, 2018; v1 submitted 29 August, 2018; originally announced August 2018.

  20. arXiv:1804.03109  [pdf

    stat.ME

    Tensor Mixed Effects Model with Applications in Nanomanufacturing Inspection

    Authors: Xiaowei Yue, ** Gyu Park, Zhiyong Liang, Jianjun Shi

    Abstract: Raman map** technique has been used to perform in-line quality inspections of nanomanufacturing processes. In such an application, massive high-dimensional Raman map** data with mixed effects is generated. In general, fixed effects and random effects in the multi-array Raman data are associated with different quality characteristics such as fabrication consistency, uniformity, defects, et al.… ▽ More

    Submitted 6 March, 2019; v1 submitted 9 April, 2018; originally announced April 2018.

    Comments: 29 pages, 8 figures

    Journal ref: Technometrics, 2019

  21. arXiv:1710.02944  [pdf, ps, other

    econ.EM stat.CO stat.ME

    A Unified Approach on the Local Power of Panel Unit Root Tests

    Authors: Zhongwen Liang

    Abstract: In this paper, a unified approach is proposed to derive the exact local asymptotic power for panel unit root tests, which is one of the most important issues in nonstationary panel data literature. Two most widely used panel unit root tests known as Levin-Lin-Chu (LLC, Levin, Lin and Chu (2002)) and Im-Pesaran-Shin (IPS, Im, Pesaran and Shin (2003)) tests are systematically studied for various sit… ▽ More

    Submitted 9 October, 2017; originally announced October 2017.

    Comments: 67 pages, 1 figure