Skip to main content

Showing 1–17 of 17 results for author: Qu, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.18697  [pdf, other

    stat.ML cs.LG cs.SI math.OC math.ST

    Inferring Dynamic Networks from Marginals with Iterative Proportional Fitting

    Authors: Serina Chang, Frederic Koehler, Zhaonan Qu, Jure Leskovec, Johan Ugander

    Abstract: A common network inference problem, arising from real-world data constraints, is how to infer a dynamic network from its time-aggregated adjacency matrix and time-varying marginals (i.e., row and column sums). Prior approaches to this problem have repurposed the classic iterative proportional fitting (IPF) procedure, also known as Sinkhorn's algorithm, with promising empirical results. However, th… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  2. arXiv:2211.15125  [pdf, other

    stat.ME stat.AP

    Global Depths for Irregularly Observed Multivariate Functional Data

    Authors: Zhuo Qu, Wenlin Dai, Marc G. Genton

    Abstract: Two frameworks for multivariate functional depth based on multivariate depths are introduced in this paper. The first framework is multivariate functional integrated depth, and the second framework involves multivariate functional extremal depth, which is an extension of the extremal depth for univariate functional data. In each framework, global and local multivariate functional depths are propos… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: 29 pages, 6 figures

  3. arXiv:2209.00809  [pdf, other

    math.OC cs.DS cs.LG stat.ML

    Optimal Diagonal Preconditioning

    Authors: Zhaonan Qu, Wenzhi Gao, Oliver Hinder, Yinyu Ye, Zhengyuan Zhou

    Abstract: Preconditioning has long been a staple technique in optimization, often applied to reduce the condition number of a matrix and speed up the convergence of algorithms. Although there are many popular preconditioning techniques in practice, most lack guarantees on reductions in condition number. Moreover, the degree to which we can improve over existing heuristic preconditioners remains an important… ▽ More

    Submitted 4 November, 2022; v1 submitted 2 September, 2022; originally announced September 2022.

  4. arXiv:2206.05891  [pdf, other

    cs.LG cs.DC stat.ML

    Anchor Sampling for Federated Learning with Partial Client Participation

    Authors: Feijie Wu, Song Guo, Zhihao Qu, Shiqi He, Ziming Liu, **g Gao

    Abstract: Compared with full client participation, partial client participation is a more practical scenario in federated learning, but it may amplify some challenges in federated learning, such as data heterogeneity. The lack of inactive clients' updates in partial client participation makes it more likely for the model aggregation to deviate from the aggregation based on full client participation. Trainin… ▽ More

    Submitted 28 May, 2023; v1 submitted 12 June, 2022; originally announced June 2022.

    Comments: ICML 2023

  5. arXiv:2204.12135  [pdf, other

    stat.ME stat.CO

    Robust Two-Layer Partition Clustering of Sparse Multivariate Functional Data

    Authors: Zhuo Qu, Wenlin Dai, Marc G. Genton

    Abstract: A novel elastic time distance for sparse multivariate functional data is proposed and used to develop a robust distance-based two-layer partition clustering method. With this proposed distance, the new approach not only can detect correct clusters for sparse multivariate functional data under outlier settings but also can detect those outliers that do not belong to any clusters. Classical distance… ▽ More

    Submitted 18 March, 2023; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: 31 pages, 9 figures

    MSC Class: 62H30

  6. arXiv:2107.12420  [pdf, other

    stat.ME econ.EM math.ST

    Semiparametric Estimation of Treatment Effects in Observational Studies with Heterogeneous Partial Interference

    Authors: Zhaonan Qu, Ruoxuan Xiong, Jizhou Liu, Guido Imbens

    Abstract: In many observational studies in social science and medicine, subjects or units are connected, and one unit's treatment and attributes may affect another's treatment and outcome, violating the stable unit treatment value assumption (SUTVA) and resulting in interference. To enable feasible estimation and inference, many previous works assume exchangeability of interfering units (neighbors). However… ▽ More

    Submitted 22 June, 2024; v1 submitted 26 July, 2021; originally announced July 2021.

  7. Sparse Functional Boxplots for Multivariate Curves

    Authors: Zhuo Qu, Marc G. Genton

    Abstract: This paper introduces the sparse functional boxplot and the intensity sparse functional boxplot as practical exploratory tools. Besides being available for complete functional data, they can be used in sparse univariate and multivariate functional data. The sparse functional boxplot, based on the functional boxplot, displays sparseness proportions within the 50\% central region. The intensity spar… ▽ More

    Submitted 27 May, 2022; v1 submitted 14 March, 2021; originally announced March 2021.

    Comments: 33 pages, 7 figures

  8. arXiv:2007.05690  [pdf, other

    cs.LG stat.ML

    A Unified Linear Speedup Analysis of Federated Averaging and Nesterov FedAvg

    Authors: Zhaonan Qu, Kaixiang Lin, Zhaojian Li, Jiayu Zhou, Zhengyuan Zhou

    Abstract: Federated learning (FL) learns a model jointly from a set of participating devices without sharing each other's privately held data. The characteristics of non-i.i.d. data across the network, low device participation, high communication costs, and the mandate that data remain private bring challenges in understanding the convergence of FL algorithms, particularly regarding how convergence scales w… ▽ More

    Submitted 31 December, 2023; v1 submitted 11 July, 2020; originally announced July 2020.

    Journal ref: Journal of Artificial Intelligence Research 78 (2023) 1143-1200

  9. arXiv:2007.03071  [pdf, other

    cs.LG stat.ML

    Deep Partial Updating: Towards Communication Efficient Updating for On-device Inference

    Authors: Zhongnan Qu, Cong Liu, Lothar Thiele

    Abstract: Emerging edge intelligence applications require the server to retrain and update deep neural networks deployed on remote edge nodes to leverage newly collected data samples. Unfortunately, it may be impossible in practice to continuously send fully updated weights to these edge nodes due to the highly constrained communication resource. In this paper, we propose the weight-wise deep partial updati… ▽ More

    Submitted 27 July, 2022; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: Published in ECCV 2022

  10. arXiv:2003.08793  [pdf

    cs.CV cs.LG stat.ML

    Deep Active Learning for Remote Sensing Object Detection

    Authors: Zhenshen Qu, **gda Du, Yong Cao, Qiuyu Guan, Pengbo Zhao

    Abstract: Recently, CNN object detectors have achieved high accuracy on remote sensing images but require huge labor and time costs on annotation. In this paper, we propose a new uncertainty-based active learning which can select images with more information for annotation and detector can still reach high performance with a fraction of the training images. Our method not only analyzes objects' classificati… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

    Comments: 6 pages, 3 figures

  11. arXiv:2003.07545  [pdf, other

    cs.LG econ.EM math.ST stat.ML

    Interpretable Personalization via Policy Learning with Linear Decision Boundaries

    Authors: Zhaonan Qu, Isabella Qian, Zhengyuan Zhou

    Abstract: With the rise of the digital economy and an explosion of available information about consumers, effective personalization of goods and services has become a core business focus for companies to improve revenues and maintain a competitive edge. This paper studies the personalization problem through the lens of policy learning, where the goal is to learn a decision-making rule (a policy) that maps f… ▽ More

    Submitted 2 November, 2022; v1 submitted 17 March, 2020; originally announced March 2020.

  12. arXiv:2001.08277  [pdf, ps, other

    cs.LG stat.ML

    Intermittent Pulling with Local Compensation for Communication-Efficient Federated Learning

    Authors: Haozhao Wang, Zhihao Qu, Song Guo, Xin Gao, Ruixuan Li, Baoliu Ye

    Abstract: Federated Learning is a powerful machine learning paradigm to cooperatively train a global model with highly distributed data. A major bottleneck on the performance of distributed Stochastic Gradient Descent (SGD) algorithm for large-scale Federated Learning is the communication overhead on pushing local gradients and pulling global model. In this paper, to reduce the communication complexity of F… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

  13. arXiv:2001.02856  [pdf, other

    stat.ML cs.LG

    D-GCCA: Decomposition-based Generalized Canonical Correlation Analysis for Multi-view High-dimensional Data

    Authors: Hai Shu, Zhe Qu, Hongtu Zhu

    Abstract: Modern biomedical studies often collect multi-view data, that is, multiple types of data measured on the same set of objects. A popular model in high-dimensional multi-view data analysis is to decompose each view's data matrix into a low-rank common-source matrix generated by latent factors common across all data views, a low-rank distinctive-source matrix corresponding to each view, and an additi… ▽ More

    Submitted 16 September, 2022; v1 submitted 9 January, 2020; originally announced January 2020.

    Comments: The publisher's version is available at https://www.jmlr.org/papers/v23/20-021.html

    Journal ref: Journal of Machine Learning Research, 23(169):1-64, 2022

  14. arXiv:1912.09989  [pdf, other

    stat.ML cs.LG

    CDPA: Common and Distinctive Pattern Analysis between High-dimensional Datasets

    Authors: Hai Shu, Zhe Qu

    Abstract: A representative model in integrative analysis of two high-dimensional correlated datasets is to decompose each data matrix into a low-rank common matrix generated by latent factors shared across datasets, a low-rank distinctive matrix corresponding to each dataset, and an additive noise matrix. Existing decomposition methods claim that their common matrices capture the common pattern of the two d… ▽ More

    Submitted 5 April, 2022; v1 submitted 20 December, 2019; originally announced December 2019.

    Journal ref: Electronic Journal of Statistics, 2022, 16 (1), 2475-2517

  15. arXiv:1901.08669  [pdf, ps, other

    cs.LG math.OC stat.ML

    SAGA with Arbitrary Sampling

    Authors: Xu Qian, Zheng Qu, Peter Richtárik

    Abstract: We study the problem of minimizing the average of a very large number of smooth functions, which is of key importance in training supervised learning models. One of the most celebrated methods in this context is the SAGA algorithm. Despite years of research on the topic, a general-purpose version of SAGA---one that would include arbitrary importance sampling and minibatching schemes---does not exi… ▽ More

    Submitted 24 January, 2019; originally announced January 2019.

    Comments: 27 pages, 8 Figures, 1 algorithm

  16. arXiv:1512.09103  [pdf, other

    math.OC cs.DS math.NA stat.ML

    Even Faster Accelerated Coordinate Descent Using Non-Uniform Sampling

    Authors: Zeyuan Allen-Zhu, Zheng Qu, Peter Richtárik, Yang Yuan

    Abstract: Accelerated coordinate descent is widely used in optimization due to its cheap per-iteration cost and scalability to large-scale problems. Up to a primal-dual transformation, it is also the same as accelerated stochastic gradient descent that is one of the central methods used in machine learning. In this paper, we improve the best known running time of accelerated coordinate descent by a factor… ▽ More

    Submitted 27 May, 2016; v1 submitted 30 December, 2015; originally announced December 2015.

    Comments: same result, but polished writing

  17. arXiv:1502.08053  [pdf, ps, other

    math.OC cs.LG stat.ML

    Stochastic Dual Coordinate Ascent with Adaptive Probabilities

    Authors: Dominik Csiba, Zheng Qu, Peter Richtárik

    Abstract: This paper introduces AdaSDCA: an adaptive variant of stochastic dual coordinate ascent (SDCA) for solving the regularized empirical risk minimization problems. Our modification consists in allowing the method adaptively change the probability distribution over the dual variables throughout the iterative process. AdaSDCA achieves provably better complexity bound than SDCA with the best fixed proba… ▽ More

    Submitted 27 February, 2015; originally announced February 2015.