Skip to main content

Showing 1–23 of 23 results for author: Hu, Q

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.18577  [pdf, other

    math.OC cs.LG stat.ML

    Single-loop Stochastic Algorithms for Difference of Max-Structured Weakly Convex Functions

    Authors: Quanqi Hu, Qi Qi, Zhaosong Lu, Tianbao Yang

    Abstract: In this paper, we study a class of non-smooth non-convex problems in the form of $\min_{x}[\max_{y\in Y}φ(x, y) - \max_{z\in Z}ψ(x, z)]$, where both $Φ(x) = \max_{y\in Y}φ(x, y)$ and $Ψ(x)=\max_{z\in Z}ψ(x, z)$ are weakly convex functions, and $φ(x, y), ψ(x, z)$ are strongly concave functions in terms of $y$ and $z$, respectively. It covers two families of problems that have been studied but are m… ▽ More

    Submitted 29 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2401.17646  [pdf, other

    stat.ME

    From Sparse to Dense Functional Data: Phase Transitions from a Simultaneous Inference Perspective

    Authors: Leheng Cai, Qirui Hu

    Abstract: We aim to develop simultaneous inference tools for the mean function of functional data from sparse to dense. First, we derive a unified Gaussian approximation to construct simultaneous confidence bands of mean functions based on the B-spline estimator. Then, we investigate the conditions of phase transitions by decomposing the asymptotic variance of the approximated Gaussian process. As an extens… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  3. arXiv:2312.07032  [pdf, ps, other

    cs.LG stat.ML

    Ahpatron: A New Budgeted Online Kernel Learning Machine with Tighter Mistake Bound

    Authors: Yun Liao, Junfan Li, Shizhong Liao, Qinghua Hu, Jianwu Dang

    Abstract: In this paper, we study the mistake bound of online kernel learning on a budget. We propose a new budgeted online kernel learning model, called Ahpatron, which significantly improves the mistake bound of previous work and resolves the open problem posed by Dekel, Shalev-Shwartz, and Singer (2005). We first present an aggressive variant of Perceptron, named AVP, a model without budget, which uses a… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  4. arXiv:2310.08843  [pdf

    stat.AP

    A Longitudinal Analysis about the Effect of Air Pollution on Astigmatism for Children and Young Adults

    Authors: Lin An, Qiuyue Hu, Jieying Guan, Yingting Zhu, Chenyao Jiang, Xiaoyun Zhong, Shuyue Ma, Dongmei Yu, Canyang Zhang, Yehong Zhuo, Peiwu Qin

    Abstract: Purpose: This study aimed to investigate the correlation between air pollution and astigmatism, considering the detrimental effects of air pollution on respiratory, cardiovascular, and eye health. Methods: A longitudinal study was conducted with 127,709 individuals aged 4-27 years from 9 cities in Guangdong Province, China, spanning from 2019 to 2021. Astigmatism was measured using cylinder values… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  5. arXiv:2310.03234  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Non-Smooth Weakly-Convex Finite-sum Coupled Compositional Optimization

    Authors: Quanqi Hu, Dixian Zhu, Tianbao Yang

    Abstract: This paper investigates new families of compositional optimization problems, called $\underline{\bf n}$on-$\underline{\bf s}$mooth $\underline{\bf w}$eakly-$\underline{\bf c}$onvex $\underline{\bf f}$inite-sum $\underline{\bf c}$oupled $\underline{\bf c}$ompositional $\underline{\bf o}$ptimization (NSWC FCCO). There has been a growing interest in FCCO due to its wide-ranging applications in machin… ▽ More

    Submitted 3 March, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

  6. arXiv:2308.01314  [pdf, other

    cs.LG cs.SE stat.ML

    Evaluating the Robustness of Test Selection Methods for Deep Neural Networks

    Authors: Qiang Hu, Yuejun Guo, Xiaofei Xie, Maxime Cordy, Wei Ma, Mike Papadakis, Yves Le Traon

    Abstract: Testing deep learning-based systems is crucial but challenging due to the required time and labor for labeling collected raw data. To alleviate the labeling effort, multiple test selection methods have been proposed where only a subset of test data needs to be labeled while satisfying testing requirements. However, we observe that such methods with reported promising results are only evaluated und… ▽ More

    Submitted 29 July, 2023; originally announced August 2023.

    Comments: 12 pages

  7. arXiv:2306.10260  [pdf, other

    stat.ME

    Online Local Differential Private Quantile Inference via Self-normalization

    Authors: Yi Liu, Qirui Hu, Lei Ding, Bei Jiang, Linglong Kong

    Abstract: Based on binary inquiries, we developed an algorithm to estimate population quantiles under Local Differential Privacy (LDP). By self-normalizing, our algorithm provides asymptotically normal estimation with valid inference, resulting in tight confidence intervals without the need for nuisance parameters to be estimated. Our proposed method can be conducted fully online, leading to high computatio… ▽ More

    Submitted 7 August, 2023; v1 submitted 17 June, 2023; originally announced June 2023.

  8. arXiv:2305.18730  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Blockwise Stochastic Variance-Reduced Methods with Parallel Speedup for Multi-Block Bilevel Optimization

    Authors: Quanqi Hu, Zi-Hao Qiu, Zhishuai Guo, Lijun Zhang, Tianbao Yang

    Abstract: In this paper, we consider non-convex multi-block bilevel optimization (MBBO) problems, which involve $m\gg 1$ lower level problems and have important applications in machine learning. Designing a stochastic gradient and controlling its variance is more intricate due to the hierarchical sampling of blocks and data and the unique challenge of estimating hyper-gradient. We aim to achieve three nice… ▽ More

    Submitted 2 June, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  9. arXiv:2305.11965  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization

    Authors: Zi-Hao Qiu, Quanqi Hu, Zhuoning Yuan, Denny Zhou, Lijun Zhang, Tianbao Yang

    Abstract: In this paper, we aim to optimize a contrastive loss with individualized temperatures in a principled and systematic manner for self-supervised learning. The common practice of using a global temperature parameter $τ$ ignores the fact that ``not all semantics are created equal", meaning that different anchor data may have different numbers of samples with similar semantics, especially when data ex… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: 33 pages, 11 figures, accepted by ICML2023

  10. arXiv:2206.02017  [pdf, ps, other

    stat.ME

    Feature screening for multi-response linear models by empirical likelihood

    Authors: Jun Lu, Qinqin Hu, Lu Lin

    Abstract: This paper proposes a new feature screening method for the multi-response ultrahigh dimensional linear model by empirical likelihood. Through a multivariate moment condition, the empirical likelihood induced ranking statistics can exploit the joint effect among responses, and thus result in a much better performance than the methods considering responses individually. More importantly, by the use… ▽ More

    Submitted 4 June, 2022; originally announced June 2022.

  11. arXiv:2206.00260  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Multi-block Min-max Bilevel Optimization with Applications in Multi-task Deep AUC Maximization

    Authors: Quanqi Hu, Yongjian Zhong, Tianbao Yang

    Abstract: In this paper, we study multi-block min-max bilevel optimization problems, where the upper level is non-convex strongly-concave minimax objective and the lower level is a strongly convex objective, and there are multiple blocks of dual variables and lower level problems. Due to the intertwined multi-block min-max bilevel structure, the computational cost at each iteration could be prohibitively hi… ▽ More

    Submitted 17 November, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

  12. arXiv:2203.16261  [pdf

    q-bio.GN cs.DC cs.SE stat.AP

    Packaging, containerization, and virtualization of computational omics methods: Advances, challenges, and opportunities

    Authors: Mohammed Alser, Sharon Waymost, Ram Ayyala, Brendan Lawlor, Richard J. Abdill, Neha Rajkumar, Nathan LaPierre, Jaqueline Brito, Andre M. Ribeiro-dos-Santos, Can Firtina, Nour Almadhoun, Varuni Sarwal, Eleazar Eskin, Qiyang Hu, Derek Strong, Byoung-Do, Kim, Malak S. Abedalthagafi, Onur Mutlu, Serghei Mangul

    Abstract: Omics software tools have reshaped the landscape of modern biology and become an essential component of biomedical research. The increasing dependence of biomedical scientists on these powerful tools creates a need for easier installation and greater usability. Packaging, virtualization, and containerization are different approaches to satisfy this need by wrap** omics tools in additional softwa… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

  13. arXiv:2202.12183  [pdf, other

    cs.LG cs.AI cs.IR math.OC stat.ML

    Large-scale Stochastic Optimization of NDCG Surrogates for Deep Learning with Provable Convergence

    Authors: Zi-Hao Qiu, Quanqi Hu, Yongjian Zhong, Lijun Zhang, Tianbao Yang

    Abstract: NDCG, namely Normalized Discounted Cumulative Gain, is a widely used ranking metric in information retrieval and machine learning. However, efficient and provable stochastic methods for maximizing NDCG are still lacking, especially for deep models. In this paper, we propose a principled approach to optimize NDCG and its top-$K$ variant. First, we formulate a novel compositional optimization proble… ▽ More

    Submitted 2 February, 2023; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: 32 pages, 12 figures; Accepted by ICML2022

  14. arXiv:2112.13356  [pdf, other

    stat.ME

    Transfer Learning in High-dimensional Semi-parametric Graphical Models with Application to Brain Connectivity Analysis

    Authors: Yong He, Qiushi Li, Qinqin Hu, Lei Liu

    Abstract: Transfer learning has drawn growing attention with the target of improving statistical efficiency of one study (dataset) by digging information from similar and related auxiliary studies (datasets). In the article, we consider transfer learning problem in estimating undirected semi-parametric graphical model. We propose an algorithm called Trans-Copula-CLIME for estimating undirected graphical mod… ▽ More

    Submitted 26 December, 2021; originally announced December 2021.

  15. arXiv:2112.10151  [pdf, ps, other

    math.ST stat.ME

    Edge differentially private estimation in the $β$-model via jittering and method of moments

    Authors: **yuan Chang, Qiao Hu, Eric D. Kolaczyk, Qiwei Yao, Fengting Yi

    Abstract: A standing challenge in data privacy is the trade-off between the level of privacy and the efficiency of statistical inference. Here we conduct an in-depth study of this trade-off for parameter estimation in the $β$-model (Chatterjee, Diaconis and Sly, 2011) for edge differentially private network data released via jittering (Karwa, Krivitsky and Slavković, 2017). Unlike most previous approaches b… ▽ More

    Submitted 2 April, 2024; v1 submitted 19 December, 2021; originally announced December 2021.

    Journal ref: Annals of Statistics 2024, Vol. 52, pp. 708-728

  16. arXiv:2007.06240  [pdf, other

    cs.CV cs.LG stat.ML

    Expert Training: Task Hardness Aware Meta-Learning for Few-Shot Classification

    Authors: Yucan Zhou, Yu Wang, Jianfei Cai, Yu Zhou, Qinghua Hu, Wei** Wang

    Abstract: Deep neural networks are highly effective when a large number of labeled samples are available but fail with few-shot classification tasks. Recently, meta-learning methods have received much attention, which train a meta-learner on massive additional tasks to gain the knowledge to instruct the few-shot classification. Usually, the training tasks are randomly sampled and performed indiscriminately,… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

    Comments: 9 pages, 6 figures

  17. arXiv:2006.15284  [pdf, other

    cs.LG stat.ML

    Stochastic Batch Augmentation with An Effective Distilled Dynamic Soft Label Regularizer

    Authors: Qian Li, Qingyuan Hu, Yong Qi, Saiyu Qi, Jie Ma, Jian Zhang

    Abstract: Data augmentation have been intensively used in training deep neural network to improve the generalization, whether in original space (e.g., image space) or representation space. Although being successful, the connection between the synthesized data and the original data is largely ignored in training, without considering the distribution information that the synthesized samples are surrounding th… ▽ More

    Submitted 27 June, 2020; originally announced June 2020.

    Comments: Accepted by IJCAI 2020. SOLE copyright holder is IJCAI (international Joint Conferences on Artificial Intelligence)

  18. arXiv:2006.13044  [pdf, other

    cs.LG eess.SP stat.ML

    Scheduling Policy and Power Allocation for Federated Learning in NOMA Based MEC

    Authors: Xiang Ma, Haijian Sun, Rose Qingyang Hu

    Abstract: Federated learning (FL) is a highly pursued machine learning technique that can train a model centrally while kee** data distributed. Distributed computation makes FL attractive for bandwidth limited applications especially in wireless communications. There can be a large number of distributed edge devices connected to a central parameter server (PS) and iteratively download/upload data from/to… ▽ More

    Submitted 21 June, 2020; originally announced June 2020.

  19. arXiv:2001.06576  [pdf, other

    cs.LG cs.SI stat.ML

    Inference for Network Structure and Dynamics from Time Series Data via Graph Neural Network

    Authors: Mengyuan Chen, Jiang Zhang, Zhang Zhang, Lun Du, Qiao Hu, Shuo Wang, Jiaqi Zhu

    Abstract: Network structures in various backgrounds play important roles in social, technological, and biological systems. However, the observable network structures in real cases are often incomplete or unavailable due to measurement errors or private protection issues. Therefore, inferring the complete network structure is useful for understanding complex systems. The existing studies have not fully solve… ▽ More

    Submitted 17 January, 2020; originally announced January 2020.

  20. arXiv:1906.03768  [pdf, ps, other

    stat.ML cs.IR cs.LG

    A cost-reducing partial labeling estimator in text classification problem

    Authors: Jiangning Chen, Zhibo Dai, Juntao Duan, Qianli Hu, Ruilin Li, Heinrich Matzinger, Ionel Popescu, Haoyan Zhai

    Abstract: We propose a new approach to address the text classification problems when learning with partial labels is beneficial. Instead of offering each training sample a set of candidate labels, we assign negative-oriented labels to the ambiguous training examples if they are unlikely fall into certain classes. We construct our new maximum likelihood estimators with self-correction property, and prove tha… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

  21. Optimal covariance matrix estimation for high-dimensional noise in high-frequency data

    Authors: **yuan Chang, Qiao Hu, Cheng Liu, Cheng Yong Tang

    Abstract: We consider high-dimensional measurement errors with high-frequency data. Our objective is on recovering the high-dimensional cross-sectional covariance matrix of the random errors with optimality. In this problem, not all components of the random vector are observed at the same time and the measurement errors are latent variables, leading to major challenges besides high data dimensionality. We p… ▽ More

    Submitted 10 September, 2022; v1 submitted 19 December, 2018; originally announced December 2018.

    Journal ref: Journal of Econometrics 2024, Vol. 239, 105329

  22. arXiv:1810.13192  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Nearly-tight bounds on linear regions of piecewise linear neural networks

    Authors: Qiang Hu, Hao Zhang

    Abstract: The developments of deep neural networks (DNN) in recent years have ushered a brand new era of artificial intelligence. DNNs are proved to be excellent in solving very complex problems, e.g., visual recognition and text understanding, to the extent of competing with or even surpassing people. Despite inspiring and encouraging success of DNNs, thorough theoretical analyses still lack to unravel the… ▽ More

    Submitted 26 December, 2018; v1 submitted 31 October, 2018; originally announced October 2018.

    Comments: Counting linear regions of neural networks

  23. arXiv:1806.00580  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Detecting Adversarial Examples via Key-based Network

    Authors: Pinlong Zhao, Zhouyu Fu, Ou wu, Qinghua Hu, Jun Wang

    Abstract: Though deep neural networks have achieved state-of-the-art performance in visual classification, recent studies have shown that they are all vulnerable to the attack of adversarial examples. Small and often imperceptible perturbations to the input images are sufficient to fool the most powerful deep neural networks. Various defense methods have been proposed to address this issue. However, they ei… ▽ More

    Submitted 2 June, 2018; originally announced June 2018.

    Comments: 6 pages