Skip to main content

Showing 1–50 of 259 results for author: Qiu

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.02689  [pdf, ps, other

    cs.LG cs.DC math.OC stat.ML

    Accelerating Distributed Optimization: A Primal-Dual Perspective on Local Steps

    Authors: Junchi Yang, Murat Yildirim, Qiu Feng

    Abstract: In distributed machine learning, efficient training across multiple agents with different data distributions poses significant challenges. Even with a centralized coordinator, current algorithms that achieve optimal communication complexity typically require either large minibatches or compromise on gradient complexity. In this work, we tackle both centralized and decentralized settings across str… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2407.00529  [pdf, other

    cs.LG cs.SD eess.AS math.ST stat.ML

    Detecting and Identifying Selection Structure in Sequential Data

    Authors: Yujia Zheng, Zeyu Tang, Yiwen Qiu, Bernhard Schölkopf, Kun Zhang

    Abstract: We argue that the selective inclusion of data points based on latent objectives is common in practical situations, such as music sequences. Since this selection process often distorts statistical analysis, previous work primarily views it as a bias to be corrected and proposes various methods to mitigate its effect. However, while controlling this bias is crucial, selection also offers an opportun… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: ICML 2024

  3. arXiv:2406.16525  [pdf, other

    stat.ML cs.LG

    OAML: Outlier Aware Metric Learning for OOD Detection Enhancement

    Authors: Heng Gao, Zhuolin He, Shoumeng Qiu, Jian Pu

    Abstract: Out-of-distribution (OOD) detection methods have been developed to identify objects that a model has not seen during training. The Outlier Exposure (OE) methods use auxiliary datasets to train OOD detectors directly. However, the collection and learning of representative OOD samples may pose challenges. To tackle these issues, we propose the Outlier Aware Metric Learning (OAML) framework. The main… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  4. arXiv:2406.08819  [pdf, other

    cs.LG cs.AI stat.ML

    AIM: Attributing, Interpreting, Mitigating Data Unfairness

    Authors: Zhining Liu, Ruizhong Qiu, Zhichen Zeng, Yada Zhu, Hendrik Hamann, Hanghang Tong

    Abstract: Data collected in the real world often encapsulates historical discrimination against disadvantaged groups and individuals. Existing fair machine learning (FairML) research has predominantly focused on mitigating discriminative bias in the model prediction, with far less effort dedicated towards exploring how to trace biases present in the data, despite its importance for the transparency and inte… ▽ More

    Submitted 18 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 12 pages, 6 figures, accepted by ACM SIGKDD 2024. Webpage: https://github.com/ZhiningLiu1998/AIM

  5. arXiv:2406.05637  [pdf, ps, other

    math.OC cs.LG math.PR stat.ML

    A Generalized Version of Chung's Lemma and its Applications

    Authors: Li Jiang, Xiao Li, Andre Milzarek, Junwen Qiu

    Abstract: Chung's lemma is a classical tool for establishing asymptotic convergence rates of (stochastic) optimization methods under strong convexity-type assumptions and appropriate polynomial diminishing step sizes. In this work, we develop a generalized version of Chung's lemma, which provides a simple non-asymptotic convergence framework for a more general family of step size rules. We demonstrate broad… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 43 pages, 5 figures

    MSC Class: 90C15; 90C30; 90C26

  6. arXiv:2405.20400  [pdf, other

    stat.ME cs.LG stat.CO stat.ML

    Fast leave-one-cluster-out cross-validation by clustered Network Information Criteria (NICc)

    Authors: Jiaxing Qiu, Douglas E. Lake, Teague R. Henry

    Abstract: This paper introduced a clustered estimator of the Network Information Criterion (NICc) to approximate leave-one-cluster-out cross-validated deviance, which can be used as an alternative to cluster-based cross-validation when modeling clustered data. Stone proved that Akaike Information Criterion (AIC) is an asymptotic equivalence to leave-one-observation-out cross-validation if the parametric mod… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  7. arXiv:2405.17478  [pdf, other

    cs.LG stat.ML

    ROSE: Register Assisted General Time Series Forecasting with Decomposed Frequency Learning

    Authors: Yihang Wang, Yuying Qiu, Peng Chen, Kai Zhao, Yang Shu, Zhongwen Rao, Lujia Pan, Bin Yang, Chenjuan Guo

    Abstract: With the increasing collection of time series data from various domains, there arises a strong demand for general time series forecasting models pre-trained on a large number of time-series datasets to support a variety of downstream prediction tasks. Enabling general time series forecasting faces two challenges: how to obtain unified representations from multi-domian time series data, and how to… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  8. arXiv:2405.07186  [pdf, other

    stat.ME stat.ML

    Adaptive-TMLE for the Average Treatment Effect based on Randomized Controlled Trial Augmented with Real-World Data

    Authors: Mark van der Laan, Sky Qiu, Lars van der Laan

    Abstract: We consider the problem of estimating the average treatment effect (ATE) when both randomized control trial (RCT) data and real-world data (RWD) are available. We decompose the ATE estimand as the difference between a pooled-ATE estimand that integrates RCT and RWD and a bias estimand that captures the conditional effect of RCT enrollment on the outcome. We introduce an adaptive targeted minimum l… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  9. arXiv:2404.17644  [pdf, other

    stat.ML cs.AI cs.LG

    A Conditional Independence Test in the Presence of Discretization

    Authors: Boyang Sun, Yu Yao, Huangyuan Hao, Yumou Qiu, Kun Zhang

    Abstract: Testing conditional independence has many applications, such as in Bayesian network learning and causal discovery. Different test methods have been proposed. However, existing methods generally can not work when only discretized observations are available. Specifically, consider $X_1$, $\tilde{X}_2$ and $X_3$ are observed variables, where $\tilde{X}_2$ is a discretization of latent variables… ▽ More

    Submitted 3 May, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

  10. arXiv:2404.10444  [pdf, other

    math.ST cs.LG stat.ML

    Semi-supervised Fréchet Regression

    Authors: Rui Qiu, Zhou Yu, Zhenhua Lin

    Abstract: This paper explores the field of semi-supervised Fréchet regression, driven by the significant costs associated with obtaining non-Euclidean labels. Methodologically, we propose two novel methods: semi-supervised NW Fréchet regression and semi-supervised kNN Fréchet regression, both based on graph distance acquired from all feature instances. These methods extend the scope of existing semi-supervi… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  11. arXiv:2404.04399  [pdf, other

    stat.ML cs.AI cs.LG stat.AP stat.ME

    Longitudinal Targeted Minimum Loss-based Estimation with Temporal-Difference Heterogeneous Transformer

    Authors: Toru Shirakawa, Yi Li, Yulun Wu, Sky Qiu, Yuxuan Li, Mingduo Zhao, Hiroyasu Iso, Mark van der Laan

    Abstract: We propose Deep Longitudinal Targeted Minimum Loss-based Estimation (Deep LTMLE), a novel approach to estimate the counterfactual mean of outcome under dynamic treatment policies in longitudinal problem settings. Our approach utilizes a transformer architecture with heterogeneous type embedding trained using temporal-difference learning. After obtaining an initial estimate using the transformer, f… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  12. arXiv:2404.01076  [pdf, other

    stat.ME

    Debiased calibration estimation using generalized entropy in survey sampling

    Authors: Yonghyun Kwon, Jae Kwang Kim, Yumou Qiu

    Abstract: Incorporating the auxiliary information into the survey estimation is a fundamental problem in survey sampling. Calibration weighting is a popular tool for incorporating the auxiliary information. The calibration weighting method of Deville and Sarndal (1992) uses a distance measure between the design weights and the final weights to solve the optimization problem with calibration constraints. Thi… ▽ More

    Submitted 2 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  13. arXiv:2403.00258  [pdf, ps, other

    stat.ML cs.LG

    "Lossless" Compression of Deep Neural Networks: A High-dimensional Neural Tangent Kernel Approach

    Authors: Lingyu Gu, Yongqi Du, Yuan Zhang, Di Xie, Shiliang Pu, Robert C. Qiu, Zhenyu Liao

    Abstract: Modern deep neural networks (DNNs) are extremely powerful; however, this comes at the price of increased depth and having more parameters per layer, making their training and inference more computationally challenging. In an attempt to address this key limitation, efforts have been devoted to the compression (e.g., sparsification and/or quantization) of these large-scale machine learning models, s… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: 32 pages, 4 figures, and 2 tables. Fixing typos in Theorems 1 and 2 from NeurIPS 2022 proceeding (https://proceedings.neurips.cc/paper_files/paper/2022/hash/185087ea328b4f03ea8fd0c8aa96f747-Abstract-Conference.html)

  14. arXiv:2402.18571  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards

    Authors: Haoxiang Wang, Yong Lin, Wei Xiong, Rui Yang, Shizhe Diao, Shuang Qiu, Han Zhao, Tong Zhang

    Abstract: Fine-grained control over large language models (LLMs) remains a significant challenge, hindering their adaptability to diverse user needs. While Reinforcement Learning from Human Feedback (RLHF) shows promise in aligning LLMs, its reliance on scalar rewards often limits its ability to capture diverse user preferences in real-world applications. To address this limitation, we introduce the Directi… ▽ More

    Submitted 6 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: The code and model are released at https://github.com/Haoxiang-Wang/directional-preference-alignment

  15. arXiv:2402.11025  [pdf, other

    cs.LG stat.ML

    Training Bayesian Neural Networks with Sparse Subspace Variational Inference

    Authors: Junbo Li, Zichen Miao, Qiang Qiu, Ruqi Zhang

    Abstract: Bayesian neural networks (BNNs) offer uncertainty quantification but come with the downside of substantially increased training and inference costs. Sparse BNNs have been investigated for efficient inference, typically by either slowly introducing sparsity throughout the training or by post-training compression of dense BNNs. The dilemma of how to cut down massive training costs remains, particula… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Journal ref: Published at International Conference on Learning Representations (ICLR) 2024

  16. arXiv:2402.02697  [pdf, ps, other

    cs.LG stat.ML

    Deep Equilibrium Models are Almost Equivalent to Not-so-deep Explicit Models for High-dimensional Gaussian Mixtures

    Authors: Zenan Ling, Longbo Li, Zhanbo Feng, Yixuan Zhang, Feng Zhou, Robert C. Qiu, Zhenyu Liao

    Abstract: Deep equilibrium models (DEQs), as a typical implicit neural network, have demonstrated remarkable success on various tasks. There is, however, a lack of theoretical understanding of the connections and differences between implicit DEQs and explicit neural network models. In this paper, leveraging recent advances in random matrix theory (RMT), we perform an in-depth analysis on the eigenspectra of… ▽ More

    Submitted 19 May, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted by ICML 2024

  17. arXiv:2401.05424  [pdf, other

    cs.CY cs.IR cs.LG stat.AP

    A Toolbox for Modelling Engagement with Educational Videos

    Authors: Yuxiang Qiu, Karim Djemili, Denis Elezi, Aaneel Shalman, María Pérez-Ortiz, Emine Yilmaz, John Shawe-Taylor, Sahan Bulathwela

    Abstract: With the advancement and utility of Artificial Intelligence (AI), personalising education to a global population could be a cornerstone of new educational systems in the future. This work presents the PEEKC dataset and the TrueLearn Python library, which contains a dataset and a series of online learner state models that are essential to facilitate research on learner engagement modelling.TrueLear… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: In Proceedings of AAAI Conference on Artificial Intelligence 2024. arXiv admin note: text overlap with arXiv:2309.11527

    ACM Class: H.3.3; J.1; I.2.0

  18. arXiv:2312.17162  [pdf, other

    stat.ML cs.AI cs.LG

    Function-Space Regularization in Neural Networks: A Probabilistic Perspective

    Authors: Tim G. J. Rudner, Sanyam Kapoor, Shikai Qiu, Andrew Gordon Wilson

    Abstract: Parameter-space regularization in neural network optimization is a fundamental tool for improving generalization. However, standard parameter-space regularization methods make it challenging to encode explicit preferences about desired predictive functions into neural network training. In this work, we approach regularization in neural networks from a probabilistic perspective and show that by vie… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: Published in Proceedings of the 40th International Conference on Machine Learning (ICML 2023)

  19. arXiv:2312.14420  [pdf, other

    math.ST stat.ME

    On eigenvalues of sample covariance matrices based on high dimensional compositional data

    Authors: Qianqian Jiang, Jiaxin Qiu, Zeng Li

    Abstract: This paper studies the asymptotic spectral properties of the sample covariance matrix for high dimensional compositional data, including the limiting spectral distribution, the limit of extreme eigenvalues, and the central limit theorem for linear spectral statistics. All asymptotic results are derived under the high-dimensional regime where the data dimension increases to infinity proportionally… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  20. arXiv:2312.06051  [pdf, other

    physics.ao-ph stat.AP

    Bangladesh's Accelerating Coastal Flood Hazard

    Authors: Jiangchao Qiu, Sai Ravela, Kerry Emanuel

    Abstract: The risk of extreme coastal flooding to Bangladesh's low-lying and densely populated coastal regions, already vulnerable to tropical cyclones, remains poorly quantified under a warming climate. Here, using a statistical-physical downscaling approach, our projections under the IPCC SSP5-8.5 scenario show that Bangladesh's 100-year coastal flood will likely intensify from 4.2m to 6.6m by the end of… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    MSC Class: 86A08

  21. Exploration of Superposition Theorem in Spectrum Space for Composite Event Analysis in an ADN

    Authors: Xing He, Qian Ai, Yuezhong Tang, Robert Qiu, Canbing Li

    Abstract: This study presents a formulation of the Superposition Theorem (ST) in the spectrum space, tailored for the analysis of composite events in an active distribution network (ADN). Our formulated ST enables a quantitative analysis on a composite event, uncovering the property of additivity among independent atom events in the spectrum space. This contribution is a significant addition to the existing… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: 12 pages. Accepted by IEEE TPWRS

  22. arXiv:2311.15990  [pdf, other

    cs.LG stat.ML

    Should We Learn Most Likely Functions or Parameters?

    Authors: Shikai Qiu, Tim G. J. Rudner, Sanyam Kapoor, Andrew Gordon Wilson

    Abstract: Standard regularized training procedures correspond to maximizing a posterior distribution over parameters, known as maximum a posteriori (MAP) estimation. However, model parameters are of interest only insomuch as they combine with the functional form of a model to provide a function that can make good predictions. Moreover, the most likely parameters under the parameter posterior do not generall… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: NeurIPS 2023. Code available at https://github.com/activatedgeek/function-space-map

  23. arXiv:2311.04686  [pdf, other

    cs.LG cs.DC stat.ML

    Robust and Communication-Efficient Federated Domain Adaptation via Random Features

    Authors: Zhanbo Feng, Yuanjie Wang, Jie Li, Fan Yang, Jiong Lou, Tiebin Mi, Robert. C. Qiu, Zhenyu Liao

    Abstract: Modern machine learning (ML) models have grown to a scale where training them on a single machine becomes impractical. As a result, there is a growing trend to leverage federated learning (FL) techniques to train large ML models in a distributed and collaborative manner. These models, however, when deployed on new devices, might struggle to generalize well due to domain shifts. In this context, fe… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 21 pages

  24. arXiv:2310.19861  [pdf, ps, other

    cs.LG cs.GT stat.ML

    Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

    Authors: Shuang Qiu, Ziyu Dai, Han Zhong, Zhaoran Wang, Zhuoran Yang, Tong Zhang

    Abstract: This paper investigates posterior sampling algorithms for competitive reinforcement learning (RL) in the context of general function approximations. Focusing on zero-sum Markov games (MGs) under two critical settings, namely self-play and adversarial learning, we first propose the self-play and adversarial generalized eluder coefficient (GEC) as complexity measures for function approximation, capt… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  25. arXiv:2310.09516  [pdf, other

    cs.LG stat.ML

    Efficient Link Prediction via GNN Layers Induced by Negative Sampling

    Authors: Yuxin Wang, Xiannian Hu, Quan Gan, Xuan**g Huang, Xipeng Qiu, David Wipf

    Abstract: Graph neural networks (GNNs) for link prediction can loosely be divided into two broad categories. First, \emph{node-wise} architectures pre-compute individual embeddings for each node that are later combined by a simple decoder to make predictions. While extremely efficient at inference time (since node embeddings are only computed once and repeatedly reused), model expressiveness is limited such… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    Comments: 19 pages, 5 figures

  26. arXiv:2310.07990  [pdf

    q-bio.GN cs.IR cs.LG stat.AP

    Multi-View Variational Autoencoder for Missing Value Imputation in Untargeted Metabolomics

    Authors: Chen Zhao, Kuan-Jui Su, Chong Wu, Xuewei Cao, Qiuying Sha, Wu Li, Zhe Luo, Tian Qin, Chuan Qiu, Lan Juan Zhao, Anqi Liu, Lindong Jiang, Xiao Zhang, Hui Shen, Weihua Zhou, Hong-Wen Deng

    Abstract: Background: Missing data is a common challenge in mass spectrometry-based metabolomics, which can lead to biased and incomplete analyses. The integration of whole-genome sequencing (WGS) data with metabolomics data has emerged as a promising approach to enhance the accuracy of data imputation in metabolomics studies. Method: In this study, we propose a novel method that leverages the information f… ▽ More

    Submitted 12 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 19 pages, 3 figures

  27. arXiv:2310.06242  [pdf, other

    econ.EM stat.ME

    Treatment Choice, Mean Square Regret and Partial Identification

    Authors: Toru Kitagawa, Sokbae Lee, Chen Qiu

    Abstract: We consider a decision maker who faces a binary treatment choice when their welfare is only partially identified from data. We contribute to the literature by anchoring our finite-sample analysis on mean square regret, a decision criterion advocated by Kitagawa, Lee, and Qiu (2022). We find that optimal rules are always fractional, irrespective of the width of the identified set and precision of i… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  28. arXiv:2310.03258  [pdf, other

    cs.LG stat.ME

    Assessing Electricity Service Unfairness with Transfer Counterfactual Learning

    Authors: Song Wei, Xiangrui Kong, Alinson Santos Xavier, Shixiang Zhu, Yao Xie, Feng Qiu

    Abstract: Energy justice is a growing area of interest in interdisciplinary energy research. However, identifying systematic biases in the energy sector remains challenging due to confounding variables, intricate heterogeneity in counterfactual effects, and limited data availability. First, this paper demonstrates how one can evaluate counterfactual unfairness in a power system by analyzing the average caus… ▽ More

    Submitted 24 January, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: The preliminary version titled "Detecting Electricity Service Equity Issues with Transfer Counterfactual Learning on Large-Scale Outage Datasets" is presented at NeurIPS 2023 Workshops on Causal Representation Learning (CRL) and Algorithmic Fairness through the Lens of Time (AFT); See v1

  29. arXiv:2309.13825  [pdf, other

    stat.ML cs.LG stat.ME

    NSOTree: Neural Survival Oblique Tree

    Authors: Xiaotong Sun, Peijie Qiu

    Abstract: Survival analysis is a statistical method employed to scrutinize the duration until a specific event of interest transpires, known as time-to-event information characterized by censorship. Recently, deep learning-based methods have dominated this field due to their representational capacity and state-of-the-art performance. However, the black-box nature of the deep neural network hinders its inter… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: 12 pages

  30. arXiv:2309.11527  [pdf, other

    cs.IR cs.AI cs.CY cs.LG stat.ML

    TrueLearn: A Python Library for Personalised Informational Recommendations with (Implicit) Feedback

    Authors: Yuxiang Qiu, Karim Djemili, Denis Elezi, Aaneel Shalman, María Pérez-Ortiz, Sahan Bulathwela

    Abstract: This work describes the TrueLearn Python library, which contains a family of online learning Bayesian models for building educational (or more generally, informational) recommendation systems. This family of models was designed following the "open learner" concept, using humanly-intuitive user representations. For the sake of interpretability and putting the user in control, the TrueLearn library… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: To be presented at the ORSUM workshop at RecSys 2023

    ACM Class: H.3.3; J.1; I.2.0

  31. arXiv:2309.03097  [pdf, other

    stat.AP

    An Algorithm for Modelling Escalator Fixed Loss Energy for PHM and sustainable energy usage

    Authors: Xuwen Hu, Jiaqi Qiu, Yu Lin, Inez Maria Zwetsloot, William Ka Fai Lee, Edmond Yin San Yeung, Colman Yiu Wah Yeung, Chris Chun Long Wong

    Abstract: Prognostic Health Management (PHM) is designed to assess and monitor the health status of systems, anticipate the onset of potential failure, and prevent unplanned downtime. In recent decades, collecting massive amounts of real-time sensor data enabled condition monitoring (CM) and consequently, detection of abnormalities to support maintenance decision-making. Additionally, the utilization of PHM… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  32. arXiv:2309.01978  [pdf, other

    cs.LG stat.ME

    An LSTM-Based Predictive Monitoring Method for Data with Time-varying Variability

    Authors: Jiaqi Qiu, Yu Lin, Inez Zwetsloot

    Abstract: The recurrent neural network and its variants have shown great success in processing sequences in recent years. However, this deep neural network has not aroused much attention in anomaly detection through predictively process monitoring. Furthermore, the traditional statistic models work on assumptions and hypothesis tests, while neural network (NN) models do not need that many assumptions. This… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: 19 pages, 9 figures, 6 tables

  33. arXiv:2309.01360  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Random Projections of Sparse Adjacency Matrices

    Authors: Frank Qiu

    Abstract: We analyze a random projection method for adjacency matrices, studying its utility in representing sparse graphs. We show that these random projections retain the functionality of their underlying adjacency matrices while having extra properties that make them attractive as dynamic graph representations. In particular, they can represent graphs of different sizes and vertex sets in the same space,… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: 21 pages

    MSC Class: 65F50

  34. arXiv:2309.00870  [pdf, other

    stat.ME

    Robust estimation for number of factors in high dimensional factor modeling via Spearman correlation matrix

    Authors: Jiaxin Qiu, Zeng Li, Jianfeng Yao

    Abstract: Determining the number of factors in high-dimensional factor modeling is essential but challenging, especially when the data are heavy-tailed. In this paper, we introduce a new estimator based on the spectral properties of Spearman sample correlation matrix under the high-dimensional setting, where both dimension and sample size tend to infinity proportionally. Our estimator is robust against heav… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

  35. arXiv:2308.16425  [pdf, other

    cs.LG stat.ML

    On the Equivalence between Implicit and Explicit Neural Networks: A High-dimensional Viewpoint

    Authors: Zenan Ling, Zhenyu Liao, Robert C. Qiu

    Abstract: Implicit neural networks have demonstrated remarkable success in various tasks. However, there is a lack of theoretical analysis of the connections and differences between implicit and explicit networks. In this paper, we study high-dimensional implicit neural networks and provide the high dimensional equivalents for the corresponding conjugate kernels and neural tangent kernels. Built upon this,… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: Accepted by Workshop on High-dimensional Learning Dynamics, ICML 2023, Honolulu, Hawaii

  36. arXiv:2308.05738  [pdf, other

    stat.CO q-bio.NC stat.AP stat.ME

    Continuous and Atlas-free Analysis of Brain Structural Connectivity

    Authors: William Consagra, Martin Cole, Xing Qiu, Zhengwu Zhang

    Abstract: Brain structural networks are often represented as discrete adjacency matrices with elements summarizing the connectivity between pairs of regions of interest (ROIs). These ROIs are typically determined a-priori using a brain atlas. The choice of atlas is often arbitrary and can lead to a loss of important connectivity information at the sub-ROI level. This work introduces an atlas-free framework… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  37. arXiv:2307.00836  [pdf, other

    stat.ML cs.LG

    Trading-Off Payments and Accuracy in Online Classification with Paid Stochastic Experts

    Authors: Dirk van der Hoeven, Ciara Pike-Burke, Hao Qiu, Nicolo Cesa-Bianchi

    Abstract: We investigate online classification with paid stochastic experts. Here, before making their prediction, each expert must be paid. The amount that we pay each expert directly influences the accuracy of their prediction through some unknown Lipschitz "productivity" function. In each round, the learner must decide how much to pay each expert and then make a prediction. They incur a cost equal to a w… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: ICML 2023

  38. arXiv:2306.16406  [pdf, other

    stat.ME math.ST stat.ML

    Efficient and Multiply Robust Risk Estimation under General Forms of Dataset Shift

    Authors: Hongxiang Qiu, Eric Tchetgen Tchetgen, Edgar Dobriban

    Abstract: Statistical machine learning methods often face the challenge of limited data available from the population of interest. One remedy is to leverage data from auxiliary source populations, which share some conditional distributions or are linked in other ways with the target domain. Techniques leveraging such \emph{dataset shift} conditions are known as \emph{domain adaptation} or \emph{transfer lea… ▽ More

    Submitted 7 June, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

  39. arXiv:2306.11074  [pdf, other

    cs.LG stat.ML

    Simple and Fast Group Robustness by Automatic Feature Reweighting

    Authors: Shikai Qiu, Andres Potapczynski, Pavel Izmailov, Andrew Gordon Wilson

    Abstract: A major challenge to out-of-distribution generalization is reliance on spurious features -- patterns that are predictive of the class label in the training data distribution, but not causally related to the target. Standard methods for reducing the reliance on spurious features typically assume that we know what the spurious feature is, which is rarely true in the real world. Methods that attempt… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: ICML 23. Code available at https://github.com/AndPotap/afr

    Journal ref: 40th International Conference on Machine Learning 2023

  40. arXiv:2306.05436  [pdf, other

    stat.AP cs.CY

    Remaining Useful Life Modelling with an Escalator Health Condition Analytic System

    Authors: Inez M. Zwetsloot, Yu Lin, Jiaqi Qiu, Lishuai Li, William Ka Fai Lee, Edmond Yin San Yeung, Colman Yiu Wah Yeung, Chris Chun Long Wong

    Abstract: The refurbishment of an escalator is usually linked with its design life as recommended by the manufacturer. However, the actual useful life of an escalator should be determined by its operating condition which is affected by the runtime, workload, maintenance quality, vibration, etc., rather than age only. The objective of this project is to develop a comprehensive health condition analytic syste… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 14 pages, 12 figures, 7 tables

  41. arXiv:2306.03065  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    LibAUC: A Deep Learning Library for X-Risk Optimization

    Authors: Zhuoning Yuan, Dixian Zhu, Zi-Hao Qiu, Gang Li, Xuanhui Wang, Tianbao Yang

    Abstract: This paper introduces the award-winning deep learning (DL) library called LibAUC for implementing state-of-the-art algorithms towards optimizing a family of risk functions named X-risks. X-risks refer to a family of compositional functions in which the loss function of each data point is defined in a way that contrasts the data point with a large number of others. They have broad applications in A… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by KDD2023

  42. arXiv:2305.18730  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Blockwise Stochastic Variance-Reduced Methods with Parallel Speedup for Multi-Block Bilevel Optimization

    Authors: Quanqi Hu, Zi-Hao Qiu, Zhishuai Guo, Lijun Zhang, Tianbao Yang

    Abstract: In this paper, we consider non-convex multi-block bilevel optimization (MBBO) problems, which involve $m\gg 1$ lower level problems and have important applications in machine learning. Designing a stochastic gradient and controlling its variance is more intricate due to the hierarchical sampling of blocks and data and the unique challenge of estimating hyper-gradient. We aim to achieve three nice… ▽ More

    Submitted 2 June, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  43. arXiv:2305.13444  [pdf, other

    stat.ME

    Ordinal Outcome State-Space Models for Intensive Longitudinal Data

    Authors: Teague R. Henry, Lindley R. Slipetz, Ami Falk, Jiaxing Qiu, Meng Chen

    Abstract: Intensive longitudinal (IL) data are increasingly prevalent in psychological science, coinciding with technological advancements that make it simple to deploy study designs such as daily diary and ecological momentary assessments. IL data are characterized by a rapid rate of data collection (1+ collections per day), over a period of time, allowing for the capture of the dynamics that underlie psyc… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 28 pages, 6 figures, 7 pages supplementary materials

  44. arXiv:2305.11965  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization

    Authors: Zi-Hao Qiu, Quanqi Hu, Zhuoning Yuan, Denny Zhou, Lijun Zhang, Tianbao Yang

    Abstract: In this paper, we aim to optimize a contrastive loss with individualized temperatures in a principled and systematic manner for self-supervised learning. The common practice of using a global temperature parameter $τ$ ignores the fact that ``not all semantics are created equal", meaning that different anchor data may have different numbers of samples with similar semantics, especially when data ex… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: 33 pages, 11 figures, accepted by ICML2023

  45. arXiv:2305.10636  [pdf, other

    cs.LG stat.ML

    Augmented Message Passing Stein Variational Gradient Descent

    Authors: Jiankui Zhou, Yue Qiu

    Abstract: Stein Variational Gradient Descent (SVGD) is a popular particle-based method for Bayesian inference. However, its convergence suffers from the variance collapse, which reduces the accuracy and diversity of the estimation. In this paper, we study the isotropy property of finite particles during the convergence process and show that SVGD of finite particles cannot spread across the entire sample spa… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    MSC Class: 62-08; 62G09 ACM Class: G.3; I.2

  46. arXiv:2305.10572  [pdf, ps, other

    stat.ML cs.LG

    Tensor Products and Hyperdimensional Computing

    Authors: Frank Qiu

    Abstract: Following up on a previous analysis of graph embeddings, we generalize and expand some results to the general setting of vector symbolic architectures (VSA) and hyperdimensional computing (HDC). Importantly, we explore the mathematical relationship between superposition, orthogonality, and tensor product. We establish the tensor product representation as the central representation, with a suite of… ▽ More

    Submitted 20 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: 18 pages

    MSC Class: 68T30

  47. arXiv:2305.08942  [pdf, other

    stat.ME physics.data-an stat.AP

    Probabilistic forecast of nonlinear dynamical systems with uncertainty quantification

    Authors: Mengyang Gu, Yizi Lin, Victor Chang Lee, Diana Qiu

    Abstract: Data-driven modeling is useful for reconstructing nonlinear dynamical systems when the underlying process is unknown or too expensive to compute. Having reliable uncertainty assessment of the forecast enables tools to be deployed to predict new scenarios unobserved before. In this work, we first extend parallel partial Gaussian processes for predicting the vector-valued transition function that li… ▽ More

    Submitted 30 October, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Journal ref: Physica D: Nonlinear Phenomena, 133938 (2023)

  48. An Efficient Doubly-robust Imputation Framework for Longitudinal Dropout, with an Application to an Alzheimer's Clinical Trial

    Authors: Yuqi Qiu, Karen Messer

    Abstract: We develop a novel doubly-robust (DR) imputation framework for longitudinal studies with monotone dropout, motivated by the informative dropout that is common in FDA-regulated trials for Alzheimer's disease. In this approach, the missing data are first imputed using a doubly-robust augmented inverse probability weighting (AIPW) estimator, then the imputed completed data are substituted into a full… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: To be published in The Annals of Applied Statistics

  49. Prediction method of cigarette draw resistance based on correlation analysis

    Authors: Linsheng Chen, Zhonghua Yu, Bo Zhang, Qiang Zhu, Hu Fan, Yucan Qiu

    Abstract: The cigarette draw resistance monitoring method is incomplete and single, and the lacks correlation analysis and preventive modeling, resulting in substandard cigarettes in the market. To address this problem without increasing the hardware cost, in this paper, multi-indicator correlation analysis is used to predict cigarette draw resistance. First, the monitoring process of draw resistance is ana… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Preprint, submitted to Computers and Electronics in Agriculture. For any suggestions or improvements, please contact me directly by e-mail

  50. arXiv:2304.03933  [pdf, other

    stat.ME cs.LG stat.CO stat.ML

    Efficient Multimodal Sampling via Tempered Distribution Flow

    Authors: Yixuan Qiu, Xiao Wang

    Abstract: Sampling from high-dimensional distributions is a fundamental problem in statistical research and practice. However, great challenges emerge when the target density function is unnormalized and contains isolated modes. We tackle this difficulty by fitting an invertible transformation map**, called a transport map, between a reference probability measure and the target distribution, so that sampl… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.