Skip to main content

Showing 1–50 of 276 results for author: Chen, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.12525  [pdf, other

    cs.SI physics.soc-ph stat.AP

    Anatomy of Elite and Mass Polarization in Social Networks

    Authors: Ali Salloum, Ted Hsuan Yun Chen, Mikko Kivelä

    Abstract: Existing methods for quantifying polarization in social networks typically report a single value describing the amount of polarization in a social system. While this approach can be used to confirm the observation that many societies have witnessed an increase in political polarization in recent years, it misses the complexities that could be used to understand the reasons behind this phenomenon.… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2406.03396  [pdf, other

    cs.LG math.FA stat.ML

    Noisy Data Visualization using Functional Data Analysis

    Authors: Haozhe Chen, Andres Felipe Duque Correa, Guy Wolf, Kevin R. Moon

    Abstract: Data visualization via dimensionality reduction is an important tool in exploratory data analysis. However, when the data are noisy, many existing methods fail to capture the underlying structure of the data. The method called Empirical Intrinsic Geometry (EIG) was previously proposed for performing dimensionality reduction on high dimensional dynamical processes while theoretically eliminating al… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  3. arXiv:2405.17836  [pdf, other

    eess.SP cs.LG stat.ML

    An Innovative Networks in Federated Learning

    Authors: Zavareh Bozorgasl, Hao Chen

    Abstract: This paper presents the development and application of Wavelet Kolmogorov-Arnold Networks (Wav-KAN) in federated learning. We implemented Wav-KAN \cite{wav-kan} in the clients. Indeed, we have considered both continuous wavelet transform (CWT) and also discrete wavelet transform (DWT) to enable multiresolution capabaility which helps in heteregeneous data distribution across clients. Extensive exp… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Work in progress

  4. arXiv:2405.16663  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Private Edge Density Estimation for Random Graphs: Optimal, Efficient and Robust

    Authors: Hongjie Chen, **gqiu Ding, Yiding Hua, David Steurer

    Abstract: We give the first polynomial-time, differentially node-private, and robust algorithm for estimating the edge density of Erdős-Rényi random graphs and their generalization, inhomogeneous random graphs. We further prove information-theoretical lower bounds, showing that the error rate of our algorithm is optimal up to logarithmic factors. Previous algorithms incur either exponential running time or… ▽ More

    Submitted 3 June, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: fix minor typos; add missing references

  5. arXiv:2405.15986  [pdf, ps, other

    cs.LG cs.DC math.NA stat.ML

    Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time Complexity

    Authors: Haoxuan Chen, Yinuo Ren, Lexing Ying, Grant M. Rotskoff

    Abstract: Diffusion models have become a leading method for generative modeling of both image and scientific data. As these models are costly to train and evaluate, reducing the inference cost for diffusion models remains a major goal. Inspired by the recent empirical success in accelerating diffusion models via the parallel sampling technique~\cite{shih2024parallel}, we propose to divide the sampling proce… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  6. arXiv:2405.14953  [pdf, other

    cs.LG cs.AI stat.ML

    Mallows-DPO: Fine-Tune Your LLM with Preference Dispersions

    Authors: Haoxian Chen, Hanyang Zhao, Henry Lam, David Yao, Wenpin Tang

    Abstract: Direct Preference Optimization (DPO) has recently emerged as a popular approach to improve reinforcement learning with human feedback (RLHF), leading to better techniques to fine-tune large language models (LLM). A weakness of DPO, however, lies in its lack of capability to characterize the diversity of human preferences. Inspired by Mallows' theory of preference ranking, we develop in this paper… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  7. arXiv:2405.12832  [pdf, other

    cs.LG cs.AI eess.SP stat.ML

    Wav-KAN: Wavelet Kolmogorov-Arnold Networks

    Authors: Zavareh Bozorgasl, Hao Chen

    Abstract: In this paper, we introduce Wav-KAN, an innovative neural network architecture that leverages the Wavelet Kolmogorov-Arnold Networks (Wav-KAN) framework to enhance interpretability and performance. Traditional multilayer perceptrons (MLPs) and even recent advancements like Spl-KAN face challenges related to interpretability, training speed, robustness, computational efficiency, and performance. Wa… ▽ More

    Submitted 27 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: Work in progress; codes are available at are available at https://github.com/zavareh1/Wav-KAN

  8. arXiv:2404.10262  [pdf, other

    stat.CO math.OC

    Safe Feature Identification Rule for Fused Lasso by An Extra Dual Variable

    Authors: Pan Shang, Huangyue Chen, Lingchen Kong

    Abstract: Fused Lasso was proposed to characterize the sparsity of the coefficients and the sparsity of their successive differences for the linear regression. Due to its wide applications, there are many existing algorithms to solve fused Lasso. However, the computation of this model is time-consuming in high-dimensional data sets. To accelerate the calculation of fused Lasso in high-dimension data sets, w… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  9. arXiv:2404.08073  [pdf, other

    math.OC cs.LG stat.ML

    Spurious Stationarity and Hardness Results for Mirror Descent

    Authors: He Chen, Jia** Li, Anthony Man-Cho So

    Abstract: Despite the considerable success of Bregman proximal-type algorithms, such as mirror descent, in machine learning, a critical question remains: Can existing stationarity measures, often based on Bregman divergence, reliably distinguish between stationary and non-stationary points? In this paper, we present a groundbreaking finding: All existing stationarity measures necessarily imply the existence… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  10. arXiv:2404.06549  [pdf, other

    cs.LG stat.ML

    Variational Stochastic Gradient Descent for Deep Neural Networks

    Authors: Haotian Chen, Anna Kuzina, Babak Esmaeili, Jakub M Tomczak

    Abstract: Optimizing deep neural networks is one of the main tasks in successful deep learning. Current state-of-the-art optimizers are adaptive gradient-based optimization methods such as Adam. Recently, there has been an increasing interest in formulating gradient-based optimizers in a probabilistic framework for better estimation of gradients and modeling uncertainties. Here, we propose to combine both a… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  11. arXiv:2404.03828  [pdf, other

    cs.LG cs.AI stat.ML

    Outlier-Efficient Hopfield Layers for Large Transformer-Based Models

    Authors: Jerry Yao-Chieh Hu, Pei-Hsuan Chang, Robin Luo, Hong-Yu Chen, Weijian Li, Wei-Po Wang, Han Liu

    Abstract: We introduce an Outlier-Efficient Modern Hopfield Model (termed $\mathrm{OutEffHop}$) and use it to address the outlier inefficiency problem of {training} gigantic transformer-based models. Our main contribution is a novel associative memory model facilitating \textit{outlier-efficient} associative memory retrievals. Interestingly, this memory model manifests a model-based interpretation of an out… ▽ More

    Submitted 26 June, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: Accepted at ICML 2024; v2 updated to camera-ready version; Code available at https://github.com/MAGICS-LAB/OutEffHop; Models are on Hugging Face: https://huggingface.co/collections/magicslabnu/outeffhop-6610fcede8d2cda23009a98f

  12. arXiv:2403.12213  [pdf, ps, other

    cs.DS cs.CC cs.LG stat.ML

    Private graphon estimation via sum-of-squares

    Authors: Hongjie Chen, **gqiu Ding, Tommaso d'Orsi, Yiding Hua, Chih-Hung Liu, David Steurer

    Abstract: We develop the first pure node-differentially-private algorithms for learning stochastic block models and for graphon estimation with polynomial running time for any constant number of blocks. The statistical utility guarantees match those of the previous best information-theoretic (exponential-time) node-private mechanisms for these problems. The algorithm is based on an exponential mechanism for… ▽ More

    Submitted 18 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 71 pages, accepted to STOC 2024

  13. arXiv:2403.11332  [pdf, other

    cs.LG cs.SI stat.ME

    Graph Machine Learning based Doubly Robust Estimator for Network Causal Effects

    Authors: Seyedeh Baharan Khatami, Harsh Parikh, Haowei Chen, Sudeepa Roy, Babak Salimi

    Abstract: We address the challenge of inferring causal effects in social network data. This results in challenges due to interference -- where a unit's outcome is affected by neighbors' treatments -- and network-induced confounding factors. While there is extensive literature focusing on estimating causal effects in social network setups, a majority of them make prior assumptions about the form of network-i… ▽ More

    Submitted 31 May, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  14. arXiv:2403.04015  [pdf, other

    cs.LG cs.AI stat.ML

    Knockoff-Guided Feature Selection via A Single Pre-trained Reinforced Agent

    Authors: Xinyuan Wang, Dongjie Wang, Wangyang Ying, Rui Xie, Haifeng Chen, Yanjie Fu

    Abstract: Feature selection prepares the AI-readiness of data by eliminating redundant features. Prior research falls into two primary categories: i) Supervised Feature Selection, which identifies the optimal feature subset based on their relevance to the target variable; ii) Unsupervised Feature Selection, which reduces the feature space dimensionality by capturing the essential information within the feat… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  15. arXiv:2402.11425  [pdf, other

    stat.ME cs.LG math.OC math.PR

    Online Local False Discovery Rate Control: A Resource Allocation Approach

    Authors: Ruicheng Ao, Hongyu Chen, David Simchi-Levi, Feng Zhu

    Abstract: We consider the problem of sequentially conducting multiple experiments where each experiment corresponds to a hypothesis testing task. At each time point, the experimenter must make an irrevocable decision of whether to reject the null hypothesis (or equivalently claim a discovery) before the next experimental result arrives. The goal is to maximize the number of discoveries while maintaining a l… ▽ More

    Submitted 1 April, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  16. arXiv:2402.08095  [pdf, ps, other

    stat.ML cs.LG

    Convergence Analysis of Discrete Diffusion Model: Exact Implementation through Uniformization

    Authors: Hongrui Chen, Lexing Ying

    Abstract: Diffusion models have achieved huge empirical success in data generation tasks. Recently, some efforts have been made to adapt the framework of diffusion models to discrete state space, providing a more natural approach for modeling intrinsically discrete data, such as language and graphs. This is achieved by formulating both the forward noising process and the corresponding reversed process as Co… ▽ More

    Submitted 14 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 19 pages

  17. arXiv:2402.03933  [pdf

    cs.SE stat.AP

    Development of a Evaluation Tool for Age-Appropriate Software in Aging Environments: A Delphi Study

    Authors: Zhenggang Bai, Yougxiang Fang, Hongtu Chen, Xinru Chen, Ning An, Min Zhang, Guoxin Rui, **g **

    Abstract: Objective: We aimed to develop a dependable reliable tool for assessing software ageappropriateness. Methods: We conducted a systematic review to get the indicators of technology ageappropriateness from studies from January 2000 to April 2023.This study engaged 25 experts from the fields of anthropology, sociology,and social technology research across, three rounds of Delphi consultations were con… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  18. arXiv:2402.02357  [pdf, other

    cs.LG stat.ME

    Multi-modal Causal Structure Learning and Root Cause Analysis

    Authors: Lecheng Zheng, Zhengzhang Chen, **grui He, Haifeng Chen

    Abstract: Effective root cause analysis (RCA) is vital for swiftly restoring services, minimizing losses, and ensuring the smooth operation and management of complex systems. Previous data-driven RCA methods, particularly those employing causal discovery techniques, have primarily focused on constructing dependency or causal graphs for backtracking the root causes. However, these methods often fall short as… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted by the Web Conference 2024

  19. arXiv:2401.15645  [pdf, other

    stat.CO cs.LG math.NA physics.comp-ph stat.ML

    Ensemble-Based Annealed Importance Sampling

    Authors: Haoxuan Chen, Lexing Ying

    Abstract: Sampling from a multimodal distribution is a fundamental and challenging problem in computational science and statistics. Among various approaches proposed for this task, one popular method is Annealed Importance Sampling (AIS). In this paper, we propose an ensemble-based version of AIS by combining it with population-based Monte Carlo methods to improve its efficiency. By kee** track of an ense… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 33 pages, 13 figures

    MSC Class: 65C05; 65C40; 65C60; 62P35

  20. arXiv:2401.03341  [pdf, other

    cs.LG stat.ML

    Weakly Augmented Variational Autoencoder in Time Series Anomaly Detection

    Authors: Zhangkai Wu, Longbing Cao, Qi Zhang, Junxian Zhou, Hui Chen

    Abstract: Due to their unsupervised training and uncertainty estimation, deep Variational Autoencoders (VAEs) have become powerful tools for reconstruction-based Time Series Anomaly Detection (TSAD). Existing VAE-based TSAD methods, either statistical or deep, tune meta-priors to estimate the likelihood probability for effectively capturing spatiotemporal dependencies in the data. However, these methods con… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

  21. arXiv:2401.03126  [pdf, other

    math.MG stat.OT

    Quotient geometry of bounded or fixed rank correlation matrices

    Authors: Hengchao Chen

    Abstract: This paper studies the quotient geometry of bounded or fixed-rank correlation matrices. We establish a bijection between the set of bounded-rank correlation matrices and a quotient set of a spherical product manifold by an orthogonal group. We show that it forms an orbit space, whose stratification is determined by the rank of the matrices, and the principal stratum has a compatible Riemannian quo… ▽ More

    Submitted 10 January, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    MSC Class: 51F99; 15B99; 53A99; 65K10

  22. arXiv:2312.11319  [pdf, other

    stat.ME

    Uncertainty Quantification for Data-Driven Change-Point Learning via Cross-Validation

    Authors: Hui Chen, Yinxu Jia, Guanghui Wang, Changliang Zou

    Abstract: Accurately detecting multiple change-points is critical for various applications, but determining the optimal number of change-points remains a challenge. Existing approaches based on information criteria attempt to balance goodness-of-fit and model complexity, but their performance varies depending on the model. Recently, data-driven selection criteria based on cross-validation has been proposed,… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 11 pages, 1 figure, to appear at AAAI 2024

  23. arXiv:2312.05933  [pdf, other

    cs.LG cs.AI stat.ML

    Temporal Supervised Contrastive Learning for Modeling Patient Risk Progression

    Authors: Shahriar Noroozizadeh, Jeremy C. Weiss, George H. Chen

    Abstract: We consider the problem of predicting how the likelihood of an outcome of interest for a patient changes over time as we observe more of the patient data. To solve this problem, we propose a supervised contrastive learning framework that learns an embedding representation for each time step of a patient time series. Our framework learns the embedding space to have the following properties: (1) nea… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: Machine Learning for Health (ML4H 2023)

    Journal ref: In Machine Learning for Health (ML4H), pages 403-427. PMLR, 2023

  24. arXiv:2311.12736  [pdf, other

    stat.AP

    Spatio-Temporal Modeling of Surface Water Quality Distribution in California (1956-2023)

    Authors: Houlin Chen, Meredith Franklin

    Abstract: Surface water quality has a direct impact on public health, ecosystems, and agriculture, in addition to being an important indicator of the overall health of the environment. California's diverse climate, extensive coastline, and varied topography lead to distinct spatial and temporal patterns in surface water. This study offers a comprehensive assessment of these patterns by leveraging around 70… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  25. arXiv:2311.05819  [pdf, other

    stat.ME

    A flexible framework for synthesizing human activity patterns with application to sequential categorical data

    Authors: Zuofu Huang, Julian Wolfson, Jayne A. Fulkerson, Ryan Demmer, Helen N. Chen

    Abstract: The ability to synthesize realistic data in a parametrizable way is valuable for a number of reasons, including privacy, missing data imputation, and evaluating the performance of statistical and computational methods. When the underlying data generating process is complex, data synthesis requires approaches that balance realism and simplicity. In this paper, we address the problem of synthesizing… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  26. arXiv:2311.00923  [pdf, other

    cs.LG stat.ME

    A Review and Roadmap of Deep Causal Model from Different Causal Structures and Representations

    Authors: Hang Chen, Keqing Du, Chenguang Li, Xinyu Yang

    Abstract: The fusion of causal models with deep learning introducing increasingly intricate data sets, such as the causal associations within images or between textual components, has surfaced as a focal research area. Nonetheless, the broadening of original causal concepts and theories to such complex, non-statistical data has been met with serious challenges. In response, our study proposes redefinitions… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: under review

  27. arXiv:2310.19726  [pdf, other

    cs.LG cs.AI stat.ML

    A Path to Simpler Models Starts With Noise

    Authors: Lesia Semenova, Harry Chen, Ronald Parr, Cynthia Rudin

    Abstract: The Rashomon set is the set of models that perform approximately equally well on a given dataset, and the Rashomon ratio is the fraction of all models in a given hypothesis space that are in the Rashomon set. Rashomon ratios are often large for tabular datasets in criminal justice, healthcare, lending, education, and in other areas, which has practical implications about whether simpler models can… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  28. arXiv:2310.10434  [pdf, other

    stat.ML cond-mat.mtrl-sci cs.LG physics.chem-ph

    Equivariant Matrix Function Neural Networks

    Authors: Ilyes Batatia, Lars L. Schaaf, Huajie Chen, Gábor Csányi, Christoph Ortner, Felix A. Faber

    Abstract: Graph Neural Networks (GNNs), especially message-passing neural networks (MPNNs), have emerged as powerful architectures for learning on graphs in diverse applications. However, MPNNs face challenges when modeling non-local interactions in graphs such as large conjugated molecules, and social networks due to oversmoothing and oversquashing. Although Spectral GNNs and traditional neural networks su… ▽ More

    Submitted 30 January, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: International Conference on Learning Representations, 2024

  29. arXiv:2310.09766  [pdf, other

    stat.ML cs.LG

    Pseudo-Bayesian Optimization

    Authors: Haoxian Chen, Henry Lam

    Abstract: Bayesian Optimization is a popular approach for optimizing expensive black-box functions. Its key idea is to use a surrogate model to approximate the objective and, importantly, quantify the associated uncertainty that allows a sequential search of query points that balance exploitation-exploration. Gaussian process (GP) has been a primary candidate for the surrogate model, thanks to its Bayesian-… ▽ More

    Submitted 20 June, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

  30. arXiv:2310.04934  [pdf, other

    stat.ME math.ST

    UBSea: A Unified Community Detection Framework

    Authors: Xiancheng Lin, Hao Chen

    Abstract: Detecting communities in networks and graphs is an important task across many disciplines such as statistics, social science and engineering. There are generally three different kinds of mixing patterns for the case of two communities: assortative mixing, disassortative mixing and core-periphery structure. Modularity optimization is a classical way for fitting network models with communities. Howe… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  31. arXiv:2310.02941  [pdf, ps, other

    stat.ML cs.LG math.PR

    Hoeffding's Inequality for Markov Chains under Generalized Concentrability Condition

    Authors: Hao Chen, Abhishek Gupta, Yin Sun, Ness Shroff

    Abstract: This paper studies Hoeffding's inequality for Markov chains under the generalized concentrability condition defined via integral probability metric (IPM). The generalized concentrability condition establishes a framework that interpolates and extends the existing hypotheses of Markov chain Hoeffding-type inequalities. The flexibility of our framework allows Hoeffding's inequality to be applied bey… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  32. arXiv:2307.15205  [pdf, other

    stat.ME math.ST

    Robust graph-based methods for overcoming the curse of dimensionality

    Authors: Yejiong Zhu, Hao Chen

    Abstract: Graph-based two-sample tests and graph-based change-point detection that utilize a similarity graph provide a powerful tool for analyzing high-dimensional and non-Euclidean data as these methods do not impose distributional assumptions on data and have good performance across various scenarios. Current graph-based tests that deliver efficacy across a broad spectrum of alternatives typically reply… ▽ More

    Submitted 19 October, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

  33. arXiv:2307.07264  [pdf, ps, other

    cs.LG cs.DS stat.ML

    On Interpolating Experts and Multi-Armed Bandits

    Authors: Houshuang Chen, Yuchen He, Chihao Zhang

    Abstract: Learning with expert advice and multi-armed bandit are two classic online decision problems which differ on how the information is observed in each round of the game. We study a family of problems interpolating the two. For a vector $\mathbf{m}=(m_1,\dots,m_K)\in \mathbb{N}^K$, an instance of $\mathbf{m}$-MAB indicates that the arms are partitioned into $K$ groups and the $i$-th group contains… ▽ More

    Submitted 4 August, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

  34. arXiv:2306.15199  [pdf, other

    stat.ME

    A new classification framework for high-dimensional data

    Authors: Xiangbo Mo, Hao Chen

    Abstract: Classification is a classic problem but encounters lots of challenges when dealing with a large number of features, which is common in many modern applications, such as identifying tumor sub-types from genomic data or categorizing customer attitudes based on on-line reviews. We propose a new framework that utilizes the ranks of pairwise distances among observations and identifies a common pattern… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  35. arXiv:2306.14826  [pdf, other

    stat.ME

    Incorporating increased variability in testing for cancer DNA methylation

    Authors: James Y. Dai, Heng Chen, Xiaoyu Wang, Wei Sun, Ying Huang, William M. Grady, Ziding Feng

    Abstract: Cancer development is associated with aberrant DNA methylation, including increased stochastic variability. Statistical tests for discovering cancer methylation biomarkers have focused on changes in mean methylation. To improve the power of detection, we propose to incorporate increased variability in testing for cancer differential methylation by two joint constrained tests: one for differential… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  36. arXiv:2306.10601  [pdf, other

    stat.ME

    Sliced Wasserstein Regression

    Authors: Han Chen, Hans-Georg Müller

    Abstract: While statistical modeling of distributional data has gained increased attention, the case of multivariate distributions has been somewhat neglected despite its relevance in various applications. This is because the Wasserstein distance, commonly used in distributional data analysis, poses challenges for multivariate distributions. A promising alternative is the sliced Wasserstein distance, which… ▽ More

    Submitted 12 March, 2024; v1 submitted 18 June, 2023; originally announced June 2023.

  37. arXiv:2306.09882  [pdf, other

    cs.LG stat.ML stat.OT

    Uncertainty Quantification via Spatial-Temporal Tweedie Model for Zero-inflated and Long-tail Travel Demand Prediction

    Authors: Xinke Jiang, Dingyi Zhuang, Xianghui Zhang, Hao Chen, Jiayuan Luo, Xiaowei Gao

    Abstract: Understanding Origin-Destination (O-D) travel demand is crucial for transportation management. However, traditional spatial-temporal deep learning models grapple with addressing the sparse and long-tail characteristics in high-resolution O-D matrices and quantifying prediction uncertainty. This dilemma arises from the numerous zeros and over-dispersed demand patterns within these matrices, which c… ▽ More

    Submitted 30 January, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: In proceeding of CIKM 2023. Doi: https://dl.acm.org/doi/10.1145/3583780.3615215

  38. arXiv:2306.05722  [pdf, other

    cs.LG stat.ML

    Ridge Estimation with Nonlinear Transformations

    Authors: Zheng Zhai, Hengchao Chen, Zhigang Yao

    Abstract: Ridge estimation is an important manifold learning technique. The goal of this paper is to examine the effects of nonlinear transformations on the ridge sets. The main result proves the inclusion relationship between ridges: $\cR(f\circ p)\subseteq \cR(p)$, provided that the transformation $f$ is strictly increasing and concave on the range of the function $p$. Additionally, given an underlying tr… ▽ More

    Submitted 4 August, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

  39. arXiv:2306.02833  [pdf, ps, other

    stat.ML cs.LG math.ST

    The $L^\infty$ Learnability of Reproducing Kernel Hilbert Spaces

    Authors: Hongrui Chen, Jihao Long, Lei Wu

    Abstract: In this work, we analyze the learnability of reproducing kernel Hilbert spaces (RKHS) under the $L^\infty$ norm, which is critical for understanding the performance of kernel methods and random feature models in safety- and security-critical applications. Specifically, we relate the $L^\infty$ learnability of a RKHS to the spectrum decay of the associate kernel and both lower bounds and upper boun… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 20 pages

  40. arXiv:2306.00469  [pdf, other

    stat.CO

    HiQR: An efficient algorithm for high-dimensional quadratic regression with penalties

    Authors: Cheng Wang, Haozhe Chen, Binyan Jiang

    Abstract: This paper investigates the efficient solution of penalized quadratic regressions in high-dimensional settings. A novel and efficient algorithm for ridge-penalized quadratic regression is proposed, leveraging the matrix structures of the regression with interactions. Additionally, an alternating direction method of multipliers (ADMM) framework is developed for penalized quadratic regression with g… ▽ More

    Submitted 2 December, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 20 pages

  41. arXiv:2305.19440  [pdf, other

    cs.LG cond-mat.str-el stat.ML

    Machine learning with tree tensor networks, CP rank constraints, and tensor dropout

    Authors: Hao Chen, Thomas Barthel

    Abstract: Tensor networks approximate order-$N$ tensors with a reduced number of degrees of freedom that is only polynomial in $N$ and arranged as a network of partially contracted smaller tensors. As suggested in [arXiv:2205.15296] in the context of quantum many-body physics, computation costs can be further substantially reduced by imposing constraints on the canonical polyadic (CP) rank of the tensors in… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 7 pages, 8 figures

  42. arXiv:2305.19206  [pdf, other

    math.OC stat.ML

    Gradient descent in matrix factorization: Understanding large initialization

    Authors: Hengchao Chen, Xin Chen, Mohamad Elmasri, Qiang Sun

    Abstract: Gradient Descent (GD) has been proven effective in solving various matrix factorization problems. However, its optimization behavior with large initial values remains less understood. To address this gap, this paper presents a novel theoretical framework for examining the convergence trajectory of GD with a large initialization. The framework is grounded in signal-to-noise ratio concepts and induc… ▽ More

    Submitted 31 May, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: Published in the 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024)

  43. arXiv:2305.16908  [pdf, other

    stat.ME

    On efficient covariate adjustment selection in causal effect estimation

    Authors: Hongyi Chen, Maurits Kaptein

    Abstract: In order to achieve unbiased and efficient estimators of causal effects from observational data, covariate selection for confounding adjustment becomes an important task in causal inference. Despite recent advancements in graphical criterion for constructing valid and efficient adjustment sets, these methods often rely on assumptions that may not hold in practice. We examine the properties of exis… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  44. arXiv:2305.16904  [pdf, other

    stat.ME

    A novel framework extending cause-effect inference methods to multivariate causal discovery

    Authors: Hongyi Chen, Maurits Kaptein

    Abstract: We focus on the extension of bivariate causal learning methods into multivariate problem settings in a systematic manner via a novel framework. It is purposive to augment the scale to which bivariate causal discovery approaches can be applied since contrast to traditional causal discovery methods, bivariate methods render estimation in the form of a causal Directed Acyclic Graph (DAG) instead of i… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  45. arXiv:2305.16527  [pdf, other

    math.ST cs.IT math.NA stat.ML

    When can Regression-Adjusted Control Variates Help? Rare Events, Sobolev Embedding and Minimax Optimality

    Authors: Jose Blanchet, Haoxuan Chen, Yi** Lu, Lexing Ying

    Abstract: This paper studies the use of a machine learning-based estimator as a control variate for mitigating the variance of Monte Carlo sampling. Specifically, we seek to uncover the key factors that influence the efficiency of control variates in reducing variance. We examine a prototype estimation problem that involves simulating the moments of a Sobolev function based on observations obtained from (ra… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  46. arXiv:2305.06862  [pdf, other

    stat.ML cs.HC cs.LG

    A General Framework for Visualizing Embedding Spaces of Neural Survival Analysis Models Based on Angular Information

    Authors: George H. Chen

    Abstract: We propose a general framework for visualizing any intermediate embedding representation used by any neural survival analysis model. Our framework is based on so-called anchor directions in an embedding space. We show how to estimate these anchor directions using clustering or, alternatively, using user-supplied "concepts" defined by collections of raw inputs (e.g., feature vectors all from female… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: Conference on Health, Inference, and Learning (CHIL 2023)

  47. arXiv:2305.05642  [pdf, ps, other

    stat.ML cs.LG

    A duality framework for generalization analysis of random feature models and two-layer neural networks

    Authors: Hongrui Chen, Jihao Long, Lei Wu

    Abstract: We consider the problem of learning functions in the $\mathcal{F}_{p,π}$ and Barron spaces, which are natural function spaces that arise in the high-dimensional analysis of random feature models (RFMs) and two-layer neural networks. Through a duality analysis, we reveal that the approximation and estimation of these spaces can be considered equivalent in a certain sense. This enables us to focus o… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 42 pages

  48. arXiv:2305.02640  [pdf, other

    cs.LG cs.AI stat.ME

    Towards Causal Representation Learning and Deconfounding from Indefinite Data

    Authors: Hang Chen, Xinyu Yang, Qing Yang

    Abstract: Owing to the cross-pollination between causal discovery and deep learning, non-statistical data (e.g., images, text, etc.) encounters significant conflicts in terms of properties and methods with traditional causal data. To unify these data types of varying forms, we redefine causal data from two novel perspectives and then propose three data paradigms. Among them, the indefinite data (like dialog… ▽ More

    Submitted 11 August, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

  49. arXiv:2305.01143  [pdf, other

    stat.ML cs.LG

    Understanding the Generalization Ability of Deep Learning Algorithms: A Kernelized Renyi's Entropy Perspective

    Authors: Yuxin Dong, Tieliang Gong, Hong Chen, Chen Li

    Abstract: Recently, information theoretic analysis has become a popular framework for understanding the generalization behavior of deep neural networks. It allows a direct analysis for stochastic gradient/Langevin descent (SGD/SGLD) learning algorithms without strong assumptions such as Lipschitz or convexity conditions. However, the current generalization error bounds within this framework are still far fr… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  50. arXiv:2305.00578  [pdf, other

    stat.ME

    A new clustering framework

    Authors: Hao Chen, Xiancheng Lin

    Abstract: Detection of clusters is a crucial task across many disciplines such as statistics, engineering and bioinformatics. We mainly focus on the modern high dimensional scenario, where traditional methods could fail due to the curse of dimensionality. In this study, we propose a non-parametric framework for clustering that can be applied to arbitrary dimensions. Simulation results show that this new fra… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.