Skip to main content

Showing 1–30 of 30 results for author: Huo, X

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.12187  [pdf, ps, other

    stat.ML cs.LG math.ST

    Approximation of RKHS Functionals by Neural Networks

    Authors: Tian-Yi Zhou, Namjoon Suh, Guang Cheng, Xiaoming Huo

    Abstract: Motivated by the abundance of functional data such as time series and images, there has been a growing interest in integrating such data into neural networks and learning maps from function spaces to R (i.e., functionals). In this paper, we study the approximation of functionals on reproducing kernel Hilbert spaces (RKHS's) using neural networks. We establish the universality of the approximation… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  2. arXiv:2401.15262  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Asymptotic Behavior of Adversarial Training Estimator under $\ell_\infty$-Perturbation

    Authors: Yiling Xie, Xiaoming Huo

    Abstract: Adversarial training has been proposed to hedge against adversarial attacks in machine learning and statistical models. This paper focuses on adversarial training under $\ell_\infty$-perturbation, which has recently attracted much research attention. The asymptotic behavior of the adversarial training estimator is investigated in the generalized linear model. The results imply that the limiting di… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  3. arXiv:2401.04286  [pdf, ps, other

    stat.ML cs.LG

    Universal Consistency of Wide and Deep ReLU Neural Networks and Minimax Optimal Convergence Rates for Kolmogorov-Donoho Optimal Function Classes

    Authors: Hyunouk Ko, Xiaoming Huo

    Abstract: In this paper, we prove the universal consistency of wide and deep ReLU neural network classifiers trained on the logistic loss. We also give sufficient conditions for a class of probability measures for which classifiers based on neural networks achieve minimax optimal rates of convergence. The result applies to a wide range of known function classes. In particular, while most previous works impo… ▽ More

    Submitted 30 January, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

  4. arXiv:2310.10767  [pdf, ps, other

    cs.LG stat.ML

    Wide Neural Networks as Gaussian Processes: Lessons from Deep Equilibrium Models

    Authors: Tianxiang Gao, Xiaokai Huo, Hailiang Liu, Hongyang Gao

    Abstract: Neural networks with wide layers have attracted significant attention due to their equivalence to Gaussian processes, enabling perfect fitting of training data while maintaining generalization performance, known as benign overfitting. However, existing results mainly focus on shallow or finite-depth networks, necessitating a comprehensive analysis of wide neural networks with infinite-depth layers… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023

  5. arXiv:2309.15075  [pdf, other

    stat.ML cs.LG math.ST

    On Excess Risk Convergence Rates of Neural Network Classifiers

    Authors: Hyunouk Ko, Namjoon Suh, Xiaoming Huo

    Abstract: The recent success of neural networks in pattern recognition and classification problems suggests that neural networks possess qualities distinct from other more classical classifiers such as SVMs or boosting classifiers. This paper studies the performance of plug-in classifiers based on neural networks in a binary classification setting as measured by their excess risks. Compared to the typical s… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  6. arXiv:2308.08030  [pdf, other

    stat.ML cs.LG math.ST

    Classification of Data Generated by Gaussian Mixture Models Using Deep ReLU Networks

    Authors: Tian-Yi Zhou, Xiaoming Huo

    Abstract: This paper studies the binary classification of unbounded data from ${\mathbb R}^d$ generated under Gaussian Mixture Models (GMMs) using deep ReLU neural networks. We obtain $\unicode{x2013}$ for the first time $\unicode{x2013}$ non-asymptotic upper bounds and convergence rates of the excess risk (excess misclassification error) for the classification without restrictions on model parameters. The… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  7. arXiv:2307.05109  [pdf, other

    cs.LG stat.ML

    Conformalization of Sparse Generalized Linear Models

    Authors: Etash Kumar Guha, Eugene Ndiaye, Xiaoming Huo

    Abstract: Given a sequence of observable variables $\{(x_1, y_1), \ldots, (x_n, y_n)\}$, the conformal prediction method estimates a confidence set for $y_{n+1}$ given $x_{n+1}$ that is valid for any finite sample size by merely assuming that the joint distribution of the data is permutation invariant. Although attractive, computing such a set is computationally infeasible in most regression problems. Indee… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: ICML 2023

  8. arXiv:2303.15579  [pdf, other

    stat.ML cs.LG

    Adjusted Wasserstein Distributionally Robust Estimator in Statistical Learning

    Authors: Yiling Xie, Xiaoming Huo

    Abstract: We propose an adjusted Wasserstein distributionally robust estimator -- based on a nonlinear transformation of the Wasserstein distributionally robust (WDRO) estimator in statistical learning. The classic WDRO estimator is asymptotically biased, while our adjusted WDRO estimator is asymptotically unbiased, resulting in a smaller asymptotic mean squared error. Further, under certain conditions, our… ▽ More

    Submitted 9 May, 2024; v1 submitted 27 March, 2023; originally announced March 2023.

  9. arXiv:2303.03576  [pdf, other

    stat.CO stat.ML

    A Survey of Numerical Algorithms that can Solve the Lasso Problems

    Authors: Yujie Zhao, Xiaoming Huo

    Abstract: In statistics, the least absolute shrinkage and selection operator (Lasso) is a regression method that performs both variable selection and regularization. There is a lot of literature available, discussing the statistical properties of the regression coefficients estimated by the Lasso method. However, there lacks a comprehensive review discussing the algorithms to solve the optimization problem… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  10. arXiv:2301.09675  [pdf, other

    math.OC stat.ML

    Improved Rate of First Order Algorithms for Entropic Optimal Transport

    Authors: Yiling Luo, Yiling Xie, Xiaoming Huo

    Abstract: This paper improves the state-of-the-art rate of a first-order algorithm for solving entropy regularized optimal transport. The resulting rate for approximating the optimal transport (OT) has been improved from $\widetilde{O}({n^{2.5}}/ε)$ to $\widetilde{O}({n^2}/ε)$, where $n$ is the problem size and $ε$ is the accuracy level. In particular, we propose an accelerated primal-dual stochastic mirror… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

  11. arXiv:2212.01259  [pdf, other

    stat.ML cs.LG

    Covariance Estimators for the ROOT-SGD Algorithm in Online Learning

    Authors: Yiling Luo, Xiaoming Huo, Yajun Mei

    Abstract: Online learning naturally arises in many statistical and machine learning problems. The most widely used methods in online learning are stochastic first-order algorithms. Among this family of algorithms, there is a recently developed algorithm, Recursive One-Over-T SGD (ROOT-SGD). ROOT-SGD is advantageous in that it converges at a non-asymptotically fast rate, and its estimator further converges t… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  12. arXiv:2210.16645  [pdf, other

    math.OC stat.ML

    Solving a Special Type of Optimal Transport Problem by a Modified Hungarian Algorithm

    Authors: Yiling Xie, Yiling Luo, Xiaoming Huo

    Abstract: Computing the empirical Wasserstein distance in the Wasserstein-distance-based independence test is an optimal transport (OT) problem with a special structure. This observation inspires us to study a special type of OT problem and propose a modified Hungarian algorithm to solve it exactly. For the OT problem involving two marginals with $m$ and $n$ atoms ($m\geq n$), respectively, the computationa… ▽ More

    Submitted 28 February, 2023; v1 submitted 29 October, 2022; originally announced October 2022.

  13. arXiv:2210.14184  [pdf, other

    stat.ML cs.LG

    Learning Ability of Interpolating Deep Convolutional Neural Networks

    Authors: Tian-Yi Zhou, Xiaoming Huo

    Abstract: It is frequently observed that overparameterized neural networks generalize well. Regarding such phenomena, existing theoretical work mainly devotes to linear settings or fully-connected neural networks. This paper studies the learning ability of an important family of deep neural networks, deep convolutional neural networks (DCNNs), under both underparameterized and overparameterized settings. We… ▽ More

    Submitted 16 August, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

  14. Hot-spots Detection in Count Data by Poisson Assisted Smooth Sparse Tensor Decomposition

    Authors: Yujie Zhao, Xiaoming Huo, Yajun Mei

    Abstract: Count data occur widely in many bio-surveillance and healthcare applications, e.g., the numbers of new patients of different types of infectious diseases from different cities/counties/states repeatedly over time, say, daily/weekly/monthly. For this type of count data, one important task is the quick detection and localization of hot-spots in terms of unusual infectious rates so that we can respon… ▽ More

    Submitted 1 June, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: 7 figures, 22 pages, 4 tables

    Journal ref: Journal of Applied Statistics, 2022

  15. The Directional Bias Helps Stochastic Gradient Descent to Generalize in Kernel Regression Models

    Authors: Yiling Luo, Xiaoming Huo, Yajun Mei

    Abstract: We study the Stochastic Gradient Descent (SGD) algorithm in nonparametric statistics: kernel regression in particular. The directional bias property of SGD, which is known in the linear regression setting, is generalized to the kernel regression. More specifically, we prove that SGD with moderate and annealing step-size converges along the direction of the eigenvector that corresponds to the large… ▽ More

    Submitted 29 April, 2022; originally announced May 2022.

  16. Implicit Regularization Properties of Variance Reduced Stochastic Mirror Descent

    Authors: Yiling Luo, Xiaoming Huo, Yajun Mei

    Abstract: In machine learning and statistical data analysis, we often run into objective function that is a summation: the number of terms in the summation possibly is equal to the sample size, which can be enormous. In such a setting, the stochastic mirror descent (SMD) algorithm is a numerically efficient method -- each iteration involving a very small subset of the data. The variance reduction version of… ▽ More

    Submitted 29 April, 2022; originally announced May 2022.

  17. arXiv:2203.00813  [pdf, other

    stat.ML cs.DS math.OC

    An Accelerated Stochastic Algorithm for Solving the Optimal Transport Problem

    Authors: Yiling Xie, Yiling Luo, Xiaoming Huo

    Abstract: A primal-dual accelerated stochastic gradient descent with variance reduction algorithm (PDASGD) is proposed to solve linear-constrained optimization problems. PDASGD could be applied to solve the discrete optimal transport (OT) problem and enjoys the best-known computational complexity -- $\widetilde{\mathcal{O}}(n^2/ε)$, where $n$ is the number of atoms, and $ε>0$ is the accuracy. In the literat… ▽ More

    Submitted 29 May, 2023; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: Compared with previous versions, both theoretical complexity and numerical performances have been improved for solving the OT problem in this version

  18. arXiv:2103.10231  [pdf, ps, other

    stat.ME

    Identification of Partial-Differential-Equations-Based Models from Noisy Data via Splines

    Authors: Yujie Zhao, Xiaoming Huo, Yajun Mei

    Abstract: We propose a two-stage method called \textit{Spline Assisted Partial Differential Equation based Model Identification (SAPDEMI)} to identify partial differential equation (PDE)-based models from noisy data. In the first stage, we employ the cubic splines to estimate unobservable derivatives. The underlying PDE is based on a subset of these derivatives. This stage is computationally efficient: its… ▽ More

    Submitted 6 March, 2023; v1 submitted 18 March, 2021; originally announced March 2021.

  19. arXiv:2103.07045  [pdf, ps, other

    math.NA stat.ML

    Asymptotic Theory of $\ell_1$-Regularized PDE Identification from a Single Noisy Trajectory

    Authors: Yuchen He, Namjoon Suh, Xiaoming Huo, Sungha Kang, Yajun Mei

    Abstract: We prove the support recovery for a general class of linear and nonlinear evolutionary partial differential equation (PDE) identification from a single noisy trajectory using $\ell_1$ regularized Pseudo-Least Squares model~($\ell_1$-PsLS). In any associative $\mathbb{R}$-algebra generated by finitely many differentiation operators that contain the unknown PDE operator, applying $\ell_1$-PsLS to a… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

    Comments: 38 pages, 6 figures

  20. arXiv:2010.13934  [pdf, other

    stat.ML cs.LG stat.CO

    Accelerate the Warm-up Stage in the Lasso Computation via a Homotopic Approach

    Authors: Yujie Zhao, Xiaoming Huo

    Abstract: In optimization, it is known that when the objective functions are strictly convex and well-conditioned, gradient-based approaches can be extremely effective, e.g., achieving the exponential rate of convergence. On the other hand, the existing Lasso-type estimator in general cannot achieve the optimal rate due to the undesirable behavior of the absolute function at the origin. A homotopic method i… ▽ More

    Submitted 6 March, 2023; v1 submitted 26 October, 2020; originally announced October 2020.

    Comments: 19 pages, 3 figures, 3 tables

  21. arXiv:2009.09310  [pdf, other

    math.ST stat.AP

    Fast and Asymptotically Powerful Detection for Filamentary Objects in Digital Images

    Authors: Kai Ni, Shanshan Cao, Xiaoming Huo

    Abstract: Given an inhomogeneous chain embedded in a noisy image, we consider the conditions under which such an embedded chain is detectable. Many applications, such as detecting moving objects, detecting ship wakes, can be abstracted as the detection on the existence of chains. In this work, we provide the detection algorithm with low order of computation complexity to detect the chain and the optimal the… ▽ More

    Submitted 19 September, 2020; originally announced September 2020.

    Comments: 13 pages, 8 figures

  22. arXiv:2001.00068  [pdf, other

    stat.AP eess.IV

    Asymptotic convergence rate of the longest run in an inflating Bernoulli net

    Authors: Kai Ni, Shanshan Cao, Xiaoming Huo

    Abstract: In image detection, one problem is to test whether the set, though mostly consisting of uniformly scattered points, also contains a small fraction of points sampled from some (a priori unknown) curve, for example, a curve with $C^α$-norm bounded by $β$. One approach is to analyze the data by counting membership in multiscale multianisotropic strips, which involves an algorithm that delves into the… ▽ More

    Submitted 31 December, 2019; originally announced January 2020.

  23. arXiv:1912.00524  [pdf, other

    stat.ML cs.LG

    Factor Analysis on Citation, Using a Combined Latent and Logistic Regression Model

    Authors: Namjoon Suh, Xiaoming Huo, Eric Heim, Lee Seversky

    Abstract: We propose a combined model, which integrates the latent factor model and the logistic regression model, for the citation network. It is noticed that neither a latent factor model nor a logistic regression model alone is sufficient to capture the structure of the data. The proposed model has a latent (i.e., factor analysis) model to represents the main technological trends (a.k.a., factors), and a… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Comments: Citation network, matrix decomposition, latent variable model, logistic regression model, convex optimization, alternating direction method of multiplier

  24. arXiv:1911.03592  [pdf, other

    stat.AP

    Optimal Shape Control via $L_\infty$ Loss for Composite Fuselage Assembly

    Authors: Juan Du, Shanshan Cao, Jeffrey H. Hunt, Xiaoming Huo

    Abstract: Shape control is critical to ensure the quality of composite fuselage assembly. In current practice, the structures are adjusted to the design shape in terms of the $\ell_2$ loss for further assembly without considering the existing dimensional gap between two structures. Such practice has two limitations: (1) the design shape may not be the optimal shape in terms of a pair of incoming fuselages w… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: 31 pages, 10 figures

  25. arXiv:1911.02753  [pdf, other

    stat.CO

    Optimal Projections in the Distance-Based Statistical Methods

    Authors: Chuan** Yu, Xiaoming Huo

    Abstract: This paper introduces a new way to calculate distance-based statistics, particularly when the data are multivariate. The main idea is to pre-calculate the optimal projection directions given the variable dimension, and to project multidimensional variables onto these pre-specified projection directions; by subsequently utilizing the fast algorithm that is developed in Huo and Székely [2016] for th… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

  26. arXiv:1903.00037  [pdf, other

    stat.ME

    Distance-Based Independence Screening for Canonical Analysis

    Authors: Yi** Ni, Chuan** Yu, Andy Ko, Xiaoming Huo

    Abstract: This paper introduces a novel method called Distance-Based Independence Screening for Canonical Analysis (DISCA) that performs simultaneous dimension reduction for a pair of random variables by optimizing the distance covariance (dCov). dCov is a statistic first proposed by Székely et al. [2009] for independence testing. Compared with sufficient dimension reduction (SDR) and canonical correlation… ▽ More

    Submitted 12 October, 2023; v1 submitted 28 February, 2019; originally announced March 2019.

    Comments: 33 pages

  27. arXiv:1707.04602  [pdf, other

    stat.ME

    An Efficient and Distribution-Free Two-Sample Test Based on Energy Statistics and Random Projections

    Authors: Cheng Huang, Xiaoming Huo

    Abstract: A common disadvantage in existing distribution-free two-sample testing approaches is that the computational complexity could be high. Specifically, if the sample size is $N$, the computational complexity of those two-sample tests is at least $O(N^2)$. In this paper, we develop an efficient algorithm with complexity $O(N \log N)$ for computing energy statistics in univariate cases. For multivariate… ▽ More

    Submitted 14 July, 2017; originally announced July 2017.

    Comments: 27 pages, 6 figures

  28. arXiv:1701.06054  [pdf, ps, other

    stat.ME

    A Statistically and Numerically Efficient Independence Test based on Random Projections and Distance Covariance

    Authors: Cheng Huang, Xiaoming Huo

    Abstract: Test of independence plays a fundamental role in many statistical techniques. Among the nonparametric approaches, the distance-based methods (such as the distance correlation based hypotheses testing for independence) have numerous advantages, comparing with many other alternatives. A known limitation of the distance-based method is that its computational complexity can be high. In general, when t… ▽ More

    Submitted 21 January, 2017; originally announced January 2017.

    Comments: 52 pages, 8 figures, technical paper

    MSC Class: Primary 62G10; 62H20; 62H15; secondary 62G20

  29. arXiv:1511.01443  [pdf, ps, other

    stat.ME cs.DC stat.ML

    A Distributed One-Step Estimator

    Authors: Cheng Huang, Xiaoming Huo

    Abstract: Distributed statistical inference has recently attracted enormous attention. Many existing work focuses on the averaging estimator. We propose a one-step approach to enhance a simple-averaging based distributed estimator. We derive the corresponding asymptotic properties of the newly proposed estimator. We find that the proposed one-step estimator enjoys the same asymptotic properties as the centr… ▽ More

    Submitted 10 November, 2015; v1 submitted 4 November, 2015; originally announced November 2015.

    Comments: 31 pages

  30. arXiv:1410.1503  [pdf, ps, other

    stat.CO stat.ME

    Fast Computing for Distance Covariance

    Authors: Xiaoming Huo, Gabor J. Szekely

    Abstract: Distance covariance and distance correlation have been widely adopted in measuring dependence of a pair of random variables or random vectors. If the computation of distance covariance and distance correlation is implemented directly accordingly to its definition then its computational complexity is O($n^2$) which is a disadvantage compared to other faster methods. In this paper we show that the c… ▽ More

    Submitted 6 October, 2014; originally announced October 2014.

    Comments: 38 pages, 6 tables, 5 figures. arXiv admin note: text overlap with arXiv:1205.4701 by other authors