Skip to main content

Showing 1–36 of 36 results for author: Gao, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.16525  [pdf, other

    stat.ML cs.LG

    OAML: Outlier Aware Metric Learning for OOD Detection Enhancement

    Authors: Heng Gao, Zhuolin He, Shoumeng Qiu, Jian Pu

    Abstract: Out-of-distribution (OOD) detection methods have been developed to identify objects that a model has not seen during training. The Outlier Exposure (OE) methods use auxiliary datasets to train OOD detectors directly. However, the collection and learning of representative OOD samples may pose challenges. To tackle these issues, we propose the Outlier Aware Metric Learning (OAML) framework. The main… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.08709  [pdf, other

    cs.LG stat.ME

    Introducing Diminutive Causal Structure into Graph Representation Learning

    Authors: Hang Gao, Peng Qiao, Yifan **, Fengge Wu, Jiangmeng Li, Changwen Zheng

    Abstract: When engaging in end-to-end graph representation learning with Graph Neural Networks (GNNs), the intricate causal relationships and rules inherent in graph data pose a formidable challenge for the model in accurately capturing authentic data relationships. A proposed mitigating strategy involves the direct integration of rules or relationships corresponding to the graph data into the model. Howeve… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2404.19557  [pdf, other

    stat.ML cs.LG

    Neural Dynamic Data Valuation

    Authors: Zhangyong Liang, Huanhuan Gao, Ji Zhang

    Abstract: Data constitute the foundational component of the data economy and its marketplaces. Efficient and fair data valuation has emerged as a topic of significant interest.\ Many approaches based on marginal contribution have shown promising results in various downstream tasks. However, they are well known to be computationally expensive as they require training a large number of utility functions, whic… ▽ More

    Submitted 12 June, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: 43 pages, 19 figures

  4. arXiv:2404.01273  [pdf, other

    cs.LG cs.CL stat.ME

    TWIN-GPT: Digital Twins for Clinical Trials via Large Language Model

    Authors: Yue Wang, Tianfan Fu, Yinlong Xu, Zihan Ma, Hongxia Xu, Yingzhou Lu, Bang Du, Honghao Gao, Jian Wu

    Abstract: Clinical trials are indispensable for medical research and the development of new treatments. However, clinical trials often involve thousands of participants and can span several years to complete, with a high probability of failure during the process. Recently, there has been a burgeoning interest in virtual clinical trials, which simulate real-world scenarios and hold the potential to significa… ▽ More

    Submitted 28 June, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  5. arXiv:2402.17157  [pdf, other

    cs.LG physics.comp-ph physics.flu-dyn stat.ML

    Generative Learning for Forecasting the Dynamics of Complex Systems

    Authors: Han Gao, Sebastian Kaltenbach, Petros Koumoutsakos

    Abstract: We introduce generative models for accelerating simulations of complex systems through learning and evolving their effective dynamics. In the proposed Generative Learning of Effective Dynamics (G-LED), instances of high dimensional data are down sampled to a lower dimensional manifold that is evolved through an auto-regressive attention mechanism. In turn, Bayesian diffusion models, that map this… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  6. arXiv:2312.09613  [pdf, other

    cs.LG cs.AI stat.ML

    Rethinking Causal Relationships Learning in Graph Neural Networks

    Authors: Hang Gao, Chengyu Yao, Jiangmeng Li, Lingyu Si, Yifan **, Fengge Wu, Changwen Zheng, Hua** Liu

    Abstract: Graph Neural Networks (GNNs) demonstrate their significance by effectively modeling complex interrelationships within graph-structured data. To enhance the credibility and robustness of GNNs, it becomes exceptionally crucial to bolster their ability to capture causal relationships. However, despite recent advancements that have indeed strengthened GNNs from a causal learning perspective, conductin… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  7. arXiv:2310.10767  [pdf, ps, other

    cs.LG stat.ML

    Wide Neural Networks as Gaussian Processes: Lessons from Deep Equilibrium Models

    Authors: Tianxiang Gao, Xiaokai Huo, Hailiang Liu, Hongyang Gao

    Abstract: Neural networks with wide layers have attracted significant attention due to their equivalence to Gaussian processes, enabling perfect fitting of training data while maintaining generalization performance, known as benign overfitting. However, existing results mainly focus on shallow or finite-depth networks, necessitating a comprehensive analysis of wide neural networks with infinite-depth layers… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023

  8. arXiv:2303.10808  [pdf, other

    stat.ME

    Dimension-agnostic Change Point Detection

    Authors: Hanjia Gao, Runmin Wang, Xiaofeng Shao

    Abstract: Change point testing for high-dimensional data has attracted a lot of attention in statistics and machine learning owing to the emergence of high-dimensional data with structural breaks from many fields. In practice, when the dimension is less than the sample size but is not small, it is often unclear whether a method that is tailored to high-dimensional data or simply a classical method that is d… ▽ More

    Submitted 3 December, 2023; v1 submitted 19 March, 2023; originally announced March 2023.

  9. arXiv:2302.12322  [pdf, other

    stat.ME

    Testing Serial Independence of Object-Valued Time Series

    Authors: Feiyu Jiang, Hanjia Gao, Xiaofeng Shao

    Abstract: We propose a novel method for testing serial independence of object-valued time series in metric spaces, which is more general than Euclidean or Hilbert spaces. The proposed method is fully nonparametric, free of tuning parameters, and can capture all nonlinear pairwise dependence. The key concept used in this paper is the distance covariance in metric spaces, which is extended to auto distance co… ▽ More

    Submitted 27 July, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

  10. arXiv:2212.02724  [pdf, other

    cs.LG math.OC stat.ML

    Decentralized Stochastic Gradient Descent Ascent for Finite-Sum Minimax Problems

    Authors: Hongchang Gao

    Abstract: Minimax optimization problems have attracted significant attention in recent years due to their widespread application in numerous machine learning models. To solve the minimax problem, a wide variety of stochastic optimization methods have been proposed. However, most of them ignore the distributed setting where the training data is distributed on multiple workers. In this paper, we developed a n… ▽ More

    Submitted 11 June, 2024; v1 submitted 5 December, 2022; originally announced December 2022.

  11. arXiv:2209.15562  [pdf, other

    cs.LG stat.ML

    On the optimization and generalization of overparameterized implicit neural networks

    Authors: Tianxiang Gao, Hongyang Gao

    Abstract: Implicit neural networks have become increasingly attractive in the machine learning community since they can achieve competitive performance but use much less computational resources. Recently, a line of theoretical works established the global convergences for first-order methods such as gradient descent if the implicit networks are over-parameterized. However, as they train all layers together,… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

  12. arXiv:2209.09018  [pdf, other

    eess.SP cs.LG stat.AP

    A Causal Intervention Scheme for Semantic Segmentation of Quasi-periodic Cardiovascular Signals

    Authors: Xingyao Wang, Yuwen Li, Hongxiang Gao, Xianghong Cheng, Jianqing Li, Chengyu Liu

    Abstract: Precise segmentation is a vital first step to analyze semantic information of cardiac cycle and capture anomaly with cardiovascular signals. However, in the field of deep semantic segmentation, inference is often unilaterally confounded by the individual attribute of data. Towards cardiovascular signals, quasi-periodicity is the essential characteristic to be learned, regarded as the synthesize of… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: submitted to IEEE Journal of Biomedical and Health Informatics (J-BHI)

  13. arXiv:2208.08584  [pdf, other

    cs.LG stat.ME

    Robust Causal Graph Representation Learning against Confounding Effects

    Authors: Hang Gao, Jiangmeng Li, Wenwen Qiang, Lingyu Si, Bing Xu, Changwen Zheng, Fuchun Sun

    Abstract: The prevailing graph neural network models have achieved significant progress in graph representation learning. However, in this paper, we uncover an ever-overlooked phenomenon: the pre-trained graph representation learning model tested with full graphs underperforms the model tested with well-pruned graphs. This observation reveals that there exist confounders in graphs, which may interfere with… ▽ More

    Submitted 10 February, 2023; v1 submitted 17 August, 2022; originally announced August 2022.

    Comments: Accepted by AAAI 2023 as Oral Presentation

  14. arXiv:2205.07463  [pdf, other

    cs.LG math.OC stat.ML

    Gradient Descent Optimizes Infinite-Depth ReLU Implicit Networks with Linear Widths

    Authors: Tianxiang Gao, Hongyang Gao

    Abstract: Implicit deep learning has recently become popular in the machine learning community since these implicit models can achieve competitive performance with state-of-the-art deep networks while using significantly less memory and computational resources. However, our theoretical understanding of when and how first-order methods such as gradient descent (GD) converge on \textit{nonlinear} implicit net… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

  15. arXiv:2110.05645  [pdf, other

    cs.LG math.OC stat.ML

    A global convergence theory for deep ReLU implicit networks via over-parameterization

    Authors: Tianxiang Gao, Hailiang Liu, Jia Liu, Hridesh Rajan, Hongyang Gao

    Abstract: Implicit deep learning has received increasing attention recently due to the fact that it generalizes the recursive prediction rules of many commonly used neural network architectures. Its prediction rule is provided implicitly based on the solution of an equilibrium equation. Although a line of recent empirical studies has demonstrated its superior performances, the theoretical understanding of i… ▽ More

    Submitted 18 February, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: Accepted by ICLR 2022

  16. arXiv:2008.10435  [pdf, other

    cs.LG stat.ML

    Periodic Stochastic Gradient Descent with Momentum for Decentralized Training

    Authors: Hongchang Gao, Heng Huang

    Abstract: Decentralized training has been actively studied in recent years. Although a wide variety of methods have been proposed, yet the decentralized momentum SGD method is still underexplored. In this paper, we propose a novel periodic decentralized momentum SGD method, which employs the momentum schema and periodic communication for decentralized training. With these two strategies, as well as the topo… ▽ More

    Submitted 24 August, 2020; originally announced August 2020.

  17. Towards Deeper Graph Neural Networks

    Authors: Meng Liu, Hongyang Gao, Shuiwang Ji

    Abstract: Graph neural networks have shown significant success in the field of graph representation learning. Graph convolutions perform neighborhood aggregation and represent one of the most important graph operations. Nevertheless, one layer of these neighborhood aggregation methods only consider immediate neighbors, and the performance decreases when going deeper to enable larger receptive fields. Severa… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: 11 pages, KDD2020

  18. arXiv:1910.04724  [pdf, other

    cs.LG stat.ML

    Using Neural Networks for Programming by Demonstration

    Authors: Karan K. Budhraja, Hang Gao, Tim Oates

    Abstract: Agent-based modeling is a paradigm of modeling dynamic systems of interacting agents that are individually governed by specified behavioral rules. Training a model of such agents to produce an emergent behavior by specification of the emergent (as opposed to agent) behavior is easier from a demonstration perspective. Without the involvement of manual behavior specification via code or reliance on… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

  19. arXiv:1910.04618  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Universal Adversarial Perturbation for Text Classification

    Authors: Hang Gao, Tim Oates

    Abstract: Given a state-of-the-art deep neural network text classifier, we show the existence of a universal and very small perturbation vector (in the embedding space) that causes natural text to be misclassified with high probability. Unlike images on which a single fixed-size adversarial perturbation can be found, text is of variable length, so we define the "universality" as "token-agnostic", where a si… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

  20. arXiv:1908.06729  [pdf, other

    stat.ML cs.DB cs.LG

    Autoregressive-Model-Based Methods for Online Time Series Prediction with Missing Values: an Experimental Evaluation

    Authors: Xi Chen, Hongzhi Wang, Yanjie Wei, Jianzhong Li, Hong Gao

    Abstract: Time series prediction with missing values is an important problem of time series analysis since complete data is usually hard to obtain in many real-world applications. To model the generation of time series, autoregressive (AR) model is a basic and widely used one, which assumes that each observation in the time series is a noisy linear combination of some previous observations along with a cons… ▽ More

    Submitted 26 August, 2019; v1 submitted 10 August, 2019; originally announced August 2019.

  21. arXiv:1907.04652  [pdf, other

    cs.LG stat.ML

    Graph Representation Learning via Hard and Channel-Wise Attention Networks

    Authors: Hongyang Gao, Shuiwang Ji

    Abstract: Attention operators have been widely applied in various fields, including computer vision, natural language processing, and network embedding learning. Attention operators on graph data enables learnable weights when aggregating information from neighboring nodes. However, graph attention operators (GAOs) consume excessive computational resources, preventing their applications on large graphs. In… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.

    Comments: 9 pages, KDD19

  22. arXiv:1906.04819  [pdf, other

    stat.ML cs.LG

    ADASS: Adaptive Sample Selection for Training Acceleration

    Authors: Shen-Yi Zhao, Hao Gao, Wu-Jun Li

    Abstract: Stochastic gradient decent~(SGD) and its variants, including some accelerated variants, have become popular for training in machine learning. However, in all existing SGD and its variants, the sample size in each iteration~(epoch) of training is the same as the size of the full training set. In this paper, we propose a new method, called \underline{ada}ptive \underline{s}ample \underline{s}electio… ▽ More

    Submitted 17 September, 2019; v1 submitted 11 June, 2019; originally announced June 2019.

  23. arXiv:1905.12960  [pdf, ps, other

    stat.ML cs.LG

    On the Convergence of Memory-Based Distributed SGD

    Authors: Shen-Yi Zhao, Hao Gao, Wu-Jun Li

    Abstract: Distributed stochastic gradient descent~(DSGD) has been widely used for optimizing large-scale machine learning models, including both convex and non-convex models. With the rapid growth of model size, huge communication cost has been the bottleneck of traditional DSGD. Recently, many communication compression methods have been proposed. Memory-based distributed stochastic gradient descent~(M-DSGD… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

  24. arXiv:1905.12948  [pdf, other

    stat.ML cs.LG

    Global Momentum Compression for Sparse Communication in Distributed Learning

    Authors: Chang-Wei Shi, Shen-Yi Zhao, Yin-Peng Xie, Hao Gao, Wu-Jun Li

    Abstract: With the rapid growth of data, distributed momentum stochastic gradient descent~(DMSGD) has been widely used in distributed learning, especially for training large-scale deep models. Due to the latency and limited bandwidth of the network, communication has become the bottleneck of distributed learning. Communication compression with sparsified gradient, abbreviated as \emph{sparse communication},… ▽ More

    Submitted 3 April, 2024; v1 submitted 30 May, 2019; originally announced May 2019.

  25. arXiv:1905.06310  [pdf, other

    stat.AP stat.ME

    Fast Parameter Inference in a Biomechanical Model of the Left Ventricle using Statistical Emulation

    Authors: Vinny Davies, Umberto Noè, Alan Lazarus, Hao Gao, Benn Macdonald, Colin Berry, Xiaoyu Luo, Dirk Husmeier

    Abstract: A central problem in biomechanical studies of personalised human left ventricular (LV) modelling is estimating the material properties and biophysical parameters from in-vivo clinical measurements in a time frame suitable for use within a clinic. Understanding these properties can provide insight into heart function or dysfunction and help inform personalised medicine. However, finding a solution… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

  26. arXiv:1905.05178  [pdf, other

    cs.LG stat.ML

    Graph U-Nets

    Authors: Hongyang Gao, Shuiwang Ji

    Abstract: We consider the problem of representation learning for graph data. Convolutional neural networks can naturally operate on images, but have significant challenges in dealing with graph data. Given images are special cases of graphs with nodes lie on 2D lattices, graph embedding tasks have a natural correspondence with image pixel-wise prediction tasks such as segmentation. While encoder-decoder arc… ▽ More

    Submitted 11 May, 2019; originally announced May 2019.

    Comments: 10 pages, ICML19

  27. arXiv:1904.01561  [pdf, other

    cs.LG stat.ML

    Analyzing Learned Molecular Representations for Property Prediction

    Authors: Kevin Yang, Kyle Swanson, Wengong **, Connor Coley, Philipp Eiden, Hua Gao, Angel Guzman-Perez, Timothy Hopper, Brian Kelley, Miriam Mathea, Andrew Palmer, Volker Settels, Tommi Jaakkola, Klavs Jensen, Regina Barzilay

    Abstract: Advancements in neural machinery have led to a wide range of algorithmic solutions for molecular property prediction. Two classes of models in particular have yielded promising results: neural networks applied to computed molecular fingerprints or expert-crafted descriptors, and graph convolutional neural networks that construct a learned molecular representation by operating on the graph structur… ▽ More

    Submitted 20 November, 2019; v1 submitted 2 April, 2019; originally announced April 2019.

    Journal ref: Journal of chemical information and modeling 59.8 (2019): 3370-3388

  28. arXiv:1901.03040  [pdf, other

    cs.LG math.OC stat.ML

    Quantized Epoch-SGD for Communication-Efficient Distributed Learning

    Authors: Shen-Yi Zhao, Hao Gao, Wu-Jun Li

    Abstract: Due to its efficiency and ease to implement, stochastic gradient descent (SGD) has been widely used in machine learning. In particular, SGD is one of the most popular optimization methods for distributed learning. Recently, quantized SGD (QSGD), which adopts quantization to reduce the communication cost in SGD-based distributed learning, has attracted much attention. Although several QSGD methods… ▽ More

    Submitted 10 January, 2019; originally announced January 2019.

  29. arXiv:1810.11730  [pdf, other

    cs.LG stat.ML

    Low-shot Learning via Covariance-Preserving Adversarial Augmentation Networks

    Authors: Hang Gao, Zheng Shou, Alireza Zareian, Hanwang Zhang, Shih-Fu Chang

    Abstract: Deep neural networks suffer from over-fitting and catastrophic forgetting when trained with small data. One natural remedy for this problem is data augmentation, which has been recently shown to be effective. However, previous works either assume that intra-class variances can always be generalized to new classes, or employ naive generation methods to hallucinate finite examples without modeling t… ▽ More

    Submitted 13 December, 2018; v1 submitted 27 October, 2018; originally announced October 2018.

    Journal ref: In Advances in Neural Information Processing Systems, pp. 981-991. 2018

  30. arXiv:1810.04038  [pdf, other

    cs.LG cs.AI stat.ML

    Understanding and Improving Recurrent Networks for Human Activity Recognition by Continuous Attention

    Authors: Ming Zeng, Haoxiang Gao, Tong Yu, Ole J. Mengshoel, Helge Langseth, Ian Lane, Xiaobing Liu

    Abstract: Deep neural networks, including recurrent networks, have been successfully applied to human activity recognition. Unfortunately, the final representation learned by recurrent networks might encode some noise (irrelevant signal components, unimportant sensor modalities, etc.). Besides, it is difficult to interpret the recurrent networks to gain insight into the models' behavior. To address these is… ▽ More

    Submitted 7 October, 2018; originally announced October 2018.

    Comments: 8 pages. published in The International Symposium on Wearable Computers (ISWC) 2018

    Journal ref: The International Symposium on Wearable Computers (ISWC) 2018

  31. Large-Scale Learnable Graph Convolutional Networks

    Authors: Hongyang Gao, Zhengyang Wang, Shuiwang Ji

    Abstract: Convolutional neural networks (CNNs) have achieved great success on grid-like data such as images, but face tremendous challenges in learning from more generic data such as graphs. In CNNs, the trainable local filters enable the automatic extraction of high-level features. The computation with filters requires a fixed number of ordered units in the receptive fields. However, the number of neighbor… ▽ More

    Submitted 12 August, 2018; originally announced August 2018.

    Journal ref: In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (pp. 1416-1424). ACM (2018)

  32. arXiv:1803.06071  [pdf, other

    cs.DB cs.LG stat.ML

    Impacts of Dirty Data: and Experimental Evaluation

    Authors: Zhixin Qi, Hongzhi Wang, Jianzhong Li, Hong Gao

    Abstract: Data quality issues have attracted widespread attention due to the negative impacts of dirty data on data mining and machine learning results. The relationship between data quality and the accuracy of results could be applied on the selection of the appropriate algorithm with the consideration of data quality and the determination of the data share to clean. However, rare research has focused on e… ▽ More

    Submitted 26 April, 2021; v1 submitted 16 March, 2018; originally announced March 2018.

    Comments: 22 pages, 192 figures

  33. arXiv:1802.03604  [pdf, other

    cs.LG cs.DC stat.ML

    Feature-Distributed SVRG for High-Dimensional Linear Classification

    Authors: Gong-Duo Zhang, Shen-Yi Zhao, Hao Gao, Wu-Jun Li

    Abstract: Linear classification has been widely used in many high-dimensional applications like text classification. To perform linear classification for large-scale tasks, we often need to design distributed learning methods on a cluster of multiple machines. In this paper, we propose a new distributed learning method, called feature-distributed stochastic variance reduced gradient (FD-SVRG) for high-dimen… ▽ More

    Submitted 10 February, 2018; originally announced February 2018.

  34. arXiv:1711.00629  [pdf, other

    stat.ML cs.LG q-bio.NC

    Sleep Stage Classification Based on Multi-level Feature Learning and Recurrent Neural Networks via Wearable Device

    Authors: Xin Zhang, Weixuan Kou, Eric I-Chao Chang, He Gao, Yubo Fan, Yan Xu

    Abstract: This paper proposes a practical approach for automatic sleep stage classification based on a multi-level feature learning framework and Recurrent Neural Network (RNN) classifier using heart rate and wrist actigraphy derived from a wearable device. The feature learning framework is designed to extract low- and mid-level features. Low-level features capture temporal and frequency domain properties a… ▽ More

    Submitted 2 November, 2017; originally announced November 2017.

    Comments: 11 pages, 10 figures

  35. arXiv:1705.06820  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Pixel Deconvolutional Networks

    Authors: Hongyang Gao, Hao Yuan, Zhengyang Wang, Shuiwang Ji

    Abstract: Deconvolutional layers have been widely used in a variety of deep models for up-sampling, including encoder-decoder networks for semantic segmentation and deep generative models for unsupervised learning. One of the key limitations of deconvolutional operations is that they result in the so-called checkerboard problem. This is caused by the fact that no direct relationship exists among adjacent pi… ▽ More

    Submitted 26 November, 2017; v1 submitted 18 May, 2017; originally announced May 2017.

    Comments: 11 pages

  36. arXiv:1106.2124  [pdf

    physics.med-ph cs.CV math.NA stat.AP

    Omni-tomography/Multi-tomography -- Integrating Multiple Modalities for Simultaneous Imaging

    Authors: Ge Wang, Jie Zhang, Hao Gao, Victor Weir, Hengyong Yu, Wenxiang Cong, Xiaochen Xu, Haiou Shen, James Bennett, Yue Wang, Michael Vannier

    Abstract: Current tomographic imaging systems need major improvements, especially when multi-dimensional, multi-scale, multi-temporal and multi-parametric phenomena are under investigation. Both preclinical and clinical imaging now depend on in vivo tomography, often requiring separate evaluations by different imaging modalities to define morphologic details, delineate interval changes due to disease or int… ▽ More

    Submitted 10 June, 2011; originally announced June 2011.

    Comments: 43 pages, 15 figures, 99 references, provisional patent applications filed by Virginia Tech