Skip to main content

Showing 1–50 of 56 results for author: Huang, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.05428  [pdf, other

    cs.IT math.ST stat.ML

    Information-Theoretic Thresholds for the Alignments of Partially Correlated Graphs

    Authors: Dong Huang, Xianwen Song, Pengkun Yang

    Abstract: This paper studies the problem of recovering the hidden vertex correspondence between two correlated random graphs. We propose the partially correlated Erdős-Rényi graphs model, wherein a pair of induced subgraphs with a certain number are correlated. We investigate the information-theoretic thresholds for recovering the latent correlated subgraphs and the hidden vertex correspondence. We prove th… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  2. arXiv:2406.03683  [pdf, other

    cs.LG stat.ML

    Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion Models

    Authors: Ding Huang, Ting Li, Jian Huang

    Abstract: We propose a Bayesian framework for fine-tuning large diffusion models with a novel network structure called Bayesian Power Steering (BPS). We clarify the meaning behind adaptation from a \textit{large probability space} to a \textit{small probability space} and explore the task of fine-tuning pre-trained models using learnable modules from a Bayesian perspective. BPS extracts task-specific knowle… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 25 pages, 26 figures, and 4 tables

    MSC Class: 62G05; 68T07

  3. arXiv:2405.16672  [pdf, other

    stat.ML cs.LG stat.ME

    Transfer Learning Under High-Dimensional Graph Convolutional Regression Model for Node Classification

    Authors: Jiachen Chen, Danyang Huang, Liyuan Wang, Kathryn L. Lunetta, Debarghya Mukherjee, Huimin Cheng

    Abstract: Node classification is a fundamental task, but obtaining node classification labels can be challenging and expensive in many real-world scenarios. Transfer learning has emerged as a promising solution to address this challenge by leveraging knowledge from source domains to enhance learning in a target domain. Existing transfer learning methods for node classification primarily focus on integrating… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  4. arXiv:2403.16773  [pdf, other

    stat.ME econ.EM

    Privacy-Protected Spatial Autoregressive Model

    Authors: Danyang Huang, Ziyi Kong, Shuyuan Wu, Hansheng Wang

    Abstract: Spatial autoregressive (SAR) models are important tools for studying network effects. However, with an increasing emphasis on data privacy, data providers often implement privacy protection measures that make classical SAR models inapplicable. In this study, we introduce a privacy-protected SAR model with noise-added response and covariates to meet privacy-protection requirements. However, in this… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  5. arXiv:2403.13118  [pdf, other

    stat.ME cs.LG math.DS math.SP stat.ML

    Modal Analysis of Spatiotemporal Data via Multivariate Gaussian Process Regression

    Authors: Jiwoo Song, Daning Huang

    Abstract: Modal analysis has become an essential tool to understand the coherent structure of complex flows. The classical modal analysis methods, such as dynamic mode decomposition (DMD) and spectral proper orthogonal decomposition (SPOD), rely on a sufficient amount of data that is regularly sampled in time. However, often one needs to deal with sparse temporally irregular data, e.g., due to experimental… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 43 pages, 35 figures

  6. arXiv:2403.11163  [pdf, ps, other

    stat.ME cs.LG math.ST stat.CO

    A Selective Review on Statistical Methods for Massive Data Computation: Distributed Computing, Subsampling, and Minibatch Techniques

    Authors: Xuetong Li, Yuan Gao, Hong Chang, Danyang Huang, Yingying Ma, Rui Pan, Haobo Qi, Feifei Wang, Shuyuan Wu, Ke Xu, **g Zhou, Xuening Zhu, Yingqiu Zhu, Hansheng Wang

    Abstract: This paper presents a selective review of statistical computation methods for massive data analysis. A huge amount of statistical methods for massive data computation have been rapidly developed in the past decades. In this work, we focus on three categories of statistical computation methods: (1) distributed computing, (2) subsampling methods, and (3) minibatch gradient techniques. The first clas… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  7. arXiv:2402.06031  [pdf, other

    cs.LG math.ST stat.ML

    An operator learning perspective on parameter-to-observable maps

    Authors: Daniel Zhengyu Huang, Nicholas H. Nelsen, Margaret Trautner

    Abstract: Computationally efficient surrogates for parametrized physical models play a crucial role in science and engineering. Operator learning provides data-driven surrogates that map between function spaces. However, instead of full-field measurements, often the available data are only finite-dimensional parametrizations of model inputs or finite observables of model outputs. Building on Fourier Neural… ▽ More

    Submitted 6 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 63 pages, 10 figures, 1 table

    MSC Class: 68T07; 62G20; 65J15

  8. arXiv:2402.01148  [pdf, other

    math.ST cs.LG stat.ML

    The Optimality of Kernel Classifiers in Sobolev Space

    Authors: Jianfa Lai, Zhifan Li, Dongming Huang, Qian Lin

    Abstract: Kernel methods are widely used in machine learning, especially for classification problems. However, the theoretical analysis of kernel classification is still limited. This paper investigates the statistical performances of kernel classifiers. With some mild assumptions on the conditional probability $η(x)=\mathbb{P}(Y=1\mid X=x)$, we derive an upper bound on the classification excess risk of a k… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 21 pages, 2 figures

    MSC Class: 62G08 (Primary); 68T07; 46E22 (secondary) ACM Class: G.3

  9. arXiv:2401.10903  [pdf, other

    q-fin.ST cs.LG stat.AP

    Application of Machine Learning in Stock Market Forecasting: A Case Study of Disney Stock

    Authors: Dengxin Huang

    Abstract: This document presents a stock market analysis conducted on a dataset consisting of 750 instances and 16 attributes donated in 2014-10-23. The analysis includes an exploratory data analysis (EDA) section, feature engineering, data preparation, model selection, and insights from the analysis. The Fama French 3-factor model is also utilized in the analysis. The results of the analysis are presented,… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: 9 pages, 7 figures

  10. arXiv:2312.08728  [pdf, other

    stat.CO

    Mini-batch Gradient Descent with Buffer

    Authors: Haobo Qi, Du Huang, Yingqiu Zhu, Danyang Huang, Hansheng Wang

    Abstract: In this paper, we studied a buffered mini-batch gradient descent (BMGD) algorithm for training complex model on massive datasets. The algorithm studied here is designed for fast training on a GPU-CPU system, which contains two steps: the buffering step and the computation step. In the buffering step, a large batch of data (i.e., a buffer) are loaded from the hard drive to the graphical memory of G… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  11. arXiv:2312.05579  [pdf, other

    stat.ML cs.LG

    Conditional Stochastic Interpolation for Generative Learning

    Authors: Ding Huang, Jian Huang, Ting Li, Guohao Shen

    Abstract: We propose a conditional stochastic interpolation (CSI) approach to learning conditional distributions. CSI learns probability flow equations or stochastic differential equations that transport a reference distribution to the target conditional distribution. This is achieved by first learning the drift function and the conditional score function based on conditional stochastic interpolation, which… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: 44 pages, 4 figures

  12. arXiv:2312.01815  [pdf, other

    stat.ME

    Hypothesis Testing in Gaussian Graphical Models: Novel Goodness-of-Fit Tests and Conditional Randomization Tests

    Authors: Xiaotong Lin, Fangqiao Tian, Dongming Huang

    Abstract: We introduce novel hypothesis testing methods for Gaussian graphical models, whose foundation is an innovative algorithm that generates exchangeable copies from these models. We utilize the exchangeable copies to formulate a goodness-of-fit test, which is valid in both low and high-dimensional settings and flexible in choosing the test statistic. This test exhibits superior power performance, espe… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    MSC Class: 62F03; 62H15

  13. arXiv:2312.01411  [pdf, other

    stat.ME

    Bayesian inference on Cox regression models using catalytic prior distributions

    Authors: Weihao Li, Dongming Huang

    Abstract: The Cox proportional hazards model (Cox model) is a popular model for survival data analysis. When the sample size is small relative to the dimension of the model, the standard maximum partial likelihood inference is often problematic. In this work, we propose the Cox catalytic prior distributions for Bayesian inference on Cox models, which is an extension of a general class of prior distributions… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: 34 pages

  14. arXiv:2310.03597  [pdf, other

    stat.ML cs.LG math.DS math.NA

    Sampling via Gradient Flows in the Space of Probability Measures

    Authors: Yifan Chen, Daniel Zhengyu Huang, Jiaoyang Huang, Sebastian Reich, Andrew M Stuart

    Abstract: Sampling a target probability distribution with an unknown normalization constant is a fundamental challenge in computational science and engineering. Recent work shows that algorithms derived by considering gradient flows in the space of probability measures open up new avenues for algorithm development. This paper makes three contributions to this sampling approach by scrutinizing the design com… ▽ More

    Submitted 9 March, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Related and text overlap with arXiv:2302.11024

  15. arXiv:2306.10915  [pdf, other

    stat.ML cs.LG

    Practical Equivariances via Relational Conditional Neural Processes

    Authors: Daolang Huang, Manuel Haussmann, Ulpu Remes, ST John, Grégoire Clarté, Kevin Sebastian Luck, Samuel Kaski, Luigi Acerbi

    Abstract: Conditional Neural Processes (CNPs) are a class of metalearning models popular for combining the runtime efficiency of amortized inference with reliable uncertainty quantification. Many relevant machine learning tasks, such as in spatio-temporal modeling, Bayesian Optimization and continuous control, inherently contain equivariances -- for example to translation -- which the model can exploit for… ▽ More

    Submitted 5 November, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 38 pages, 8 figures. Accepted at the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  16. arXiv:2306.04111  [pdf, other

    cs.LG cs.DC stat.ME

    Quasi-Newton Updating for Large-Scale Distributed Learning

    Authors: Shuyuan Wu, Danyang Huang, Hansheng Wang

    Abstract: Distributed computing is critically important for modern statistical analysis. Herein, we develop a distributed quasi-Newton (DQN) framework with excellent statistical, computation, and communication efficiency. In the DQN method, no Hessian matrix inversion or communication is needed. This considerably reduces the computation and communication complexity of the proposed method. Notably, related e… ▽ More

    Submitted 11 June, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: 56 pages, 3 figures

  17. arXiv:2305.15871  [pdf, other

    stat.ML cs.LG stat.CO

    Learning Robust Statistics for Simulation-based Inference under Model Misspecification

    Authors: Daolang Huang, Ayush Bharti, Amauri Souza, Luigi Acerbi, Samuel Kaski

    Abstract: Simulation-based inference (SBI) methods such as approximate Bayesian computation (ABC), synthetic likelihood, and neural posterior estimation (NPE) rely on simulating statistics to infer parameters of intractable likelihood models. However, such methods are known to yield untrustworthy and misleading inference outcomes under model misspecification, thus hindering their widespread applicability. I… ▽ More

    Submitted 5 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 22 pages, 13 figures, Published at NeurIPS 2023

  18. arXiv:2304.06900  [pdf, other

    stat.ME

    Subsampling-Based Modified Bayesian Information Criterion for Large-Scale Stochastic Block Models

    Authors: Jiayi Deng, Danyang Huang, Xiangyu Chang, Bo Zhang

    Abstract: Identifying the number of communities is a fundamental problem in community detection, which has received increasing attention recently. However, rapid advances in technology have led to the emergence of large-scale networks in various disciplines, thereby making existing methods computationally infeasible. To address this challenge, we propose a novel subsampling-based modified Bayesian informati… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  19. arXiv:2302.11024  [pdf, other

    stat.ML math.NA

    Gradient Flows for Sampling: Mean-Field Models, Gaussian Approximations and Affine Invariance

    Authors: Yifan Chen, Daniel Zhengyu Huang, Jiaoyang Huang, Sebastian Reich, Andrew M. Stuart

    Abstract: Sampling a probability distribution with an unknown normalization constant is a fundamental problem in computational science and engineering. This task may be cast as an optimization problem over all probability measures, and an initial distribution can be evolved to the desired minimizer dynamically via gradient flows. Mean-field models, whose law is governed by the gradient flow in the space of… ▽ More

    Submitted 2 November, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: 82 pages, 8 figures (Welcome any feedback!)

  20. arXiv:2302.05872  [pdf, other

    cs.CV cs.LG stat.ML

    I$^2$SB: Image-to-Image Schrödinger Bridge

    Authors: Guan-Horng Liu, Arash Vahdat, De-An Huang, Evangelos A. Theodorou, Weili Nie, Anima Anandkumar

    Abstract: We propose Image-to-Image Schrödinger Bridge (I$^2$SB), a new class of conditional diffusion models that directly learn the nonlinear diffusion processes between two given distributions. These diffusion bridges are particularly useful for image restoration, as the degraded images are structurally informative priors for reconstructing the clean images. I$^2$SB belongs to a tractable class of Schröd… ▽ More

    Submitted 25 May, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: ICML camera ready (high-resolution figures)

  21. arXiv:2208.14123  [pdf, other

    stat.ME

    Catalytic Priors: Using Synthetic Data to Specify Prior Distributions in Bayesian Analysis

    Authors: Dongming Huang, Feicheng Wang, Donald B. Rubin, S. C. Kou

    Abstract: Catalytic prior distributions provide general, easy-to-use, and interpretable specifications of prior distributions for Bayesian analysis. They are particularly beneficial when the observed data are inadequate to stably estimate a complex target model. A catalytic prior distribution is constructed by augmenting the observed data with synthetic data that are sampled from the predictive distribution… ▽ More

    Submitted 22 September, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

  22. arXiv:2205.08364  [pdf, other

    cs.LG stat.ME stat.ML

    Network Gradient Descent Algorithm for Decentralized Federated Learning

    Authors: Shuyuan Wu, Danyang Huang, Hansheng Wang

    Abstract: We study a fully decentralized federated learning algorithm, which is a novel gradient descent algorithm executed on a communication-based network. For convenience, we refer to it as a network gradient descent (NGD) method. In the NGD method, only statistics (e.g., parameter estimates) need to be communicated, minimizing the risk of privacy. Meanwhile, different clients communicate with each other… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

  23. arXiv:2204.08247  [pdf, other

    cs.CV cs.LG stat.ML

    Joint Multi-view Unsupervised Feature Selection and Graph Learning

    Authors: Si-Guo Fang, Dong Huang, Chang-Dong Wang, Yong Tang

    Abstract: Despite significant progress, previous multi-view unsupervised feature selection methods mostly suffer from two limitations. First, they generally utilize either cluster structure or similarity structure to guide the feature selection, which neglect the possibility of a joint formulation with mutual benefits. Second, they often learn the similarity structure by either global structure learning or… ▽ More

    Submitted 11 August, 2023; v1 submitted 18 April, 2022; originally announced April 2022.

    Comments: To appear in IEEE Transactions on Emerging Topics in Computational Intelligence

  24. arXiv:2204.07615  [pdf, other

    cs.LG stat.ML

    TabNAS: Rejection Sampling for Neural Architecture Search on Tabular Datasets

    Authors: Chengrun Yang, Gabriel Bender, Hanxiao Liu, Pieter-Jan Kindermans, Madeleine Udell, Yifeng Lu, Quoc Le, Da Huang

    Abstract: The best neural architecture for a given machine learning problem depends on many factors: not only the complexity and structure of the dataset, but also on resource constraints including latency, compute, energy consumption, etc. Neural architecture search (NAS) for tabular datasets is an important but under-explored problem. Previous NAS algorithms designed for image search spaces incorporate re… ▽ More

    Submitted 20 October, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: NeurIPS 2022; 30 pages, 15 figures, 7 tables

  25. arXiv:2204.05552  [pdf

    stat.AP

    The Effects of Dynamic Learning and the Forgetting Process on an Optimizing Modelling for Full-Service Repair Pricing Contracts for Medical Devices

    Authors: Ai** Jiang, Lin Li, Xuemin Xu, David Y. C. Huang

    Abstract: In order to improve the profitability and customer service management of original equipment manufacturers (OEMs) in a market where full-service (FS) and on-call service (OS) co-exist, this article extends the optimizing modelling for pricing FS repair contracts with the effects of dynamic learning and forgetting. Along with considering autonomous learning in maintenance practice, this study also a… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

  26. Fast Multi-view Clustering via Ensembles: Towards Scalability, Superiority, and Simplicity

    Authors: Dong Huang, Chang-Dong Wang, Jian-Huang Lai

    Abstract: Despite significant progress, there remain three limitations to the previous multi-view clustering algorithms. First, they often suffer from high computational complexity, restricting their feasibility for large-scale datasets. Second, they typically fuse multi-view information via one-stage fusion, neglecting the possibilities in multi-stage fusions. Third, dataset-specific hyperparameter-tuning… ▽ More

    Submitted 24 January, 2023; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: To appear in IEEE Transactions on Knowledge and Data Engineering

  27. arXiv:2203.02709  [pdf, other

    stat.ME

    A Wasserstein distance-based spectral clustering method for transaction data analysis

    Authors: Yingqiu Zhu, Danyang Huang, Bo Zhang

    Abstract: With the rapid development of online payment platforms, it is now possible to record massive transaction data. Clustering on transaction data significantly contributes to analyzing merchants' behavior patterns. This enables payment platforms to provide differentiated services or implement risk management strategies. However, traditional methods exploit transactions by generating low-dimensional fe… ▽ More

    Submitted 16 February, 2023; v1 submitted 5 March, 2022; originally announced March 2022.

  28. arXiv:2110.13613  [pdf, other

    cs.SI stat.AP stat.ME

    Subsampling Spectral Clustering for Large-Scale Social Networks

    Authors: Jiayi Deng, Yi Ding, Yingqiu Zhu, Danyang Huang, Bingyi **g, Bo Zhang

    Abstract: Online social network platforms such as Twitter and Sina Weibo have been extremely popular over the past 20 years. Identifying the network community of a social platform is essential to exploring and understanding the users' interests. However, the rapid development of science and technology has generated large amounts of social network data, creating great computational challenges for community d… ▽ More

    Submitted 21 December, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

  29. arXiv:2110.10210  [pdf, other

    math.PR cs.LG stat.ML

    Long Random Matrices and Tensor Unfolding

    Authors: Gérard Ben Arous, Daniel Zhengyu Huang, Jiaoyang Huang

    Abstract: In this paper, we consider the singular values and singular vectors of low rank perturbations of large rectangular random matrices, in the regime the matrix is "long": we allow the number of rows (columns) to grow polynomially in the number of columns (rows). We prove there exists a critical signal-to-noise ratio (depending on the dimensions of the matrix), and the extreme singular values and sing… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: 29 pages, 4 figures

  30. arXiv:2108.00543  [pdf

    stat.AP

    Statistical Learning in Preclinical Drug Proarrhythmic Assessment

    Authors: Nan Milex Xi, Yu-Yi Hsu, Qianyu Dang, Dalong Patrick Huang

    Abstract: Torsades de pointes (TdP) is an irregular heart rhythm characterized by faster beat rates and potentially could lead to sudden cardiac death. Much effort has been invested in understanding the drug-induced TdP in preclinical studies. However, a comprehensive statistical learning framework that can accurately predict the drug-induced TdP risk from preclinical data is still lacking. We proposed ordi… ▽ More

    Submitted 7 January, 2022; v1 submitted 1 August, 2021; originally announced August 2021.

  31. arXiv:2009.08435  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Large Norms of CNN Layers Do Not Hurt Adversarial Robustness

    Authors: Youwei Liang, Dong Huang

    Abstract: Since the Lipschitz properties of convolutional neural networks (CNNs) are widely considered to be related to adversarial robustness, we theoretically characterize the $\ell_1$ norm and $\ell_\infty$ norm of 2D multi-channel convolutional layers and provide efficient methods to compute the exact $\ell_1$ norm and $\ell_\infty$ norm. Based on our theorem, we propose a novel regularization method te… ▽ More

    Submitted 15 August, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: AAAI 2021, including Appendix, 15 pages, 4 figures

  32. arXiv:2008.10208  [pdf, other

    cs.LG cs.CV stat.ML

    Multi-view Graph Learning by Joint Modeling of Consistency and Inconsistency

    Authors: Youwei Liang, Dong Huang, Chang-Dong Wang, Philip S. Yu

    Abstract: Graph learning has emerged as a promising technique for multi-view clustering with its ability to learn a unified and robust graph from multiple views. However, existing graph learning methods mostly focus on the multi-view consistency issue, yet often neglect the inconsistency across multiple views, which makes them vulnerable to possibly low-quality or noisy datasets. To overcome this limitation… ▽ More

    Submitted 3 July, 2021; v1 submitted 24 August, 2020; originally announced August 2020.

    Comments: Preprint, under review

    ACM Class: I.5.3; I.5.1

  33. arXiv:2004.03260  [pdf, other

    stat.ML cs.LG

    Automatic, Dynamic, and Nearly Optimal Learning Rate Specification by Local Quadratic Approximation

    Authors: Yingqiu Zhu, Yu Chen, Danyang Huang, Bo Zhang, Hansheng Wang

    Abstract: In deep learning tasks, the learning rate determines the update step size in each iteration, which plays a critical role in gradient-based optimization. However, the determination of the appropriate learning rate in practice typically replies on subjective judgement. In this work, we propose a novel optimization method based on local quadratic approximation (LQA). In each update step, given the gr… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

    Comments: 10 pages, 5 figures

    MSC Class: 62-08; 41A99 ACM Class: G.0; I.0

  34. arXiv:2004.02414  [pdf, other

    stat.ME

    Efficient Estimation for Generalized Linear Models on a Distributed System with Nonrandomly Distributed Data

    Authors: Feifei Wang, Danyang Huang, Yingqiu Zhu, Hansheng Wang

    Abstract: Distributed systems have been widely used in practice to accomplish data analysis tasks of huge scales. In this work, we target on the estimation problem of generalized linear models on a distributed system with nonrandomly distributed data. We develop a Pseudo-Newton-Raphson algorithm for efficient estimation. In this algorithm, we first obtain a pilot estimator based on a small random sample col… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  35. arXiv:1911.09781  [pdf, other

    cs.LG cs.CV stat.ML

    Beyond Synthetic Noise: Deep Learning on Controlled Noisy Labels

    Authors: Lu Jiang, Di Huang, Mason Liu, Weilong Yang

    Abstract: Performing controlled experiments on noisy data is essential in understanding deep learning across noise levels. Due to the lack of suitable datasets, previous research has only examined deep learning on controlled synthetic label noise, and real-world label noise has never been studied in a controlled setting. This paper makes three contributions. First, we establish the first benchmark of contro… ▽ More

    Submitted 27 August, 2020; v1 submitted 21 November, 2019; originally announced November 2019.

    Comments: published at ICML 2020

  36. arXiv:1909.04503  [pdf, other

    cs.SE cs.LG stat.ML

    ArduCode: Predictive Framework for Automation Engineering

    Authors: Arquimedes Canedo, Palash Goyal, Di Huang, Amit Pandey, Gustavo Quiros

    Abstract: Automation engineering is the task of integrating, via software, various sensors, actuators, and controls for automating a real-world process. Today, automation engineering is supported by a suite of software tools including integrated development environments (IDE), hardware configurators, compilers, and runtimes. These tools focus on the automation code itself, but leave the automation engineer… ▽ More

    Submitted 6 July, 2020; v1 submitted 6 September, 2019; originally announced September 2019.

  37. arXiv:1909.02811  [pdf, other

    cs.SI cs.LG stat.ML

    Graph Representation Ensemble Learning

    Authors: Palash Goyal, Di Huang, Sujit Rokka Chhetri, Arquimedes Canedo, Jaya Shree, Evan Patterson

    Abstract: Representation learning on graphs has been gaining attention due to its wide applicability in predicting missing links, and classifying and recommending nodes. Most embedding methods aim to preserve certain properties of the original graph in the low dimensional space. However, real world graphs have a combination of several properties which are difficult to characterize and capture by a single ap… ▽ More

    Submitted 12 September, 2019; v1 submitted 6 September, 2019; originally announced September 2019.

  38. arXiv:1905.11669  [pdf, other

    cs.LG stat.ML

    CompactNet: Platform-Aware Automatic Optimization for Convolutional Neural Networks

    Authors: Weicheng Li, Rui Wang, Zhongzhi Luan, Di Huang, Zidong Du, Yunji Chen, Depei Qian

    Abstract: Convolutional Neural Network (CNN) based Deep Learning (DL) has achieved great progress in many real-life applications. Meanwhile, due to the complex model structures against strict latency and memory restriction, the implementation of CNN models on the resource-limited platforms is becoming more challenging. This work proposes a solution, called CompactNet\footnote{Project URL: \url{https://githu… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

  39. arXiv:1904.10171  [pdf

    cs.RO cs.LG stat.ML

    Driving Decision and Control for Autonomous Lane Change based on Deep Reinforcement Learning

    Authors: Tianyu Shi, Pin Wang, Xuxin Cheng, Ching-Yao Chan, Ding Huang

    Abstract: We apply Deep Q-network (DQN) with the consideration of safety during the task for deciding whether to conduct the maneuver. Furthermore, we design two similar Deep Q learning frameworks with quadratic approximator for deciding how to select a comfortable gap and just follow the preceding vehicle. Finally, a polynomial lane change trajectory is generated and Pure Pursuit Control is implemented for… ▽ More

    Submitted 30 July, 2019; v1 submitted 23 April, 2019; originally announced April 2019.

    Comments: This Paper has been submitted to ITSC 2019

  40. arXiv:1903.02806  [pdf, ps, other

    stat.ME

    Relaxing the Assumptions of Knockoffs by Conditioning

    Authors: Dongming Huang, Lucas Janson

    Abstract: The recent paper Candès et al. (2018) introduced model-X knockoffs, a method for variable selection that provably and non-asymptotically controls the false discovery rate with no restrictions or assumptions on the dimensionality of the data or the conditional distribution of the response given the covariates. The one requirement for the procedure is that the covariate samples are drawn independent… ▽ More

    Submitted 12 June, 2020; v1 submitted 7 March, 2019; originally announced March 2019.

    MSC Class: 62G10; 62B05; 62J02

  41. Ultra-Scalable Spectral Clustering and Ensemble Clustering

    Authors: Dong Huang, Chang-Dong Wang, Jian-Sheng Wu, Jian-Huang Lai, Chee-Keong Kwoh

    Abstract: This paper focuses on scalability and robustness of spectral clustering for extremely large-scale datasets with limited resources. Two novel algorithms are proposed, namely, ultra-scalable spectral clustering (U-SPEC) and ultra-scalable ensemble clustering (U-SENC). In U-SPEC, a hybrid representative selection strategy and a fast approximation method for K-nearest representatives are proposed for… ▽ More

    Submitted 5 March, 2019; v1 submitted 3 March, 2019; originally announced March 2019.

    Comments: To appear in IEEE Transactions on Knowledge and Data Engineering, 2019

  42. arXiv:1902.09757  [pdf, other

    cs.LG cs.IR cs.SI stat.ML

    Interaction-aware Factorization Machines for Recommender Systems

    Authors: Fuxing Hong, Dongbo Huang, Ge Chen

    Abstract: Factorization Machine (FM) is a widely used supervised learning approach by effectively modeling of feature interactions. Despite the successful application of FM and its many deep learning variants, treating every feature interaction fairly may degrade the performance. For example, the interactions of a useless feature may introduce noises; the importance of a feature may also differ when interac… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

  43. arXiv:1811.02172  [pdf, other

    cs.CL cs.LG stat.ML

    Neural Phrase-to-Phrase Machine Translation

    Authors: Jiangtao Feng, Lingpeng Kong, Po-Sen Huang, Chong Wang, Da Huang, Jiayuan Mao, Kan Qiao, Dengyong Zhou

    Abstract: In this paper, we propose Neural Phrase-to-Phrase Machine Translation (NP$^2$MT). Our model uses a phrase attention mechanism to discover relevant input (source) segments that are used by a decoder to generate output (target) phrases. We also design an efficient dynamic programming algorithm to decode segments that allows the model to be trained faster than the existing neural phrase-based machine… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

  44. Enhanced Ensemble Clustering via Fast Propagation of Cluster-wise Similarities

    Authors: Dong Huang, Chang-Dong Wang, Hongxing Peng, Jianhuang Lai, Chee-Keong Kwoh

    Abstract: Ensemble clustering has been a popular research topic in data mining and machine learning. Despite its significant progress in recent years, there are still two challenging issues in the current ensemble clustering research. First, most of the existing algorithms tend to investigate the ensemble information at the object-level, yet often lack the ability to explore the rich information at higher l… ▽ More

    Submitted 30 October, 2018; originally announced October 2018.

    Comments: To appear in IEEE Transactions on Systems, Man, and Cybernetics: Systems. The MATLAB source code of this work is available at: http://www.researchgate.net/publication/328581758

  45. arXiv:1810.03064  [pdf, other

    cs.LG cs.AI stat.ML

    CSI-Net: Unified Human Body Characterization and Pose Recognition

    Authors: Fei Wang, **song Han, Shiyuan Zhang, Xu He, Dong Huang

    Abstract: We build CSI-Net, a unified Deep Neural Network~(DNN), to learn the representation of WiFi signals. Using CSI-Net, we jointly solved two body characterization problems: biometrics estimation (including body fat, muscle, water, and bone rates) and person recognition. We also demonstrated the application of CSI-Net on two distinctive pose recognition tasks: the hand sign recognition (fine-scaled act… ▽ More

    Submitted 22 January, 2019; v1 submitted 6 October, 2018; originally announced October 2018.

    Comments: 14 pages, 6 figures and 10 tables

  46. arXiv:1808.00079  [pdf, other

    cs.LG cs.DC stat.ML

    Optimal Gradient Checkpoint Search for Arbitrary Computation Graphs

    Authors: Jianwei Feng, Dong Huang

    Abstract: Deep Neural Networks(DNNs) require huge GPU memory when training on modern image/video databases. Unfortunately, the GPU memory is physically finite, which limits the image resolutions and batch sizes that could be used in training for better DNN performance. Unlike solutions that require physically upgrade GPUs, the Gradient CheckPointing(GCP) training trades computation for more memory beyond ex… ▽ More

    Submitted 18 March, 2021; v1 submitted 31 July, 2018; originally announced August 2018.

  47. arXiv:1807.04369  [pdf, other

    cs.CR cs.LG stat.ML

    Differentially-Private "Draw and Discard" Machine Learning

    Authors: Vasyl Pihur, Aleksandra Korolova, Frederick Liu, Subhash Sankuratripati, Moti Yung, Dachuan Huang, Ruogu Zeng

    Abstract: In this work, we propose a novel framework for privacy-preserving client-distributed machine learning. It is motivated by the desire to achieve differential privacy guarantees in the local model of privacy in a way that satisfies all systems constraints using asynchronous client-server communication and provides attractive model learning properties. We call it "Draw and Discard" because it relies… ▽ More

    Submitted 10 October, 2018; v1 submitted 11 July, 2018; originally announced July 2018.

  48. arXiv:1806.04166  [pdf, other

    cs.LG cs.CV stat.ML

    Learning to Decompose and Disentangle Representations for Video Prediction

    Authors: Jun-Ting Hsieh, Bingbin Liu, De-An Huang, Li Fei-Fei, Juan Carlos Niebles

    Abstract: Our goal is to predict future video frames given a sequence of input frames. Despite large amounts of video data, this remains a challenging task because of the high-dimensionality of video frames. We address this challenge by proposing the Decompositional Disentangled Predictive Auto-Encoder (DDPAE), a framework that combines structured probabilistic models and deep networks to automatically (i)… ▽ More

    Submitted 17 October, 2018; v1 submitted 11 June, 2018; originally announced June 2018.

  49. arXiv:1806.00608  [pdf, other

    cs.LG cs.AI cs.LO stat.ML

    GamePad: A Learning Environment for Theorem Proving

    Authors: Daniel Huang, Prafulla Dhariwal, Dawn Song, Ilya Sutskever

    Abstract: In this paper, we introduce a system called GamePad that can be used to explore the application of machine learning methods to theorem proving in the Coq proof assistant. Interactive theorem provers such as Coq enable users to construct machine-checkable proofs in a step-by-step manner. Hence, they provide an opportunity to explore theorem proving with human supervision. We use GamePad to synthesi… ▽ More

    Submitted 21 December, 2018; v1 submitted 2 June, 2018; originally announced June 2018.

  50. arXiv:1609.06789  [pdf, ps, other

    stat.ME

    Krigings Over Space and Time Based on Latent Low-Dimensional Structures

    Authors: Da Huang, Qiwei Yao, Rongmao Zhang

    Abstract: We propose a new approach to represent nonparametrically the linear dependence structure of a spatio-temporal process in terms of latent common factors. Though it is formally similar to the existing reduced rank approximation methods (Section 7.1.3 of Cressie and Wikle, 2011), the fundamental difference is that the low-dimensional structure is completely unknown in our setting, which is learned fr… ▽ More

    Submitted 18 March, 2018; v1 submitted 21 September, 2016; originally announced September 2016.

    Comments: 35 pages, 2 figures