Skip to main content

Showing 1–50 of 71 results for author: Zhao, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.13876  [pdf, other

    stat.ME

    An Empirical Bayes Jackknife Regression Framework for Covariance Matrix Estimation

    Authors: Huqin Xin, Sihai Dave Zhao

    Abstract: Covariance matrix estimation, a classical statistical topic, poses significant challenges when the sample size is comparable to or smaller than the number of features. In this paper, we frame covariance matrix estimation as a compound decision problem and apply an optimal decision rule to estimate covariance parameters. To approximate this rule, we introduce an algorithm that integrates jackknife… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 13 pages, 3 figures

    MSC Class: 62C25

  2. arXiv:2404.10004  [pdf

    cs.LG physics.soc-ph stat.AP

    A Strategy Transfer and Decision Support Approach for Epidemic Control in Experience Shortage Scenarios

    Authors: X. Xiao, P. Chen, X. Cao, K. Liu, L. Deng, D. Zhao, Z. Chen, Q. Deng, F. Yu, H. Zhang

    Abstract: Epidemic outbreaks can cause critical health concerns and severe global economic crises. For countries or regions with new infectious disease outbreaks, it is essential to generate preventive strategies by learning lessons from others with similar risk profiles. A Strategy Transfer and Decision Support Approach (STDSA) is proposed based on the profile similarity evaluation. There are four steps in… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 20 pages, 9 figures

  3. arXiv:2310.07187  [pdf, other

    stat.ML cs.LG

    Kernel Cox partially linear regression: building predictive models for cancer patients' survival

    Authors: Yaohua Rong, Sihai Dave Zhao, Xia Zheng, Yi Li

    Abstract: Wide heterogeneity exists in cancer patients' survival, ranging from a few months to several decades. To accurately predict clinical outcomes, it is vital to build an accurate predictive model that relates patients' molecular profiles with patients' survival. With complex relationships between survival and high-dimensional molecular predictors, it is challenging to conduct non-parametric modeling… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  4. arXiv:2306.07239  [pdf, ps, other

    stat.ME

    Nonparametric empirical Bayes biomarker imputation and estimation

    Authors: Alton Barbehenn, Sihai Dave Zhao

    Abstract: Biomarkers are often measured in bulk to diagnose patients, monitor patient conditions, and research novel drug pathways. The measurement of these biomarkers often suffers from detection limits that result in missing and untrustworthy measurements. Frequently, missing biomarkers are imputed so that down-stream analysis can be conducted with modern statistical methods that cannot normally handle da… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  5. arXiv:2306.00342  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Combining Explicit and Implicit Regularization for Efficient Learning in Deep Networks

    Authors: Dan Zhao

    Abstract: Works on implicit regularization have studied gradient trajectories during the optimization process to explain why deep networks favor certain kinds of solutions over others. In deep linear networks, it has been shown that gradient descent implicitly regularizes toward low-rank solutions on matrix completion/factorization tasks. Adding depth not only improves performance on these tasks but also ac… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Journal ref: Advances in Neural Information Processing Systems 35 (NeurIPS 2022), 3024--3038

  6. arXiv:2302.02092  [pdf, other

    cs.LG stat.ML

    Interpolation for Robust Learning: Data Augmentation on Wasserstein Geodesics

    Authors: Jiacheng Zhu, Jielin Qiu, Aritra Guha, Zhuolin Yang, Xuanlong Nguyen, Bo Li, Ding Zhao

    Abstract: We propose to study and promote the robustness of a model as per its performance through the interpolation of training data distributions. Specifically, (1) we augment the data by finding the worst-case Wasserstein barycenter on the geodesic connecting subpopulation distributions of different categories. (2) We regularize the model for smoother performance on the continuous geodesic path connectin… ▽ More

    Submitted 28 August, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: 34 pages, 3 figures, 18 tables

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:43129-43157, 2023

  7. Choosing statistical models to assess biological interaction as a departure from additivity of effects

    Authors: David M. Thompson, Yan Daniel Zhao

    Abstract: Vanderweele and Knol define biological interaction as an instance wherein "two exposures physically interact to bring about the outcome." A hallmark of biological interaction is that the total effect, produced when factors act together, differs from the sum of effects when the factors operate independently. Epidemiologists construct statistical models to assess biological interaction. The form of… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

    Comments: 29 pages, 3 tables, 3 figures

  8. Dynamic Global Sensitivity for Differentially Private Contextual Bandits

    Authors: Huazheng Wang, David Zhao, Hongning Wang

    Abstract: Bandit algorithms have become a reference solution for interactive recommendation. However, as such algorithms directly interact with users for improved recommendations, serious privacy concerns have been raised regarding its practical use. In this work, we propose a differentially private linear contextual bandit algorithm, via a tree-based mechanism to add Laplace or Gaussian noise to model para… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: RecSys 2022

  9. arXiv:2208.01220  [pdf, other

    stat.ML cs.LG eess.SP

    GeoECG: Data Augmentation via Wasserstein Geodesic Perturbation for Robust Electrocardiogram Prediction

    Authors: Jiacheng Zhu, Jielin Qiu, Zhuolin Yang, Douglas Weber, Michael A. Rosenberg, Emerson Liu, Bo Li, Ding Zhao

    Abstract: There has been an increased interest in applying deep neural networks to automatically interpret and analyze the 12-lead electrocardiogram (ECG). The current paradigms with machine learning methods are often limited by the amount of labeled data. This phenomenon is particularly problematic for clinically-relevant data, where labeling at scale can be time-consuming and costly in terms of the specia… ▽ More

    Submitted 10 August, 2022; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: 26 pages, Figure 13, Machine Learning for Healthcare 2022

    Journal ref: Machine Learning for Healthcare 2022, JMLR Volume 182

  10. arXiv:2207.09081  [pdf, other

    cs.LG cs.AI cs.RO stat.ME

    Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Reasoning

    Authors: Wenhao Ding, Haohong Lin, Bo Li, Ding Zhao

    Abstract: As a pivotal component to attaining generalizable solutions in human intelligence, reasoning provides great potential for reinforcement learning (RL) agents' generalization towards varied goals by summarizing part-to-whole arguments and discovering cause-and-effect relations. However, how to discover and represent causalities remains a huge gap that hinders the development of causal RL. In this pa… ▽ More

    Submitted 17 May, 2023; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted to NeurIPS 2022

  11. arXiv:2205.14568  [pdf, other

    stat.ML astro-ph.IM cs.LG stat.ME

    Conditionally Calibrated Predictive Distributions by Probability-Probability Map: Application to Galaxy Redshift Estimation and Probabilistic Forecasting

    Authors: Biprateep Dey, David Zhao, Jeffrey A. Newman, Brett H. Andrews, Rafael Izbicki, Ann B. Lee

    Abstract: Uncertainty quantification is crucial for assessing the predictive ability of AI algorithms. Much research has been devoted to describing the predictive distribution (PD) $F(y|\mathbf{x})$ of a target variable $y \in \mathbb{R}$ given complex input features $\mathbf{x} \in \mathcal{X}$. However, off-the-shelf PDs (from, e.g., normalizing flows and Bayesian neural networks) often lack conditional c… ▽ More

    Submitted 17 July, 2023; v1 submitted 28 May, 2022; originally announced May 2022.

    Comments: 21 pages, 11 figures. Under review. Code available as a Python package https://github.com/lee-group-cmu/Cal-PIT

  12. arXiv:2204.02351  [pdf, other

    cs.LG cs.RO stat.ME

    Test Against High-Dimensional Uncertainties: Accelerated Evaluation of Autonomous Vehicles with Deep Importance Sampling

    Authors: Mansur Arief, Zhepeng Cen, Zhenyuan Liu, Zhiyuang Huang, Henry Lam, Bo Li, Ding Zhao

    Abstract: Evaluating the performance of autonomous vehicles (AV) and their complex subsystems to high precision under naturalistic circumstances remains a challenge, especially when failure or dangerous cases are rare. Rarity does not only require an enormous sample size for a naive method to achieve high confidence estimation, but it also causes dangerous underestimation of the true failure rate and it is… ▽ More

    Submitted 5 April, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

  13. arXiv:2203.12595  [pdf, other

    eess.SP cs.LG stat.ML

    PhysioMTL: Personalizing Physiological Patterns using Optimal Transport Multi-Task Regression

    Authors: Jiacheng Zhu, Gregory Darnell, Agni Kumar, Ding Zhao, Bo Li, Xuanlong Nguyen, Shirley You Ren

    Abstract: Heart rate variability (HRV) is a practical and noninvasive measure of autonomic nervous system activity, which plays an essential role in cardiovascular health. However, using HRV to assess physiology status is challenging. Even in clinical settings, HRV is sensitive to acute stressors such as physical activity, mental stress, hydration, alcohol, and sleep. Wearable devices provide convenient HRV… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

    Comments: 20 pages, 9 figures, accepted by CHIL 2022

  14. arXiv:2111.14829  [pdf, other

    cs.LG stat.ML

    Nonparametric Topological Layers in Neural Networks

    Authors: Dongfang Zhao

    Abstract: Various topological techniques and tools have been applied to neural networks in terms of network complexity, explainability, and performance. One fundamental assumption of this line of research is the existence of a global (Euclidean) coordinate system upon which the topological layer is constructed. Despite promising results, such a \textit{topologization} method has yet to be widely adopted bec… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

  15. arXiv:2111.02204  [pdf, other

    stat.ME stat.ML

    Certifiable Deep Importance Sampling for Rare-Event Simulation of Black-Box Systems

    Authors: Mansur Arief, Yuanlu Bai, Wenhao Ding, Shengyi He, Zhiyuan Huang, Henry Lam, Ding Zhao

    Abstract: Rare-event simulation techniques, such as importance sampling (IS), constitute powerful tools to speed up challenging estimation of rare catastrophic events. These techniques often leverage the knowledge and analysis on underlying system structures to endow desirable efficiency guarantees. However, black-box problems, especially those arising from recent safety-critical applications of AI-driven p… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: The conference version of this paper has appeared in AISTATS 2021 (arXiv:2006.15722)

  16. arXiv:2110.15209  [pdf, other

    astro-ph.IM cs.LG stat.ME stat.ML

    Re-calibrating Photometric Redshift Probability Distributions Using Feature-space Regression

    Authors: Biprateep Dey, Jeffrey A. Newman, Brett H. Andrews, Rafael Izbicki, Ann B. Lee, David Zhao, Markus Michael Rau, Alex I. Malz

    Abstract: Many astrophysical analyses depend on estimates of redshifts (a proxy for distance) determined from photometric (i.e., imaging) data alone. Inaccurate estimates of photometric redshift uncertainties can result in large systematic errors. However, probability distribution outputs from many photometric redshift methods do not follow the frequentist definition of a Probability Density Function (PDF)… ▽ More

    Submitted 27 January, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: Fourth Workshop on Machine Learning and the Physical Sciences (NeurIPS 2021)

  17. arXiv:2110.12573  [pdf, other

    math.ST stat.ME

    Over-Conservativeness of Variance-Based Efficiency Criteria and Probabilistic Efficiency in Rare-Event Simulation

    Authors: Yuanlu Bai, Zhiyuan Huang, Henry Lam, Ding Zhao

    Abstract: In rare-event simulation, an importance sampling (IS) estimator is regarded as efficient if its relative error, namely the ratio between its standard deviation and mean, is sufficiently controlled. It is widely known that when a rare-event set contains multiple "important regions" encoded by the so-called dominating points, IS needs to account for all of them via mixing to achieve efficiency. We a… ▽ More

    Submitted 28 October, 2022; v1 submitted 24 October, 2021; originally announced October 2021.

  18. arXiv:2110.04137   

    stat.AP

    A surrogate-based reliability analysis method of the motion of large flexible space structures

    Authors: Dongyu Zhao

    Abstract: Satellites and their instruments are subject to the motion stability throughout their lifetimes. The reliability of the large flexible space structures (LFSS) is particularly important for the motion stability of satellites and their instruments. In this paper, the reliability analysis of large flexible space structures is conducted based on Bayesian support vector regression (SVR). The kinematic… ▽ More

    Submitted 18 April, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: The paper is not completed and should be revised

  19. arXiv:2107.03920  [pdf, other

    stat.ML cs.LG

    Likelihood-Free Frequentist Inference: Bridging Classical Statistics and Machine Learning for Reliable Simulator-Based Inference

    Authors: Niccolò Dalmasso, Luca Masserano, David Zhao, Rafael Izbicki, Ann B. Lee

    Abstract: Many areas of science make extensive use of computer simulators that implicitly encode intractable likelihood functions of complex systems. Classical statistical methods are poorly suited for these so-called likelihood-free inference (LFI) settings, especially outside asymptotic and low-dimensional regimes. At the same time, traditional LFI methods - such as Approximate Bayesian Computation or mor… ▽ More

    Submitted 19 November, 2023; v1 submitted 8 July, 2021; originally announced July 2021.

    Comments: 45 pages, 6 figures, code available at https://github.com/lee-group-cmu/lf2i, supplementary material available at https://lucamasserano.github.io/data/LF2I_supplementary_material.pdf

  20. arXiv:2107.03280  [pdf, other

    stat.ML cs.LG

    MD-split+: Practical Local Conformal Inference in High Dimensions

    Authors: Benjamin LeRoy, David Zhao

    Abstract: Quantifying uncertainty in model predictions is a common goal for practitioners seeking more than just point predictions. One tool for uncertainty quantification that requires minimal assumptions is conformal inference, which can help create probabilistically valid prediction regions for black box models. Classical conformal prediction only provides marginal validity, whereas in many situations lo… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: Appearing in ICML 2021 workshop on distribution-free uncertainty quantification

  21. arXiv:2104.08970  [pdf, other

    stat.ME

    Linear shrinkage for predicting responses in large-scale multivariate linear regression

    Authors: Yihe Wang, Sihai Dave Zhao

    Abstract: We propose a new prediction method for multivariate linear regression problems where the number of features is less than the sample size but the number of outcomes is extremely large. Many popular procedures, such as penalized regression procedures, require parameter tuning that is computationally untenable in such large-scale problems. We take a different approach, motivated by ideas from simulta… ▽ More

    Submitted 18 April, 2021; originally announced April 2021.

  22. arXiv:2104.08157  [pdf, other

    cs.LG stat.ME

    Capturing patterns of variation unique to a specific dataset

    Authors: Robin Tu, Alexander H. Foss, Sihai D. Zhao

    Abstract: Capturing patterns of variation present in a dataset is important in exploratory data analysis and unsupervised learning. Contrastive dimension reduction methods, such as contrastive principal component analysis (cPCA), find patterns unique to a target dataset of interest by contrasting with a carefully chosen background dataset representing unwanted or uninteresting variation. However, such metho… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

  23. arXiv:2102.10473  [pdf, other

    stat.ME

    Diagnostics for Conditional Density Models and Bayesian Inference Algorithms

    Authors: David Zhao, Niccolò Dalmasso, Rafael Izbicki, Ann B. Lee

    Abstract: There has been growing interest in the AI community for precise uncertainty quantification. Conditional density models f(y|x), where x represents potentially high-dimensional features, are an integral part of uncertainty quantification in prediction and Bayesian inference. However, it is challenging to assess conditional density estimates and gain insight into modes of failure. While existing diag… ▽ More

    Submitted 23 July, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: Appearing in 37th Conference on Uncertainty in Artificial Intelligence (UAI 2021), Spotlight Talk; camera-ready version

  24. arXiv:2102.03895  [pdf, other

    stat.ML cs.LG stat.AP

    Functional optimal transport: map estimation and domain adaptation for functional data

    Authors: Jiacheng Zhu, Aritra Guha, Dat Do, Mengdi Xu, XuanLong Nguyen, Ding Zhao

    Abstract: We introduce a formulation of optimal transport problem for distributions on function spaces, where the stochastic map between functional domains can be partially represented in terms of an (infinite-dimensional) Hilbert-Schmidt operator map** a Hilbert space of functions to another. For numerous machine learning tasks, data can be naturally viewed as samples drawn from spaces of functions, such… ▽ More

    Submitted 28 August, 2023; v1 submitted 7 February, 2021; originally announced February 2021.

    Comments: 48 pages, 10 figures, 3 tables

  25. arXiv:2011.02147  [pdf, other

    stat.ML cs.LG

    Capped norm linear discriminant analysis and its applications

    Authors: Jiakou Liu, Xiong Xiong, Pei-Wei Ren, Da Zhao, Chun-Na Li, Yuan-Hai Shao

    Abstract: Classical linear discriminant analysis (LDA) is based on squared Frobenious norm and hence is sensitive to outliers and noise. To improve the robustness of LDA, in this paper, we introduce capped l_{2,1}-norm of a matrix, which employs non-squared l_2-norm and "capped" operation, and further propose a novel capped l_{2,1}-norm linear discriminant analysis, called CLDA. Due to the use of capped l_{… ▽ More

    Submitted 4 November, 2020; originally announced November 2020.

  26. arXiv:2010.04890  [pdf, other

    cs.LG math.ST stat.ML

    Rare-Event Simulation for Neural Network and Random Forest Predictors

    Authors: Yuanlu Bai, Zhiyuan Huang, Henry Lam, Ding Zhao

    Abstract: We study rare-event simulation for a class of problems where the target hitting sets of interest are defined via modern machine learning tools such as neural networks and random forests. This problem is motivated from fast emerging studies on the safety evaluation of intelligent systems, robustness quantification of learning models, and other potential applications to large-scale simulation in whi… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

  27. arXiv:2009.09822  [pdf, other

    cs.DB cs.LG stat.ML

    TODS: An Automated Time Series Outlier Detection System

    Authors: Kwei-Herng Lai, Daochen Zha, Guanchu Wang, Junjie Xu, Yue Zhao, Devesh Kumar, Yile Chen, Purav Zumkhawaka, Minyang Wan, Diego Martinez, Xia Hu

    Abstract: We present TODS, an automated Time Series Outlier Detection System for research and industrial applications. TODS is a highly modular system that supports easy pipeline construction. The basic building block of TODS is primitive, which is an implementation of a function with hyperparameters. TODS currently supports 70 primitives, including data processing, time series processing, feature analysis,… ▽ More

    Submitted 7 January, 2021; v1 submitted 18 September, 2020; originally announced September 2020.

    Comments: Accepted by AAAI'21 demo track

  28. arXiv:2009.08886  [pdf, other

    cs.CV stat.ML

    BNAS-v2: Memory-efficient and Performance-collapse-prevented Broad Neural Architecture Search

    Authors: Zixiang Ding, Yaran Chen, Nannan Li, Dongbin Zhao

    Abstract: In this paper, we propose BNAS-v2 to further improve the efficiency of NAS, embodying both superiorities of BCNN simultaneously. To mitigate the unfair training issue of BNAS, we employ continuous relaxation strategy to make each edge of cell in BCNN relevant to all candidate operations for over-parameterized BCNN construction. Moreover, the continuous relaxation strategy relaxes the choice of a c… ▽ More

    Submitted 25 January, 2021; v1 submitted 18 September, 2020; originally announced September 2020.

    Comments: 12 pages, 11 figures, 3 tables

  29. arXiv:2009.08311  [pdf, other

    cs.LG cs.RO stat.ML

    Multimodal Safety-Critical Scenarios Generation for Decision-Making Algorithms Evaluation

    Authors: Wenhao Ding, Baiming Chen, Bo Li, Kim Ji Eun, Ding Zhao

    Abstract: Existing neural network-based autonomous systems are shown to be vulnerable against adversarial attacks, therefore sophisticated evaluation on their robustness is of great importance. However, evaluating the robustness only under the worst-case scenarios based on known attacks is not comprehensive, not to mention that some of them even rarely occur in the real world. In addition, the distribution… ▽ More

    Submitted 26 December, 2020; v1 submitted 16 September, 2020; originally announced September 2020.

    Comments: 8 pages, 7 figures

  30. arXiv:2009.07415  [pdf, other

    cs.LG stat.ML

    Meta-AAD: Active Anomaly Detection with Deep Reinforcement Learning

    Authors: Daochen Zha, Kwei-Herng Lai, Mingyang Wan, Xia Hu

    Abstract: High false-positive rate is a long-standing challenge for anomaly detection algorithms, especially in high-stake applications. To identify the true anomalies, in practice, analysts or domain experts will be employed to investigate the top instances one by one in a ranked list of anomalies identified by an anomaly detection system. This verification procedure generates informative labels that can b… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

    Comments: Accepted by ICDM 2020

  31. arXiv:2006.15722  [pdf, other

    cs.LG stat.ML

    Deep Probabilistic Accelerated Evaluation: A Robust Certifiable Rare-Event Simulation Methodology for Black-Box Safety-Critical Systems

    Authors: Mansur Arief, Zhiyuan Huang, Guru Koushik Senthil Kumar, Yuanlu Bai, Shengyi He, Wenhao Ding, Henry Lam, Ding Zhao

    Abstract: Evaluating the reliability of intelligent physical systems against rare safety-critical events poses a huge testing burden for real-world applications. Simulation provides a useful platform to evaluate the extremal risks of these systems before their deployments. Importance Sampling (IS), while proven to be powerful for rare-event simulation, faces challenges in handling these learning-based syste… ▽ More

    Submitted 8 March, 2021; v1 submitted 28 June, 2020; originally announced June 2020.

  32. arXiv:2006.15097  [pdf, other

    cs.LG stat.ML

    Policy-GNN: Aggregation Optimization for Graph Neural Networks

    Authors: Kwei-Herng Lai, Daochen Zha, Kaixiong Zhou, Xia Hu

    Abstract: Graph data are pervasive in many real-world applications. Recently, increasing attention has been paid on graph neural networks (GNNs), which aim to model the local graph structures and capture the hierarchical patterns by aggregating the information from neighbors with stackable network modules. Motivated by the observation that different nodes often require different iterations of aggregation to… ▽ More

    Submitted 26 June, 2020; originally announced June 2020.

    Comments: Accepted by ACM SIGKDD'20 research track

  33. arXiv:2006.11441  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of Gaussian Processes

    Authors: Mengdi Xu, Wenhao Ding, Jiacheng Zhu, Zuxin Liu, Baiming Chen, Ding Zhao

    Abstract: Continuously learning to solve unseen tasks with limited experience has been extensively pursued in meta-learning and continual learning, but with restricted assumptions such as accessible task distributions, independently and identically distributed tasks, and clear task delineations. However, real-world physical tasks frequently violate these assumptions, resulting in performance degradation. Th… ▽ More

    Submitted 30 November, 2020; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: 16 pages, 6 figures

  34. arXiv:2006.11321  [pdf, other

    cs.LG stat.ML

    AutoOD: Automated Outlier Detection via Curiosity-guided Search and Self-imitation Learning

    Authors: Yuening Li, Zhengzhang Chen, Daochen Zha, Kaixiong Zhou, Haifeng **, Haifeng Chen, Xia Hu

    Abstract: Outlier detection is an important data mining task with numerous practical applications such as intrusion detection, credit card fraud detection, and video surveillance. However, given a specific complicated task with big data, the process of building a powerful deep learning based system for outlier detection still highly relies on human expertise and laboring trials. Although Neural Architecture… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

  35. arXiv:2006.10241  [pdf, other

    cs.LG cs.MA cs.RO stat.AP stat.ML

    Robust Unsupervised Learning of Temporal Dynamic Interactions

    Authors: Aritra Guha, Rayleigh Lei, Jiacheng Zhu, XuanLong Nguyen, Ding Zhao

    Abstract: Robust representation learning of temporal dynamic interactions is an important problem in robotic learning in general and automated unsupervised learning in particular. Temporal dynamic interactions can be described by (multiple) geometric trajectories in a suitable space over which unsupervised learning techniques may be applied to extract useful features from raw and high-dimensional data measu… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  36. arXiv:2006.06972  [pdf, other

    cs.LG stat.ML

    Towards Deeper Graph Neural Networks with Differentiable Group Normalization

    Authors: Kaixiong Zhou, Xiao Huang, Yuening Li, Daochen Zha, Rui Chen, Xia Hu

    Abstract: Graph neural networks (GNNs), which learn the representation of a node by aggregating its neighbors, have become an effective computational tool in downstream applications. Over-smoothing is one of the key issues which limit the performance of GNNs as the number of layers increases. It is because the stacked aggregators would make node representations converge to indistinguishable vectors. Several… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  37. arXiv:2006.05891  [pdf, other

    cs.LG stat.ML

    On Noise Injection in Generative Adversarial Networks

    Authors: Ruili Feng, Deli Zhao, Zhengjun Zha

    Abstract: Noise injection has been proved to be one of the key technique advances in generating high-fidelity images. Despite its successful usage in GANs, the mechanism of its validity is still unclear. In this paper, we propose a geometric framework to theoretically analyze the role of noise injection in GANs. Based on Riemannian geometry, we successfully model the noise injection framework as fuzzy equiv… ▽ More

    Submitted 22 May, 2021; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: Deep Learning Theory; Generative Adversarial Network; Machine Learning

  38. arXiv:2005.08479  [pdf, other

    cs.LG cs.CR stat.ML

    Large-Scale Secure XGB for Vertical Federated Learning

    Authors: Wen**g Fang, Derun Zhao, ** Tan, Chaochao Chen, Chaofan Yu, Li Wang, Lei Wang, Jun Zhou, Benyu Zhang

    Abstract: Privacy-preserving machine learning has drawn increasingly attention recently, especially with kinds of privacy regulations come into force. Under such situation, Federated Learning (FL) appears to facilitate privacy-preserving joint modeling among multiple parties. Although many federated algorithms have been extensively studied, there is still a lack of secure and practical gradient tree boostin… ▽ More

    Submitted 2 September, 2021; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: accepted by cikm21

  39. arXiv:2005.05441  [pdf, other

    cs.LG cs.MA stat.ML

    Delay-Aware Multi-Agent Reinforcement Learning for Cooperative and Competitive Environments

    Authors: Baiming Chen, Mengdi Xu, Zuxin Liu, Liang Li, Ding Zhao

    Abstract: Action and observation delays exist prevalently in the real-world cyber-physical systems which may pose challenges in reinforcement learning design. It is particularly an arduous task when handling multi-agent systems where the delay of one agent could spread to other agents. To resolve this problem, this paper proposes a novel framework to deal with delays as well as the non-stationary training i… ▽ More

    Submitted 28 August, 2020; v1 submitted 11 May, 2020; originally announced May 2020.

  40. arXiv:2005.05440  [pdf, other

    cs.LG cs.AI stat.ML

    Delay-Aware Model-Based Reinforcement Learning for Continuous Control

    Authors: Baiming Chen, Mengdi Xu, Liang Li, Ding Zhao

    Abstract: Action delays degrade the performance of reinforcement learning in many real-world systems. This paper proposes a formal definition of delay-aware Markov Decision Process and proves it can be transformed into standard MDP with augmented states using the Markov reward process. We develop a delay-aware model-based reinforcement learning framework that can incorporate the multi-step delay into the le… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Journal ref: Neurocomputing Volume 450, 25 August 2021, Pages 119-128

  41. arXiv:2005.04549  [pdf, other

    stat.ME math.ST

    A Compound Decision Approach to Covariance Matrix Estimation

    Authors: Huiqin Xin, Sihai Dave Zhao

    Abstract: Covariance matrix estimation is a fundamental statistical task in many applications, but the sample covariance matrix is sub-optimal when the sample size is comparable to or less than the number of features. Such high-dimensional settings are common in modern genomics, where covariance matrix estimation is frequently employed as a method for inferring gene networks. To achieve estimation accuracy… ▽ More

    Submitted 2 June, 2022; v1 submitted 9 May, 2020; originally announced May 2020.

    Comments: 20 pages, 4 figures. Biometrics (2022)

    MSC Class: 62C12 (Primary) 62C25 (Secondary)

  42. arXiv:2003.05602  [pdf, other

    cs.LG cs.AI stat.ML

    PyODDS: An End-to-end Outlier Detection System with Automated Machine Learning

    Authors: Yuening Li, Daochen Zha, Praveen Kumar Venugopal, Na Zou, Xia Hu

    Abstract: Outlier detection is an important task for various data mining applications. Current outlier detection techniques are often manually designed for specific domains, requiring large human efforts of database setup, algorithm selection, and hyper-parameter tuning. To fill this gap, we present PyODDS, an automated end-to-end Python system for Outlier Detection with Database Support, which automaticall… ▽ More

    Submitted 11 March, 2020; originally announced March 2020.

    Comments: In Companion Proceedings of the Web Conference 2020 (WWW 20)

  43. BNAS:An Efficient Neural Architecture Search Approach Using Broad Scalable Architecture

    Authors: Zixiang Ding, Yaran Chen, Nannan Li, Dongbin Zhao, Zhiquan Sun, C. L. Philip Chen

    Abstract: In this paper, we propose Broad Neural Architecture Search (BNAS) where we elaborately design broad scalable architecture dubbed Broad Convolutional Neural Network (BCNN) to solve the above issue. On one hand, the proposed broad scalable architecture has fast training speed due to its shallow topology. Moreover, we also adopt reinforcement learning and parameter sharing used in ENAS as the optimiz… ▽ More

    Submitted 20 January, 2021; v1 submitted 18 January, 2020; originally announced January 2020.

    Comments: 15 pages, 12 figures, 5 tables

  44. arXiv:1912.10233  [pdf, other

    cs.LG cs.CV stat.ML

    Latent Variables on Spheres for Autoencoders in High Dimensions

    Authors: Deli Zhao, Jiapeng Zhu, Bo Zhang

    Abstract: Variational Auto-Encoder (VAE) has been widely applied as a fundamental generative model in machine learning. For complex samples like imagery objects or scenes, however, VAE suffers from the dimensional dilemma between reconstruction precision that needs high-dimensional latent codes and probabilistic inference that favors a low-dimensional latent space. By virtue of high-dimensional geometry, we… ▽ More

    Submitted 16 February, 2020; v1 submitted 21 December, 2019; originally announced December 2019.

  45. arXiv:1911.11819  [pdf, other

    q-fin.TR q-fin.ST stat.ML

    Cryptocurrency Price Prediction and Trading Strategies Using Support Vector Machines

    Authors: David Zhao, Alessandro Rinaldo, Christopher Brookins

    Abstract: Few assets in financial history have been as notoriously volatile as cryptocurrencies. While the long term outlook for this asset class remains unclear, we are successful in making short term price predictions for several major crypto assets. Using historical data from July 2015 to November 2019, we develop a large number of technical indicators to capture patterns in the cryptocurrency market. We… ▽ More

    Submitted 28 November, 2019; v1 submitted 26 November, 2019; originally announced November 2019.

    Comments: Corrected typos

  46. arXiv:1910.12457  [pdf, ps, other

    stat.ME

    Estimation and inference for the indirect effect in high-dimensional linear mediation models

    Authors: Ruixuan Rachel Zhou, Liewei Wang, Sihai Dave Zhao

    Abstract: Mediation analysis is difficult when the number of potential mediators is larger than the sample size. In this paper we propose new inference procedures for the indirect effect in the presence of high-dimensional mediators for linear mediation models. We develop methods for both incomplete mediation, where a direct effect may exist, as well as complete mediation, where the direct effect is known t… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: To appear in Biometrika

  47. arXiv:1910.09323  [pdf, other

    cs.LG stat.ML

    Recurrent Attentive Neural Process for Sequential Data

    Authors: Shenghao Qin, Jiacheng Zhu, Jimmy Qin, Wenshuo Wang, Ding Zhao

    Abstract: Neural processes (NPs) learn stochastic processes and predict the distribution of target output adaptively conditioned on a context set of observed input-output pairs. Furthermore, Attentive Neural Process (ANP) improved the prediction accuracy of NPs by incorporating attention mechanism among contexts and targets. In a number of real-world applications such as robotics, finance, speech, and biolo… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

    Comments: 12 pages, 6 figures, NeurIPS 2019 Workshop

  48. arXiv:1910.02575  [pdf, other

    cs.LG cs.DB stat.ML

    PyODDS: An End-to-End Outlier Detection System

    Authors: Yuening Li, Daochen Zha, Na Zou, Xia Hu

    Abstract: PyODDS is an end-to end Python system for outlier detection with database support. PyODDS provides outlier detection algorithms which meet the demands for users in different fields, w/wo data science or machine learning background. PyODDS gives the ability to execute machine learning algorithms in-database without moving data out of the database server or over the network. It also provides access… ▽ More

    Submitted 11 October, 2019; v1 submitted 6 October, 2019; originally announced October 2019.

    Comments: 6 Pages, 2 Figures

  49. arXiv:1910.00099  [pdf, other

    cs.LG cs.CV cs.RO eess.IV stat.ML

    CMTS: Conditional Multiple Trajectory Synthesizer for Generating Safety-critical Driving Scenarios

    Authors: Wenhao Ding, Mengdi Xu, Ding Zhao

    Abstract: Naturalistic driving trajectories are crucial for the performance of autonomous driving algorithms. However, most of the data is collected in safe scenarios leading to the duplication of trajectories which are easy to be handled by currently developed algorithms. When considering safety, testing algorithms in near-miss scenarios that rarely show up in off-the-shelf datasets is a vital part of the… ▽ More

    Submitted 2 October, 2019; v1 submitted 17 September, 2019; originally announced October 2019.

    Comments: Submitted to ICRA 2020, 8 pages, 7 figures

  50. arXiv:1909.07843  [pdf, other

    cs.LG cs.RO stat.ML

    Active Learning for Risk-Sensitive Inverse Reinforcement Learning

    Authors: Rui Chen, Wenshuo Wang, Zirui Zhao, Ding Zhao

    Abstract: One typical assumption in inverse reinforcement learning (IRL) is that human experts act to optimize the expected utility of a stochastic cost with a fixed distribution. This assumption deviates from actual human behaviors under ambiguity. Risk-sensitive inverse reinforcement learning (RS-IRL) bridges such gap by assuming that humans act according to a random cost with respect to a set of subjecti… ▽ More

    Submitted 23 September, 2019; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: 8 pages without acknowledgment, 7 figures, submitted to RA-L and ICRA 2020 for the IEEE Robotics and Automation Letters (RA-L)