Skip to main content

Showing 1–39 of 39 results for author: Ding, X

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.12317  [pdf, other

    stat.ML cs.LG

    Kernel spectral joint embeddings for high-dimensional noisy datasets using duo-landmark integral operators

    Authors: Xiucai Ding, Rong Ma

    Abstract: Integrative analysis of multiple heterogeneous datasets has become standard practice in many research fields, especially in single-cell genomics and medical informatics. Existing approaches oftentimes suffer from limited power in capturing nonlinear structures, insufficient account of noisiness and effects of high-dimensionality, lack of adaptivity to signals and sample sizes imbalance, and their… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 32 pages, 5 figures; comments are welcome

  2. arXiv:2402.03701  [pdf, other

    cs.LG stat.ML

    Improving and Unifying Discrete&Continuous-time Discrete Denoising Diffusion

    Authors: Lingxiao Zhao, Xueying Ding, Lijun Yu, Leman Akoglu

    Abstract: Discrete diffusion models have seen a surge of attention with applications on naturally discrete data such as language and graphs. Although discrete-time discrete diffusion has been established for a while, only recently Campbell et al. (2022) introduced the first framework for continuous-time discrete diffusion. However, their training and sampling processes differ significantly from the discrete… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Unify Discrete Denoising Diffusion

  3. arXiv:2402.03687  [pdf, other

    cs.LG stat.ML

    Pard: Permutation-Invariant Autoregressive Diffusion for Graph Generation

    Authors: Lingxiao Zhao, Xueying Ding, Leman Akoglu

    Abstract: Graph generation has been dominated by autoregressive models due to their simplicity and effectiveness, despite their sensitivity to ordering. Yet diffusion models have garnered increasing attention, as they offer comparable performance while being permutation-invariant. Current graph diffusion models generate graphs in a one-shot fashion, but they require extra features and thousands of denoising… ▽ More

    Submitted 23 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Diffusion Model on Graphs

  4. arXiv:2401.15778  [pdf, ps, other

    math.ST stat.ME

    On the partial autocorrelation function for locally stationary time series: characterization, estimation and inference

    Authors: Xiucai Ding, Zhou Zhou

    Abstract: For stationary time series, it is common to use the plots of partial autocorrelation function (PACF) or PACF-based tests to explore the temporal dependence structure of such processes. To our best knowledge, such analogs for non-stationary time series have not been fully established yet. In this paper, we fill this gap for locally stationary time series with short-range dependence. First, we chara… ▽ More

    Submitted 30 January, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

    Comments: 26 pages, 6 figures

  5. arXiv:2312.10796  [pdf, other

    stat.ME math.ST

    Two sample test for covariance matrices in ultra-high dimension

    Authors: Xiucai Ding, Yichen Hu, Zhenggang Wang

    Abstract: In this paper, we propose a new test for testing the equality of two population covariance matrices in the ultra-high dimensional setting that the dimension is much larger than the sizes of both of the two samples. Our proposed methodology relies on a data splitting procedure and a comparison of a set of well selected eigenvalues of the sample covariance matrices on the split data sets. Compared t… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: 43 pages, 1 figure

  6. arXiv:2310.20699  [pdf, other

    physics.chem-ph cs.LG physics.comp-ph physics.data-an stat.AP

    Bayesian Multistate Bennett Acceptance Ratio Methods

    Authors: Xinqiang Ding

    Abstract: The multistate Bennett acceptance ratio (MBAR) method is a prevalent approach for computing free energies of thermodynamic states. In this work, we introduce BayesMBAR, a Bayesian generalization of the MBAR method. By integrating configurations sampled from thermodynamic states with a prior distribution, BayesMBAR computes a posterior distribution of free energies. Using the posterior distribution… ▽ More

    Submitted 14 February, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

  7. arXiv:2309.09222  [pdf, other

    cs.LG stat.ML

    Data-driven Modeling and Inference for Bayesian Gaussian Process ODEs via Double Normalizing Flows

    Authors: Jian Xu, Shian Du, Junmei Yang, Xinghao Ding, John Paisley, Delu Zeng

    Abstract: Recently, Gaussian processes have been used to model the vector field of continuous dynamical systems, referred to as GPODEs, which are characterized by a probabilistic ODE equation. Bayesian inference for these models has been extensively studied and applied in tasks such as time series prediction. However, the use of standard GPs with basic kernels like squared exponential kernels has been commo… ▽ More

    Submitted 2 January, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

  8. arXiv:2308.13018  [pdf, other

    stat.AP astro-ph.IM

    A Robust Bayesian Meta-Analysis for Estimating the Hubble Constant via Time Delay Cosmography

    Authors: Hyungsuk Tak, Xuheng Ding

    Abstract: We propose a Bayesian meta-analysis to infer the current expansion rate of the Universe, called the Hubble constant ($H_0$), via time delay cosmography. Inputs of the meta-analysis are estimates of two properties for each pair of gravitationally lensed images; time delay and Fermat potential difference estimates with their standard errors. A meta-analysis can be appealing in practice because obtai… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  9. arXiv:2303.03532  [pdf, ps, other

    stat.ME

    Extreme eigenvalues of sample covariance matrices under generalized elliptical models with applications

    Authors: Xiucai Ding, Jiahui Xie, Long Yu, Wang Zhou

    Abstract: We consider the extreme eigenvalues of the sample covariance matrix $Q=YY^*$ under the generalized elliptical model that $Y=Σ^{1/2}XD.$ Here $Σ$ is a bounded $p \times p$ positive definite deterministic matrix representing the population covariance structure, $X$ is a $p \times n$ random matrix containing either independent columns sampled from the unit sphere in $\mathbb{R}^p$ or i.i.d. centered… ▽ More

    Submitted 19 April, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: 90 pages, 6 figures, some typos are corrected

  10. arXiv:2209.04968  [pdf, other

    stat.CO

    Population-Based Hierarchical Non-negative Matrix Factorization for Survey Data

    Authors: Xiaofu Ding, Xinyu Dong, Olivia McGough, Chenxin Shen, Annie Ulichney, Ruiyao Xu, William Swartworth, Jocelyn T. Chi, Deanna Needell

    Abstract: Motivated by the problem of identifying potential hierarchical population structure on modern survey data containing a wide range of complex data types, we introduce population-based hierarchical non-negative matrix factorization (PHNMF). PHNMF is a variant of hierarchical non-negative matrix factorization based on feature similarity. As such, it enables an automatic and interpretable approach for… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

  11. arXiv:2206.07647  [pdf, other

    cs.LG cs.AI stat.ME

    Hyperparameter Sensitivity in Deep Outlier Detection: Analysis and a Scalable Hyper-Ensemble Solution

    Authors: Xueying Ding, Lingxiao Zhao, Leman Akoglu

    Abstract: Outlier detection (OD) literature exhibits numerous algorithms as it applies to diverse domains. However, given a new detection task, it is unclear how to choose an algorithm to use, nor how to set its hyperparameter(s) (HPs) in unsupervised settings. HP tuning is an ever-growing problem with the arrival of many new detectors based on deep learning, which usually come with a long list of HPs. Surp… ▽ More

    Submitted 18 October, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: 19 pages, The code is available at: https://github.com/xyvivian/ROBOD

  12. arXiv:2203.00126  [pdf, other

    stat.ML cs.LG stat.ME

    Learning Low-Dimensional Nonlinear Structures from High-Dimensional Noisy Data: An Integral Operator Approach

    Authors: Xiucai Ding, Rong Ma

    Abstract: We propose a kernel-spectral embedding algorithm for learning low-dimensional nonlinear structures from high-dimensional and noisy observations, where the datasets are assumed to be sampled from an intrinsically low-dimensional manifold and corrupted by high-dimensional noise. The algorithm employs an adaptive bandwidth selection procedure which does not rely on prior knowledge of the underlying m… ▽ More

    Submitted 6 July, 2023; v1 submitted 28 February, 2022; originally announced March 2022.

    Comments: Accepted to the Annals of Statistics

  13. arXiv:2112.08545  [pdf, ps, other

    math.ST stat.ME

    Simultaneous Sieve Inference for Time-Inhomogeneous Nonlinear Time Series Regression

    Authors: Xiucai Ding, Zhou Zhou

    Abstract: In this paper, we consider the time-inhomogeneous nonlinear time series regression for a general class of locally stationary time series. On one hand, we propose sieve nonparametric estimators for the time-varying regression functions which can achieve the min-max optimal rate. On the other hand, we develop a unified simultaneous inferential theory which can be used to conduct both structural and… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: 57 pages, 8 figures

  14. arXiv:2112.00693  [pdf, ps, other

    math.ST stat.ME

    Auto-Regressive Approximations to Non-stationary Time Series, with Inference and Applications

    Authors: Xiucai Ding, Zhou Zhou

    Abstract: Understanding the time-varying structure of complex temporal systems is one of the main challenges of modern time series analysis. In this paper, we show that every uniformly-positive-definite-in-covariance and sufficiently short-range dependent non-stationary and nonlinear time series can be well approximated globally by a white-noise-driven auto-regressive (AR) process of slowly diverging order.… ▽ More

    Submitted 24 April, 2023; v1 submitted 1 December, 2021; originally announced December 2021.

    Comments: Final version accepted to The Annals of Statistics. arXiv admin note: text overlap with arXiv:1912.12937

  15. arXiv:2111.10940  [pdf, ps, other

    stat.ML cs.LG

    How do kernel-based sensor fusion algorithms behave under high dimensional noise?

    Authors: Xiucai Ding, Hau-Tieng Wu

    Abstract: We study the behavior of two kernel based sensor fusion algorithms, nonparametric canonical correlation analysis (NCCA) and alternating diffusion (AD), under the nonnull setting that the clean datasets collected from two sensors are modeled by a common low dimensional manifold embedded in a high dimensional Euclidean space and the datasets are corrupted by high dimensional noise. We establish the… ▽ More

    Submitted 21 November, 2021; originally announced November 2021.

  16. Distilling and Transferring Knowledge via cGAN-generated Samples for Image Classification and Regression

    Authors: Xin Ding, Yongwei Wang, Zuheng Xu, Z. Jane Wang, William J. Welch

    Abstract: Knowledge distillation (KD) has been actively studied for image classification tasks in deep learning, aiming to improve the performance of a student based on the knowledge from a teacher. However, applying KD in image regression with a scalar response variable has been rarely studied, and there exists no KD method applicable to both classification and regression tasks yet. Moreover, existing KD m… ▽ More

    Submitted 26 December, 2022; v1 submitted 7 April, 2021; originally announced April 2021.

  17. arXiv:2103.11166  [pdf, other

    cs.CV cs.LG stat.ML

    Efficient Subsampling of Realistic Images From GANs Conditional on a Class or a Continuous Variable

    Authors: Xin Ding, Yongwei Wang, Z. Jane Wang, William J. Welch

    Abstract: Recently, subsampling or refining images generated from unconditional GANs has been actively studied to improve the overall image quality. Unfortunately, these methods are often observed less effective or inefficient in handling conditional GANs (cGANs) -- conditioning on a class (aka class-conditional GANs) or a continuous variable (aka continuous cGANs or CcGANs). In this work, we introduce an e… ▽ More

    Submitted 20 April, 2022; v1 submitted 20 March, 2021; originally announced March 2021.

  18. arXiv:2011.12508  [pdf, other

    cs.LG stat.ML

    Causal inference using deep neural networks

    Authors: Ye Yuan, Xueying Ding, Ziv Bar-Joseph

    Abstract: Causal inference from observation data is a core problem in many scientific fields. Here we present a general supervised deep learning framework that infers causal interactions by transforming the input vectors to an image-like representation for every pair of inputs. Given a training dataset we first construct a normalized empirical probability density distribution (NEPDF) matrix. We then train a… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

  19. arXiv:2011.07466  [pdf, other

    cs.CV cs.LG stat.ML

    Continuous Conditional Generative Adversarial Networks: Novel Empirical Losses and Label Input Mechanisms

    Authors: Xin Ding, Yongwei Wang, Zuheng Xu, William J. Welch, Z. Jane Wang

    Abstract: This work proposes the continuous conditional generative adversarial network (CcGAN), the first generative model for image generation conditional on continuous, scalar conditions (termed regression labels). Existing conditional GANs (cGANs) are mainly designed for categorical conditions (eg, class labels); conditioning on regression labels is mathematically distinct and raises two fundamental prob… ▽ More

    Submitted 30 October, 2023; v1 submitted 15 November, 2020; originally announced November 2020.

    Comments: Accepted by IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

  20. arXiv:2010.03700  [pdf, other

    stat.ME stat.AP

    Multivariate functional responses low rank regression with an application to brain imaging data

    Authors: Xiucai Ding, Dengdeng Yu, Zhengwu Zhang, Dehan Kong

    Abstract: We propose a multivariate functional responses low rank regression model with possible high dimensional functional responses and scalar covariates. By expanding the slope functions on a set of sieve basis, we reconstruct the basis coefficients as a matrix. To estimate these coefficients, we propose an efficient procedure using nuclear norm regularization. We also derive error bounds for our estima… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: Canadian Journal of Statistics(accepted)

  21. arXiv:2009.02251  [pdf, other

    cs.LG cs.IR math.NA stat.ML

    Efficient Model-Based Collaborative Filtering with Fast Adaptive PCA

    Authors: Xiangyun Ding, Wenjian Yu, Yuyang Xie, Shenghua Liu

    Abstract: A model-based collaborative filtering (CF) approach utilizing fast adaptive randomized singular value decomposition (SVD) is proposed for the matrix completion problem in recommender system. Firstly, a fast adaptive PCA frameworkis presented which combines the fixed-precision randomized matrix factorization algorithm [1] and accelerating skills for handling large sparse data. Then, a novel termina… ▽ More

    Submitted 4 September, 2020; originally announced September 2020.

  22. arXiv:2007.07052  [pdf, ps, other

    stat.ME cs.LG

    Predicting feature imputability in the absence of ground truth

    Authors: Niamh McCombe, Xuemei Ding, Girijesh Prasad, David P. Finn, Stephen Todd, Paula L. McClean, KongFatt Wong-Lin

    Abstract: Data imputation is the most popular method of dealing with missing values, but in most real life applications, large missing data can occur and it is difficult or impossible to evaluate whether data has been imputed accurately (lack of ground truth). This paper addresses these issues by proposing an effective and simple principal component based method for determining whether individual data featu… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: 5 pages, 3 figures, 1 table. In: Proceedings of the 37th International Conference on Machine Learning (ICML), 2020

  23. arXiv:2007.03260  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting

    Authors: Xiaohan Ding, Tianxiang Hao, Jianchao Tan, Ji Liu, Jungong Han, Yuchen Guo, Guiguang Ding

    Abstract: We propose ResRep, a novel method for lossless channel pruning (a.k.a. filter pruning), which slims down a CNN by reducing the width (number of output channels) of convolutional layers. Inspired by the neurobiology research about the independence of remembering and forgetting, we propose to re-parameterize a CNN into the remembering parts and forgetting parts, where the former learn to maintain th… ▽ More

    Submitted 14 August, 2021; v1 submitted 7 July, 2020; originally announced July 2020.

    Comments: ICCV 2021

  24. arXiv:2005.12108  [pdf, other

    cs.LG stat.ML

    Gradient Monitored Reinforcement Learning

    Authors: Mohammed Sharafath Abdul Hameed, Gavneet Singh Chadha, Andreas Schwung, Steven X. Ding

    Abstract: This paper presents a novel neural network training approach for faster convergence and better generalization abilities in deep reinforcement learning. Particularly, we focus on the enhancement of training and evaluation performance in reinforcement learning algorithms by systematically reducing gradient's variance and thereby providing a more targeted learning process. The proposed method which w… ▽ More

    Submitted 25 May, 2020; originally announced May 2020.

    Comments: 14 pages, 15 images

  25. arXiv:2004.04019  [pdf, other

    stat.OT cs.LG q-bio.PE stat.ML

    A machine learning methodology for real-time forecasting of the 2019-2020 COVID-19 outbreak using Internet searches, news alerts, and estimates from mechanistic models

    Authors: Dianbo Liu, Leonardo Clemente, Canelle Poirier, Xiyu Ding, Matteo Chinazzi, Jessica T Davis, Alessandro Vespignani, Mauricio Santillana

    Abstract: We present a timely and novel methodology that combines disease estimates from mechanistic models with digital traces, via interpretable machine-learning methodologies, to reliably forecast COVID-19 activity in Chinese provinces in real-time. Specifically, our method is able to produce stable and accurate forecasts 2 days ahead of current time, and uses as inputs (a) official health reports from C… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

  26. arXiv:2002.03222  [pdf, other

    cs.LG cs.IR stat.ML

    SUOD: Toward Scalable Unsupervised Outlier Detection

    Authors: Yue Zhao, Xueying Ding, Jianing Yang, Hao** Bai

    Abstract: Outlier detection is a key field of machine learning for identifying abnormal data objects. Due to the high expense of acquiring ground truth, unsupervised models are often chosen in practice. To compensate for the unstable nature of unsupervised algorithms, practitioners from high-stakes fields like finance, health, and security, prefer to build a large number of models for further combination an… ▽ More

    Submitted 8 February, 2020; originally announced February 2020.

    Comments: In AAAI-20 Workshop on Artificial Intelligence for Cyber Security (AICS)

  27. arXiv:1910.07988  [pdf, other

    cs.LG cs.IR stat.ML

    Combining Machine Learning Models using combo Library

    Authors: Yue Zhao, Xuejian Wang, Cheng Cheng, Xueying Ding

    Abstract: Model combination, often regarded as a key sub-field of ensemble learning, has been widely used in both academic research and industry applications. To facilitate this process, we propose and implement an easy-to-use Python toolkit, combo, to aggregate models and scores under various scenarios, including classification, clustering, and anomaly detection. In a nutshell, combo provides a unified and… ▽ More

    Submitted 23 November, 2019; v1 submitted 21 September, 2019; originally announced October 2019.

    Comments: In Proceedings of Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020)

  28. arXiv:1909.12778  [pdf, other

    cs.LG cs.CV stat.ML

    Global Sparse Momentum SGD for Pruning Very Deep Neural Networks

    Authors: Xiaohan Ding, Guiguang Ding, Xiangxin Zhou, Yuchen Guo, Jungong Han, Ji Liu

    Abstract: Deep Neural Network (DNN) is powerful but computationally expensive and memory intensive, thus impeding its practical usage on resource-constrained front-end devices. DNN pruning is an approach for deep model compression, which aims at eliminating some parameters with tolerable performance degradation. In this paper, we propose a novel momentum-SGD-based optimization method to reduce the network c… ▽ More

    Submitted 25 October, 2019; v1 submitted 27 September, 2019; originally announced September 2019.

    Comments: Accepted by NeurIPS 2019

  29. Subsampling Generative Adversarial Networks: Density Ratio Estimation in Feature Space with Softplus Loss

    Authors: Xin Ding, Z. Jane Wang, William J. Welch

    Abstract: Filtering out unrealistic images from trained generative adversarial networks (GANs) has attracted considerable attention recently. Two density ratio based subsampling methods---Discriminator Rejection Sampling (DRS) and Metropolis-Hastings GAN (MH-GAN)---were recently proposed, and their effectiveness in improving GANs was demonstrated on multiple datasets. However, DRS and MH-GAN are based on di… ▽ More

    Submitted 20 February, 2020; v1 submitted 23 September, 2019; originally announced September 2019.

  30. arXiv:1909.04217  [pdf, other

    cs.LG cs.CV stat.ML

    Swapped Face Detection using Deep Learning and Subjective Assessment

    Authors: Xinyi Ding, Zohreh Raziei, Eric C. Larson, Eli V. Olinick, Paul Krueger, Michael Hahsler

    Abstract: The tremendous success of deep learning for imaging applications has resulted in numerous beneficial advances. Unfortunately, this success has also been a catalyst for malicious uses such as photo-realistic face swap** of parties without consent. Transferring one person's face from a source image to a target image of another person, while kee** the image photo-realistic overall has become incr… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: 8 pages, 5 figures

  31. arXiv:1907.04924  [pdf, other

    cs.IR cs.LG stat.ML

    Infer Implicit Contexts in Real-time Online-to-Offline Recommendation

    Authors: Xichen Ding, Jie Tang, Tracy Liu, Cheng Xu, Ya** Zhang, Feng Shi, Qixia Jiang, Dan Shen

    Abstract: Understanding users' context is essential for successful recommendations, especially for Online-to-Offline (O2O) recommendation, such as Yelp, Groupon, and Koubei. Different from traditional recommendation where individual preference is mostly static, O2O recommendation should be dynamic to capture variation of users' purposes across time and location. However, precisely inferring users' real-time… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

    Comments: 9 pages,KDD,KDD2019

  32. arXiv:1906.04904  [pdf, other

    stat.ML cs.LG

    Learning Deep Generative Models with Annealed Importance Sampling

    Authors: Xinqiang Ding, David J. Freedman

    Abstract: Variational inference (VI) and Markov chain Monte Carlo (MCMC) are two main approximate approaches for learning deep generative models by maximizing marginal likelihood. In this paper, we propose using annealed importance sampling for learning deep generative models. Our proposed approach bridges VI with MCMC. It generalizes VI methods such as variational auto-encoders and importance weighted auto… ▽ More

    Submitted 29 November, 2020; v1 submitted 11 June, 2019; originally announced June 2019.

    Journal ref: NeurIPS 2020 Workshop on Machine Learning and the Physical Sciences

  33. arXiv:1905.04748  [pdf, other

    cs.LG cs.CV stat.ML

    Approximated Oracle Filter Pruning for Destructive CNN Width Optimization

    Authors: Xiaohan Ding, Guiguang Ding, Yuchen Guo, Jungong Han, Chenggang Yan

    Abstract: It is not easy to design and run Convolutional Neural Networks (CNNs) due to: 1) finding the optimal number of filters (i.e., the width) at each layer is tricky, given an architecture; and 2) the computational intensity of CNNs impedes the deployment on computationally limited devices. Oracle Pruning is designed to remove the unimportant filters from a well-trained CNN, which estimates the filters… ▽ More

    Submitted 12 May, 2019; originally announced May 2019.

    Comments: ICML 2019

  34. arXiv:1904.03837  [pdf, other

    cs.LG cs.CV stat.ML

    Centripetal SGD for Pruning Very Deep Convolutional Networks with Complicated Structure

    Authors: Xiaohan Ding, Guiguang Ding, Yuchen Guo, Jungong Han

    Abstract: The redundancy is widely recognized in Convolutional Neural Networks (CNNs), which enables to remove unimportant filters from convolutional layers so as to slim the network with acceptable performance drop. Inspired by the linear and combinational properties of convolution, we seek to make some filters increasingly close and eventually identical for network slimming. To this end, we propose Centri… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

    Comments: CVPR 2019

  35. arXiv:1902.10060  [pdf

    stat.AP stat.ME

    Electronic Health Record Phenoty** with Internally Assessable Performance (PhIAP) using Anchor-Positive and Unlabeled Patients

    Authors: Lingjiao Zhang, Xiruo Ding, Yanyuan Ma, Naveen Muthu, Imran Ajmal, Jason H. Moore, Daniel S. Herman, **bo Chen

    Abstract: Building phenotype models using electronic health record (EHR) data conventionally requires manually labeled cases and controls. Assigning labels is labor intensive and, for some phenotypes, identifying gold-standard controls is prohibitive. To facilitate comprehensive clinical decision support and research, we sought to develop an accurate EHR phenoty** approach that assesses its performance wi… ▽ More

    Submitted 30 January, 2019; originally announced February 2019.

  36. arXiv:1810.10172   

    stat.ME cs.LG math.ST stat.ML

    Modified Multidimensional Scaling and High Dimensional Clustering

    Authors: Xiucai Ding, Qiang Sun

    Abstract: Multidimensional scaling is an important dimension reduction tool in statistics and machine learning. Yet few theoretical results characterizing its statistical performance exist, not to mention any in high dimensions. By considering a unified framework that includes low, moderate and high dimensions, we study multidimensional scaling in the setting of clustering noisy data. Our results suggest th… ▽ More

    Submitted 29 March, 2022; v1 submitted 23 October, 2018; originally announced October 2018.

    Comments: This paper will be subsumed by another paper

  37. arXiv:1705.04524  [pdf, other

    cs.LG cs.AI math.DS stat.ML

    Long-term Blood Pressure Prediction with Deep Recurrent Neural Networks

    Authors: Peng Su, Xiao-Rong Ding, Yuan-Ting Zhang, **g Liu, Fen Miao, Ni Zhao

    Abstract: Existing methods for arterial blood pressure (BP) estimation directly map the input physiological signals to output BP values without explicitly modeling the underlying temporal dependencies in BP dynamics. As a result, these models suffer from accuracy decay over a long time and thus require frequent calibration. In this work, we address this issue by formulating BP estimation as a sequence predi… ▽ More

    Submitted 14 January, 2018; v1 submitted 12 May, 2017; originally announced May 2017.

    Comments: To appear in IEEE BHI 2018

  38. arXiv:1604.04002  [pdf, other

    math.ST stat.AP

    Sparse transition matrix estimation for high-dimensional and locally stationary vector autoregressive models

    Authors: Xin Ding, Ziyi Qiu, Xiaohui Chen

    Abstract: We consider the estimation of the transition matrix in the high-dimensional time-varying vector autoregression (TV-VAR) models. Our model builds on a general class of locally stationary VAR processes that evolve smoothly in time. We propose a hybridized kernel smoothing and $\ell^1$-regularized method to directly estimate the sequence of time-varying transition matrices. Under the sparsity assumpt… ▽ More

    Submitted 29 September, 2017; v1 submitted 13 April, 2016; originally announced April 2016.

  39. arXiv:1302.2712  [pdf, other

    cs.CV physics.med-ph stat.AP

    Bayesian Nonparametric Dictionary Learning for Compressed Sensing MRI

    Authors: Yue Huang, John Paisley, Qin Lin, Xinghao Ding, Xueyang Fu, ** Zhang

    Abstract: We develop a Bayesian nonparametric model for reconstructing magnetic resonance images (MRI) from highly undersampled k-space data. We perform dictionary learning as part of the image reconstruction process. To this end, we use the beta process as a nonparametric dictionary learning prior for representing an image patch as a sparse combination of dictionary elements. The size of the dictionary and… ▽ More

    Submitted 26 July, 2014; v1 submitted 12 February, 2013; originally announced February 2013.