Skip to main content

Showing 1–32 of 32 results for author: Yan, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.02539  [pdf

    cs.RO cs.AI cs.LG stat.ML

    Research on Autonomous Robots Navigation based on Reinforcement Learning

    Authors: Zixiang Wang, Hao Yan, Yining Wang, Zhengjia Xu, Zhuoyue Wang, Zhizhong Wu

    Abstract: Reinforcement learning continuously optimizes decision-making based on real-time feedback reward signals through continuous interaction with the environment, demonstrating strong adaptive and self-learning capabilities. In recent years, it has become one of the key methods to achieve autonomous navigation of robots. In this work, an autonomous robot navigation method based on reinforcement learnin… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2407.01015  [pdf, other

    stat.ML cs.LG

    Bayesian Entropy Neural Networks for Physics-Aware Prediction

    Authors: Rahul Rathnakumar, Jiayu Huang, Hao Yan, Yongming Liu

    Abstract: This paper addresses the need for deep learning models to integrate well-defined constraints into their outputs, driven by their application in surrogate models, learning with limited data and partial information, and scenarios requiring flexible model behavior to incorporate non-data sample information. We introduce Bayesian Entropy Neural Networks (BENN), a framework grounded in Maximum Entropy… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 15 pages

    ACM Class: I.5.1

  3. arXiv:2404.04403  [pdf, other

    stat.ME cs.AI

    Low-Rank Robust Subspace Tensor Clustering for Metro Passenger Flow Modeling

    Authors: Jiuyun Hu, Ziyue Li, Chen Zhang, Fugee Tsung, Hao Yan

    Abstract: Tensor clustering has become an important topic, specifically in spatio-temporal modeling, due to its ability to cluster spatial modes (e.g., stations or road segments) and temporal modes (e.g., time of the day or day of the week). Our motivating example is from subway passenger flow modeling, where similarities between stations are commonly found. However, the challenges lie in the innate high-di… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: Conditionally Accepted in INFORMS Journal of Data Science

  4. arXiv:2310.20224  [pdf, other

    stat.ML cs.AI cs.LG stat.AP

    Choose A Table: Tensor Dirichlet Process Multinomial Mixture Model with Graphs for Passenger Trajectory Clustering

    Authors: Ziyue Li, Hao Yan, Chen Zhang, Lijun Sun, Wolfgang Ketter, Fugee Tsung

    Abstract: Passenger clustering based on trajectory records is essential for transportation operators. However, existing methods cannot easily cluster the passengers due to the hierarchical structure of the passenger trip information, including multiple trips within each passenger and multi-dimensional information about each trip. Furthermore, existing approaches rely on an accurate specification of the clus… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: Accepted in ACM SIGSPATIAL 2023. arXiv admin note: substantial text overlap with arXiv:2306.13794

  5. arXiv:2310.08721  [pdf, other

    stat.AP

    Drug Supply Chain Optimization for Adaptive Clinical Trials

    Authors: **cheng Pang, Hong Yan, Zoe Hua

    Abstract: With increasing interest in adaptive clinical trial designs, challenges are present to drug supply chain management which may offset the benefit of adaptive designs. Thus, it is necessary to develop an optimization tool to facilitate the decision making and analysis of drug supply chain planning. The challenges include the uncertainty of maximum drug supply needed, the shifting of supply requireme… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  6. arXiv:2309.03439  [pdf, other

    cs.LG stat.ME

    Personalized Tucker Decomposition: Modeling Commonality and Peculiarity on Tensor Data

    Authors: Jiuyun Hu, Naichen Shi, Raed Al Kontar, Hao Yan

    Abstract: We propose personalized Tucker decomposition (perTucker) to address the limitations of traditional tensor decomposition methods in capturing heterogeneity across different datasets. perTucker decomposes tensor data into shared global components and personalized local components. We introduce a mode orthogonality assumption and develop a proximal gradient regularized block coordinate descent algori… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  7. arXiv:2306.13794  [pdf, other

    stat.ML cs.LG

    Tensor Dirichlet Process Multinomial Mixture Model for Passenger Trajectory Clustering

    Authors: Ziyue Li, Hao Yan, Chen Zhang, Andi Wang, Wolfgang Ketter, Lijun Sun, Fugee Tsung

    Abstract: Passenger clustering based on travel records is essential for transportation operators. However, existing methods cannot easily cluster the passengers due to the hierarchical structure of the passenger trip information, namely: each passenger has multiple trips, and each trip contains multi-dimensional multi-mode information. Furthermore, existing approaches rely on an accurate specification of th… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: Under Review of Transportation Research Part C: Emerging Technologies

  8. arXiv:2305.14767  [pdf

    stat.ME

    Interpretation and visualization of distance covariance through additive decomposition of correlations formula

    Authors: Andi Wang, Hao Yan, Juan Du

    Abstract: Distance covariance is a widely used statistical methodology for testing the dependency between two groups of variables. Despite the appealing properties of consistency and superior testing power, the testing results of distance covariance are often hard to be interpreted. This paper presents an elementary interpretation of the mechanism of distance covariance through an additive decomposition of… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  9. arXiv:2208.08855  [pdf, other

    eess.SP stat.AP stat.ME

    Adaptive Partially-Observed Sequential Change Detection and Isolation

    Authors: Xinyu Zhao, Jiuyun Hu, Yajun Mei, Hao Yan

    Abstract: High-dimensional data has become popular due to the easy accessibility of sensors in modern industrial applications. However, one specific challenge is that it is often not easy to obtain complete measurements due to limited sensing powers and resource constraints. Furthermore, distinct failure patterns may exist in the systems, and it is necessary to identify the true failure pattern. This work f… ▽ More

    Submitted 25 August, 2022; v1 submitted 9 August, 2022; originally announced August 2022.

    Comments: Accepted in Technometrics

  10. arXiv:2208.05045  [pdf, other

    cs.LG stat.AP stat.ME

    Adaptive Resources Allocation CUSUM for Binomial Count Data Monitoring with Application to COVID-19 Hotspot Detection

    Authors: Jiuyun Hu, Yajun Mei, Sarah Holte, Hao Yan

    Abstract: In this paper, we present an efficient statistical method (denoted as "Adaptive Resources Allocation CUSUM") to robustly and efficiently detect the hotspot with limited sampling resources. Our main idea is to combine the multi-arm bandit (MAB) and change-point detection methods to balance the exploration and exploitation of resource allocation for hotspot detection. Further, a Bayesian weighted up… ▽ More

    Submitted 17 August, 2022; v1 submitted 9 August, 2022; originally announced August 2022.

    Comments: Accepted in Journal of Applied Statistics

  11. arXiv:2112.04492  [pdf, other

    cs.LG stat.ME

    Daily peak electrical load forecasting with a multi-resolution approach

    Authors: Yvenn Amara-Ouali, Matteo Fasiolo, Yannig Goude, Hui Yan

    Abstract: In the context of smart grids and load balancing, daily peak load forecasting has become a critical activity for stakeholders of the energy industry. An understanding of peak magnitude and timing is paramount for the implementation of smart grid strategies such as peak shaving. The modelling approach proposed in this paper leverages high-resolution and low-resolution information to forecast daily… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

  12. arXiv:2105.08180  [pdf, other

    cs.LG cs.AI stat.AP

    Deep Multistage Multi-Task Learning for Quality Prediction of Multistage Manufacturing Systems

    Authors: Hao Yan, Nurretin Dorukhan Sergin, William A. Brenneman, Stephen Joseph Lange, Shan Ba

    Abstract: In multistage manufacturing systems, modeling multiple quality indices based on the process sensing variables is important. However, the classic modeling technique predicts each quality variable one at a time, which fails to consider the correlation within or between stages. We propose a deep multistage multi-task learning framework to jointly predict all output sensing variables in a unified end-… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Accepted by Journal of Quality Technology

  13. arXiv:2101.06839  [pdf, other

    stat.ME

    Adaptive Change Point Monitoring for High-Dimensional Data

    Authors: Teng Wu, Runmin Wang, Hao Yan, Xiaofeng Shao

    Abstract: In this paper, we propose a class of monitoring statistics for a mean shift in a sequence of high-dimensional observations. Inspired by the recent U-statistic based retrospective tests developed by Wang et al.(2019) and Zhang et al.(2020), we advance the U-statistic based approach to the sequential monitoring problem by develo** a new adaptive monitoring procedure that can detect both dense and… ▽ More

    Submitted 17 January, 2021; originally announced January 2021.

  14. arXiv:2009.10645  [pdf, other

    stat.ML cs.LG stat.AP

    Partially Observable Online Change Detection via Smooth-Sparse Decomposition

    Authors: Jie Guo, Hao Yan, Chen Zhang, Steven Hoi

    Abstract: We consider online change detection of high dimensional data streams with sparse changes, where only a subset of data streams can be observed at each sensing time point due to limited sensing capacities. On the one hand, the detection scheme should be able to deal with partially observable data and meanwhile have efficient detection power for sparse changes. On the other, the scheme should be able… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

    Comments: 48 pages

  15. arXiv:2008.10030  [pdf, other

    cs.LG cs.CV stat.ML

    Unsupervised Domain Adaptation via Discriminative Manifold Propagation

    Authors: You-Wei Luo, Chuan-Xian Ren, Dao-Qing Dai, Hong Yan

    Abstract: Unsupervised domain adaptation is effective in leveraging rich information from a labeled source domain to an unlabeled target domain. Though deep learning and adversarial strategy made a significant breakthrough in the adaptability of features, there are two issues to be further studied. First, hard-assigned pseudo labels on the target domain are arbitrary and error-prone, and direct application… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

    Comments: To be published in IEEE Transactions on Pattern Analysis and Machine Intelligence

  16. arXiv:2007.06136  [pdf, other

    stat.AP

    Bayesian Bi-clustering Methods with Applications in Computational Biology

    Authors: Han Yan, Jiexing Wu, Yang Li, Jun S. Liu

    Abstract: Bi-clustering is a useful approach in analyzing biological data when observations come from heterogeneous groups and have a large number of features. We outline a general Bayesian approach in tackling bi-clustering problems in moderate to high dimensions, and propose three Bayesian bi-clustering models on categorical data, which increase in complexities in their modeling of the distributions of fe… ▽ More

    Submitted 9 February, 2021; v1 submitted 12 July, 2020; originally announced July 2020.

  17. arXiv:2004.11710  [pdf, other

    stat.AP stat.ME

    Rapid Detection of Hot-spots via Tensor Decomposition with applications to Crime Rate Data

    Authors: Yujie Zhao, Hao Yan, Sarah Holte, Yajun Mei

    Abstract: We propose an efficient statistical method (denoted as SSR-Tensor) to robustly and quickly detect hot-spots that are sparse and temporal-consistent in a spatial-temporal dataset through the tensor decomposition. Our main idea is first to build an SSR model to decompose the tensor data into a Smooth global trend mean, Sparse local hot-spots, and Residuals. Next, tensor decomposition is utilized as… ▽ More

    Submitted 14 May, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: arXiv admin note: text overlap with arXiv:2001.11685

  18. arXiv:2004.11022  [pdf, other

    cs.LG eess.SP stat.AP

    Long-Short Term Spatiotemporal Tensor Prediction for Passenger Flow Profile

    Authors: Ziyue Li, Hao Yan, Chen Zhang, Fugee Tsung

    Abstract: Spatiotemporal data is very common in many applications, such as manufacturing systems and transportation systems. It is typically difficult to be accurately predicted given intrinsic complex spatial and temporal correlations. Most of the existing methods based on various statistical models and regularization terms, fail to preserve innate features in data alongside their complex correlations. In… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

  19. arXiv:2004.10977  [pdf

    stat.AP cs.CV

    Real-time Detection of Clustered Events in Video-imaging data with Applications to Additive Manufacturing

    Authors: Hao Yan, Marco Grasso, Kamran Paynabar, Bianca Maria Colosimo

    Abstract: The use of video-imaging data for in-line process monitoring applications has become more and more popular in the industry. In this framework, spatio-temporal statistical process monitoring methods are needed to capture the relevant information content and signal possible out-of-control states. Video-imaging data are characterized by a spatio-temporal variability structure that depends on the unde… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

  20. arXiv:2001.11685  [pdf, ps, other

    stat.AP stat.ME

    Rapid Detection of Hot-spot by Tensor Decomposition with Application to Weekly Gonorrhea Data

    Authors: Yujie Zhao, Hao Yan, Sarah E. Holte, Roxanne P. Kerani, Yajun Mei

    Abstract: In many bio-surveillance and healthcare applications, data sources are measured from many spatial locations repeatedly over time, say, daily/weekly/monthly. In these applications, we are typically interested in detecting hot-spots, which are defined as some structured outliers that are sparse over the spatial domain but persistent over time. In this paper, we propose a tensor decomposition method… ▽ More

    Submitted 7 April, 2020; v1 submitted 31 January, 2020; originally announced January 2020.

    Journal ref: The XIIIth International Workshop on Intelligent Statistical Quality Control, pp 289- 310, Hong Kong, 2019

  21. arXiv:1912.05693  [pdf, other

    cs.LG stat.ML

    Tensor Completion for Weakly-dependent Data on Graph for Metro Passenger Flow Prediction

    Authors: Ziyue Li, Nurettin Dorukhan Sergin, Hao Yan, Chen Zhang, Fugee Tsung

    Abstract: Low-rank tensor decomposition and completion have attracted significant interest from academia given the ubiquity of tensor data. However, the low-rank structure is a global property, which will not be fulfilled when the data presents complex and weak dependencies given specific graph structures. One particular application that motivates this study is the spatiotemporal data analysis. As shown in… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

    Comments: Accepted at AAAI 2020

  22. Toward a Better Monitoring Statistic for Profile Monitoring via Variational Autoencoders

    Authors: Nurettin Sergin, Hao Yan

    Abstract: Wide accessibility of imaging and profile sensors in modern industrial systems created an abundance of high-dimensional sensing variables. This led to a a growing interest in the research of high-dimensional process monitoring. However, most of the approaches in the literature assume the in-control population to lie on a linear manifold with a given basis (i.e., spline, wavelet, kernel, etc) or an… ▽ More

    Submitted 10 August, 2022; v1 submitted 1 November, 2019; originally announced November 2019.

    Comments: Journal of Quality Technology 53 (2021) 454-473

  23. arXiv:1910.09979  [pdf, other

    stat.ML cs.LG

    Orthogonal Nonnegative Tucker Decomposition

    Authors: Junjun Pan, Michael K. Ng, Ye Liu, Xiongjun Zhang, Hong Yan

    Abstract: In this paper, we study the nonnegative tensor data and propose an orthogonal nonnegative Tucker decomposition (ONTD). We discuss some properties of ONTD and develop a convex relaxation algorithm of the augmented Lagrangian function to solve the optimization problem. The convergence of the algorithm is given. We employ ONTD on the image data sets from the real world applications including face rec… ▽ More

    Submitted 27 October, 2019; v1 submitted 21 October, 2019; originally announced October 2019.

  24. arXiv:1910.05513  [pdf, other

    cs.LG stat.ML

    On Robustness of Neural Ordinary Differential Equations

    Authors: Hanshu Yan, Jiawei Du, Vincent Y. F. Tan, Jiashi Feng

    Abstract: Neural ordinary differential equations (ODEs) have been attracting increasing attention in various research domains recently. There have been some works studying optimization issues and approximation capabilities of neural ODEs, but their robustness is still yet unclear. In this work, we fill this important gap by exploring robustness properties of neural ODEs both empirically and theoretically. W… ▽ More

    Submitted 3 March, 2022; v1 submitted 12 October, 2019; originally announced October 2019.

  25. arXiv:1910.02119  [pdf, other

    stat.ML cs.LG eess.SP

    AKM$^2$D : An Adaptive Framework for Online Sensing and Anomaly Quantification

    Authors: Hao Yan, Kamran Paynabar, Jianjun Shi

    Abstract: In point-based sensing systems such as coordinate measuring machines (CMM) and laser ultrasonics where complete sensing is impractical due to the high sensing time and cost, adaptive sensing through a systematic exploration is vital for online inspection and anomaly quantification. Most of the existing sequential sampling methodologies focus on reducing the overall fitting error for the entire sam… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: Under review in IISE Transaction

  26. arXiv:1906.03586  [pdf, other

    cs.LG cs.SI stat.ML

    Dynamic Network Embedding via Incremental Skip-gram with Negative Sampling

    Authors: Hao Peng, Jianxin Li, Hao Yan, Qiran Gong, Senzhang Wang, Lin Liu, Lihong Wang, Xiang Ren

    Abstract: Network representation learning, as an approach to learn low dimensional representations of vertices, has attracted considerable research attention recently. It has been proven extremely useful in many machine learning tasks over large graph. Most existing methods focus on learning the structural representations of vertices in a static network, but cannot guarantee an accurate and efficient embedd… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

    Comments: Accepted by China Science Information Science. arXiv admin note: text overlap with arXiv:1811.05932 by other authors

  27. Structured Point Cloud Data Analysis via Regularized Tensor Regression for Process Modeling and Optimization

    Authors: Hao Yan, Kamran Paynabar, Massimo Pacella

    Abstract: Advanced 3D metrology technologies such as Coordinate Measuring Machine (CMM) and laser 3D scanners have facilitated the collection of massive point cloud data, beneficial for process monitoring, control and optimization. However, due to their high dimensionality and structure complexity, modeling and analysis of point clouds are still a challenge. In this paper, we utilize multilinear algebra tec… ▽ More

    Submitted 1 December, 2018; v1 submitted 25 July, 2018; originally announced July 2018.

    Comments: Technometrics, accepted

    Journal ref: Technometrics 61.3 (2019): 385-395

  28. arXiv:1804.03797  [pdf, other

    stat.ML cs.LG

    Dynamic Multivariate Functional Data Modeling via Sparse Subspace Learning

    Authors: Chen Zhang, Hao Yan, Seungho Lee, Jianjun Shi

    Abstract: Multivariate functional data from a complex system are naturally high-dimensional and have complex cross-correlation structure. The complexity of data structure can be observed as that (1) some functions are strongly correlated with similar features, while some others may have almost no cross-correlations with quite diverse features; and (2) the cross-correlation structure may also change over tim… ▽ More

    Submitted 10 April, 2018; originally announced April 2018.

  29. arXiv:1804.03346  [pdf, other

    cs.LG stat.ML

    Learning Latent Events from Network Message Logs

    Authors: Siddhartha Satpathi, Supratim Deb, R Srikant, He Yan

    Abstract: We consider the problem of separating error messages generated in large distributed data center networks into error events. In such networks, each error event leads to a stream of messages generated by hardware and software components affected by the event. These messages are stored in a giant message log. We consider the unsupervised learning problem of identifying the signatures of events that g… ▽ More

    Submitted 17 July, 2019; v1 submitted 10 April, 2018; originally announced April 2018.

    Comments: To Appear in IEEE Transactions on Networking, Appeared in Workshop on MiLeTS, SIGKDD 2018

  30. A novel approach for fusion of heterogeneous sources of data

    Authors: Mostafa Reisi Gahrooei, Hao Yan, Kamran Paynabar, Jianjun Shi

    Abstract: With advancements in sensor technology, a heterogeneous set of data, containing samples of scalar, waveform signal, image, or even structured point cloud are becoming increasingly popular. Develo** a statistical model, representing the behavior of the underlying system based upon such a heterogeneous set of data can be used in monitoring, control, and optimization of the system. Unfortunately, a… ▽ More

    Submitted 19 April, 2018; v1 submitted 28 February, 2018; originally announced March 2018.

    Journal ref: Technometrics, 2020

  31. arXiv:1612.07439  [pdf, other

    stat.AP

    Estimating fiber orientation distribution from diffusion MRI with spherical needlets

    Authors: Hao Yan, Owen Carmichael, Debashis Paul, Jie Peng

    Abstract: We present a novel method for estimation of the fiber orientation distribution (FOD) function based on diffusion-weighted Magnetic Resonance Imaging (D-MRI) data. We formulate the problem of FOD estimation as a regression problem through spherical deconvolution and a sparse representation of the FOD by a spherical needlets basis that form a multi-resolution tight frame for spherical functions. Thi… ▽ More

    Submitted 21 December, 2016; originally announced December 2016.

  32. arXiv:1110.3257  [pdf, other

    stat.AP

    Modelling the impact of human activity on nitrogen dioxide concentrations in Europe

    Authors: Gavin Shaddick, Haojie Yan, Danielle Vienneau

    Abstract: Ambient concentrations of many pollutants are associated with emissions due to human activity, such as road transport and other combustion sources. In this paper we consider air pollution as a multi--level phenomenon within a Bayesian hierarchical model. We examine different scales of variation in pollution concentrations ranging from large scale transboundary effects to more localised effects whi… ▽ More

    Submitted 14 October, 2011; originally announced October 2011.

    Comments: 22 pages, 5 figures