Skip to main content

Showing 1–50 of 75 results for author: He, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.16525  [pdf, other

    stat.ML cs.LG

    OAML: Outlier Aware Metric Learning for OOD Detection Enhancement

    Authors: Heng Gao, Zhuolin He, Shoumeng Qiu, Jian Pu

    Abstract: Out-of-distribution (OOD) detection methods have been developed to identify objects that a model has not seen during training. The Outlier Exposure (OE) methods use auxiliary datasets to train OOD detectors directly. However, the collection and learning of representative OOD samples may pose challenges. To tackle these issues, we propose the Outlier Aware Metric Learning (OAML) framework. The main… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2404.01153  [pdf, other

    stat.ML cs.DC cs.LG math.ST stat.ME

    TransFusion: Covariate-Shift Robust Transfer Learning for High-Dimensional Regression

    Authors: Zelin He, Ying Sun, **gyuan Liu, Runze Li

    Abstract: The main challenge that sets transfer learning apart from traditional supervised learning is the distribution shift, reflected as the shift between the source and target models and that between the marginal covariate distributions. In this work, we tackle model shifts in the presence of covariate shifts in the high-dimensional regression setting. Specifically, we propose a two-step method with a n… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted by the 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024)

  3. arXiv:2404.00481  [pdf, other

    stat.ML cs.LG eess.SY

    Convolutional Bayesian Filtering

    Authors: Wenhan Cao, Shiqi Liu, Chang Liu, Zeyu He, Stephen S. -T. Yau, Shengbo Eben Li

    Abstract: Bayesian filtering serves as the mainstream framework of state estimation in dynamic systems. Its standard version utilizes total probability rule and Bayes' law alternatively, where how to define and compute conditional probability is critical to state distribution inference. Previously, the conditional probability is assumed to be exactly known, which represents a measure of the occurrence proba… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  4. arXiv:2403.13565  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    AdaTrans: Feature-wise and Sample-wise Adaptive Transfer Learning for High-dimensional Regression

    Authors: Zelin He, Ying Sun, **gyuan Liu, Runze Li

    Abstract: We consider the transfer learning problem in the high dimensional setting, where the feature dimension is larger than the sample size. To learn transferable information, which may vary across features or the source samples, we propose an adaptive transfer learning method that can detect and aggregate the feature-wise (F-AdaTrans) or sample-wise (S-AdaTrans) transferable structures. We achieve this… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Technical Report

  5. arXiv:2402.13934  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Do Efficient Transformers Really Save Computation?

    Authors: Kai Yang, Jan Ackermann, Zhenyu He, Guhao Feng, Bohang Zhang, Yunzhen Feng, Qiwei Ye, Di He, Liwei Wang

    Abstract: As transformer-based language models are trained on increasingly large datasets and with vast numbers of parameters, finding more efficient alternatives to the standard Transformer has become very valuable. While many efficient Transformers and Transformer alternatives have been proposed, none provide theoretical guarantees that they are a suitable replacement for the standard Transformer. This ma… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  6. arXiv:2402.12724  [pdf, other

    stat.ME q-bio.GN stat.AP

    Controlled Variable Selection from Summary Statistics Only? A Solution via GhostKnockoffs and Penalized Regression

    Authors: Zhaomeng Chen, Zihuai He, Benjamin B. Chu, Jiaqi Gu, Tim Morrison, Chiara Sabatti, Emmanuel Candès

    Abstract: Identifying which variables do influence a response while controlling false positives pervades statistics and data science. In this paper, we consider a scenario in which we only have access to summary statistics, such as the values of marginal empirical correlations between each dependent variable of potential interest and the response. This situation may arise due to privacy concerns, e.g., to a… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  7. arXiv:2401.16776  [pdf, other

    stat.CO cs.LG stat.ML

    Leveraging Nested MLMC for Sequential Neural Posterior Estimation with Intractable Likelihoods

    Authors: Xiliang Yang, Yifei Xiong, Zhijian He

    Abstract: Sequential neural posterior estimation (SNPE) techniques have been recently proposed for dealing with simulation-based models with intractable likelihoods. They are devoted to learning the posterior from adaptively proposed simulations using neural network-based conditional density estimators. As a SNPE technique, the automatic posterior transformation (APT) method proposed by Greenberg et al. (20… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 28 pages, 4 figures

  8. arXiv:2401.16421  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation

    Authors: Zhenyu He, Guhao Feng, Shengjie Luo, Kai Yang, Liwei Wang, **g**g Xu, Zhi Zhang, Hongxia Yang, Di He

    Abstract: In this work, we leverage the intrinsic segmentation of language sequences and design a new positional encoding method called Bilevel Positional Encoding (BiPE). For each position, our BiPE blends an intra-segment encoding and an inter-segment encoding. The intra-segment encoding identifies the locations within a segment and helps the model capture the semantic information therein via absolute pos… ▽ More

    Submitted 17 June, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: 17 pages, 7 figures, 8 tables; ICML 2024 Camera Ready version; Code: https://github.com/zhenyuhe00/BiPE

  9. arXiv:2401.08941   

    stat.ME

    A Powerful and Precise Feature-level Filter using Group Knockoffs

    Authors: Jiaqi Gu, Zihuai He

    Abstract: Selecting important features that have substantial effects on the response with provable type-I error rate control is a fundamental concern in statistics, with wide-ranging practical applications. Existing knockoff filters, although shown to provide theoretical guarantee on false discovery rate (FDR) control, often struggle to strike a balance between high power and precision in pinpointing import… ▽ More

    Submitted 27 February, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: We need a major revision of this paper

  10. arXiv:2401.02154  [pdf, other

    cs.LG cs.AI cs.CR stat.ME

    Disentangle Estimation of Causal Effects from Cross-Silo Data

    Authors: Yuxuan Liu, Haozhao Wang, Shuang Wang, Zhiming He, Wenchao Xu, Jialiang Zhu, Fan Yang

    Abstract: Estimating causal effects among different events is of great importance to critical fields such as drug development. Nevertheless, the data features associated with events may be distributed across various silos and remain private within respective parties, impeding direct information exchange between them. This, in turn, can result in biased estimations of local causal effects, which rely on the… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: Accepted by ICASSP 2024

  11. arXiv:2401.00461  [pdf, other

    stat.ME

    A Penalized Functional Linear Cox Regression Model for Spatially-defined Environmental Exposure with an Estimated Buffer Distance

    Authors: Jooyoung Lee, Zhibing He, Charlotte Roscoe, Peter James, Li Xu, Donna Spiegelman, David Zucker, Molin Wang

    Abstract: In environmental health research, it is of interest to understand the effect of the neighborhood environment on health. Researchers have shown a protective association between green space around a person's residential address and depression outcomes. In measuring exposure to green space, distance buffers are often used. However, buffer distances differ across studies. Typically, the buffer distanc… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    Comments: 27 pages, 5 figures

  12. arXiv:2311.12530  [pdf, other

    stat.ML cs.LG stat.CO

    An efficient likelihood-free Bayesian inference method based on sequential neural posterior estimation

    Authors: Yifei Xiong, Xiliang Yang, Sanguo Zhang, Zhijian He

    Abstract: Sequential neural posterior estimation (SNPE) techniques have been recently proposed for dealing with simulation-based models with intractable likelihoods. Unlike approximate Bayesian computation, SNPE techniques learn the posterior from sequential simulation using neural network-based conditional density estimators by minimizing a specific loss function. The SNPE method proposed by Lueckmann et a… ▽ More

    Submitted 27 November, 2023; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: 30 pages, 7 figures

  13. arXiv:2310.15069  [pdf, other

    stat.ME q-bio.GN stat.AP

    Second-order group knockoffs with applications to GWAS

    Authors: Benjamin B Chu, Jiaqi Gu, Zhaomeng Chen, Tim Morrison, Emmanuel Candes, Zihuai He, Chiara Sabatti

    Abstract: Conditional testing via the knockoff framework allows one to identify -- among large number of possible explanatory variables -- those that carry unique information about an outcome of interest, and also provides a false discovery rate guarantee on the selection. This approach is particularly well suited to the analysis of genome wide association studies (GWAS), which have the goal of identifying… ▽ More

    Submitted 3 March, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 46 pages, 10 figures, 2 tables, 3 algorithms

  14. arXiv:2310.09493  [pdf, other

    stat.ME stat.AP

    Summary Statistics Knockoffs Inference with Family-wise Error Rate Control

    Authors: Catherine Xinrui Yu, Jiaqi Gu, Zhaomeng Chen, Zihuai He

    Abstract: Testing multiple hypotheses of conditional independence with provable error rate control is a fundamental problem with various applications. To infer conditional independence with family-wise error rate (FWER) control when only summary statistics of marginal dependence are accessible, we adopt GhostKnockoff to directly generate knockoff copies of summary statistics and propose a new filter to sele… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    Comments: 35 pages

  15. arXiv:2310.04030  [pdf

    stat.ME

    Robust inference with GhostKnockoffs in genome-wide association studies

    Authors: Xinran Qi, Michael E. Belloy, Jiaqi Gu, Xiaoxia Liu, Hua Tang, Zihuai He

    Abstract: Genome-wide association studies (GWASs) have been extensively adopted to depict the underlying genetic architecture of complex diseases. Motivated by GWASs' limitations in identifying small effect loci to understand complex traits' polygenicity and fine-map** putative causal variants from proxy ones, we propose a knockoff-based method which only requires summary statistics from GWASs and demonst… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  16. arXiv:2308.04368  [pdf, other

    stat.ME

    Multiple Testing of Local Extrema for Detection of Structural Breaks in Piecewise Linear Models

    Authors: Zhibing He, Dan Cheng, Yunpeng Zhao

    Abstract: In this paper, we propose a new generic method for detecting the number and locations of structural breaks or change points in piecewise linear models under stationary Gaussian noise. Our method transforms the change point detection problem into identifying local extrema (local maxima and local minima) through kernel smoothing and differentiation of the data sequence. By computing p-values for all… ▽ More

    Submitted 4 December, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

  17. arXiv:2308.03785  [pdf, other

    stat.ME

    Network Inference Using the Hub Model and Variants

    Authors: Zhibing He, Yunpeng Zhao, Peter Bickel, Charles Weko, Dan Cheng, Jirui Wang

    Abstract: Statistical network analysis primarily focuses on inferring the parameters of an observed network. In many applications, especially in the social sciences, the observed data is the groups formed by individual subjects. In these applications, the network is itself a parameter of a statistical model. Zhao and Weko (2019) propose a model-based approach, called the hub model, to infer implicit network… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2004.09709

  18. arXiv:2307.12189  [pdf, other

    physics.soc-ph stat.AP

    Speed Limit: Obey, or Not Obey?

    Authors: Zhengbing He, Mirco Nanni, Luca Pappalardo, Paolo Santi, Carlo Ratti

    Abstract: It is commonly expected that drivers maintain a driving speed that is lower than or around the posted speed limit, as failure to obey may result in safety risks and fines. By taking randomly selected road segments as examples, this study compares the percentages of speeding vehicles in five countries worldwide, namely, two European countries (Germany and Italy), two Asian countries (Japan and Chin… ▽ More

    Submitted 27 November, 2023; v1 submitted 22 July, 2023; originally announced July 2023.

  19. arXiv:2307.07346  [pdf, other

    cs.LG stat.AP

    A testing-based approach to assess the clusterability of categorical data

    Authors: Lianyu Hu, Junjie Dong, Mudi Jiang, Yan Liu, Zengyou He

    Abstract: The objective of clusterability evaluation is to check whether a clustering structure exists within the data set. As a crucial yet often-overlooked issue in cluster analysis, it is essential to conduct such a test before applying any clustering algorithm. If a data set is unclusterable, any subsequent clustering analysis would not yield valid results. Despite its importance, the majority of existi… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: 19 pages, 13 figures

  20. arXiv:2303.04095  [pdf, other

    physics.soc-ph stat.AP

    Investigating and modeling day-to-day route choices based on laboratory experiments. Part II: A route-dependent attraction-based stochastic process model

    Authors: Hang Qi, Ning Jia, Xiaobo Qu, Zhengbing He

    Abstract: To explain day-to-day (DTD) route-choice behaviors and traffic dynamics observed in a series of lab experiments, Part I of this research proposed a discrete choice-based analytical dynamic model (Qi et al., 2023). Although the deterministic model could well reproduce the experimental observations, it converges to a stable equilibrium of route flow while the observed DTD evolution is apparently wit… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  21. arXiv:2303.04088  [pdf, other

    physics.soc-ph stat.AP

    Investigating day-to-day route choices based on multi-scenario laboratory experiments. Part I: Route-dependent attraction and its modeling

    Authors: Hang Qi, Ning Jia, Xiaobo Qu, Zhengbing He

    Abstract: In the area of urban transportation networks, a growing number of day-to-day (DTD) traffic dynamic theories have been proposed to describe the network flow evolution, and an increasing amount of laboratory experiments have been conducted to observe travelers' behavior regularities. However, the "communication" between theorists and experimentalists has not been made well. This paper devotes to 1)… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Journal ref: Transportation Research Part A, 2023

  22. arXiv:2211.03956  [pdf, other

    cs.LG stat.AP

    Significance-Based Categorical Data Clustering

    Authors: Lianyu Hu, Mudi Jiang, Yan Liu, Zengyou He

    Abstract: Although numerous algorithms have been proposed to solve the categorical data clustering problem, how to access the statistical significance of a set of categorical clusters remains unaddressed. To fulfill this void, we employ the likelihood ratio test to derive a test statistic that can serve as a significance-based objective function in categorical data clustering. Consequently, a new clustering… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: 36 pages, 6 figures

  23. arXiv:2206.00381  [pdf, ps, other

    physics.soc-ph cs.SI stat.AP

    The statistical nature of h-index of a network node

    Authors: Yan Liu, Mudi Jiang, Lianyu Hu, Zengyou He

    Abstract: Evaluating the importance of a network node is a crucial task in network science and graph data mining. H-index is a popular centrality measure for this task, however, there is still a lack of its interpretation from a rigorous statistical aspect. Here we show the statistical nature of h-index from the perspective of order statistics, and we obtain a new family of centrality indices by generalizin… ▽ More

    Submitted 19 May, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

  24. arXiv:2202.10991  [pdf

    cs.LG cs.AI stat.AP

    Temporal Subty** of Alzheimer's Disease Using Medical Conditions Preceding Alzheimer's Disease Onset in Electronic Health Records

    Authors: Zhe He, Shubo Tian, Arslan Erdengasileng, Neil Charness, Jiang Bian

    Abstract: Subty** of Alzheimer's disease (AD) can facilitate diagnosis, treatment, prognosis and disease management. It can also support the testing of new prevention and treatment strategies through clinical trials. In this study, we employed spectral clustering to cluster 29,922 AD patients in the OneFlorida Data Trust using their longitudinal EHR data of diagnosis and conditions into four subtypes. The… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: 10 pages

  25. arXiv:2109.14719  [pdf

    cs.LG stat.AP stat.CO stat.ML

    Deep neural networks with controlled variable selection for the identification of putative causal genetic variants

    Authors: Peyman H. Kassani, Fred Lu, Yann Le Guen, Zihuai He

    Abstract: Deep neural networks (DNN) have been used successfully in many scientific problems for their high prediction accuracy, but their application to genetic studies remains challenging due to their poor interpretability. In this paper, we consider the problem of scalable, robust variable selection in DNN for the identification of putative causal genetic variants in genome sequencing studies. We identif… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

  26. Enhancing Trajectory Prediction using Sparse Outputs: Application to Team Sports

    Authors: Brandon Victor, Aiden Nibali, Zhen He, David L. Carey

    Abstract: Sophisticated trajectory prediction models that effectively mimic team dynamics have many potential uses for sports coaches, broadcasters and spectators. However, through experiments on soccer data we found that it can be surprisingly challenging to train a deep learning model for player trajectory prediction which outperforms linear extrapolation on average distance between predicted and true fut… ▽ More

    Submitted 31 May, 2021; originally announced June 2021.

    Comments: 10 pages (not including references), 7 figures. Published in Neural Computing and Applications on 20 March 2021

    ACM Class: I.2.6

  27. arXiv:2104.12476  [pdf, other

    cs.CV stat.ML

    EigenGAN: Layer-Wise Eigen-Learning for GANs

    Authors: Zhenliang He, Meina Kan, Shiguang Shan

    Abstract: Recent studies on Generative Adversarial Network (GAN) reveal that different layers of a generative CNN hold different semantics of the synthesized images. However, few GAN models have explicit dimensions to control the semantic attributes represented in a specific layer. This paper proposes EigenGAN which is able to unsupervisedly mine interpretable and controllable dimensions from different gene… ▽ More

    Submitted 9 August, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: ICCV 2021. Code: https://github.com/LynnHo/EigenGAN-Tensorflow

  28. arXiv:2103.17236  [pdf, other

    stat.ML cs.LG math.NA

    High-Dimensional Uncertainty Quantification via Tensor Regression with Rank Determination and Adaptive Sampling

    Authors: Zichang He, Zheng Zhang

    Abstract: Fabrication process variations can significantly influence the performance and yield of nano-scale electronic and photonic circuits. Stochastic spectral methods have achieved great success in quantifying the impact of process variations, but they suffer from the curse of dimensionality. Recently, low-rank tensor methods have been developed to mitigate this issue, but two fundamental challenges rem… ▽ More

    Submitted 27 June, 2021; v1 submitted 31 March, 2021; originally announced March 2021.

    Comments: 12 pages, accepted by IEEE Trans. Components, Packaging and Manufacturing Technology

  29. arXiv:2103.12345  [pdf, other

    stat.ML cs.LG q-fin.PM

    The Success of AdaBoost and Its Application in Portfolio Management

    Authors: Yijian Chuan, Chaoyi Zhao, Zhenrui He, Lan Wu

    Abstract: We develop a novel approach to explain why AdaBoost is a successful classifier. By introducing a measure of the influence of the noise points (ION) in the training data for the binary classification problem, we prove that there is a strong connection between the ION and the test error. We further identify that the ION of AdaBoost decreases as the iteration number or the complexity of the base lear… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

  30. arXiv:2007.13140   

    stat.ML cs.LG stat.CO

    Fully Bayesian Analysis of the Relevance Vector Machine Classification for Imbalanced Data

    Authors: Wenyang Wang, Dongchu Sun, Zhuoqiong He

    Abstract: Relevance Vector Machine (RVM) is a supervised learning algorithm extended from Support Vector Machine (SVM) based on the Bayesian sparsity model. Compared with the regression problem, RVM classification is difficult to be conducted because there is no closed-form solution for the weight parameter posterior. Original RVM classification algorithm used Newton's method in optimization to obtain the m… ▽ More

    Submitted 27 October, 2022; v1 submitted 26 July, 2020; originally announced July 2020.

    Comments: The extended and final version of this paper has been published with open access modality in the CAAI Transactions on Intelligence Technology and can be found at link https://ietresearch.onlinelibrary.wiley.com/doi/full/10.1049/cit2.12111. Please refer to the TRIT published version in your scientific papers

  31. arXiv:2007.12336  [pdf, other

    cs.LG cs.CR stat.ML

    T-BFA: Targeted Bit-Flip Adversarial Weight Attack

    Authors: Adnan Siraj Rakin, Zhezhi He, **gtao Li, Fan Yao, Chaitali Chakrabarti, Deliang Fan

    Abstract: Traditional Deep Neural Network (DNN) security is mostly related to the well-known adversarial input example attack. Recently, another dimension of adversarial attack, namely, attack on DNN weight parameters, has been shown to be very powerful. As a representative one, the Bit-Flip-based adversarial weight Attack (BFA) injects an extremely small amount of faults into weight parameters to hijack th… ▽ More

    Submitted 7 January, 2021; v1 submitted 23 July, 2020; originally announced July 2020.

  32. arXiv:2004.03481  [pdf, other

    cs.SI cs.CY stat.AP

    Routine pattern discovery and anomaly detection in individual travel behavior

    Authors: Lijun Sun, Xinyu Chen, Zhaocheng He, Luis F. Miranda-Moreno

    Abstract: Discovering patterns and detecting anomalies in individual travel behavior is a crucial problem in both research and practice. In this paper, we address this problem by building a probabilistic framework to model individual spatiotemporal travel behavior data (e.g., trip records and trajectory data). We develop a two-dimensional latent Dirichlet allocation (LDA) model to characterize the generativ… ▽ More

    Submitted 5 April, 2020; originally announced April 2020.

    Journal ref: Networks and Spatial Economics (2021)

  33. arXiv:2004.02359  [pdf, other

    cs.LG stat.ME stat.ML

    Deep Neural Network in Cusp Catastrophe Model

    Authors: Ranadeep Daw, Zhuoqiong He

    Abstract: Catastrophe theory was originally proposed to study dynamical systems that exhibit sudden shifts in behavior arising from small changes in input. These models can generate reasonable explanation behind abrupt jumps in nonlinear dynamic models. Among the different catastrophe models, the Cusp Catastrophe model attracted the most attention due to it's relatively simpler dynamics and rich domain of a… ▽ More

    Submitted 21 April, 2020; v1 submitted 5 April, 2020; originally announced April 2020.

  34. arXiv:2002.12663  [pdf, other

    cs.LG cs.CV stat.ML

    HOTCAKE: Higher Order Tucker Articulated Kernels for Deeper CNN Compression

    Authors: Rui Lin, Ching-Yun Ko, Zhuolun He, Cong Chen, Yuan Cheng, Hao Yu, Graziano Chesi, Ngai Wong

    Abstract: The emerging edge computing has promoted immense interests in compacting a neural network without sacrificing much accuracy. In this regard, low-rank tensor decomposition constitutes a powerful tool to compress convolutional neural networks (CNNs) by decomposing the 4-way kernel tensor into multi-stage smaller ones. Building on top of Tucker-2 decomposition, we propose a generalized Higher Order T… ▽ More

    Submitted 28 February, 2020; originally announced February 2020.

    Comments: 6 pages, 5 figures

  35. arXiv:2001.06325  [pdf, other

    cs.LG stat.ML

    Universal Adversarial Attack on Attention and the Resulting Dataset DAmageNet

    Authors: Sizhe Chen, Zhengbao He, Cheng** Sun, Jie Yang, Xiaolin Huang

    Abstract: Adversarial attacks on deep neural networks (DNNs) have been found for several years. However, the existing adversarial attacks have high success rates only when the information of the victim DNN is well-known or could be estimated by the structure similarity or massive queries. In this paper, we propose to Attack on Attention (AoA), a semantic property commonly shared by DNNs. AoA enjoys a signif… ▽ More

    Submitted 21 October, 2020; v1 submitted 16 January, 2020; originally announced January 2020.

    Comments: accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

  36. arXiv:1912.07160  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    DAmageNet: A Universal Adversarial Dataset

    Authors: Sizhe Chen, Xiaolin Huang, Zhengbao He, Cheng** Sun

    Abstract: It is now well known that deep neural networks (DNNs) are vulnerable to adversarial attack. Adversarial samples are similar to the clean ones, but are able to cheat the attacked DNN to produce incorrect predictions in high confidence. But most of the existing adversarial attacks have high success rate only when the information of the attacked DNN is well-known or could be estimated by massive quer… ▽ More

    Submitted 15 December, 2019; originally announced December 2019.

  37. arXiv:1910.11148  [pdf

    eess.IV cs.CV cs.LG stat.ML

    Learning Priors in High-frequency Domain for Inverse Imaging Reconstruction

    Authors: Zhuonan He, **jie Zhou, Dong Liang, Yuhao Wang, Qiegen Liu

    Abstract: Ill-posed inverse problems in imaging remain an active research topic in several decades, with new approaches constantly emerging. Recognizing that the popular dictionary learning and convolutional sparse coding are both essentially modeling the high-frequency component of an image, which convey most of the semantic information such as texture details, in this work we propose a novel multi-profile… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

  38. arXiv:1910.10897  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

    Authors: Tianhe Yu, Deirdre Quillen, Zhanpeng He, Ryan Julian, Avnish Narayan, Hayden Shively, Adithya Bellathur, Karol Hausman, Chelsea Finn, Sergey Levine

    Abstract: Meta-reinforcement learning algorithms can enable robots to acquire new skills much more quickly, by leveraging prior experience to learn how to learn. However, much of the current research on meta-reinforcement learning focuses on task distributions that are very narrow. For example, a commonly used meta-reinforcement learning benchmark uses different running velocities for a simulated robot as d… ▽ More

    Submitted 14 June, 2021; v1 submitted 23 October, 2019; originally announced October 2019.

    Comments: This is an update version of a manuscript that originally appeared at CoRL 2019. Videos are here: meta-world.github.io, open-sourced code are available at: https://github.com/rlworkgroup/metaworld, and the baselines can be found at https://github.com/rlworkgroup/garage

  39. arXiv:1909.09148  [pdf, other

    cs.LG cs.CV stat.ML

    Data Augmentation Revisited: Rethinking the Distribution Gap between Clean and Augmented Data

    Authors: Zhuoxun He, Lingxi Xie, Xin Chen, Ya Zhang, Yanfeng Wang, Qi Tian

    Abstract: Data augmentation has been widely applied as an effective methodology to improve generalization in particular when training deep neural networks. Recently, researchers proposed a few intensive data augmentation techniques, which indeed improved accuracy, yet we notice that these methods augment data have also caused a considerable gap between clean and augmented data. In this paper, we revisit thi… ▽ More

    Submitted 21 November, 2019; v1 submitted 19 September, 2019; originally announced September 2019.

  40. arXiv:1909.02902  [pdf, other

    cs.LG cs.CV stat.ML

    Dynamic Spatial-Temporal Representation Learning for Traffic Flow Prediction

    Authors: Lingbo Liu, Jiajie Zhen, Guanbin Li, Geng Zhan, Zhaocheng He, Bowen Du, Liang Lin

    Abstract: As a crucial component in intelligent transportation systems, traffic flow prediction has recently attracted widespread research interest in the field of artificial intelligence (AI) with the increasing availability of massive traffic mobility data. Its key challenge lies in how to integrate diverse factors (such as temporal rules and spatial dependencies) to infer the evolution trend of traffic f… ▽ More

    Submitted 12 June, 2020; v1 submitted 1 September, 2019; originally announced September 2019.

    Comments: Accepted by IEEE Transactions on Intelligent Transportation Systems. arXiv admin note: text overlap with arXiv:1809.00101

  41. arXiv:1908.07232  [pdf, other

    math.NA stat.CO

    Sensitivity estimation of conditional value at risk using randomized quasi-Monte Carlo

    Authors: Zhijian He

    Abstract: Conditional value at risk (CVaR) is a popular measure for quantifying portfolio risk. Sensitivity analysis of CVaR is very useful in risk management and gradient-based optimization algorithms. In this paper, we study the infinitesimal perturbation analysis estimator for CVaR sensitivity using randomized quasi-Monte Carlo (RQMC) simulation. We first prove that the RQMC-based estimator is strongly c… ▽ More

    Submitted 21 September, 2020; v1 submitted 20 August, 2019; originally announced August 2019.

  42. arXiv:1908.06951  [pdf, ps, other

    stat.ML cs.LG

    Gradient Boosting Machine: A Survey

    Authors: Zhiyuan He, Danchen Lin, Thomas Lau, Mike Wu

    Abstract: In this survey, we discuss several different types of gradient boosting algorithms and illustrate their mathematical frameworks in detail: 1. introduction of gradient boosting leads to 2. objective function optimization, 3. loss function estimations, and 4. model constructions. 5. application of boosting in ranking.

    Submitted 19 August, 2019; originally announced August 2019.

  43. arXiv:1907.06356  [pdf, other

    cs.LG eess.SP stat.ML

    Motorway Traffic Flow Prediction using Advanced Deep Learning

    Authors: Adriana-Simona Mihaita, Haowen Li, Zongyang He, Marian-Andrei Rizoiu

    Abstract: Congestion prediction represents a major priority for traffic management centres around the world to ensure timely incident response handling. The increasing amounts of generated traffic data have been used to train machine learning predictors for traffic, however this is a challenging task due to inter-dependencies of traffic flow both in time and space. Recently, deep learning techniques have sh… ▽ More

    Submitted 16 July, 2019; v1 submitted 15 July, 2019; originally announced July 2019.

    Comments: Published in the Proceedings of the 22nd IEEE Intelligent Transportation Systems Conference (ITSC'19). Auckland, New Zealand

  44. arXiv:1907.02124  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Non-Structured DNN Weight Pruning -- Is It Beneficial in Any Platform?

    Authors: Xiaolong Ma, Sheng Lin, Shaokai Ye, Zhezhi He, Linfeng Zhang, Geng Yuan, Sia Huat Tan, Zhengang Li, Deliang Fan, Xuehai Qian, Xue Lin, Kaisheng Ma, Yanzhi Wang

    Abstract: Large deep neural network (DNN) models pose the key challenge to energy efficiency due to the significantly higher energy consumption of off-chip DRAM accesses than arithmetic or SRAM operations. It motivates the intensive research on model compression with two main approaches. Weight pruning leverages the redundancy in the number of weights and can be performed in a non-structured, which has high… ▽ More

    Submitted 7 January, 2020; v1 submitted 3 July, 2019; originally announced July 2019.

  45. arXiv:1906.10163  [pdf

    stat.AP

    Assessing the Validity of a a priori Patient-Trial Generalizability Score using Real-world Data from a Large Clinical Data Research Network: A Colorectal Cancer Clinical Trial Case Study

    Authors: Qian Li, Zhe He, Yi Guo, Hansi Zhang, Thomas J George Jr, William Hogan, Neil Charness, Jiang Bian

    Abstract: Existing trials had not taken enough consideration of their population representativeness, which can lower the effectiveness when the treatment is applied in real-world clinical practice. We analyzed the eligibility criteria of Bevacizumab colorectal cancer treatment trials, assessed their a priori generalizability, and examined how it affects patient outcomes when applied in real-world clinical s… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

  46. arXiv:1906.04734  [pdf

    cs.LG stat.ML

    Incremental Classifier Learning Based on PEDCC-Loss and Cosine Distance

    Authors: Qiuyu Zhu, Zikuang He, Xin Ye

    Abstract: The main purpose of incremental learning is to learn new knowledge while not forgetting the knowledge which have been learned before. At present, the main challenge in this area is the catastrophe forgetting, namely the network will lose their performance in the old tasks after training for new tasks. In this paper, we introduce an ensemble method of incremental classifier to alleviate this proble… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

  47. arXiv:1905.12469  [pdf

    cs.CY stat.ML

    Understanding Perceptions and Attitudes in Breast Cancer Discussions on Twitter

    Authors: Francois Modave, Yunpeng Zhao, Janice Krieger, Zhe He, Yi Guo, **hai Huo, Mattia Prosperi, Jiang Bian

    Abstract: Among American women, the rate of breast cancer is only second to lung cancer. An estimated 12.4% women will develop breast cancer over the course of their lifetime. The widespread use of social media across the socio-economic spectrum offers unparalleled ways to facilitate information sharing, in particular as it pertains to health. Social media is also used by many healthcare stakeholders, rangi… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

    Comments: 5 pages, 10 figures, The 17th World Congress of Medical and Health Informatics

  48. Reference-Based Sequence Classification

    Authors: Zengyou He, Guangyao Xu, Chaohua Sheng, Bo Xu, Quan Zou

    Abstract: Sequence classification is an important data mining task in many real world applications. Over the past few decades, many sequence classification methods have been proposed from different aspects. In particular, the pattern-based method is one of the most important and widely studied sequence classification methods in the literature. In this paper, we present a reference-based sequence classificat… ▽ More

    Submitted 13 December, 2020; v1 submitted 17 May, 2019; originally announced May 2019.

    Journal ref: in IEEE Access, vol. 8, pp. 218199-218214, 2020

  49. arXiv:1905.05849  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Consensus-based Interpretable Deep Neural Networks with Application to Mortality Prediction

    Authors: Shaeke Salman, Seyedeh Neelufar Payrovnaziri, Xiuwen Liu, Pablo Rengifo-Moreno, Zhe He

    Abstract: Deep neural networks have achieved remarkable success in various challenging tasks. However, the black-box nature of such networks is not acceptable to critical applications, such as healthcare. In particular, the existence of adversarial examples and their overgeneralization to irrelevant, out-of-distribution inputs with high confidence makes it difficult, if not impossible, to explain decisions… ▽ More

    Submitted 11 September, 2019; v1 submitted 14 May, 2019; originally announced May 2019.

    Comments: 8 pages, 6 figures

  50. arXiv:1904.12383  [pdf

    cs.LG cs.AI stat.ML

    Enhancing Prediction Models for One-Year Mortality in Patients with Acute Myocardial Infarction and Post Myocardial Infarction Syndrome

    Authors: Seyedeh Neelufar Payrovnaziri, Laura A. Barrett, Daniel Bis, Jiang Bian, Zhe He

    Abstract: Predicting the risk of mortality for patients with acute myocardial infarction (AMI) using electronic health records (EHRs) data can help identify risky patients who might need more tailored care. In our previous work, we built computational models to predict one-year mortality of patients admitted to an intensive care unit (ICU) with AMI or post myocardial infarction syndrome. Our prior work only… ▽ More

    Submitted 28 April, 2019; originally announced April 2019.