Skip to main content

Showing 1–50 of 61 results for author: Zhou, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.05185  [pdf, other

    math.OC cs.LG math.PR stat.ML

    Convergence analysis of controlled particle systems arising in deep learning: from finite to infinite sample size

    Authors: Huafu Liao, Alpár R. Mészáros, Chenchen Mou, Chao Zhou

    Abstract: This paper deals with a class of neural SDEs and studies the limiting behavior of the associated sampled optimal control problems as the sample size grows to infinity. The neural SDEs with N samples can be linked to the N-particle systems with centralized control. We analyze the Hamilton--Jacobi--Bellman equation corresponding to the N-particle system and establish regularity results which are uni… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 45 pages, 2 figures

    MSC Class: 49N80; 65C35; 49L12; 62M45

  2. arXiv:2312.06050  [pdf, other

    cs.LG eess.IV stat.ML

    Federated Multilinear Principal Component Analysis with Applications in Prognostics

    Authors: Chengyu Zhou, Yuqi Su, Tangbin Xia, Xiaolei Fang

    Abstract: Multilinear Principal Component Analysis (MPCA) is a widely utilized method for the dimension reduction of tensor data. However, the integration of MPCA into federated learning remains unexplored in existing research. To tackle this gap, this article proposes a Federated Multilinear Principal Component Analysis (FMPCA) method, which enables multiple users to collaboratively reduce the dimension of… ▽ More

    Submitted 28 April, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

  3. arXiv:2310.03351  [pdf, ps, other

    stat.CO stat.ME

    Efficiently analyzing large patient registries with Bayesian joint models for longitudinal and time-to-event data

    Authors: P. Miranda Afonso, D. Rizopoulos, A. K. Palipana, G. C. Zhou, C. Brokamp, R. D. Szczesniak, E-R. Andrinopoulou

    Abstract: The joint modeling of longitudinal and time-to-event outcomes has become a popular tool in follow-up studies. However, fitting Bayesian joint models to large datasets, such as patient registries, can require extended computing times. To speed up sampling, we divided a patient registry dataset into subsamples, analyzed them in parallel, and combined the resulting Markov chain Monte Carlo draws into… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  4. arXiv:2307.15004  [pdf, other

    stat.ME math.ST

    Graphical lasso for extremes

    Authors: Phyllis Wan, Chen Zhou

    Abstract: In this paper we estimate the sparse dependence structure in the tail region of a multivariate random vector, potentially of high dimension. The tail dependence is modeled via a graphical model for extremes embedded in the Huesler-Reiss distribution (Engelke and Hitz, 2020). We propose the extreme graphical lasso procedure to estimate the sparsity in the tail dependence, similar to the Gaussian gr… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    MSC Class: 62G32; 62H12; 62F12

  5. arXiv:2210.12618  [pdf, other

    stat.ME stat.AP

    Estimating probabilities of multivariate failure sets based on pairwise tail dependence coefficients

    Authors: Anna Kiriliouk, Chen Zhou

    Abstract: An important problem in extreme-value theory is the estimation of the probability that a high-dimensional random vector falls into a given extreme failure set. This paper provides a parametric approach to this problem, based on a generalization of the tail pairwise dependence matrix (TPDM). The TPDM gives a partial summary of tail dependence for all pairs of components of the random vector. We pro… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    MSC Class: 62F10; 62H10; 62H12; 60G70 ACM Class: G.3

  6. arXiv:2207.11353  [pdf, other

    cs.LG eess.IV stat.ML

    A Supervised Tensor Dimension Reduction-Based Prognostics Model for Applications with Incomplete Imaging Data

    Authors: Chengyu Zhou, Xiaolei Fang

    Abstract: This paper proposes a supervised dimension reduction methodology for tensor data which has two advantages over most image-based prognostic models. First, the model does not require tensor data to be complete which expands its application to incomplete data. Second, it utilizes time-to-failure (TTF) to supervise the extraction of low-dimensional features which makes the extracted features more effe… ▽ More

    Submitted 4 June, 2023; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: 42 pages, 17 figures

  7. arXiv:2112.10329  [pdf, ps, other

    stat.ME

    Adapting the Hill estimator to distributed inference: dealing with the bias

    Authors: Liujun Chen, Deyuan Li, Chen Zhou

    Abstract: The distributed Hill estimator is a divide-and-conquer algorithm for estimating the extreme value index when data are stored in multiple machines. In applications, estimates based on the distributed Hill estimator can be sensitive to the choice of the number of the exceedance ratios used in each machine. Even when choosing the number at a low level, a high asymptotic bias may arise. We overcome th… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

  8. arXiv:2112.06868  [pdf, other

    cs.LG stat.ML

    Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias

    Authors: Frederic Koehler, Viraj Mehta, Chenghui Zhou, Andrej Risteski

    Abstract: Variational Autoencoders are one of the most commonly used generative models, particularly for image data. A prominent difficulty in training VAEs is data that is supported on a lower-dimensional manifold. Recent work by Dai and Wipf (2020) proposes a two-stage training algorithm for VAEs, based on a conjecture that in standard VAE training the generator will converge to a solution with 0 variance… ▽ More

    Submitted 17 May, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: Accepted as a conference paper at ICLR 2022

  9. arXiv:2111.11676  [pdf, other

    stat.ML cs.LG

    RIO: Rotation-equivariance supervised learning of robust inertial odometry

    Authors: Caifa Zhou, Xiya Cao, Dandan Zeng, Yongliang Wang

    Abstract: This paper introduces rotation-equivariance as a self-supervisor to train inertial odometry models. We demonstrate that the self-supervised scheme provides a powerful supervisory signal at training phase as well as at inference stage. It reduces the reliance on massive amounts of labeled data for training a robust model and makes it possible to update the model using various unlabeled data. Furthe… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: 12 pages, 17 figures, 2 tables

  10. arXiv:2108.01432  [pdf, other

    math.ST stat.ME

    Tail inverse regression for dimension reduction with extreme response

    Authors: Anass Aghbalou, François Portier, Anne Sabourin, Chen Zhou

    Abstract: We consider the problem of supervised dimension reduction with a particular focus on extreme values of the target $Y\in\mathbb{R}$ to be explained by a covariate vector $X \in \mathbb{R}^p$. The general purpose is to define and estimate a projection on a lower dimensional subspace of the covariate space which is sufficient for predicting exceedances of the target above high thresholds. We propose… ▽ More

    Submitted 24 February, 2023; v1 submitted 30 July, 2021; originally announced August 2021.

    Comments: main paper: 31 pages + supplementary material: 16 pages

    MSC Class: 62G32; 62H25; 62G08; 62G30

  11. arXiv:2108.01327  [pdf, ps, other

    stat.ME

    Distributed Inference for Tail Risk

    Authors: Liujun Chen, Deyuan Li, Chen Zhou

    Abstract: For measuring tail risk with scarce extreme events, extreme value analysis is often invoked as the statistical tool to extrapolate to the tail of a distribution. The presence of large datasets benefits tail risk analysis by providing more observations for conducting extreme value analysis. However, large datasets can be stored distributedly preventing the possibility of directly analyzing them. In… ▽ More

    Submitted 15 December, 2023; v1 submitted 3 August, 2021; originally announced August 2021.

  12. arXiv:2103.14224  [pdf, other

    stat.ML cs.LG

    Active multi-fidelity Bayesian online changepoint detection

    Authors: Gregory W. Gundersen, Diana Cai, Chuteng Zhou, Barbara E. Engelhardt, Ryan P. Adams

    Abstract: Online algorithms for detecting changepoints, or abrupt shifts in the behavior of a time series, are often deployed with limited resources, e.g., to edge computing settings such as mobile phones or industrial sensors. In these scenarios it may be beneficial to trade the cost of collecting an environmental measurement against the quality or "fidelity" of this measurement and how the measurement aff… ▽ More

    Submitted 25 July, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

    Comments: 37th Conference on Uncertainty in Artificial Intelligence

  13. arXiv:2103.11125  [pdf, other

    stat.AP

    Mining geometric constraints from crowd-sourced radio signals and its application to indoor positioning

    Authors: Caifa Zhou, Zhi Li, Dandan Zeng, Yongliang Wang

    Abstract: Crowd-sourcing has become a promising way to build} a feature-based indoor positioning system that has lower labour and time costs. It can make full use of the widely deployed infrastructure as well as built-in sensors on mobile devices. One of the key challenges is to generate the reference feature map (RFM), a database used for localization, by {aligning crowd-sourced {trajectories according to… ▽ More

    Submitted 20 March, 2021; originally announced March 2021.

    Comments: 20 pages, 11 figures, accepted to publish on IEEE Access

  14. arXiv:2103.00959  [pdf, other

    cs.SI cs.LG stat.ML

    CogDL: A Comprehensive Library for Graph Deep Learning

    Authors: Yukuo Cen, Zhenyu Hou, Yan Wang, Qibin Chen, Yizhen Luo, Zhongming Yu, Hengrui Zhang, Xingcheng Yao, Aohan Zeng, Shiguang Guo, Yuxiao Dong, Yang Yang, Peng Zhang, Guohao Dai, Yu Wang, Chang Zhou, Hongxia Yang, Jie Tang

    Abstract: Graph neural networks (GNNs) have attracted tremendous attention from the graph learning community in recent years. It has been widely adopted in various real-world applications from diverse domains, such as social networks and biological graphs. The research and applications of graph deep learning present new challenges, including the sparse nature of graph data, complicated training of GNNs, and… ▽ More

    Submitted 17 April, 2023; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: Accepted to WWW 2023. Website: https://github.com/THUDM/cogdl

  15. arXiv:2005.12964  [pdf, other

    cs.IR cs.LG cs.SI stat.ML

    Contrastive Learning for Debiased Candidate Generation in Large-Scale Recommender Systems

    Authors: Chang Zhou, Jianxin Ma, Jianwei Zhang, **gren Zhou, Hongxia Yang

    Abstract: Deep candidate generation (DCG) that narrows down the collection of relevant items from billions to hundreds via representation learning has become prevalent in industrial recommender systems. Standard approaches approximate maximum likelihood estimation (MLE) through sampling for better scalability and address the problem of DCG in a way similar to language modeling. However, live recommender sys… ▽ More

    Submitted 4 June, 2021; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: Accepted by the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021)

  16. arXiv:2005.09863  [pdf, other

    cs.LG stat.ML

    Understanding Negative Sampling in Graph Representation Learning

    Authors: Zhen Yang, Ming Ding, Chang Zhou, Hongxia Yang, **gren Zhou, Jie Tang

    Abstract: Graph representation learning has been extensively studied in recent years. Despite its potential in generating continuous embeddings for various networks, both the effectiveness and efficiency to infer high-quality representations toward large corpus of nodes are still challenging. Sampling is a critical point to achieve the performance goals. Prior arts usually focus on sampling positive node pa… ▽ More

    Submitted 25 June, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

    Comments: KDD 2020

  17. arXiv:2005.09347  [pdf, other

    cs.IR cs.LG stat.ML

    Controllable Multi-Interest Framework for Recommendation

    Authors: Yukuo Cen, Jianwei Zhang, Xu Zou, Chang Zhou, Hongxia Yang, Jie Tang

    Abstract: Recently, neural networks have been widely used in e-commerce recommender systems, owing to the rapid development of deep learning. We formalize the recommender system as a sequential recommendation problem, intending to predict the next items that the user might be interacted with. Recent works usually give an overall embedding from a user's behavior sequence. However, a unified user embedding ca… ▽ More

    Submitted 2 August, 2020; v1 submitted 19 May, 2020; originally announced May 2020.

    Comments: Accepted to KDD 2020

  18. arXiv:2004.11848  [pdf

    cs.CV cs.LG eess.IV stat.ML

    Deep learning for smart fish farming: applications, opportunities and challenges

    Authors: Xinting Yang, Song Zhang, **tao Liu, Qinfeng Gao, Shuanglin Dong, Chao Zhou

    Abstract: With the rapid emergence of deep learning (DL) technology, it has been successfully used in various fields including aquaculture. This change can create new opportunities and a series of challenges for information and data processing in smart fish farming. This paper focuses on the applications of DL in aquaculture, including live fish identification, species classification, behavioral analysis, f… ▽ More

    Submitted 30 June, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: 43 pages, 7 figures

    Journal ref: Reviews in aquaculture,2020

  19. Detecting Suspected Epidemic Cases Using Trajectory Big Data

    Authors: Chuansai Zhou, Wen Yuan, Jun Wang, Haiyong Xu, Yong Jiang, Xinmin Wang, Qiuzi Han Wen, **wen Zhang

    Abstract: Emerging infectious diseases are existential threats to human health and global stability. The recent outbreaks of the novel coronavirus COVID-19 have rapidly formed a global pandemic, causing hundreds of thousands of infections and huge economic loss. The WHO declares that more precise measures to track, detect and isolate infected people are among the most effective means to quickly contain the… ▽ More

    Submitted 15 April, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

    Journal ref: CSIAM Transactions on Applied Mathematics. 1(2020).186-206

  20. arXiv:2003.04265  [pdf, other

    math.ST stat.AP

    Spatial dependence and space-time trend in extreme events

    Authors: John H. J. Einmahl, Ana Ferreira, Laurens de Haan, Claudia Neves, Chen Zhou

    Abstract: The statistical theory of extremes is extended to observations that are non-stationary and not independent. The non-stationarity over time and space is controlled via the scedasis (tail scale) in the marginal distributions. Spatial dependence stems from multivariate extreme value theory. We establish asymptotic theory for both the weighted sequential tail empirical process and the weighted tail qu… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

    Comments: Supporting information: the detailed proof of Theorem 6, referenced in Section 4, as well as simulations showcasing finite sample performance of the proposed methods are available with this paper at https://bit.ly/3aJFM6B

    MSC Class: 62G32; 62G30; 62G05; 62G10; 62G20; 60F17; 60G70

  21. arXiv:2003.02740  [pdf, other

    cs.LG stat.ML

    Balance Between Efficient and Effective Learning: Dense2Sparse Reward Sha** for Robot Manipulation with Environment Uncertainty

    Authors: Yongle Luo, Kun Dong, Lili Zhao, Zhiyong Sun, Chao Zhou, Bo Song

    Abstract: Efficient and effective learning is one of the ultimate goals of the deep reinforcement learning (DRL), although the compromise has been made in most of the time, especially for the application of robot manipulations. Learning is always expensive for robot manipulation tasks and the learning effectiveness could be affected by the system uncertainty. In order to solve above challenges, in this stud… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

  22. arXiv:2001.10119  [pdf, other

    cs.LG stat.ML

    Unsupervised Program Synthesis for Images By Sampling Without Replacement

    Authors: Chenghui Zhou, Chun-Liang Li, Barnabas Poczos

    Abstract: Program synthesis has emerged as a successful approach to the image parsing task. Most prior works rely on a two-step scheme involving supervised pretraining of a Seq2Seq model with synthetic programs followed by reinforcement learning (RL) for fine-tuning with real reference images. Fully unsupervised approaches promise to train the model directly on the target images without requiring curated pr… ▽ More

    Submitted 14 June, 2021; v1 submitted 27 January, 2020; originally announced January 2020.

    Comments: Accepted to UAI 2021

    Journal ref: UAI 2021

  23. arXiv:2001.04974  [pdf, other

    cs.LG cs.AI cs.AR stat.ML

    Noisy Machines: Understanding Noisy Neural Networks and Enhancing Robustness to Analog Hardware Errors Using Distillation

    Authors: Chuteng Zhou, Prad Kadambi, Matthew Mattina, Paul N. Whatmough

    Abstract: The success of deep learning has brought forth a wave of interest in computer hardware design to better meet the high demands of neural network inference. In particular, analog computing hardware has been heavily motivated specifically for accelerating neural networks, based on either electronic, optical or photonic devices, which may well achieve lower power consumption than conventional digital… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

  24. arXiv:2001.02734  [pdf

    stat.AP

    Gasoline Pricing Policies for Transportation Safety

    Authors: Nima Safaei, Chao Zhou

    Abstract: Economic factors can have substantial effects on transportation crash trends. This study makes a comprehensive examination of the relationship between the retail gasoline price (including state and federal fuel taxes) and transportation fatal crashes from 2007 to 2016 in the US. Data on motor vehicle, bicycle and pedestrian fatal crashes come from Fatality Analysis Reporting System (FARS) provided… ▽ More

    Submitted 8 January, 2020; originally announced January 2020.

    Comments: 19 pages, 1 figure, 3 tables

  25. arXiv:1912.09301  [pdf, other

    stat.ML cs.LG eess.SP

    Feature-wise change detection and robust indoor positioning using RANSAC-like approach

    Authors: Caifa Zhou

    Abstract: Fingerprinting-based positioning, one of the promising indoor positioning solutions, has been broadly explored owing to the pervasiveness of sensor-rich mobile devices, the prosperity of opportunistically measurable location-relevant signals and the progress of data-driven algorithms. One critical challenge is to controland improve the quality of the reference fingerprint map (RFM), which is built… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: 36 pages, 20 figures, 2 tables

  26. arXiv:1912.00602  [pdf, other

    cs.LG stat.ML

    ExperienceThinking: Constrained Hyperparameter Optimization based on Knowledge and Pruning

    Authors: Chunnan Wang, Hongzhi Wang, Chang Zhou, Hanxiao Chen

    Abstract: Machine learning algorithms are very sensitive to the hyperparameters, and their evaluations are generally expensive. Users desperately need intelligent methods to quickly optimize hyperparameter settings according to known evaluation information, and thus reduce computational cost and promote optimization efficiency. Motivated by this, we propose ExperienceThinking algorithm to quickly find the b… ▽ More

    Submitted 4 May, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

  27. arXiv:1911.08820  [pdf, other

    cs.LG stat.ML

    A Fast Sampling Gradient Tree Boosting Framework

    Authors: Daniel Chao Zhou, Zhongming **, Tong Zhang

    Abstract: As an adaptive, interpretable, robust, and accurate meta-algorithm for arbitrary differentiable loss functions, gradient tree boosting is one of the most popular machine learning techniques, though the computational expensiveness severely limits its usage. Stochastic gradient boosting could be adopted to accelerates gradient boosting by uniformly sampling training instances, but its estimator coul… ▽ More

    Submitted 20 November, 2019; originally announced November 2019.

  28. arXiv:1910.14238  [pdf, other

    cs.LG cs.IR stat.ML

    Learning Disentangled Representations for Recommendation

    Authors: Jianxin Ma, Chang Zhou, Peng Cui, Hongxia Yang, Wenwu Zhu

    Abstract: User behavior data in recommender systems are driven by the complex interactions of many latent factors behind the users' decision making processes. The factors are highly entangled, and may range from high-level ones that govern user intentions, to low-level ones that characterize a user's preference when executing an intention. Learning representations that uncover and disentangle these latent f… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: To appear in the Proceedings of the Thirty-third Conference on Neural Information Processing Systems (NeurIPS 2019)

  29. arXiv:1910.02558  [pdf, other

    cs.LG stat.ML

    Pushing the limits of RNN Compression

    Authors: Urmish Thakker, Igor Fedorov, Jesse Beu, Dibakar Gope, Chu Zhou, Ganesh Dasika, Matthew Mattina

    Abstract: Recurrent Neural Networks (RNN) can be difficult to deploy on resource constrained devices due to their size. As a result, there is a need for compression techniques that can significantly compress RNNs without negatively impacting task accuracy. This paper introduces a method to compress RNNs for resource constrained environments using Kronecker product (KP). KPs can compress RNN layers by 16-38x… ▽ More

    Submitted 9 October, 2019; v1 submitted 4 October, 2019; originally announced October 2019.

    Comments: 6 pages. arXiv admin note: substantial text overlap with arXiv:1906.02876

    Journal ref: 5th edition of Workshop on Energy Efficient Machine Learning and Cognitive Computing at NeurIPS 2019

  30. arXiv:1909.08417  [pdf, other

    cs.LG cs.CG math.AT stat.ML

    Persistence B-Spline Grids: Stable Vector Representation of Persistence Diagrams Based on Data Fitting

    Authors: Zhetong Dong, Hongwei Lin, Chi Zhou

    Abstract: Many attempts have been made in recent decades to integrate machine learning (ML) and topological data analysis. A prominent problem in applying persistent homology to ML tasks is finding a vector representation of a persistence diagram (PD), which is a summary diagram for representing topological features. From the perspective of data fitting, a stable vector representation, namely, persistence B… ▽ More

    Submitted 22 April, 2022; v1 submitted 17 September, 2019; originally announced September 2019.

  31. arXiv:1907.05743  [pdf

    cs.LG stat.ML

    Semi-Supervised Graph Embedding for Multi-Label Graph Node Classification

    Authors: Kaisheng Gao, **g Zhang, Cangqi Zhou

    Abstract: The graph convolution network (GCN) is a widely-used facility to realize graph-based semi-supervised learning, which usually integrates node features and graph topologic information to build learning models. However, as for multi-label learning tasks, the supervision part of GCN simply minimizes the cross-entropy loss between the last layer outputs and the ground-truth label distribution, which te… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

    Comments: 12 pages

  32. arXiv:1907.02237  [pdf, other

    cs.LG stat.ML

    Dimensional Reweighting Graph Convolutional Networks

    Authors: Xu Zou, Qiuye Jia, Jianwei Zhang, Chang Zhou, Hongxia Yang, Jie Tang

    Abstract: Graph Convolution Networks (GCNs) are becoming more and more popular for learning node representations on graphs. Though there exist various developments on sampling and aggregation to accelerate the training process and improve the performances, limited works focus on dealing with the dimensional information imbalance of node representations. To bridge the gap, we propose a method named Dimension… ▽ More

    Submitted 29 October, 2020; v1 submitted 4 July, 2019; originally announced July 2019.

    Comments: We decide to drastically modify the article so we don't wish to let this outdated version continue to confuse readers

  33. arXiv:1906.05489  [pdf, other

    cs.LG cs.CL stat.ML

    Cognitive Knowledge Graph Reasoning for One-shot Relational Learning

    Authors: Zhengxiao Du, Chang Zhou, Ming Ding, Hongxia Yang, Jie Tang

    Abstract: Inferring new facts from existing knowledge graphs (KG) with explainable reasoning processes is a significant problem and has received much attention recently. However, few studies have focused on relation types unseen in the original KG, given only one or a few instances for training. To bridge this gap, we propose CogKR for one-shot KG reasoning. The one-shot relational learning problem is tackl… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

  34. arXiv:1906.02876  [pdf, other

    cs.LG cs.NE stat.ML

    Compressing RNNs for IoT devices by 15-38x using Kronecker Products

    Authors: Urmish Thakker, Jesse Beu, Dibakar Gope, Chu Zhou, Igor Fedorov, Ganesh Dasika, Matthew Mattina

    Abstract: Recurrent Neural Networks (RNN) can be difficult to deploy on resource constrained devices due to their size.As a result, there is a need for compression techniques that can significantly compress RNNs without negatively impacting task accuracy. This paper introduces a method to compress RNNs for resource constrained environments using Kronecker product (KP). KPs can compress RNN layers by 15-38x… ▽ More

    Submitted 31 January, 2020; v1 submitted 6 June, 2019; originally announced June 2019.

  35. arXiv:1905.08022  [pdf, other

    cs.LG stat.AP stat.ML

    An iterative scheme for feature based positioning using a weighted dissimilarity measure

    Authors: Caifa Zhou, Andreas Wieser

    Abstract: We propose an iterative scheme for feature-based positioning using a new weighted dissimilarity measure with the goal of reducing the impact of large errors among the measured or modeled features. The weights are computed from the location-dependent standard deviations of the features and stored as part of the reference fingerprint map (RFM). Spatial filtering and kernel smoothing of the kinematic… ▽ More

    Submitted 30 May, 2019; v1 submitted 20 May, 2019; originally announced May 2019.

    Comments: 18 pages, 9 figures, and 1 table

  36. arXiv:1904.09981  [pdf, other

    cs.LG cs.AI stat.ML

    GraphNAS: Graph Neural Architecture Search with Reinforcement Learning

    Authors: Yang Gao, Hong Yang, Peng Zhang, Chuan Zhou, Yue Hu

    Abstract: Graph Neural Networks (GNNs) have been popularly used for analyzing non-Euclidean data such as social network data and biological data. Despite their success, the design of graph neural networks requires a lot of manual work and domain knowledge. In this paper, we propose a Graph Neural Architecture Search method (GraphNAS for short) that enables automatic search of the best graph neural architect… ▽ More

    Submitted 19 August, 2019; v1 submitted 22 April, 2019; originally announced April 2019.

  37. arXiv:1902.11128  [pdf, other

    cs.CV cs.AR cs.LG stat.ML

    FixyNN: Efficient Hardware for Mobile Computer Vision via Transfer Learning

    Authors: Paul N. Whatmough, Chuteng Zhou, Patrick Hansen, Shreyas Kolala Venkataramanaiah, Jae-sun Seo, Matthew Mattina

    Abstract: The computational demands of computer vision tasks based on state-of-the-art Convolutional Neural Network (CNN) image classification far exceed the energy budgets of mobile devices. This paper proposes FixyNN, which consists of a fixed-weight feature extractor that generates ubiquitous CNN features, and a conventional programmable CNN accelerator which processes a dataset-specific CNN. Image class… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

    Comments: 10 pages, 8 figures, paper accepted at SysML2019 conference

  38. arXiv:1901.03328  [pdf, other

    stat.AP eess.SP

    Modified Jaccard Index Analysis and Adaptive Feature Selection for Location Fingerprinting with Limited Computational Complexity

    Authors: Caifa Zhou, Andreas Wieser

    Abstract: We propose an approach for fingerprinting-based positioning which reduces the data requirements and computational complexity of the online positioning stage. It is based on a segmentation of the entire region of interest into subregions, identification of candidate subregions during the online-stage, and position estimation using a preselected subset of relevant features. The subregion selection u… ▽ More

    Submitted 10 January, 2019; originally announced January 2019.

    Comments: 15 pagers, 10 figures, 10 tables, revised version for publishing to TLBS. arXiv admin note: text overlap with arXiv:1711.07812

  39. arXiv:1901.01498  [pdf, other

    cs.LG cs.CV stat.ML

    MAE: Mutual Posterior-Divergence Regularization for Variational AutoEncoders

    Authors: Xuezhe Ma, Chunting Zhou, Eduard Hovy

    Abstract: Variational Autoencoder (VAE), a simple and effective deep generative model, has led to a number of impressive empirical successes and spawned many advanced variants and theoretical investigations. However, recent studies demonstrate that, when equipped with expressive generative distributions (aka. decoders), VAE suffers from learning uninformative latent representations with the observation call… ▽ More

    Submitted 5 January, 2019; originally announced January 2019.

    Comments: Published at ICLR-2019. 12 pages contents + 4 pages appendix, 5 figures

  40. arXiv:1812.01672  [pdf, other

    cs.LG stat.ML

    Energy Efficient Hardware for On-Device CNN Inference via Transfer Learning

    Authors: Paul Whatmough, Chuteng Zhou, Patrick Hansen, Matthew Mattina

    Abstract: On-device CNN inference for real-time computer vision applications can result in computational demands that far exceed the energy budgets of mobile devices. This paper proposes FixyNN, a co-designed hardware accelerator platform which splits a CNN model into two parts: a set of layers that are fixed in the hardware platform as a front-end fixed-weight feature extractor, and the remaining layers wh… ▽ More

    Submitted 26 February, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: 4 pages, 2 figures, NeurIPS 2018 on-device ML workshop

  41. arXiv:1811.04277  [pdf, other

    stat.ML cs.LG

    Anomaly Detection via Graphical Lasso

    Authors: Haitao Liu, Randy C. Paffenroth, Jian Zou, Chong Zhou

    Abstract: Anomalies and outliers are common in real-world data, and they can arise from many sources, such as sensor faults. Accordingly, anomaly detection is important both for analyzing the anomalies themselves and for cleaning the data for further analysis of its ambient structure. Nonetheless, a precise definition of anomalies is important for automated detection and herein we approach such problems fro… ▽ More

    Submitted 10 November, 2018; originally announced November 2018.

  42. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  43. arXiv:1810.08880  [pdf, other

    stat.ME

    High-dimensional Two-sample Precision Matrices Test: An Adaptive Approach through Multiplier Bootstrap

    Authors: Mingjuan Zhang, Yong He, Cheng Zhou, Xinsheng Zhang

    Abstract: Precision matrix, which is the inverse of covariance matrix, plays an important role in statistics, as it captures the partial correlation between variables. Testing the equality of two precision matrices in high dimensional setting is a very challenging but meaningful problem, especially in the differential network modelling. To our best knowledge, existing test is only powerful for sparse altern… ▽ More

    Submitted 20 October, 2018; originally announced October 2018.

    Comments: 30 pages 4 figures

  44. arXiv:1809.10816  [pdf, other

    cs.LG stat.ML

    Generative Adversarial Active Learning for Unsupervised Outlier Detection

    Authors: Yezheng Liu, Zhe Li, Chong Zhou, Yuanchun Jiang, Jianshan Sun, Meng Wang, Xiangnan He

    Abstract: Outlier detection is an important topic in machine learning and has been used in a wide range of applications. In this paper, we approach outlier detection as a binary-classification issue by sampling potential outliers from a uniform reference distribution. However, due to the sparsity of data in high-dimensional space, a limited number of potential outliers may fail to provide sufficient informa… ▽ More

    Submitted 17 March, 2019; v1 submitted 27 September, 2018; originally announced September 2018.

    Comments: TKDE 2019

  45. arXiv:1809.03672  [pdf, other

    stat.ML cs.IR cs.LG

    Deep Interest Evolution Network for Click-Through Rate Prediction

    Authors: Guorui Zhou, Na Mou, Ying Fan, Qi Pi, Weijie Bian, Chang Zhou, Xiaoqiang Zhu, Kun Gai

    Abstract: Click-through rate~(CTR) prediction, whose goal is to estimate the probability of the user clicks, has become one of the core tasks in advertising systems. For CTR prediction model, it is necessary to capture the latent user interest behind the user behavior data. Besides, considering the changing of the external environment and the internal cognition, user interest evolves over time dynamically.… ▽ More

    Submitted 16 November, 2018; v1 submitted 10 September, 2018; originally announced September 2018.

    Comments: 9 pages. Accepted by AAAI 2019

    ACM Class: I.2.6

  46. arXiv:1807.00282  [pdf, ps, other

    stat.ME

    A horse racing between the block maxima method and the peak-over-threshold approach

    Authors: Axel Bücher, Chen Zhou

    Abstract: Classical extreme value statistics consists of two fundamental approaches: the block maxima (BM) method and the peak-over-threshold (POT) approach. It seems to be general consensus among researchers in the field that the POT method makes use of extreme observations more efficiently than the BM method. We shed light on this discussion from three different perspectives. First, based on recent theore… ▽ More

    Submitted 1 July, 2018; originally announced July 2018.

    Comments: 19 pages

  47. arXiv:1806.06940  [pdf

    cs.LG stat.ML

    Pressure Predictions of Turbine Blades with Deep Learning

    Authors: Cheng'an Bai, Chao Zhou

    Abstract: Deep learning has been used in many areas, such as feature detections in images and the game of go. This paper presents a study that attempts to use the deep learning method to predict turbomachinery performance. Three different deep neural networks are built and trained to predict the pressure distributions of turbine airfoils. The performance of a library of turbine airfoils were firstly predict… ▽ More

    Submitted 11 June, 2018; originally announced June 2018.

    Comments: 16 pages, 13 figures

  48. arXiv:1805.06208  [pdf, other

    stat.AP stat.ML

    CDM: Compound dissimilarity measure and an application to fingerprinting-based positioning

    Authors: Caifa Zhou, Andreas Wieser

    Abstract: A non-vector-based dissimilarity measure is proposed by combining vector-based distance metrics and set operations. This proposed compound dissimilarity measure (CDM) is applicable to quantify similarity of collections of attribute/feature pairs where not all attributes are present in all collections. This is a typical challenge in the context of e.g., fingerprinting-based positioning (FbP). Compa… ▽ More

    Submitted 26 June, 2018; v1 submitted 16 May, 2018; originally announced May 2018.

    Comments: 7 pages, 5 figures, 3 tables, a paper accepted to be published IPIN 2018, Nantes France

  49. arXiv:1711.07812  [pdf, other

    stat.ML stat.AP

    Jaccard analysis and LASSO-based feature selection for location fingerprinting with limited computational complexity

    Authors: Caifa Zhou, Andreas Wieser

    Abstract: We propose an approach to reduce both computational complexity and data storage requirements for the online positioning stage of a fingerprinting-based indoor positioning system (FIPS) by introducing segmentation of the region of interest (RoI) into sub-regions, sub-region selection using a modified Jaccard index, and feature selection based on randomized least absolute shrinkage and selection ope… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

    Comments: 16 pages, 4 figures, and 2 tables. Accepted to publish on LBS 2018, Zurich

  50. arXiv:1711.07564  [pdf, ps, other

    math.OC stat.CO

    Unbiased Simulation for Optimizing Stochastic Function Compositions

    Authors: Jose Blanchet, Donald Goldfarb, Garud Iyengar, Fengpei Li, Chaoxu Zhou

    Abstract: In this paper, we introduce an unbiased gradient simulation algorithms for solving convex optimization problem with stochastic function compositions. We show that the unbiased gradient generated from the algorithm has finite variance and finite expected computation cost. We then combined the unbiased gradient simulation with two variance reduced algorithms (namely SVRG and SCSG) and showed that th… ▽ More

    Submitted 20 November, 2017; originally announced November 2017.