Skip to main content

Showing 1–40 of 40 results for author: Cai, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.11092  [pdf, other

    cs.LG math.NA stat.ML

    Guaranteed Sampling Flexibility for Low-tubal-rank Tensor Completion

    Authors: Bowen Su, Juntao You, HanQin Cai, Longxiu Huang

    Abstract: While Bernoulli sampling is extensively studied in tensor completion, t-CUR sampling approximates low-tubal-rank tensors via lateral and horizontal subtensors. However, both methods lack sufficient flexibility for diverse practical applications. To address this, we introduce Tensor Cross-Concentrated Sampling (t-CCS), a novel and straightforward sampling model that advances the matrix cross-concen… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2406.07409  [pdf, other

    stat.ML cs.IT cs.LG eess.SP math.OC

    Accelerating Ill-conditioned Hankel Matrix Recovery via Structured Newton-like Descent

    Authors: HanQin Cai, Longxiu Huang, Xiliang Lu, Juntao You

    Abstract: This paper studies the robust Hankel recovery problem, which simultaneously removes the sparse outliers and fulfills missing entries from the partial observation. We propose a novel non-convex algorithm, coined Hankel Structured Newton-Like Descent (HSNLD), to tackle the robust Hankel recovery problem. HSNLD is highly efficient with linear convergence, and its convergence rate is independent of th… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    MSC Class: 15A29; 15A83; 47B35; 90C17; 90C26; 90C53

  3. arXiv:2401.15566  [pdf, other

    stat.ML cs.IT cs.LG math.OC

    On the Robustness of Cross-Concentrated Sampling for Matrix Completion

    Authors: HanQin Cai, Longxiu Huang, Chandra Kundu, Bowen Su

    Abstract: Matrix completion is one of the crucial tools in modern data science research. Recently, a novel sampling model for matrix completion coined cross-concentrated sampling (CCS) has caught much attention. However, the robustness of the CCS model against sparse outliers remains unclear in the existing studies. In this paper, we aim to answer this question by exploring a novel Robust CCS Completion pro… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: 58th Annual Conference of Information Sciences and Systems

  4. arXiv:2401.10269  [pdf, ps, other

    cs.IT eess.SP stat.ME

    Robust Multi-Sensor Multi-Target Tracking Using Possibility Labeled Multi-Bernoulli Filter

    Authors: Han Cai, Chenbao Xue, Jeremie Houssineau, Zhirun Xue

    Abstract: With the increasing complexity of multiple target tracking scenes, a single sensor may not be able to effectively monitor a large number of targets. Therefore, it is imperative to extend the single-sensor technique to Multi-Sensor Multi-Target Tracking (MSMTT) for enhanced functionality. Typical MSMTT methods presume complete randomness of all uncertain components, and therefore effective solution… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  5. arXiv:2401.05517  [pdf, other

    stat.ME econ.EM math.ST

    On Efficient Inference of Causal Effects with Multiple Mediators

    Authors: Haoyu Wei, Hengrui Cai, Chengchun Shi, Rui Song

    Abstract: This paper provides robust estimators and efficient inference of causal effects involving multiple interacting mediators. Most existing works either impose a linear model assumption among the mediators or are restricted to handle conditionally independent mediators given the exposure. To overcome these limitations, we define causal and individual mediation effects in a general setting, and employ… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    MSC Class: 62A09; 62G05; 62G35

  6. arXiv:2401.00139  [pdf, other

    cs.AI cs.CL cs.LG stat.ME

    Is Knowledge All Large Language Models Needed for Causal Reasoning?

    Authors: Hengrui Cai, Shengjie Liu, Rui Song

    Abstract: This paper explores the causal reasoning of large language models (LLMs) to enhance their interpretability and reliability in advancing artificial intelligence. Despite the proficiency of LLMs in a range of tasks, their potential for understanding causality requires further exploration. We propose a novel causal attribution model that utilizes ``do-operators" for constructing counterfactual scenar… ▽ More

    Submitted 5 June, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: A Python implementation of our proposed method is available at https://github.com/ncsulsj/Causal_LLM

  7. arXiv:2306.14115  [pdf, other

    cs.LG cs.AI cs.CL stat.ME stat.ML

    Towards Trustworthy Explanation: On Causal Rationalization

    Authors: Wenbo Zhang, Tong Wu, Yunlong Wang, Yong Cai, Hengrui Cai

    Abstract: With recent advances in natural language processing, rationalization becomes an essential self-explaining diagram to disentangle the black box by selecting a subset of input texts to account for the major variation in prediction. Yet, existing association-based approaches on rationalization cannot identify true rationales when two or more snippets are highly inter-correlated and thus provide a sim… ▽ More

    Submitted 8 September, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

    Comments: In Proceedings of the 40th International Conference on Machine Learning (ICML) GitHub Repository: https://github.com/onepounchman/Causal-Retionalization

  8. arXiv:2306.10475  [pdf, other

    stat.ME cs.SI physics.soc-ph

    SpreadDetect: Detection of spreading change in a network over time

    Authors: Hanqing Cai, Tengyao Wang

    Abstract: Change-point analysis has been successfully applied to the detect changes in multivariate data streams over time. In many applications, when data are observed over a graph/network, change does not occur simultaneously but instead spread from an initial source coordinate to the neighbouring coordinates over time. We propose a new method, SpreadDetect, that estimates both the source coordinate and t… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

    Comments: 26 pages,3 figures, 2 tables

  9. arXiv:2305.18577  [pdf, other

    cs.LG math.OC stat.ML

    Towards Constituting Mathematical Structures for Learning to Optimize

    Authors: Jialin Liu, Xiaohan Chen, Zhangyang Wang, Wotao Yin, HanQin Cai

    Abstract: Learning to Optimize (L2O), a technique that utilizes machine learning to learn an optimization algorithm automatically from data, has gained arising attention in recent years. A generic L2O approach parameterizes the iterative update rule and learns the update direction as a black-box network. While the generic approach is widely applicable, the learned model can overfit and may not generalize we… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: ICML 2023

  10. arXiv:2303.14281  [pdf, other

    stat.ML cs.LG

    Sequential Knockoffs for Variable Selection in Reinforcement Learning

    Authors: Tao Ma, Hengrui Cai, Zhengling Qi, Chengchun Shi, Eric B. Laber

    Abstract: In real-world applications of reinforcement learning, it is often challenging to obtain a state representation that is parsimonious and satisfies the Markov property without prior knowledge. Consequently, it is common practice to construct a state which is larger than necessary, e.g., by concatenating measurements over contiguous time points. However, needlessly increasing the dimension of the sta… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  11. arXiv:2301.12389  [pdf, other

    cs.LG cs.AI stat.AP stat.ME stat.ML

    On Learning Necessary and Sufficient Causal Graphs

    Authors: Hengrui Cai, Yixin Wang, Michael Jordan, Rui Song

    Abstract: The causal revolution has stimulated interest in understanding complex relationships in various fields. Most of the existing methods aim to discover causal relationships among all variables within a complex large-scale graph. However, in practice, only a small subset of variables in the graph are relevant to the outcomes of interest. Consequently, causal estimation with the full causal graph -- pa… ▽ More

    Submitted 1 November, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: Advances in Neural Information Processing Systems 37 (Spotlight)

  12. arXiv:2301.12383  [pdf, other

    stat.ME cs.LG stat.AP stat.ML

    On Heterogeneous Treatment Effects in Heterogeneous Causal Graphs

    Authors: Richard A Watson, Hengrui Cai, Xinming An, Samuel McLean, Rui Song

    Abstract: Heterogeneity and comorbidity are two interwoven challenges associated with various healthcare problems that greatly hampered research on develo** effective treatment and understanding of the underlying neurobiological mechanism. Very few studies have been conducted to investigate heterogeneous causal effects (HCEs) in graphical contexts due to the lack of statistical methods. To characterize th… ▽ More

    Submitted 25 June, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: In Proceedings of the 40th International Conference on Machine Learning (ICML) Code implementing the proposed algorithm is open-source and publicly available at: https://github.com/richard-watson/ISL

  13. arXiv:2212.14580  [pdf, ps, other

    stat.ML cs.LG math.ST stat.ME

    Heterogeneous Synthetic Learner for Panel Data

    Authors: Ye Shen, Runzhe Wan, Hengrui Cai, Rui Song

    Abstract: In the new era of personalization, learning the heterogeneous treatment effect (HTE) becomes an inevitable trend with numerous applications. Yet, most existing HTE estimation methods focus on independently and identically distributed observations and cannot handle the non-stationarity and temporal dependency in the common panel data setting. The treatment evaluators developed for panel data, on th… ▽ More

    Submitted 29 January, 2023; v1 submitted 30 December, 2022; originally announced December 2022.

  14. arXiv:2206.09042  [pdf, other

    stat.ML cs.LG math.NA

    Riemannian CUR Decompositions for Robust Principal Component Analysis

    Authors: Keaton Hamm, Mohamed Meskini, HanQin Cai

    Abstract: Robust Principal Component Analysis (PCA) has received massive attention in recent years. It aims to recover a low-rank matrix and a sparse matrix from their sum. This paper proposes a novel nonconvex Robust PCA algorithm, coined Riemannian CUR (RieCUR), which utilizes the ideas of Riemannian optimization and robust CUR decompositions. This algorithm has the same computational complexity as Iterat… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Journal ref: ICML workshop on Topological, Algebraic and Geometric Learning (2022): 152-160

  15. arXiv:2205.07193  [pdf, other

    stat.AP stat.ME

    How Much Does Home Field Advantage Matter in Soccer Games? A Causal Inference Approach for English Premier League Analysis

    Authors: Katherine Price, Hengrui Cai, Weining Shen, Guanyu Hu

    Abstract: In many sports, it is commonly believed that the home team has an advantage over the visiting team, known as the home field advantage. Yet its causal effect on team performance is largely unknown. In this paper, we propose a novel causal inference approach to study the causal effect of home field advantage in English Premier League. We develop a hierarchical causal model and show that both league… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

  16. arXiv:2203.04695  [pdf, other

    q-bio.BM cs.LG stat.ML

    Structured Multi-task Learning for Molecular Property Prediction

    Authors: Shengchao Liu, Meng Qu, Zuobai Zhang, Huiyu Cai, Jian Tang

    Abstract: Multi-task learning for molecular property prediction is becoming increasingly important in drug discovery. However, in contrast to other domains, the performance of multi-task learning in drug discovery is still not satisfying as the number of labeled data for each task is too limited, which calls for additional data to complement the data scarcity. In this paper, we study multi-task learning for… ▽ More

    Submitted 5 October, 2022; v1 submitted 22 February, 2022; originally announced March 2022.

  17. arXiv:2112.04319  [pdf, other

    cs.SI cs.LG stat.ML

    SCR: Training Graph Neural Networks with Consistency Regularization

    Authors: Chenhui Zhang, Yufei He, Yukuo Cen, Zhenyu Hou, Wenzheng Feng, Yuxiao Dong, Xu Cheng, Hongyun Cai, Feng He, Jie Tang

    Abstract: We present the SCR framework for enhancing the training of graph neural networks (GNNs) with consistency regularization. Regularization is a set of strategies used in Machine Learning to reduce overfitting and improve the generalization ability. However, it is unclear how to best design the generalization strategies in GNNs, as it works in a semi-supervised setting for graph data. The major challe… ▽ More

    Submitted 13 June, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

  18. arXiv:2111.08885  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Jump Interval-Learning for Individualized Decision Making

    Authors: Hengrui Cai, Chengchun Shi, Rui Song, Wenbin Lu

    Abstract: An individualized decision rule (IDR) is a decision function that assigns each individual a given treatment based on his/her observed characteristics. Most of the existing works in the literature consider settings with binary or finitely many treatment options. In this paper, we focus on the continuous treatment setting and propose a jump interval-learning to develop an individualized interval-val… ▽ More

    Submitted 28 January, 2023; v1 submitted 16 November, 2021; originally announced November 2021.

  19. arXiv:2110.15501  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning

    Authors: Ye Shen, Hengrui Cai, Rui Song

    Abstract: Evaluating the performance of an ongoing policy plays a vital role in many areas such as medicine and economics, to provide crucial instruction on the early-stop of the online experiment and timely feedback from the environment. Policy evaluation in online learning thus attracts increasing attention by inferring the mean outcome of the optimal policy (i.e., the value) in real-time. Yet, such a pro… ▽ More

    Submitted 28 January, 2023; v1 submitted 28 October, 2021; originally announced October 2021.

  20. arXiv:2110.05636  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    CAPITAL: Optimal Subgroup Identification via Constrained Policy Tree Search

    Authors: Hengrui Cai, Wenbin Lu, Rachel Marceau West, Devan V. Mehrotra, Lingkang Huang

    Abstract: Personalized medicine, a paradigm of medicine tailored to a patient's characteristics, is an increasingly attractive field in health care. An important goal of personalized medicine is to identify a subgroup of patients, based on baseline covariates, that benefits more from the targeted treatment than other comparative treatments. Most of the current subgroup identification methods only focus on o… ▽ More

    Submitted 28 January, 2023; v1 submitted 11 October, 2021; originally announced October 2021.

  21. arXiv:2107.08724  [pdf, other

    stat.ME math.ST

    Estimation of high-dimensional change-points under a group sparsity structure

    Authors: Hanqing Cai, Tengyao Wang

    Abstract: Change-points are a routine feature of 'big data' observed in the form of high-dimensional data streams. In many such data streams, the component series possess group structures and it is natural to assume that changes only occur in a small number of all groups. We propose a new change point procedure, called 'groupInspect', that exploits the group sparsity structure to estimate a projection direc… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

    Comments: 25 pages, 6 figures

  22. arXiv:2104.10573  [pdf, other

    stat.ME math.ST stat.AP stat.ML

    GEAR: On Optimal Decision Making with Auxiliary Data

    Authors: Hengrui Cai, Rui Song, Wenbin Lu

    Abstract: Personalized optimal decision making, finding the optimal decision rule (ODR) based on individual characteristics, has attracted increasing attention recently in many fields, such as education, economics, and medicine. Current ODR methods usually require the primary outcome of interest in samples for assessing treatment effects, namely the experimental sample. However, in many studies, treatments… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

  23. arXiv:2104.10554  [pdf, other

    stat.ME math.ST stat.AP stat.ML

    Calibrated Optimal Decision Making with Multiple Data Sources and Limited Outcome

    Authors: Hengrui Cai, Wenbin Lu, Rui Song

    Abstract: We consider the optimal decision-making problem in a primary sample of interest with multiple auxiliary sources available. The outcome of interest is limited in the sense that it is only observed in the primary sample. In reality, such multiple data sources may belong to heterogeneous studies and thus cannot be combined directly. This paper proposes a new framework to handle heterogeneous samples… ▽ More

    Submitted 21 September, 2022; v1 submitted 21 April, 2021; originally announced April 2021.

  24. arXiv:2102.10707  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    A Zeroth-Order Block Coordinate Descent Algorithm for Huge-Scale Black-Box Optimization

    Authors: HanQin Cai, Yuchen Lou, Daniel McKenzie, Wotao Yin

    Abstract: We consider the zeroth-order optimization problem in the huge-scale setting, where the dimension of the problem is so large that performing even basic vector operations on the decision variables is infeasible. In this paper, we propose a novel algorithm, coined ZO-BCD, that exhibits favorable overall query complexity and has a much smaller per-iteration computational complexity. In addition, we di… ▽ More

    Submitted 11 June, 2021; v1 submitted 21 February, 2021; originally announced February 2021.

    Comments: Accepted to ICML 2021

    Journal ref: Proceedings of the 38th International Conference on Machine Learning, PMLR 139:1193-1203, 2021

  25. arXiv:2010.15963  [pdf, other

    stat.ML cs.LG

    Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings

    Authors: Hengrui Cai, Chengchun Shi, Rui Song, Wenbin Lu

    Abstract: We consider off-policy evaluation (OPE) in continuous treatment settings, such as personalized dose-finding. In OPE, one aims to estimate the mean outcome under a new treatment decision rule using historical data generated by a different decision rule. Most existing works on OPE focus on discrete treatment settings. To handle continuous treatments, we develop a novel estimation method for OPE usin… ▽ More

    Submitted 4 November, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

  26. arXiv:2010.07422  [pdf, other

    stat.ML cs.AI cs.IT cs.LG math.NA math.OC

    Rapid Robust Principal Component Analysis: CUR Accelerated Inexact Low Rank Estimation

    Authors: HanQin Cai, Keaton Hamm, Longxiu Huang, Jiaqi Li, Tao Wang

    Abstract: Robust principal component analysis (RPCA) is a widely used tool for dimension reduction. In this work, we propose a novel non-convex algorithm, coined Iterated Robust CUR (IRCUR), for solving RPCA problems, which dramatically improves the computational efficiency in comparison with the existing algorithms. IRCUR achieves this acceleration by employing CUR decomposition when updating the low rank… ▽ More

    Submitted 7 February, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

    Journal ref: IEEE Signal Processing Letters, 28 (2021): 116-120

  27. arXiv:2007.13533  [pdf

    eess.IV cs.LG stat.ML

    Learning Common Harmonic Waves on Stiefel Manifold -- A New Mathematical Approach for Brain Network Analyses

    Authors: Jiazhou Chen, Guoqiang Han, Hongmin Cai, Defu Yang, Paul J. Laurienti, Martin Styner, Guorong Wu, Alzheimer's Disease Neuroimaging Initiative ADNI

    Abstract: Converging evidence shows that disease-relevant brain alterations do not appear in random brain locations, instead, its spatial pattern follows large scale brain networks. In this context, a powerful network analysis approach with a mathematical foundation is indispensable to understand the mechanism of neuropathological events spreading throughout the brain. Indeed, the topology of each brain net… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  28. arXiv:2006.08509  [pdf, other

    cs.LG cs.CV stat.ML

    APQ: Joint Search for Network Architecture, Pruning and Quantization Policy

    Authors: Tianzhe Wang, Kuan Wang, Han Cai, Ji Lin, Zhijian Liu, Song Han

    Abstract: We present APQ for efficient deep learning inference on resource-constrained hardware. Unlike previous methods that separately search the neural architecture, pruning policy, and quantization policy, we optimize them in a joint manner. To deal with the larger design space it brings, a promising approach is to train a quantization-aware accuracy predictor to quickly get the accuracy of the quantize… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: Accepted by CVPR 2020

  29. arXiv:1909.06189  [pdf, other

    cs.CR stat.ML

    Machine Learning in/for Blockchain: Future and Challenges

    Authors: Fang Chen, Hong Wan, Hua Cai, Guang Cheng

    Abstract: Machine learning and blockchain are two of the most noticeable technologies in recent years. The first one is the foundation of artificial intelligence and big data, and the second one has significantly disrupted the financial industry. Both technologies are data-driven, and thus there are rapidly growing interests in integrating them for more secure and efficient data sharing and analysis. In thi… ▽ More

    Submitted 8 December, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

  30. arXiv:1908.09791  [pdf, other

    cs.LG cs.CV stat.ML

    Once-for-All: Train One Network and Specialize it for Efficient Deployment

    Authors: Han Cai, Chuang Gan, Tianzhe Wang, Zhekai Zhang, Song Han

    Abstract: We address the challenging problem of efficient inference across many devices and resource constraints, especially on edge devices. Conventional approaches either manually design or use neural architecture search (NAS) to find a specialized neural network and train it from scratch for each case, which is computationally prohibitive (causing $CO_2$ emission as much as 5 cars' lifetime) thus unscala… ▽ More

    Submitted 29 April, 2020; v1 submitted 26 August, 2019; originally announced August 2019.

    Comments: ICLR 2020

  31. arXiv:1905.03920  [pdf, other

    cs.LG stat.ML

    Integrating Tensor Similarity to Enhance Clustering Performance

    Authors: Hong Peng, Yu Hu, Jiazhou Chen, Haiyan Wang, Yang Li, Hongmin Cai

    Abstract: The performance of most the clustering methods hinges on the used pairwise affinity, which is usually denoted by a similarity matrix. However, the pairwise similarity is notoriously known for its vulnerability of noise contamination or the imbalance in samples or features, and thus hinders accurate clustering. To tackle this issue, we propose to use information among samples to boost the clusterin… ▽ More

    Submitted 26 June, 2020; v1 submitted 9 May, 2019; originally announced May 2019.

    Comments: 10 pages, 7 figures, 2 tables, 4 pages supplementary information appendix

    MSC Class: 68U99

  32. arXiv:1904.10616  [pdf, other

    cs.LG stat.ML

    Design Automation for Efficient Deep Learning Computing

    Authors: Song Han, Han Cai, Ligeng Zhu, Ji Lin, Kuan Wang, Zhijian Liu, Yujun Lin

    Abstract: Efficient deep learning computing requires algorithm and hardware co-design to enable specialization: we usually need to change the algorithm to reduce memory footprint and improve energy efficiency. However, the extra degree of freedom from the algorithm makes the design space much larger: it's not only about designing the hardware but also about how to tweak the algorithm to best fit the hardwar… ▽ More

    Submitted 23 April, 2019; originally announced April 2019.

  33. arXiv:1812.00332  [pdf, other

    cs.LG cs.CV stat.ML

    ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

    Authors: Han Cai, Ligeng Zhu, Song Han

    Abstract: Neural architecture search (NAS) has a great impact by automatically designing effective neural network architectures. However, the prohibitive computational demand of conventional NAS algorithms (e.g. $10^4$ GPU hours) makes it difficult to \emph{directly} search the architectures on large-scale tasks (e.g. ImageNet). Differentiable NAS can reduce the cost of GPU hours via a continuous representa… ▽ More

    Submitted 22 February, 2019; v1 submitted 2 December, 2018; originally announced December 2018.

    Comments: ICLR 2019

  34. arXiv:1811.05869  [pdf, other

    cs.LG cs.AI stat.ML

    Large-scale Interactive Recommendation with Tree-structured Policy Gradient

    Authors: Haokun Chen, Xinyi Dai, Han Cai, Weinan Zhang, Xuejian Wang, Ruiming Tang, Yuzhou Zhang, Yong Yu

    Abstract: Reinforcement learning (RL) has recently been introduced to interactive recommender systems (IRS) because of its nature of learning from dynamic interactions and planning for long-run performance. As IRS is always with thousands of items to recommend (i.e., thousands of actions), most existing RL-based methods, however, fail to handle such a large discrete action space problem and thus become inef… ▽ More

    Submitted 14 November, 2018; originally announced November 2018.

  35. arXiv:1808.03587  [pdf

    stat.AP

    A simplified convolutional sparse filter for impulsive signature enhancement and its application to the prognostic of rotating machinery

    Authors: Xiaodong Jia, Ming Zhao, Haoshu Cai, Jay Lee

    Abstract: Impulsive signature enhancement (ISE) is an important topic in the monitoring of rotating machinery and many different methods have been proposed. Even though, the topic of how to leverage these ISE techniques to improve the data quality in terms of prognostics and health management (PHM) still needs to be investigated. In this work, a systematic view for data quality enhancement is presented. The… ▽ More

    Submitted 10 August, 2018; originally announced August 2018.

  36. arXiv:1806.02639  [pdf, other

    cs.LG cs.AI stat.ML

    Path-Level Network Transformation for Efficient Architecture Search

    Authors: Han Cai, Jiacheng Yang, Weinan Zhang, Song Han, Yong Yu

    Abstract: We introduce a new function-preserving transformation for efficient neural architecture search. This network transformation allows reusing previously trained networks and existing successful architectures that improves sample efficiency. We aim to address the limitation of current network transformation operations that can only perform layer-level architecture modifications, such as adding (prunin… ▽ More

    Submitted 7 June, 2018; originally announced June 2018.

    Comments: ICML 2018

  37. Deep Video Generation, Prediction and Completion of Human Action Sequences

    Authors: Haoye Cai, Chunyan Bai, Yu-Wing Tai, Chi-Keung Tang

    Abstract: Current deep learning results on video generation are limited while there are only a few first results on video prediction and no relevant significant results on video completion. This is due to the severe ill-posedness inherent in these three problems. In this paper, we focus on human action videos, and propose a general, two-stage deep framework to generate human action videos with no constraint… ▽ More

    Submitted 8 December, 2017; v1 submitted 23 November, 2017; originally announced November 2017.

    Comments: Under review for CVPR 2018. Haoye and Chunyan have equal contribution

  38. arXiv:1705.05085  [pdf, other

    cs.LG stat.ML

    Active Learning for Graph Embedding

    Authors: Hongyun Cai, Vincent W. Zheng, Kevin Chen-Chuan Chang

    Abstract: Graph embedding provides an efficient solution for graph analysis by converting the graph into a low-dimensional space which preserves the structure information. In contrast to the graph structure data, the i.i.d. node embedding can be processed efficiently in terms of both time and space. Current semi-supervised graph embedding algorithms assume the labelled nodes are given, which may not be alwa… ▽ More

    Submitted 15 May, 2017; originally announced May 2017.

    Comments: Technical Report

  39. arXiv:1703.02000  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Activation Maximization Generative Adversarial Nets

    Authors: Zhiming Zhou, Han Cai, Shu Rong, Yuxuan Song, Kan Ren, Weinan Zhang, Yong Yu, Jun Wang

    Abstract: Class labels have been empirically shown useful in improving the sample quality of generative adversarial nets (GANs). In this paper, we mathematically study the properties of the current variants of GANs that make use of class label information. With class aware gradient and cross-entropy decomposition, we reveal how class labels and associated losses influence GAN's training. Based on that, we p… ▽ More

    Submitted 16 November, 2018; v1 submitted 6 March, 2017; originally announced March 2017.

    Comments: Accepted as a conference paper on ICLR 2018

  40. A Threshold-free Prospective Prediction Accuracy Measure for Censored Time to Event Data

    Authors: Yan Yuan, Qian M. Zhou, Bingying Li, Hengrui Cai, Eric J. Chow, Gregory T. Armstrong

    Abstract: Prediction performance of a risk scoring system needs to be carefully assessed before its adoption in clinical practice. Clinical preventive care often uses risk scores to screen asymptomatic population. The primary clinical interest is to predict the risk of having an event by a pre-specified future time $t_0$. Prospective accuracy measures such as positive predictive values have been recommended… ▽ More

    Submitted 14 September, 2016; v1 submitted 13 June, 2016; originally announced June 2016.

    Comments: 17 pages, 2 figures, 3 tables

    Journal ref: Statistics in Medicine 37(10):1671-1681, 2018