Skip to main content

Showing 1–50 of 63 results for author: Lu, W

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.01908  [pdf, other

    math.ST stat.ML

    A Full Adagrad algorithm with O(Nd) operations

    Authors: Antoine Godichon-Baggioni, Wei Lu, Bruno Portier

    Abstract: A novel approach is given to overcome the computational challenges of the full-matrix Adaptive Gradient algorithm (Full AdaGrad) in stochastic optimization. By develo** a recursive method that estimates the inverse of the square root of the covariance of the gradient, alongside a streaming variant for parameter updates, the study offers efficient and practical algorithms for large-scale applicat… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  2. arXiv:2404.12597  [pdf, other

    cs.LG math.ST stat.ML

    The phase diagram of kernel interpolation in large dimensions

    Authors: Haobo Zhang, Weihao Lu, Qian Lin

    Abstract: The generalization ability of kernel interpolation in large dimensions (i.e., $n \asymp d^γ$ for some $γ>0$) might be one of the most interesting problems in the recent renaissance of kernel regression, since it may help us understand the 'benign overfitting phenomenon' reported in the neural networks literature. Focusing on the inner product kernel on the sphere, we fully characterized the exact… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 18 pages, 1 figure

  3. arXiv:2403.08757  [pdf, other

    stat.ML cs.LG math.CO physics.app-ph

    Efficient Combinatorial Optimization via Heat Diffusion

    Authors: Hengyuan Ma, Wenlian Lu, Jianfeng Feng

    Abstract: Combinatorial optimization problems are widespread but inherently challenging due to their discrete nature.The primary limitation of existing methods is that they can only access a small fraction of the solution space at each iteration, resulting in limited efficiency for searching the global optimal. To overcome this challenge, diverging from conventional efforts of expanding the solver's search… ▽ More

    Submitted 14 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Code is available in https://github.com/AwakerMhy/HeO

  4. arXiv:2402.10456  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    Generative Modeling for Tabular Data via Penalized Optimal Transport Network

    Authors: Wenhui Sophia Lu, Chenyang Zhong, Wing Hung Wong

    Abstract: The task of precisely learning the probability distribution of rows within tabular data and producing authentic synthetic samples is both crucial and non-trivial. Wasserstein generative adversarial network (WGAN) marks a notable improvement in generative modeling, addressing the challenges faced by its predecessor, generative adversarial network. However, due to the mixed data types and multimodal… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 37 pages, 23 figures

  5. arXiv:2401.10923  [pdf, other

    math.OC stat.ML

    Online estimation of the inverse of the Hessian for stochastic optimization with application to universal stochastic Newton algorithms

    Authors: Antoine Godichon-Baggioni, Wei Lu, Bruno Portier

    Abstract: This paper addresses second-order stochastic optimization for estimating the minimizer of a convex function written as an expectation. A direct recursive estimation technique for the inverse Hessian matrix using a Robbins-Monro procedure is introduced. This approach enables to drastically reduces computational complexity. Above all, it allows to develop universal stochastic Newton methods and inve… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  6. arXiv:2401.00104  [pdf, other

    cs.LG cs.AI stat.ME

    Causal State Distillation for Explainable Reinforcement Learning

    Authors: Wenhao Lu, Xufeng Zhao, Thilo Fryen, Jae Hee Lee, Mengdi Li, Sven Magg, Stefan Wermter

    Abstract: Reinforcement learning (RL) is a powerful technique for training intelligent agents, but understanding why these agents make specific decisions can be quite challenging. This lack of transparency in RL models has been a long-standing problem, making it difficult for users to grasp the reasons behind an agent's behaviour. Various approaches have been explored to address this problem, with one promi… ▽ More

    Submitted 1 April, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: https://lukaswill.github.io/; Accepted as oral by CLeaR 2024

  7. arXiv:2311.13958  [pdf, other

    stat.ML cs.CV cs.LG

    Handling The Non-Smooth Challenge in Tensor SVD: A Multi-Objective Tensor Recovery Framework

    Authors: **g**g Zheng, Wanglong Lu, Wenzhe Wang, Yankai Cao, Xiaoqin Zhang, Xianta Jiang

    Abstract: Recently, numerous tensor singular value decomposition (t-SVD)-based tensor recovery methods have shown promise in processing visual data, such as color images and videos. However, these methods often suffer from severe performance degradation when confronted with tensor data exhibiting non-smooth changes. It has been commonly observed in real-world scenarios but ignored by the traditional t-SVD-b… ▽ More

    Submitted 31 March, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

  8. arXiv:2309.04268  [pdf, other

    stat.ML cs.LG math.ST

    Optimal Rate of Kernel Regression in Large Dimensions

    Authors: Weihao Lu, Haobo Zhang, Yicheng Li, Manyun Xu, Qian Lin

    Abstract: We perform a study on kernel regression for large-dimensional data (where the sample size $n$ is polynomially depending on the dimension $d$ of the samples, i.e., $n\asymp d^γ$ for some $γ>0$ ). We first build a general tool to characterize the upper bound and the minimax lower bound of kernel regression for large dimensional data through the Mendelson complexity $\varepsilon_{n}^{2}$ and the metr… ▽ More

    Submitted 28 June, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

    MSC Class: 62G08; 46E22; 68T07

  9. arXiv:2308.09444  [pdf, other

    cs.LG stat.ML

    An Efficient 1 Iteration Learning Algorithm for Gaussian Mixture Model And Gaussian Mixture Embedding For Neural Network

    Authors: Weiguo Lu, Xuan Wu, Deng Ding, Gangnan Yuan

    Abstract: We propose an Gaussian Mixture Model (GMM) learning algorithm, based on our previous work of GMM expansion idea. The new algorithm brings more robustness and simplicity than classic Expectation Maximization (EM) algorithm. It also improves the accuracy and only take 1 iteration for learning. We theoretically proof that this new algorithm is guarantee to converge regardless the parameters initialis… ▽ More

    Submitted 6 September, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

  10. arXiv:2307.13290  [pdf, other

    stat.ML cs.LG math.OC

    Modify Training Directions in Function Space to Reduce Generalization Error

    Authors: Yi Yu, Wenlian Lu, Boyu Chen

    Abstract: We propose theoretical analyses of a modified natural gradient descent method in the neural network function space based on the eigendecompositions of neural tangent kernel and Fisher information matrix. We firstly present analytical expression for the function learned by this modified natural gradient under the assumptions of Gaussian distribution and infinite width limit. Thus, we explicitly der… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  11. arXiv:2306.07456  [pdf

    stat.AP stat.ME

    On the Temporal-spatial Analysis of Estimating Urban Traffic Patterns Via GPS Trace Data of Car-hailing Vehicles

    Authors: Jiannan Mao, Lan Liu, Hao Huang, Weike Lu, Kaiyu Yang, Tianli Tang, Haotian Shi

    Abstract: Car-hailing services have become a prominent data source for urban traffic studies. Extracting useful information from car-hailing trace data is essential for effective traffic management, while discrepancies between car-hailing vehicles and urban traffic should be considered. This paper proposes a generic framework for estimating and analyzing urban traffic patterns using car-hailing trace data.… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  12. arXiv:2304.00770  [pdf, other

    stat.ML

    Online stochastic Newton methods for estimating the geometric median and applications

    Authors: Antoine Godichon-Baggioni, Wei Lu

    Abstract: In the context of large samples, a small number of individuals might spoil basic statistical indicators like the mean. It is difficult to detect automatically these atypical individuals, and an alternative strategy is using robust approaches. This paper focuses on estimating the geometric median of a random variable, which is a robust indicator of central tendency. In order to deal with large samp… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  13. arXiv:2303.11536  [pdf, other

    cs.LG cs.AI cs.CV math.ST stat.ML

    Indeterminate Probability Neural Network

    Authors: Tao Yang, Chuang Liu, Xiaofeng Ma, Weijia Lu, Ning Wu, Bingyang Li, Zhifei Yang, Peng Liu, Lin Sun, Xiaodong Zhang, Can Zhang

    Abstract: We propose a new general model called IPNN - Indeterminate Probability Neural Network, which combines neural network and probability theory together. In the classical probability theory, the calculation of probability is based on the occurrence of events, which is hardly used in current neural networks. In this paper, we propose a new general probability theory, which is an extension of classical… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 13 pages

  14. arXiv:2212.12845  [pdf, ps, other

    stat.ME cs.LG

    Mining the Factor Zoo: Estimation of Latent Factor Models with Sufficient Proxies

    Authors: Runzhe Wan, Yingying Li, Wenbin Lu, Rui Song

    Abstract: Latent factor model estimation typically relies on either using domain knowledge to manually pick several observed covariates as factor proxies, or purely conducting multivariate analysis such as principal component analysis. However, the former approach may suffer from the bias while the latter can not incorporate additional information. We propose to bridge these two approaches while allowing th… ▽ More

    Submitted 2 January, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

  15. arXiv:2203.06509  [pdf, other

    stat.CO

    Distributed Community Detection in Large Networks

    Authors: Sheng Zhang, Rui Song, Wenbin Lu, Ji Zhu

    Abstract: Community detection for large networks is a challenging task due to the high computational cost as well as the heterogeneous community structure. Stochastic block model (SBM) is a popular model to analyze community structure where nodes belonging to the same communities are connected with equal probability. Modularity optimization methods provide a fast and effective way for community detection un… ▽ More

    Submitted 12 March, 2022; originally announced March 2022.

  16. arXiv:2203.02318  [pdf, ps, other

    stat.ME stat.ML

    Adaptive Semi-Supervised Inference for Optimal Treatment Decisions with Electronic Medical Record Data

    Authors: Kevin Gunn, Wenbin Lu, Rui Song

    Abstract: A treatment regime is a rule that assigns a treatment to patients based on their covariate information. Recently, estimation of the optimal treatment regime that yields the greatest overall expected clinical outcome of interest has attracted a lot of attention. In this work, we consider estimation of the optimal treatment regime with electronic medical record data under a semi-supervised setting.… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

  17. arXiv:2202.12440  [pdf, other

    stat.ML cs.LG

    On Learning and Testing of Counterfactual Fairness through Data Preprocessing

    Authors: Haoyu Chen, Wenbin Lu, Rui Song, Pulak Ghosh

    Abstract: Machine learning has become more important in real-life decision-making but people are concerned about the ethical problems it may bring when used improperly. Recent work brings the discussion of machine learning fairness into the causal framework and elaborates on the concept of Counterfactual Fairness. In this paper, we develop the Fair Learning through dAta Preprocessing (FLAP) algorithm to lea… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

  18. arXiv:2201.06229  [pdf, ps, other

    stat.ME stat.AP stat.ML

    Targeted Optimal Treatment Regime Learning Using Summary Statistics

    Authors: Jianing Chu, Wenbin Lu, Shu Yang

    Abstract: Personalized decision-making, aiming to derive optimal treatment regimes based on individual characteristics, has recently attracted increasing attention in many fields, such as medicine, social services, and economics. Current literature mainly focuses on estimating treatment regimes from a single source population. In real-world applications, the distribution of a target population can be differ… ▽ More

    Submitted 25 February, 2023; v1 submitted 17 January, 2022; originally announced January 2022.

  19. arXiv:2111.08885  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Jump Interval-Learning for Individualized Decision Making

    Authors: Hengrui Cai, Chengchun Shi, Rui Song, Wenbin Lu

    Abstract: An individualized decision rule (IDR) is a decision function that assigns each individual a given treatment based on his/her observed characteristics. Most of the existing works in the literature consider settings with binary or finitely many treatment options. In this paper, we focus on the continuous treatment setting and propose a jump interval-learning to develop an individualized interval-val… ▽ More

    Submitted 28 January, 2023; v1 submitted 16 November, 2021; originally announced November 2021.

  20. arXiv:2111.03943  [pdf, ps, other

    cs.LG stat.CO stat.ML

    A Probit Tensor Factorization Model For Relational Learning

    Authors: Ye Liu, Rui Song, Wenbin Lu, Yanghua Xiao

    Abstract: With the proliferation of knowledge graphs, modeling data with complex multirelational structure has gained increasing attention in the area of statistical relational learning. One of the most important goals of statistical relational learning is link prediction, i.e., predicting whether certain relations exist in the knowledge graph. A large number of models and algorithms have been proposed to p… ▽ More

    Submitted 8 November, 2021; v1 submitted 6 November, 2021; originally announced November 2021.

    Comments: 30 pages

  21. arXiv:2110.05636  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    CAPITAL: Optimal Subgroup Identification via Constrained Policy Tree Search

    Authors: Hengrui Cai, Wenbin Lu, Rachel Marceau West, Devan V. Mehrotra, Lingkang Huang

    Abstract: Personalized medicine, a paradigm of medicine tailored to a patient's characteristics, is an increasingly attractive field in health care. An important goal of personalized medicine is to identify a subgroup of patients, based on baseline covariates, that benefits more from the targeted treatment than other comparative treatments. Most of the current subgroup identification methods only focus on o… ▽ More

    Submitted 28 January, 2023; v1 submitted 11 October, 2021; originally announced October 2021.

  22. arXiv:2109.00712  [pdf, other

    stat.ME

    Online Testing of Subgroup Treatment Effects Based on Value Difference

    Authors: Miao Yu, Wenbin Lu, Rui Song

    Abstract: Online A/B testing plays a critical role in the high-tech industry to guide product development and accelerate innovation. It performs a null hypothesis statistical test to determine which variant is better. However, a typical A/B test presents two problems: (i) a fixed-horizon framework inflates the false-positive errors under continuous monitoring; (ii) the homogeneous effects assumption fails t… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

  23. arXiv:2107.04839  [pdf, other

    stat.ME stat.AP

    On Estimating Optimal Regime for Treatment Initiation Time Based on Restricted Mean Residual Lifetime

    Authors: Xin Chen, Rui Song, Jiajia Zhang, Swann Arp Adams, Liuquan Sun, Wenbin Lu

    Abstract: When to initiate treatment on patients is an important problem in many medical studies such as AIDS and cancer. In this article, we formulate the treatment initiation time problem for time-to-event data and propose an optimal individualized regime that determines the best treatment initiation time for individual patients based on their characteristics. Different from existing optimal treatment reg… ▽ More

    Submitted 29 September, 2021; v1 submitted 10 July, 2021; originally announced July 2021.

  24. arXiv:2107.01152  [pdf, other

    stat.ML cs.AI cs.CV cs.IT cs.LG

    Simpler, Faster, Stronger: Breaking The log-K Curse On Contrastive Learners With FlatNCE

    Authors: Junya Chen, Zhe Gan, Xuan Li, Qing Guo, Liqun Chen, Shuyang Gao, Tagyoung Chung, Yi Xu, Belinda Zeng, Wenlian Lu, Fan Li, Lawrence Carin, Chenyang Tao

    Abstract: InfoNCE-based contrastive representation learners, such as SimCLR, have been tremendously successful in recent years. However, these contrastive schemes are notoriously resource demanding, as their effectiveness breaks down with small-batch training (i.e., the log-K curse, whereas K is the batch-size). In this work, we reveal mathematically why contrastive learners fail in the small-batch-size reg… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

  25. arXiv:2104.10573  [pdf, other

    stat.ME math.ST stat.AP stat.ML

    GEAR: On Optimal Decision Making with Auxiliary Data

    Authors: Hengrui Cai, Rui Song, Wenbin Lu

    Abstract: Personalized optimal decision making, finding the optimal decision rule (ODR) based on individual characteristics, has attracted increasing attention recently in many fields, such as education, economics, and medicine. Current ODR methods usually require the primary outcome of interest in samples for assessing treatment effects, namely the experimental sample. However, in many studies, treatments… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

  26. arXiv:2104.10554  [pdf, other

    stat.ME math.ST stat.AP stat.ML

    Calibrated Optimal Decision Making with Multiple Data Sources and Limited Outcome

    Authors: Hengrui Cai, Wenbin Lu, Rui Song

    Abstract: We consider the optimal decision-making problem in a primary sample of interest with multiple auxiliary sources available. The outcome of interest is limited in the sense that it is only observed in the primary sample. In reality, such multiple data sources may belong to heterogeneous studies and thus cannot be combined directly. This paper proposes a new framework to handle heterogeneous samples… ▽ More

    Submitted 21 September, 2022; v1 submitted 21 April, 2021; originally announced April 2021.

  27. arXiv:2103.11147  [pdf, ps, other

    math.ST stat.OT

    A unified approach for covariance matrix estimation under Stein loss

    Authors: Anis M. Haddouche, Wei Lu

    Abstract: In this paper, we address the problem of estimating a covariance matrix of a multivariate Gaussian distribution, relative to a Stein loss function, from a decision theoretic point of view. We investigate the case where the covariance matrix is invertible and the case when it is non--invertible in a unified approach.

    Submitted 20 March, 2021; originally announced March 2021.

  28. arXiv:2011.14542  [pdf, other

    math.ST math.PR stat.ME

    Calibration for multivariate Lévy-driven Ornstein-Uhlenbeck processes with applications to weak subordination

    Authors: Kevin W. Lu

    Abstract: Consider a multivariate Lévy-driven Ornstein-Uhlenbeck process where the stationary distribution or background driving Lévy process is from a parametric family. We derive the likelihood function assuming that the innovation term is absolutely continuous. Two examples are studied in detail: the process where the stationary distribution or background driving Lévy process is given by a weak variance… ▽ More

    Submitted 31 August, 2021; v1 submitted 29 November, 2020; originally announced November 2020.

    MSC Class: 62M05; 60G51; 60G10

  29. arXiv:2010.15963  [pdf, other

    stat.ML cs.LG

    Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings

    Authors: Hengrui Cai, Chengchun Shi, Rui Song, Wenbin Lu

    Abstract: We consider off-policy evaluation (OPE) in continuous treatment settings, such as personalized dose-finding. In OPE, one aims to estimate the mean outcome under a new treatment decision rule using historical data generated by a different decision rule. Most existing works on OPE focus on discrete treatment settings. To handle continuous treatments, we develop a novel estimation method for OPE usin… ▽ More

    Submitted 4 November, 2021; v1 submitted 29 October, 2020; originally announced October 2020.

  30. Statistical Inference for Online Decision Making via Stochastic Gradient Descent

    Authors: Haoyu Chen, Wenbin Lu, Rui Song

    Abstract: Online decision making aims to learn the optimal decision rule by making personalized decisions and updating the decision rule recursively. It has become easier than before with the help of big data, but new challenges also come along. Since the decision rule should be updated once per step, an offline update which uses all the historical data is inefficient in computation and storage. To this end… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: Accepted by the Journal of the American Statistical Association

  31. Statistical Inference for Online Decision-Making: In a Contextual Bandit Setting

    Authors: Haoyu Chen, Wenbin Lu, Rui Song

    Abstract: Online decision-making problem requires us to make a sequence of decisions based on incremental information. Common solutions often need to learn a reward model of different actions given the contextual information and then maximize the long-term reward. It is meaningful to know if the posited model is reasonable and how the model performs in the asymptotic sense. We study this problem under the s… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: Accepted by the Journal of the American Statistical Association

  32. arXiv:2008.08931  [pdf, other

    cs.SI cs.LG stat.ML

    A Deep Prediction Network for Understanding Advertiser Intent and Satisfaction

    Authors: Liyi Guo, Rui Lu, Haoqi Zhang, Junqi **, Zhenzhe Zheng, Fan Wu, ** Li, Haiyang Xu, Han Li, Wenkai Lu, Jian Xu, Kun Gai

    Abstract: For e-commerce platforms such as Taobao and Amazon, advertisers play an important role in the entire digital ecosystem: their behaviors explicitly influence users' browsing and shop** experience; more importantly, advertiser's expenditure on advertising constitutes a primary source of platform revenue. Therefore, providing better services for advertisers is essential for the long-term prosperity… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

    Journal ref: CIKM 2020, Virtual Event, Ireland

  33. Predicting heave and surge motions of a semi-submersible with neural networks

    Authors: Xiaoxian Guo, Xiantao Zhang, Xinliang Tian, Xin Li, Wenyue Lu

    Abstract: Real-time motion prediction of a vessel or a floating platform can help to improve the performance of motion compensation systems. It can also provide useful early-warning information for offshore operations that are critical with regard to motion. In this study, a long short-term memory (LSTM) -based machine learning model was developed to predict heave and surge motions of a semi-submersible. Th… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

    Comments: 16 pages, 22 figures, submitted to Applied Ocean Research

  34. arXiv:2007.09812  [pdf, ps, other

    stat.ME

    Causal Effect Estimation and Optimal Dose Suggestions in Mobile Health

    Authors: Liangyu Zhu, Wenbin Lu, Rui Song

    Abstract: In this article, we propose novel structural nested models to estimate causal effects of continuous treatments based on mobile health data. To find the treatment regime that optimizes the expected short-term outcomes for patients, we define a weighted lag-K advantage as the value function. The optimal treatment regime is then defined to be the one that maximizes the value function. Our method impo… ▽ More

    Submitted 23 July, 2020; v1 submitted 19 July, 2020; originally announced July 2020.

    Comments: Accepted for ICML 2020

  35. arXiv:2007.09811  [pdf, ps, other

    stat.ME math.ST stat.AP stat.ML

    Kernel Assisted Learning for Personalized Dose Finding

    Authors: Liangyu Zhu, Wenbin Lu, Michael R. Kosorok, Rui Song

    Abstract: An individualized dose rule recommends a dose level within a continuous safe dose range based on patient level information such as physical conditions, genetic factors and medication histories. Traditionally, personalized dose finding process requires repeating clinical visits of the patient and frequent adjustments of the dosage. Thus the patient is constantly exposed to the risk of underdosing a… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: Accepted for KDD 2020

  36. arXiv:2003.10838  [pdf, other

    cs.CY cs.LG stat.ML

    Prob2Vec: Mathematical Semantic Embedding for Problem Retrieval in Adaptive Tutoring

    Authors: Du Su, Ali Yekkehkhany, Yi Lu, Wenmiao Lu

    Abstract: We propose a new application of embedding techniques for problem retrieval in adaptive tutoring. The objective is to retrieve problems whose mathematical concepts are similar. There are two challenges: First, like sentences, problems helpful to tutoring are never exactly the same in terms of the underlying concepts. Instead, good problems mix concepts in innovative ways, while still displaying con… ▽ More

    Submitted 20 March, 2020; originally announced March 2020.

  37. arXiv:2002.03277  [pdf, ps, other

    stat.ME stat.AP

    A New Framework for Online Testing of Heterogeneous Treatment Effect

    Authors: Miao Yu, Wenbin Lu, Rui Song

    Abstract: We propose a new framework for online testing of heterogeneous treatment effects. The proposed test, named sequential score test (SST), is able to control type I error under continuous monitoring and detect multi-dimensional heterogeneous treatment effects. We provide an online p-value calculation for SST, making it convenient for continuous monitoring, and extend our tests to online multiple test… ▽ More

    Submitted 8 February, 2020; originally announced February 2020.

    Comments: 8 pages, no figures. To be published on AAAI 2020 proceedings

  38. arXiv:2002.01927  [pdf, other

    cs.CE cs.LG stat.ML

    Self-Directed Online Machine Learning for Topology Optimization

    Authors: Changyu Deng, Yizhou Wang, Can Qin, Yun Fu, Wei Lu

    Abstract: Topology optimization by optimally distributing materials in a given domain requires non-gradient optimizers to solve highly complicated problems. However, with hundreds of design variables or more involved, solving such problems would require millions of Finite Element Method (FEM) calculations whose computational cost is huge and impractical. Here we report Self-directed Online Learning Optimiza… ▽ More

    Submitted 25 January, 2022; v1 submitted 4 February, 2020; originally announced February 2020.

  39. arXiv:2002.01751  [pdf, other

    stat.ML cs.LG

    Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making

    Authors: Chengchun Shi, Runzhe Wan, Rui Song, Wenbin Lu, Ling Leng

    Abstract: The Markov assumption (MA) is fundamental to the empirical validity of reinforcement learning. In this paper, we propose a novel Forward-Backward Learning procedure to test MA in sequential decision making. The proposed test does not assume any parametric form on the joint distribution of the observed data and plays an important role for identifying the optimal policy in high-order Markov decision… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

  40. arXiv:2001.04515  [pdf, other

    stat.ML cs.LG

    Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon Settings

    Authors: C. Shi, S. Zhang, W. Lu, R. Song

    Abstract: Reinforcement learning is a general technique that allows an agent to learn an optimal policy and interact with an environment in sequential decision making problems. The goodness of a policy is measured by its value function starting from some initial state. The focus of this paper is to construct confidence intervals (CIs) for a policy's value in infinite horizon settings where the number of dec… ▽ More

    Submitted 20 June, 2021; v1 submitted 13 January, 2020; originally announced January 2020.

  41. arXiv:1911.05531  [pdf, other

    q-bio.BM cs.LG stat.ML

    Accurate Protein Structure Prediction by Embeddings and Deep Learning Representations

    Authors: Iddo Drori, Darshan Thaker, Arjun Srivatsa, Daniel Jeong, Yueqi Wang, Linyong Nan, Fan Wu, Dimitri Leggas, **hao Lei, Weiyi Lu, Weilong Fu, Yuan Gao, Sashank Karri, Anand Kannan, Antonio Moretti, Mohammed AlQuraishi, Chen Keasar, Itsik Pe'er

    Abstract: Proteins are the major building blocks of life, and actuators of almost all chemical and biophysical events in living organisms. Their native structures in turn enable their biological functions which have a fundamental role in drug design. This motivates predicting the structure of a protein from its sequence of amino acids, a fundamental problem in computational biology. In this work, we demonst… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Journal ref: Machine Learning in Computational Biology, 2019

  42. arXiv:1910.13632  [pdf

    stat.ME q-bio.QM stat.AP

    RCRnorm: An integrated system of random-coefficient hierarchical regression models for normalizing NanoString nCounter data

    Authors: Gaoxiang Jia, Xinlei Wang, Qiwei Li, Wei Lu, Ximing Tang, Ignacio Wistuba, Yang Xie

    Abstract: Formalin-fixed paraffin-embedded (FFPE) samples have great potential for biomarker discovery, retrospective studies and diagnosis or prognosis of diseases. Their application, however, is hindered by the unsatisfactory performance of traditional gene expression profiling techniques on damaged RNAs. NanoString nCounter platform is well suited for profiling of FFPE samples and measures gene expressio… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    MSC Class: 97K80

    Journal ref: Ann. Appl. Stat. 13 (2019), no. 3, 1617--1647. https://projecteuclid.org/euclid.aoas/1571277766

  43. arXiv:1910.06444  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Building Damage Detection in Satellite Imagery Using Convolutional Neural Networks

    Authors: Joseph Z. Xu, Wenhan Lu, Zebo Li, Pranav Khaitan, Valeriya Zaytseva

    Abstract: In all types of disasters, from earthquakes to armed conflicts, aid workers need accurate and timely data such as damage to buildings and population displacement to mount an effective response. Remote sensing provides this data at an unprecedented scale, but extracting operationalizable information from satellite images is slow and labor-intensive. In this work, we use machine learning to automate… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

  44. The Learning of Fuzzy Cognitive Maps With Noisy Data: A Rapid and Robust Learning Method With Maximum Entropy

    Authors: Guoliang Feng, Wei Lu, Witold Pedrycz, Jianhua Yang, Xiaodong Liu

    Abstract: Numerous learning methods for fuzzy cognitive maps (FCMs), such as the Hebbian-based and the population-based learning methods, have been developed for modeling and simulating dynamic systems. However, these methods are faced with several obvious limitations. Most of these models are extremely time consuming when learning the large-scale FCMs with hundreds of nodes. Furthermore, the FCMs learned b… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

    Comments: The manuscript has been published on IEEE Transactions on Cybernetics

  45. arXiv:1811.07342  [pdf, other

    cs.LG cs.AI stat.ML

    Transform-Based Multilinear Dynamical System for Tensor Time Series Analysis

    Authors: Weijun Lu, Xiao-Yang Liu, Qingwei Wu, Yue Sun, Anwar Walid

    Abstract: We propose a novel multilinear dynamical system (MLDS) in a transform domain, named $\mathcal{L}$-MLDS, to model tensor time series. With transformations applied to a tensor data, the latent multidimensional correlations among the frontal slices are built, and thus resulting in the computational independence in the transform domain. This allows the exact separability of the multi-dimensional probl… ▽ More

    Submitted 18 November, 2018; originally announced November 2018.

  46. arXiv:1807.05666  [pdf, other

    cs.LG stat.ML

    Scene Learning: Deep Convolutional Networks For Wind Power Prediction by Embedding Turbines into Grid Space

    Authors: Ruiguo Yu, Zhiqiang Liu, Xuewei Li, Wenhuan Lu, Mei Yu, Jianrong Wang, Bin Li

    Abstract: Wind power prediction is of vital importance in wind power utilization. There have been a lot of researches based on the time series of the wind power or speed, but In fact, these time series cannot express the temporal and spatial changes of wind, which fundamentally hinders the advance of wind power prediction. In this paper, a new kind of feature that can describe the process of temporal and sp… ▽ More

    Submitted 17 July, 2018; v1 submitted 15 July, 2018; originally announced July 2018.

    Comments: 11 pages

  47. arXiv:1806.06304  [pdf, other

    stat.ME

    Post-Lasso Inference for High-Dimensional Regression

    Authors: X. Jessie Jeng, Huimin Peng, Wenbin Lu

    Abstract: Among the most popular variable selection procedures in high-dimensional regression, Lasso provides a solution path to rank the variables and determines a cut-off position on the path to select variables and estimate coefficients. In this paper, we consider variable selection from a new perspective motivated by the frequently occurred phenomenon that relevant variables are not completely distingui… ▽ More

    Submitted 16 June, 2018; originally announced June 2018.

  48. arXiv:1805.08462  [pdf, other

    cs.LG stat.ML

    Meta-Learning with Hessian-Free Approach in Deep Neural Nets Training

    Authors: Boyu Chen, Wenlian Lu, Ernest Fokoue

    Abstract: Meta-learning is a promising method to achieve efficient training method towards deep neural net and has been attracting increases interests in recent years. But most of the current methods are still not capable to train complex neuron net model with long-time training process. In this paper, a novel second-order meta-optimizer, named Meta-learning with Hessian-Free(MLHF) approach, is proposed bas… ▽ More

    Submitted 7 September, 2018; v1 submitted 22 May, 2018; originally announced May 2018.

  49. arXiv:1805.08309  [pdf, ps, other

    cs.LG cs.DC eess.IV stat.ML

    AxTrain: Hardware-Oriented Neural Network Training for Approximate Inference

    Authors: Xin He, Liu Ke, Wenyan Lu, Guihai Yan, Xuan Zhang

    Abstract: The intrinsic error tolerance of neural network (NN) makes approximate computing a promising technique to improve the energy efficiency of NN inference. Conventional approximate computing focuses on balancing the efficiency-accuracy trade-off for existing pre-trained networks, which can lead to suboptimal solutions. In this paper, we propose AxTrain, a hardware-oriented training framework to facil… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

    Comments: In International Symposium on Low Power Electronics and Design (ISLPED) 2018

  50. arXiv:1803.01541  [pdf, other

    cs.CV cs.LG stat.ML

    Improving the Improved Training of Wasserstein GANs: A Consistency Term and Its Dual Effect

    Authors: Xiang Wei, Boqing Gong, Zixia Liu, Wei Lu, Liqiang Wang

    Abstract: Despite being impactful on a variety of problems and applications, the generative adversarial nets (GANs) are remarkably difficult to train. This issue is formally analyzed by \cite{arjovsky2017towards}, who also propose an alternative direction to avoid the caveats in the minmax two-player training of GANs. The corresponding algorithm, called Wasserstein GAN (WGAN), hinges on the 1-Lipschitz cont… ▽ More

    Submitted 5 March, 2018; originally announced March 2018.

    Comments: Accepted as a conference paper in International Conference on Learning Representation(ICLR). Xiang Wei and Boqing Gong contributed equally in this work