Skip to main content

Showing 1–50 of 102 results for author: Xu, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.16605  [pdf, other

    cs.CL cs.AI cs.LG stat.ME

    CLEAR: Can Language Models Really Understand Causal Graphs?

    Authors: Sirui Chen, Mengying Xu, Kun Wang, Xingyu Zeng, Rui Zhao, Shengjie Zhao, Chaochao Lu

    Abstract: Causal reasoning is a cornerstone of how humans interpret the world. To model and reason about causality, causal graphs offer a concise yet effective solution. Given the impressive advancements in language models, a crucial question arises: can they really understand causal graphs? To this end, we pioneer an investigation into language models' understanding of causal graphs. Specifically, we devel… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.03681  [pdf, other

    stat.ME math.ST

    Multiscale Tests for Point Processes and Longitudinal Networks

    Authors: Youmeng Jiang, Min Xu

    Abstract: We propose a new testing framework applicable to both the two-sample problem on point processes and the community detection problem on rectangular arrays of point processes, which we refer to as longitudinal networks; the latter problem is useful in situations where we observe interactions among a group of individuals over time. Our framework is based on a multiscale discretization scheme that con… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 59 pages, 9 figures

    MSC Class: 62Mxx

  3. arXiv:2403.16688  [pdf, other

    math.ST stat.ME stat.ML

    Optimal convex $M$-estimation via score matching

    Authors: Oliver Y. Feng, Yu-Chun Kao, Min Xu, Richard J. Samworth

    Abstract: In the context of linear regression, we construct a data-driven convex loss function with respect to which empirical risk minimisation yields optimal asymptotic variance in the downstream estimation of the regression coefficients. Our semiparametric approach targets the best decreasing approximation of the derivative of the log-density of the noise distribution. At the population level, this fitti… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 69 pages, 12 figures and 4 tables

  4. arXiv:2401.17504  [pdf, other

    cs.LG stat.ME

    CaMU: Disentangling Causal Effects in Deep Model Unlearning

    Authors: Shaofei Shen, Chenhao Zhang, Alina Bialkowski, Weitong Chen, Miao Xu

    Abstract: Machine unlearning requires removing the information of forgetting data while kee** the necessary information of remaining data. Despite recent advancements in this area, existing methodologies mainly focus on the effect of removing forgetting data without considering the negative impact this can have on the information of the remaining data, resulting in significant performance degradation afte… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Full version of the paper accepted for the SDM 24 conference

  5. arXiv:2312.15469  [pdf, other

    stat.ML cs.LG stat.ME

    Efficient Estimation of the Central Mean Subspace via Smoothed Gradient Outer Products

    Authors: Gan Yuan, Mingyue Xu, Samory Kpotufe, Daniel Hsu

    Abstract: We consider the problem of sufficient dimension reduction (SDR) for multi-index models. The estimators of the central mean subspace in prior works either have slow (non-parametric) convergence rates, or rely on stringent distributional conditions (e.g., the covariate distribution $P_{\mathbf{X}}$ being elliptical symmetric). In this paper, we show that a fast parametric convergence rate of form… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    MSC Class: 62B05; 62G08

  6. arXiv:2311.08254  [pdf, other

    stat.ME

    Identifiable and interpretable nonparametric factor analysis

    Authors: Maoran Xu, Amy H. Herring, David B. Dunson

    Abstract: Factor models have been widely used to summarize the variability of high-dimensional data through a set of factors with much lower dimensionality. Gaussian linear factor models have been particularly popular due to their interpretability and ease of computation. However, in practice, data often violate the multivariate Gaussian assumption. To characterize higher-order dependence and nonlinearity,… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 50 pages, 17 figures

  7. arXiv:2310.20030  [pdf, other

    cs.LG math.DG stat.ML

    Scaling Riemannian Diffusion Models

    Authors: Aaron Lou, Minkai Xu, Stefano Ermon

    Abstract: Riemannian diffusion models draw inspiration from standard Euclidean space diffusion models to learn distributions on general manifolds. Unfortunately, the additional geometric complexity renders the diffusion transition term inexpressible in closed form, so prior methods resort to imprecise approximations of the score matching training objective that degrade performance and preclude applications… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  8. arXiv:2309.04594  [pdf, ps, other

    stat.AP

    A Comparison between Markov Switching Zero-inflated and Hurdle Models for Spatio-temporal Infectious Disease Counts

    Authors: Mingchi Xu, Dirk Douwes-Schultz, Alexandra M. Schmidt

    Abstract: In epidemiological studies, zero-inflated and hurdle models are commonly used to handle excess zeros in reported infectious disease cases. However, they can not model the persistence (from presence to presence) and reemergence (from absence to presence) of a disease separately. Covariates can sometimes have different effects on the reemergence and persistence of a disease. Recently, a zero-inflate… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  9. arXiv:2309.04268  [pdf, other

    stat.ML cs.LG math.ST

    Optimal Rate of Kernel Regression in Large Dimensions

    Authors: Weihao Lu, Haobo Zhang, Yicheng Li, Manyun Xu, Qian Lin

    Abstract: We perform a study on kernel regression for large-dimensional data (where the sample size $n$ is polynomially depending on the dimension $d$ of the samples, i.e., $n\asymp d^γ$ for some $γ>0$ ). We first build a general tool to characterize the upper bound and the minimax lower bound of kernel regression for large dimensional data through the Mendelson complexity $\varepsilon_{n}^{2}$ and the metr… ▽ More

    Submitted 28 June, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

    MSC Class: 62G08; 46E22; 68T07

  10. arXiv:2308.08046  [pdf, ps, other

    cs.LG stat.ML

    Regret Lower Bounds in Multi-agent Multi-armed Bandit

    Authors: Mengfan Xu, Diego Klabjan

    Abstract: Multi-armed Bandit motivates methods with provable upper bounds on regret and also the counterpart lower bounds have been extensively studied in this context. Recently, Multi-agent Multi-armed Bandit has gained significant traction in various domains, where individual clients face bandit problems in a distributed manner and the objective is the overall system performance, typically measured by reg… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 10 pages

  11. arXiv:2306.10395  [pdf, other

    stat.ML cs.LG

    Distributed Semi-Supervised Sparse Statistical Inference

    Authors: Jiyuan Tu, Weidong Liu, Xiaojun Mao, Mingyue Xu

    Abstract: The debiased estimator is a crucial tool in statistical inference for high-dimensional model parameters. However, constructing such an estimator involves estimating the high-dimensional inverse Hessian matrix, incurring significant computational costs. This challenge becomes particularly acute in distributed setups, where traditional methods necessitate computing a debiased estimator on every mach… ▽ More

    Submitted 15 December, 2023; v1 submitted 17 June, 2023; originally announced June 2023.

    Comments: IEEE Transactions on Information Theory, 2023

  12. arXiv:2306.05579  [pdf, other

    cs.LG stat.ML

    Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with Heterogeneous Rewards

    Authors: Mengfan Xu, Diego Klabjan

    Abstract: We study a decentralized multi-agent multi-armed bandit problem in which multiple clients are connected by time dependent random graphs provided by an environment. The reward distributions of each arm vary across clients and rewards are generated independently over time by an environment based on distributions that include both sub-exponential and sub-gaussian distributions. Each client pulls an a… ▽ More

    Submitted 17 October, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: 58 pages, to appear at Advances in Neural Information Processing Systems (NeurIPS 2023 Spotlight)

  13. arXiv:2304.02127  [pdf, other

    stat.ME

    A Bayesian Collocation Integral Method for Parameter Estimation in Ordinary Differential Equations

    Authors: Mingwei Xu, Samuel W. K. Wong, Peijun Sang

    Abstract: Inferring the parameters of ordinary differential equations (ODEs) from noisy observations is an important problem in many scientific fields. Currently, most parameter estimation methods that bypass numerical integration tend to rely on basis functions or Gaussian processes to approximate the ODE solution and its derivatives. Due to the sensitivity of the ODE solution to its derivatives, these met… ▽ More

    Submitted 23 October, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

  14. arXiv:2303.01992  [pdf, other

    math.ST stat.ME

    Choosing the $p$ in $L_p$ loss: rate adaptivity on the symmetric location problem

    Authors: Yu-Chun Kao, Min Xu, Cun-Hui Zhang

    Abstract: Given univariate random variables $Y_1, \ldots, Y_n$ with the $\text{Uniform}(θ_0 - 1, θ_0 + 1)$ distribution, the sample midrange $\frac{Y_{(n)}+Y_{(1)}}{2}$ is the MLE for $θ_0$ and estimates $θ_0$ with error of order $1/n$, which is much smaller compared with the $1/\sqrt{n}$ error rate of the usual sample mean estimator. However, the sample midrange performs poorly when the data has say the Ga… ▽ More

    Submitted 16 August, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: 60 pages; 7 figures

    MSC Class: 62F10

  15. arXiv:2302.05933  [pdf, other

    stat.ML cs.LG

    Generalization Ability of Wide Neural Networks on $\mathbb{R}$

    Authors: Jianfa Lai, Manyun Xu, Rui Chen, Qian Lin

    Abstract: We perform a study on the generalization ability of the wide two-layer ReLU neural network on $\mathbb{R}$. We first establish some spectral properties of the neural tangent kernel (NTK): $a)$ $K_{d}$, the NTK defined on $\mathbb{R}^{d}$, is positive definite; $b)$ $λ_{i}(K_{1})$, the $i$-th largest eigenvalue of $K_{1}$, is proportional to $i^{-2}$. We then show that: $i)$ when the width… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

    Comments: 47 pages, 4 figures

    MSC Class: 62G08 (Primary); 68T07 (secondary); 46E22 ACM Class: G.3

  16. arXiv:2302.05549  [pdf, other

    stat.ME cs.DC

    Balancing Approach for Causal Inference at Scale

    Authors: Sicheng Lin, Meng Xu, Xi Zhang, Shih-Kang Chao, Ying-Kai Huang, Xiaolin Shi

    Abstract: With the modern software and online platforms to collect massive amount of data, there is an increasing demand of applying causal inference methods at large scale when randomized experimentation is not viable. Weighting methods that directly incorporate covariate balancing have recently gained popularity for estimating causal effects in observational studies. These methods reduce the manual effort… ▽ More

    Submitted 3 August, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

  17. arXiv:2212.00884  [pdf, other

    cs.LG stat.ML

    Pareto Regret Analyses in Multi-objective Multi-armed Bandit

    Authors: Mengfan Xu, Diego Klabjan

    Abstract: We study Pareto optimality in multi-objective multi-armed bandit by providing a formulation of adversarial multi-objective multi-armed bandit and defining its Pareto regrets that can be applied to both stochastic and adversarial settings. The regrets do not rely on any scalarization functions and reflect Pareto optimality compared to scalarized regrets. We also present new algorithms assuming both… ▽ More

    Submitted 30 May, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: 19 pages; accepted at ICML 2023 and to be published in Proceedings of Machine Learning Research (PMLR)

  18. arXiv:2210.13843  [pdf, ps, other

    econ.EM stat.ME

    GLS under Monotone Heteroskedasticity

    Authors: Yoichi Arai, Taisuke Otsu, Mengshan Xu

    Abstract: The generalized least square (GLS) is one of the most basic tools in regression analyses. A major issue in implementing the GLS is estimation of the conditional variance function of the error term, which typically requires a restrictive functional form assumption for parametric estimation or smoothing parameters for nonparametric estimation. In this paper, we propose an alternative approach to est… ▽ More

    Submitted 22 January, 2024; v1 submitted 25 October, 2022; originally announced October 2022.

  19. arXiv:2210.08461  [pdf, other

    cs.LG stat.ML

    Positive-Unlabeled Learning using Random Forests via Recursive Greedy Risk Minimization

    Authors: Jonathan Wilton, Abigail M. Y. Koay, Ryan K. L. Ko, Miao Xu, Nan Ye

    Abstract: The need to learn from positive and unlabeled data, or PU learning, arises in many applications and has attracted increasing interest. While random forests are known to perform well on many tasks with positive and negative data, recent PU algorithms are generally based on deep neural networks, and the potential of tree-based PU learning is under-explored. In this paper, we propose new random fores… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

    Comments: Accepted at NeurIPS 2022

  20. arXiv:2209.07306  [pdf, other

    stat.AP cs.CY math.ST

    Statistical Modeling of Data Breach Risks: Time to Identification and Notification

    Authors: Maochao Xu, Quynh Nhu Nguyen

    Abstract: It is very challenging to predict the cost of a cyber incident owing to the complex nature of cyber risk. However, it is inevitable for insurance companies who offer cyber insurance policies. The time to identifying an incident and the time to noticing the affected individuals are two important components in determining the cost of a cyber incident. In this work, we initialize the study on those t… ▽ More

    Submitted 24 September, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

  21. arXiv:2207.08868  [pdf, ps, other

    econ.EM stat.ME

    Isotonic propensity score matching

    Authors: Mengshan Xu, Taisuke Otsu

    Abstract: We propose a one-to-many matching estimator of the average treatment effect based on propensity scores estimated by isotonic regression. The method relies on the monotonicity assumption on the propensity score function, which can be justified in many applications in economics. We show that the nature of the isotonic estimator can help us to fix many problems of existing matching methods, including… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  22. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  23. Modeling Ride-Sourcing Matching and Pickup Processes based on Additive Gaussian Process Models

    Authors: Zheng Zhu, Meng Xu, Yining Di, Xiqun Chen, **gru Yu

    Abstract: Matching and pickup processes are core features of ride-sourcing services. Previous studies have adopted abundant analytical models to depict the two processes and obtain operational insights; while the goodness of fit between models and data was dismissed. To simultaneously consider the fitness between models and data and analytically tractable formations, we propose a data-driven approach based… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

    Comments: 30 pages, 8 figures, 4 tables. Submitted and under review in Transportmetrica B: Transport Dynamics

  24. Random Matrix Time Series

    Authors: Peiyuan Teng, Min Xu

    Abstract: In this paper, a time series model with coefficients that take values from random matrix ensembles is proposed. Formal definitions, theoretical solutions, and statistical properties are derived. Estimation and forecast methodologies for random matrix time series are discussed with examples. Random matrix differential equations and potential applications of the time series model are suggested at th… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: 15 pages

    Journal ref: Journal of Statistical Theory and Practice, 17, Article number: 42 (2023)

  25. arXiv:2203.10975  [pdf, other

    stat.ML cs.LG stat.AP stat.ME

    GCF: Generalized Causal Forest for Heterogeneous Treatment Effect Estimation in Online Marketplace

    Authors: Shu Wan, Chen Zheng, Zhonggen Sun, Mengfan Xu, Xiaoqing Yang, Hongtu Zhu, Jiecheng Guo

    Abstract: Uplift modeling is a rapidly growing approach that utilizes causal inference and machine learning methods to directly estimate the heterogeneous treatment effects, which has been widely applied to various online marketplaces to assist large-scale decision-making in recent years. The existing popular models, like causal forest (CF), are limited to either discrete treatments or posing parametric ass… ▽ More

    Submitted 23 September, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

  26. arXiv:2111.07052   

    physics.ao-ph stat.AP

    Distribution and Determinants of Correlation between PM2.5 and O3 in China Mainland: Dynamitic simil-Hu Lines

    Authors: Chenru Chen, Miaoqing Xu, Shuyi Liu, Dehai Zhu, Jianyu Yang, Bingbo Gao, Ziyue Chen

    Abstract: In recent years, China has made great efforts to control air pollution. During the governance process, it is found that fine particulate matter (PM2.5) and ozone (O3) change in the same trend among some areas and the opposite in others, which brings some difficulties to take measures in a planned way. Therefore, this study adopted multi-year and large-scale air quality data to explore the distribu… ▽ More

    Submitted 30 September, 2022; v1 submitted 13 November, 2021; originally announced November 2021.

    Comments: Our research group have decided to withdraw this preprint

  27. arXiv:2108.04851  [pdf, other

    stat.ME

    Bayesian Inference using the Proximal Map**: Uncertainty Quantification under Varying Dimensionality

    Authors: Maoran Xu, Hua Zhou, Yujie Hu, Leo L. Duan

    Abstract: In statistical applications, it is common to encounter parameters supported on a varying or unknown dimensional space. Examples include the fused lasso regression, the matrix recovery under an unknown low rank, etc. Despite the ease of obtaining a point estimate via the optimization, it is much more challenging to quantify their uncertainty -- in the Bayesian framework, a major difficulty is that… ▽ More

    Submitted 2 October, 2022; v1 submitted 10 August, 2021; originally announced August 2021.

    Comments: 26 pages, 4 figures

  28. arXiv:2107.00153  [pdf, other

    stat.ME math.PR stat.CO

    Root and community inference on the latent growth process of a network

    Authors: Harry Crane, Min Xu

    Abstract: Many existing statistical models for networks overlook the fact that many real world networks are formed through a growth process. To address this, we introduce the PAPER (Preferential Attachment Plus Erdős--Rényi) model for random networks, where we let a random network G be the union of a preferential attachment (PA) tree T and additional Erdős--Rényi (ER) random edges. The PA tree component cap… ▽ More

    Submitted 7 February, 2023; v1 submitted 30 June, 2021; originally announced July 2021.

    Comments: 69 pages; 29 figures

    MSC Class: 62M99; 05C80

  29. arXiv:2103.08450  [pdf, other

    stat.AP stat.ML

    Modeling Multivariate Cyber Risks: Deep Learning Dating Extreme Value Theory

    Authors: Mingyue Zhang Wu, **zhu Luo, Xing Fang, Maochao Xu, Peng Zhao

    Abstract: Modeling cyber risks has been an important but challenging task in the domain of cyber security. It is mainly because of the high dimensionality and heavy tails of risk patterns. Those obstacles have hindered the development of statistical modeling of the multivariate cyber risks. In this work, we propose a novel approach for modeling the multivariate cyber risks which relies on the deep learning… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: 25 pages

  30. arXiv:2102.03895  [pdf, other

    stat.ML cs.LG stat.AP

    Functional optimal transport: map estimation and domain adaptation for functional data

    Authors: Jiacheng Zhu, Aritra Guha, Dat Do, Mengdi Xu, XuanLong Nguyen, Ding Zhao

    Abstract: We introduce a formulation of optimal transport problem for distributions on function spaces, where the stochastic map between functional domains can be partially represented in terms of an (infinite-dimensional) Hilbert-Schmidt operator map** a Hilbert space of functions to another. For numerous machine learning tasks, data can be naturally viewed as samples drawn from spaces of functions, such… ▽ More

    Submitted 28 August, 2023; v1 submitted 7 February, 2021; originally announced February 2021.

    Comments: 48 pages, 10 figures, 3 tables

  31. arXiv:2012.03420  [pdf, other

    cs.LG stat.ML

    Towards Generalized Implementation of Wasserstein Distance in GANs

    Authors: Minkai Xu, Zhiming Zhou, Guansong Lu, Jian Tang, Weinan Zhang, Yong Yu

    Abstract: Wasserstein GANs (WGANs), built upon the Kantorovich-Rubinstein (KR) duality of Wasserstein distance, is one of the most theoretically sound GAN models. However, in practice it does not always outperform other variants of GANs. This is mostly due to the imperfect implementation of the Lipschitz condition required by the KR duality. Extensive work has been done in the community with different imple… ▽ More

    Submitted 12 January, 2021; v1 submitted 6 December, 2020; originally announced December 2020.

    Comments: Accepted by AAAI 2021

  32. arXiv:2011.14437  [pdf, other

    stat.AP

    How to Measure Your App: A Couple of Pitfalls and Remedies in Measuring App Performance in Online Controlled Experiments

    Authors: Yuxiang Xie, Meng Xu, Evan Chow, Xiaolin Shi

    Abstract: Effectively measuring, understanding, and improving mobile app performance is of paramount importance for mobile app developers. Across the mobile Internet landscape, companies run online controlled experiments (A/B tests) with thousands of performance metrics in order to understand how app performance causally impacts user retention and to guard against service or app regressions that degrade use… ▽ More

    Submitted 29 November, 2020; originally announced November 2020.

    Comments: WSDM '21: Proceedings of the 14th International Conference on Web Search and Data Mining

  33. arXiv:2010.01875  [pdf, other

    cs.LG stat.ML

    Pointwise Binary Classification with Pairwise Confidence Comparisons

    Authors: Lei Feng, Senlin Shu, Nan Lu, Bo Han, Miao Xu, Gang Niu, Bo An, Masashi Sugiyama

    Abstract: To alleviate the data requirement for training effective binary classifiers in binary classification, many weakly supervised learning settings have been proposed. Among them, some consider using pairwise but not pointwise labels, when pointwise labels are not accessible due to privacy, confidentiality, or security reasons. However, as a pairwise label denotes whether or not two data points share a… ▽ More

    Submitted 13 January, 2022; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted to ICML 2021

  34. arXiv:2010.01819  [pdf, other

    cs.LG stat.ML

    Bigeminal Priors Variational auto-encoder

    Authors: Xuming Ran, Mingkun Xu, Qi Xu, Huihui Zhou, Quanying Liu

    Abstract: Variational auto-encoders (VAEs) are an influential and generally-used class of likelihood-based generative models in unsupervised learning. The likelihood-based generative models have been reported to be highly robust to the out-of-distribution (OOD) inputs and can be a detector by assuming that the model assigns higher likelihoods to the samples from the in-distribution (ID) dataset than an OOD… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

  35. arXiv:2009.09538  [pdf, other

    cs.LG cs.AI stat.ML

    Regret Bounds and Reinforcement Learning Exploration of EXP-based Algorithms

    Authors: Mengfan Xu, Diego Klabjan

    Abstract: We study the challenging exploration incentive problem in both bandit and reinforcement learning, where the rewards are scale-free and potentially unbounded, driven by real-world scenarios and differing from existing work. Past works in reinforcement learning either assume costly interactions with an environment or propose algorithms finding potentially low quality local maxima. Motivated by EXP-t… ▽ More

    Submitted 3 May, 2024; v1 submitted 20 September, 2020; originally announced September 2020.

    Comments: 40 pages, 8 figures

  36. arXiv:2008.05095  [pdf, ps, other

    cs.LG stat.ML

    Experimental Analysis of Legendre Decomposition in Machine Learning

    Authors: Jianye Pang, Kai Yi, Wanguang Yin, Min Xu

    Abstract: In this technical report, we analyze Legendre decomposition for non-negative tensor in theory and application. In theory, the properties of dual parameters and dually flat manifold in Legendre decomposition are reviewed, and the process of tensor projection and parameter updating is analyzed. In application, a series of verification experiments and clustering experiments with parameters on submani… ▽ More

    Submitted 21 September, 2020; v1 submitted 12 August, 2020; originally announced August 2020.

  37. arXiv:2007.15840  [pdf

    cs.LG cs.CV stat.ML

    A Survey on Concept Factorization: From Shallow to Deep Representation Learning

    Authors: Zhao Zhang, Yan Zhang, Mingliang Xu, Li Zhang, Yi Yang, Shuicheng Yan

    Abstract: The quality of learned features by representation learning determines the performance of learning algorithms and the related application tasks (such as high-dimensional data clustering). As a relatively new paradigm for representation learning, Concept Factorization (CF) has attracted a great deal of interests in the areas of machine learning and data mining for over a decade. Lots of effective CF… ▽ More

    Submitted 31 January, 2021; v1 submitted 31 July, 2020; originally announced July 2020.

    Comments: Please cite this work as: Zhao Zhang, Yan Zhang, Mingliang Xu, Li Zhang, Yi Yang and Shuicheng Yan, "A Survey on Concept Factorization: From Shallow to Deep Representation Learning," Information Processing and Management (IPM), Jan 2021

  38. arXiv:2007.08929  [pdf, other

    cs.LG stat.ML

    Provably Consistent Partial-Label Learning

    Authors: Lei Feng, Jiaqi Lv, Bo Han, Miao Xu, Gang Niu, Xin Geng, Bo An, Masashi Sugiyama

    Abstract: Partial-label learning (PLL) is a multi-class classification problem, where each training example is associated with a set of candidate labels. Even though many practical PLL methods have been proposed in the last two decades, there lacks a theoretical understanding of the consistency of those methods-none of the PLL methods hitherto possesses a generation process of candidate label sets, and then… ▽ More

    Submitted 23 October, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

    Comments: NeurIPS 2020 camera-ready version

  39. arXiv:2007.08128  [pdf, other

    cs.LG stat.ML

    Detecting Out-of-distribution Samples via Variational Auto-encoder with Reliable Uncertainty Estimation

    Authors: Xuming Ran, Mingkun Xu, Lingrui Mei, Qi Xu, Quanying Liu

    Abstract: Variational autoencoders (VAEs) are influential generative models with rich representation capabilities from the deep neural network architecture and Bayesian method. However, VAE models have a weakness that assign a higher likelihood to out-of-distribution (OOD) inputs than in-distribution (ID) inputs. To address this problem, a reliable uncertainty estimation is considered to be critical for in-… ▽ More

    Submitted 1 November, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

  40. arXiv:2007.00454  [pdf, other

    cs.SI stat.ME

    Pricing cyber insurance for a large-scale network

    Authors: Lei Hua, Maochao Xu

    Abstract: Facing the lack of cyber insurance loss data, we propose an innovative approach for pricing cyber insurance for a large-scale network based on synthetic data. The synthetic data is generated by the proposed risk spreading and recovering algorithm that allows infection and recovery events to occur sequentially, and allows dependence of random waiting time to infection for different nodes. The scale… ▽ More

    Submitted 29 June, 2020; originally announced July 2020.

  41. arXiv:2006.16723  [pdf, other

    cs.LG cs.AI cs.DB cs.LO stat.ML

    Neural Datalog Through Time: Informed Temporal Modeling via Logical Specification

    Authors: Hongyuan Mei, Guanghui Qin, Minjie Xu, Jason Eisner

    Abstract: Learning how to predict future events from patterns of past events is difficult when the set of possible event types is large. Training an unrestricted neural model might overfit to spurious patterns. To exploit domain-specific knowledge of how past events might affect an event's present probability, we propose using a temporal deductive database to track structured facts over time. Rules serve to… ▽ More

    Submitted 16 August, 2020; v1 submitted 30 June, 2020; originally announced June 2020.

    Comments: ICML 2020 camera-ready (new Appendix A.3, rewritten Appendix F)

  42. arXiv:2006.16312  [pdf, other

    cs.LG cs.DS cs.IR eess.SY stat.ML

    Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising

    Authors: Xiaotian Hao, Zhaoqing Peng, Yi Ma, Guan Wang, Junqi **, Jianye Hao, Shan Chen, Rongquan Bai, Mingzhou Xie, Miao Xu, Zhenzhe Zheng, Chuan Yu, Han Li, Jian Xu, Kun Gai

    Abstract: In E-commerce, advertising is essential for merchants to reach their target users. The typical objective is to maximize the advertiser's cumulative revenue over a period of time under a budget constraint. In real applications, an advertisement (ad) usually needs to be exposed to the same user multiple times until the user finally contributes revenue (e.g., places an order). However, existing adver… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: accepted by ICML 2020

  43. arXiv:2006.11441  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of Gaussian Processes

    Authors: Mengdi Xu, Wenhao Ding, Jiacheng Zhu, Zuxin Liu, Baiming Chen, Ding Zhao

    Abstract: Continuously learning to solve unseen tasks with limited experience has been extensively pursued in meta-learning and continual learning, but with restricted assumptions such as accessible task distributions, independently and identically distributed tasks, and clear task delineations. However, real-world physical tasks frequently violate these assumptions, resulting in performance degradation. Th… ▽ More

    Submitted 30 November, 2020; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: 16 pages, 6 figures

  44. arXiv:2006.06983  [pdf, other

    cs.LG cs.DC stat.ML

    Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data

    Authors: Chengxu Yang, Qipeng Wang, Mengwei Xu, Zhenpeng Chen, Kaigui Bian, Yunxin Liu, Xuanzhe Liu

    Abstract: Federated learning (FL) is an emerging, privacy-preserving machine learning paradigm, drawing tremendous attention in both academia and industry. A unique characteristic of FL is heterogeneity, which resides in the various hardware specifications and dynamic states across the participating devices. Theoretically, heterogeneity can exert a huge influence on the FL training process, e.g., causing a… ▽ More

    Submitted 12 March, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

  45. arXiv:2006.01340  [pdf, other

    stat.ME

    Bayesian Inference with the l1-ball Prior: Solving Combinatorial Problems with Exact Zeros

    Authors: Maoran Xu, Leo L. Duan

    Abstract: The l1-regularization is very popular in high dimensional statistics -- it changes a combinatorial problem of choosing which subset of the parameter are zero, into a simple continuous optimization. Using a continuous prior concentrated near zero, the Bayesian counterparts are successful in quantifying the uncertainty in the variable selection problems; nevertheless, the lack of exact zeros makes i… ▽ More

    Submitted 20 February, 2023; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: 44 pages, 15 figures

  46. arXiv:2005.08794  [pdf, other

    math.PR math.ST stat.CO

    Inference on the History of a Randomly Growing Tree

    Authors: Harry Crane, Min Xu

    Abstract: The spread of infectious disease in a human community or the proliferation of fake news on social media can be modeled as a randomly growing tree-shaped graph. The history of the random growth process is often unobserved but contains important information such as the source of the infection. We consider the problem of statistical inference on aspects of the latent history using only a single snaps… ▽ More

    Submitted 13 January, 2021; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: 36 pages; 7 figures; 5 tables

    MSC Class: 90B15; 62M15

  47. arXiv:2005.06546  [pdf

    cs.LG stat.ML

    Triaging moderate COVID-19 and other viral pneumonias from routine blood tests

    Authors: Forrest Sheng Bao, Youbiao He, Jie Liu, Yuanfang Chen, Qian Li, Christina R. Zhang, Lei Han, Baoli Zhu, Yaorong Ge, Shi Chen, Ming Xu, Liu Ouyang

    Abstract: The COVID-19 is swee** the world with deadly consequences. Its contagious nature and clinical similarity to other pneumonias make separating subjects contracted with COVID-19 and non-COVID-19 viral pneumonia a priority and a challenge. However, COVID-19 testing has been greatly limited by the availability and cost of existing methods, even in developed countries like the US. Intrigued by the wid… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    ACM Class: I.5.4

  48. arXiv:2005.05784  [pdf, other

    q-bio.NC cs.LG eess.SP stat.ML

    A Graph Gaussian Embedding Method for Predicting Alzheimer's Disease Progression with MEG Brain Networks

    Authors: Mengjia Xu, David Lopez Sanz, Pilar Garces, Fernando Maestu, Quanzheng Li, Dimitrios Pantazis

    Abstract: Characterizing the subtle changes of functional brain networks associated with the pathological cascade of Alzheimer's disease (AD) is important for early diagnosis and prediction of disease progression prior to clinical symptoms. We developed a new deep learning method, termed multiple graph Gaussian embedding model (MG2G), which can learn highly informative network features by map** high-dimen… ▽ More

    Submitted 10 November, 2020; v1 submitted 7 May, 2020; originally announced May 2020.

  49. arXiv:2005.05441  [pdf, other

    cs.LG cs.MA stat.ML

    Delay-Aware Multi-Agent Reinforcement Learning for Cooperative and Competitive Environments

    Authors: Baiming Chen, Mengdi Xu, Zuxin Liu, Liang Li, Ding Zhao

    Abstract: Action and observation delays exist prevalently in the real-world cyber-physical systems which may pose challenges in reinforcement learning design. It is particularly an arduous task when handling multi-agent systems where the delay of one agent could spread to other agents. To resolve this problem, this paper proposes a novel framework to deal with delays as well as the non-stationary training i… ▽ More

    Submitted 28 August, 2020; v1 submitted 11 May, 2020; originally announced May 2020.

  50. arXiv:2005.05440  [pdf, other

    cs.LG cs.AI stat.ML

    Delay-Aware Model-Based Reinforcement Learning for Continuous Control

    Authors: Baiming Chen, Mengdi Xu, Liang Li, Ding Zhao

    Abstract: Action delays degrade the performance of reinforcement learning in many real-world systems. This paper proposes a formal definition of delay-aware Markov Decision Process and proves it can be transformed into standard MDP with augmented states using the Markov reward process. We develop a delay-aware model-based reinforcement learning framework that can incorporate the multi-step delay into the le… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Journal ref: Neurocomputing Volume 450, 25 August 2021, Pages 119-128